; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015305 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015305
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
Genome locationscaffold2:2447731..2450269
RNA-Seq ExpressionMS015305
SyntenyMS015305
Gene Ontology termsNA
InterPro domainsIPR008978 - HSP20-like chaperone
IPR025723 - Anion-transporting ATPase-like domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR040612 - ArsA, HSP20-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446550.1 PREDICTED: uncharacterized protein At1g26090, chloroplastic [Cucumis melo]1.3e-20180.61Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFGNPIPIS+  RT   ++    R + +Q+SK+ MD   Q PTRLLTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVI NQDPT EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC HNLSAVRLETTQMLLEPLK+LKQADSRLNMTQGVLEGVVGEEL VLPGMDS+FS+L LE+F+GFS  M QRD+K  YDIVIYDG+ TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SK RLYLKY+RS AEKTDLGRLATPSIL LVDEAM ISRPGSHL GRTSTDIWE LE +LE+GSSAF+EP KF C+IVMDPTSPASVQSALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF
        YWGCTIQAGAQI GA AF SSH +AE+ + LKE FSPLSLAF+P+FSIGS VDWNTVL DASSKGPRDLLSSSKS  SSL+ PVKFDPGN+SVTLLMPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
         KSEIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_022150659.1 uncharacterized protein At1g26090, chloroplastic [Momordica charantia]5.8e-24797.59Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
        MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI

Query:  GNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
        GNSPVECGHNLSAVRLETTQMLLEPLKQL+QADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
Subjt:  GNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR

Query:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
        IMGAASKARLYLKYMRSAAEKTDLGRLATPSIL LVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
Subjt:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKS
        CTIQAGAQISGAFAFISSH+DAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTL MPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKS

Query:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_022956773.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita moschata]4.1e-20079.52Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC  NLSAVRLETTQMLLEPLK+LKQADSRLNMTQG LEG+VGEELG+LPGMDS+FSVL LE+FLG S  MAQ D+K  YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSI+ LVDEAM IS PGSHLSGRTSTD W+ALERMLE+GSSA +EP +F CFIVMDPTSPASV+SA R
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF
        YWGCTIQAGAQISGAFA ISS +DAES +RLKENFSPLSL FMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+L SPVKF+PGN+SVTLLMPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_022979170.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima]2.6e-20280.39Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIG+SPVEC HNLSAVRLETTQMLLEPLK+LKQADS LNMTQG LEGVVGEELG+LPGMDS+FSVL LE+FLGFS  MAQ D+KA YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSIL LVDEAM IS PGSHLSGRTSTD W+ALE MLE+GSSA +EP +F CFIVMDPTSPASV+SALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF
        YWGCTIQAGAQISGAFA ISS +DAES +RLKENF PL LAFMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+LLSPVKFDPGN+SVTLLMPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_038891424.1 uncharacterized protein At1g26090, chloroplastic [Benincasa hispida]2.2e-20181.26Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSL +S SFFGNPIPIS+  RT       R R + +Q+SKEI D   Q PTR+LTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVIHNQDPT EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC  NLSAVRLETTQMLLEPLK+LKQADSRLNMTQG+LEGVVGEELGVLPG DS+FS+L LE+FLGFS  M QRD+K  YD+VIYDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSIL LVDEAM ISRPGSHLS RTSTDIWEALE +LE+GSSAF+EP KF CFIVMDPTSPASVQSALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF
        YWGCTIQAG QISGA AFISSH+ AES + LKE FSPLSLAFMP+FS GS VDWNTVL DASSKGPRDLLS SKS  SSLLSPVKFDPGN+SVTLLMPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
         KSEIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAK  DR LVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

TrEMBL top hitse value%identityAlignment
A0A1S3BET7 uncharacterized protein At1g26090, chloroplastic6.1e-20280.61Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFGNPIPIS+  RT   ++    R + +Q+SK+ MD   Q PTRLLTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVI NQDPT EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC HNLSAVRLETTQMLLEPLK+LKQADSRLNMTQGVLEGVVGEEL VLPGMDS+FS+L LE+F+GFS  M QRD+K  YDIVIYDG+ TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SK RLYLKY+RS AEKTDLGRLATPSIL LVDEAM ISRPGSHL GRTSTDIWE LE +LE+GSSAF+EP KF C+IVMDPTSPASVQSALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF
        YWGCTIQAGAQI GA AF SSH +AE+ + LKE FSPLSLAF+P+FSIGS VDWNTVL DASSKGPRDLLSSSKS  SSL+ PVKFDPGN+SVTLLMPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
         KSEIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

A0A5A7STS2 ArsA_ATPase domain-containing protein2.6e-19283.57Show/hide
Query:  EIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGV
        ++  Q PTRLLTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVI NQDPT EYLLDCKIGNSPVEC HNLSAVRLETTQMLLEPLK+LKQADSRLNMTQGV
Subjt:  EIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGV

Query:  LEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGI
        LEGVVGEEL VLPGMDS+FS+L LE+F+GFS  M QRD+K  YDIVIYDG+ TEETIR++GA SK RLYLKY+RS AEKTDLGRLATPSIL LVDEAM I
Subjt:  LEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGI

Query:  SRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPK
        SRPGSHL GRTSTDIWE LE +LE+GSSAF+EP KF C+IVMDPTSPASVQSALRYWGCTIQAGAQI GA AF SSH +AE+ + LKE FSPLSLAF+P+
Subjt:  SRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPK

Query:  FSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVG
        FSIGS VDWNTVL DASSKGPRDLLSSSKS  SSL+ PVKFDPGN+SVTLLMPGF KSEIKLYQARS      GGSELLVEAGDQRRVISLPKEIQGKVG
Subjt:  FSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVG

Query:  GAKFMDRSLVITMR
        GAKFMDRSLVITMR
Subjt:  GAKFMDRSLVITMR

A0A6J1D944 uncharacterized protein At1g26090, chloroplastic2.8e-24797.59Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
        MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI

Query:  GNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
        GNSPVECGHNLSAVRLETTQMLLEPLKQL+QADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
Subjt:  GNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR

Query:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
        IMGAASKARLYLKYMRSAAEKTDLGRLATPSIL LVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
Subjt:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKS
        CTIQAGAQISGAFAFISSH+DAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTL MPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKS

Query:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

A0A6J1GY43 uncharacterized protein At1g26090, chloroplastic2.0e-20079.52Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC  NLSAVRLETTQMLLEPLK+LKQADSRLNMTQG LEG+VGEELG+LPGMDS+FSVL LE+FLG S  MAQ D+K  YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSI+ LVDEAM IS PGSHLSGRTSTD W+ALERMLE+GSSA +EP +F CFIVMDPTSPASV+SA R
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF
        YWGCTIQAGAQISGAFA ISS +DAES +RLKENFSPLSL FMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+L SPVKF+PGN+SVTLLMPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

A0A6J1ISG8 uncharacterized protein At1g26090, chloroplastic1.2e-20280.39Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIG+SPVEC HNLSAVRLETTQMLLEPLK+LKQADS LNMTQG LEGVVGEELG+LPGMDS+FSVL LE+FLGFS  MAQ D+KA YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSIL LVDEAM IS PGSHLSGRTSTD W+ALE MLE+GSSA +EP +F CFIVMDPTSPASV+SALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF
        YWGCTIQAGAQISGAFA ISS +DAES +RLKENF PL LAFMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+LLSPVKFDPGN+SVTLLMPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

SwissProt top hitse value%identityAlignment
O52027 Putative arsenical pump-driving ATPase3.2e-0628.4Show/hide
Query:  SKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGH-NLSAVRLETTQMLLE----PLKQLKQADSR
        ++E++  + TR L F GKGG GK++ A   A   A AG  T +V  +       + +  +G+ P   G  NL A R++  + L E     L  +++    
Subjt:  SKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGH-NLSAVRLETTQMLLE----PLKQLKQADSR

Query:  LNMTQGVLEGVVG--EELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIM
         + TQ  +E  V   EE    P  + + +   LEKF+ + E       +  YDIV++D   T  T+R++
Subjt:  LNMTQGVLEGVVG--EELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIM

Q46366 Putative arsenical pump-driving ATPase1.0e-1222.46Show/hide
Query:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKTS +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L +    +++  +R+ M QGV  GV+ +E
Subjt:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEE

Query:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLS
        + +LPGM+ +FS+L ++++               YD ++ D   T ET+R++              S  +    G  A  ++   +     +S+P S +S
Subjt:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLS

Query:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPK
         + +      D  E+++++   LE      ++  K    +VM+     S++  +R        G ++      ++  +DA+  S   E +  +   ++ +
Subjt:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPK

Query:  FSIG-SPVDWNTV-LHDASSKGPRDLL--------SSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFL-LYRGGSELLVEAGDQRRVI
           G SP+    + ++D    G + L          +  S +     P+KF        +      + ++KL  A    + ++  G EL V+ G+QR++I
Subjt:  FSIG-SPVDWNTV-LHDASSKGPRDLL--------SSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFL-LYRGGSELLVEAGDQRRVI

Query:  SLPKEIQG-KVGGAKFMDRSLVI
        +LP  + G + G A F D+ L I
Subjt:  SLPKEIQG-KVGGAKFMDRSLVI

Q46465 Putative arsenical pump-driving ATPase9.3e-1423.09Show/hide
Query:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKTS +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L E    +++  +R+ M QGV  GV+ +E
Subjt:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEE

Query:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLS
        + +LPGM+ +FS+L ++++               YD ++ D   T ET+R++              S  +    G  A  ++   +     +S+P S +S
Subjt:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLS

Query:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVM--DPTSPASVQSALRY---WGCTIQ-------AGAQISGAFAFISSHVDAESVSRLKE
         + +      D  E+++++   LE      ++  K    +VM  +  S      AL Y   +G  +          AQ +  +      +  + +  ++E
Subjt:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVM--DPTSPASVQSALRY---WGCTIQ-------AGAQISGAFAFISSHVDAESVSRLKE

Query:  NFSPLSLAFMPKF--SIGSPVDWNTVLHDA-SSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFL------LYRGGSELL
         FSPL +  +  +   I          HD      P D++            P+KF               K +I   Q +  F       ++  G EL 
Subjt:  NFSPLSLAFMPKF--SIGSPVDWNTVLHDA-SSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFL------LYRGGSELL

Query:  VEAGDQRRVISLPKEIQG-KVGGAKFMDRSLVI
        V+ G+QR++I+LP  + G + G A F D+ L I
Subjt:  VEAGDQRRVISLPKEIQG-KVGGAKFMDRSLVI

Q55794 Putative arsenical pump-driving ATPase1.5e-1121.63Show/hide
Query:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEE
        R++   GKGG GKTS A       A  G +T ++  +   +     D ++G+ P     NL    L+    L      +K+  +++   +G L+GV  EE
Subjt:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEE

Query:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLS
        L +LPGMD +F ++           M +   +A+YD++I D   T   +R++        Y++      +   +     P +  L     G S P   + 
Subjt:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLS

Query:  GRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTI-QAGAQISGAFAFISSHVDAESVSRLK-----------ENFSPLSLA
             + +E +E +        ++ ++    +V +P      +S   +   ++      +  A   +   +D     R K           +NF PL + 
Subjt:  GRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTI-QAGAQISGAFAFISSHVDAESVSRLK-----------ENFSPLSLA

Query:  FMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQ
          P FS    +     L        +D   S   +  + ++ V+    + S+ L +PG  K +I+L +          G EL V  G+ RR + LP+ + 
Subjt:  FMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQ

Query:  G-KVGGAKFMDRSLVI
             GAK  D  L I
Subjt:  G-KVGGAKFMDRSLVI

Q6DYE4 Uncharacterized protein At1g26090, chloroplastic6.4e-14057.2Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS
        + +S L  +S   N +PI   +RT   + +R+RRA  +   SS+++ D      QK T+ +TFLGKGGSGKT++AVFAAQH+ALAGL TCLVIHNQDP++
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS

Query:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD
        E+LL  KIG SP     NLS +RLETT+MLLEPLKQLKQAD+RLNMTQGVLEGVVGEELGVLPGMDS+FS+L LE+ +GF     +++ K   +D++IYD
Subjt:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD

Query:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS
        GISTEET+R++G +SK RLY KY+RS AEKTDLGRL +PSI+  VDE+M I+   S   G TS  +W+ LER LE G+SA+ +P +F  F+VMDP +P S
Subjt:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS

Query:  VQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT
        V++ALRYWGCT+QAG+ +SGAFA  SSH+ ++     K +F PL  A        + +DW+ +L D ++   R+LLS + SH +SL   V FD   + VT
Subjt:  VQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT

Query:  LLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        L MPGFEKSEIKLYQ       YRGGSELL+EAGDQRRVI LP +IQGKVGGAKF+DRSL++TMR
Subjt:  LLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

Arabidopsis top hitse value%identityAlignment
AT1G26090.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.6e-14157.2Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS
        + +S L  +S   N +PI   +RT   + +R+RRA  +   SS+++ D      QK T+ +TFLGKGGSGKT++AVFAAQH+ALAGL TCLVIHNQDP++
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS

Query:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD
        E+LL  KIG SP     NLS +RLETT+MLLEPLKQLKQAD+RLNMTQGVLEGVVGEELGVLPGMDS+FS+L LE+ +GF     +++ K   +D++IYD
Subjt:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD

Query:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS
        GISTEET+R++G +SK RLY KY+RS AEKTDLGRL +PSI+  VDE+M I+   S   G TS  +W+ LER LE G+SA+ +P +F  F+VMDP +P S
Subjt:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS

Query:  VQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT
        V++ALRYWGCT+QAG+ +SGAFA  SSH+ ++     K +F PL  A        + +DW+ +L D ++   R+LLS + SH +SL   V FD   + VT
Subjt:  VQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT

Query:  LLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        L MPGFEKSEIKLYQ       YRGGSELL+EAGDQRRVI LP +IQGKVGGAKF+DRSL++TMR
Subjt:  LLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGTCTCTGCTATATTCCACTTCTTTCTTTGGAAACCCAATTCCCATTTCAATGCCAATTCGAACCGGAAGAGCAGCATCTACTCGCAGGAGAAGAGCTCTGCC
AGTCCAGTCTTCCAAAGAGATTATGGACCAGAAACCAACCAGGCTGCTCACTTTTCTTGGCAAAGGCGGCTCGGGGAAGACCTCTTCAGCGGTATTCGCCGCTCAGCACT
TTGCATTGGCTGGACTGCGGACGTGTCTGGTGATACATAATCAAGACCCTACGTCTGAGTATCTTCTGGATTGTAAAATTGGGAATTCTCCCGTCGAATGCGGTCACAAC
CTCTCAGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAACAGCTAAAGCAAGCAGATTCTCGTCTTAACATGACACAAGGAGTTCTTGAAGGGGTGGT
TGGAGAAGAGCTTGGAGTACTTCCAGGAATGGATTCTGTCTTCTCGGTACTTCTACTTGAGAAATTTCTTGGGTTCTCAGAGAATATGGCCCAAAGAGACCGAAAAGCTA
ACTATGACATAGTAATATATGACGGTATCAGCACCGAGGAAACAATAAGGATCATGGGAGCGGCCAGTAAAGCGAGGTTGTACCTAAAATATATGAGGAGCGCTGCTGAA
AAAACCGATCTTGGGAGATTGGCTACTCCTTCAATTTTGAGTCTTGTTGATGAAGCCATGGGTATAAGCAGGCCAGGCTCCCATCTCAGTGGTAGAACCAGTACAGATAT
ATGGGAGGCACTGGAACGCATGTTAGAGAGAGGGTCTTCTGCATTTTCAGAGCCAAGTAAATTTGGCTGCTTTATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGT
CTGCATTACGGTACTGGGGTTGCACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTTCATTTCTTCACACGTGGATGCAGAATCCGTTTCTAGGTTGAAGGAG
AACTTTTCCCCTTTATCTTTGGCCTTTATGCCAAAGTTCTCAATTGGTTCCCCTGTAGATTGGAACACAGTTCTGCATGATGCATCAAGTAAAGGCCCGAGAGACCTTCT
GTCTTCGTCAAAAAGCCATATCAGCAGTCTGCTATCACCTGTAAAATTCGATCCTGGAAACAGATCGGTTACACTTCTCATGCCAGGCTTCGAGAAGTCAGAAATCAAGC
TTTACCAGGCACGTTCATTCTTTCTATTATATAGGGGAGGGTCTGAGCTGTTGGTAGAAGCTGGAGATCAGAGGCGTGTCATTTCTCTGCCTAAAGAAATTCAAGGGAAG
GTGGGTGGTGCCAAGTTCATGGACAGAAGTCTTGTGATCACAATGCGT
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGTCTCTGCTATATTCCACTTCTTTCTTTGGAAACCCAATTCCCATTTCAATGCCAATTCGAACCGGAAGAGCAGCATCTACTCGCAGGAGAAGAGCTCTGCC
AGTCCAGTCTTCCAAAGAGATTATGGACCAGAAACCAACCAGGCTGCTCACTTTTCTTGGCAAAGGCGGCTCGGGGAAGACCTCTTCAGCGGTATTCGCCGCTCAGCACT
TTGCATTGGCTGGACTGCGGACGTGTCTGGTGATACATAATCAAGACCCTACGTCTGAGTATCTTCTGGATTGTAAAATTGGGAATTCTCCCGTCGAATGCGGTCACAAC
CTCTCAGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAACAGCTAAAGCAAGCAGATTCTCGTCTTAACATGACACAAGGAGTTCTTGAAGGGGTGGT
TGGAGAAGAGCTTGGAGTACTTCCAGGAATGGATTCTGTCTTCTCGGTACTTCTACTTGAGAAATTTCTTGGGTTCTCAGAGAATATGGCCCAAAGAGACCGAAAAGCTA
ACTATGACATAGTAATATATGACGGTATCAGCACCGAGGAAACAATAAGGATCATGGGAGCGGCCAGTAAAGCGAGGTTGTACCTAAAATATATGAGGAGCGCTGCTGAA
AAAACCGATCTTGGGAGATTGGCTACTCCTTCAATTTTGAGTCTTGTTGATGAAGCCATGGGTATAAGCAGGCCAGGCTCCCATCTCAGTGGTAGAACCAGTACAGATAT
ATGGGAGGCACTGGAACGCATGTTAGAGAGAGGGTCTTCTGCATTTTCAGAGCCAAGTAAATTTGGCTGCTTTATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGT
CTGCATTACGGTACTGGGGTTGCACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTTCATTTCTTCACACGTGGATGCAGAATCCGTTTCTAGGTTGAAGGAG
AACTTTTCCCCTTTATCTTTGGCCTTTATGCCAAAGTTCTCAATTGGTTCCCCTGTAGATTGGAACACAGTTCTGCATGATGCATCAAGTAAAGGCCCGAGAGACCTTCT
GTCTTCGTCAAAAAGCCATATCAGCAGTCTGCTATCACCTGTAAAATTCGATCCTGGAAACAGATCGGTTACACTTCTCATGCCAGGCTTCGAGAAGTCAGAAATCAAGC
TTTACCAGGCACGTTCATTCTTTCTATTATATAGGGGAGGGTCTGAGCTGTTGGTAGAAGCTGGAGATCAGAGGCGTGTCATTTCTCTGCCTAAAGAAATTCAAGGGAAG
GTGGGTGGTGCCAAGTTCATGGACAGAAGTCTTGTGATCACAATGCGT
Protein sequenceShow/hide protein sequence
MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHN
LSAVRLETTQMLLEPLKQLKQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAE
KTDLGRLATPSILSLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHVDAESVSRLKE
NFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLLMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGK
VGGAKFMDRSLVITMR