; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G102760 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G102760
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
Genome locationCiama_Chr05:33569178..33573488
RNA-Seq ExpressionCaUC05G102760
SyntenyCaUC05G102760
Gene Ontology termsNA
InterPro domainsIPR008978 - HSP20-like chaperone
IPR025723 - Anion-transporting ATPase-like domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR040612 - ArsA, HSP20-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135118.2 uncharacterized protein At1g26090, chloroplastic [Cucumis sativus]3.6e-21076.94Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFS SFFGNPIPIS+RT T  C  R + LQASK+ TDVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLV+ NQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        NSPVECSHNL+AVRLETTQMLLEPLKRLKQADSRLNMTQGVLEG                 VVGEEL VLPG DSIFS+LQLERF+GFSGIMGQRDQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YD+VIYDGICTEETIRMIGATSK                                  RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISR GSHL G
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS DIWE LEH+LEKGSSAFAEPRKFSC+IVMDPTSPASVQSALRYWGCTIQAGAQI GALAFISS  NAE++ASLKEKFSPLSLAF+PQFSTGSSVDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVLRD SS GPRDLLS SK+LT SL  PVKFDPGNKSVTLLMPGFGKSEIKLY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAKF    L++
Subjt:  QGKVGGAKFTGFLLIV

XP_008446550.1 PREDICTED: uncharacterized protein At1g26090, chloroplastic [Cucumis melo]7.3e-21177.52Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFS SFFGNPIPISIRT T PC  R + LQASK+  DVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVI NQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEG                 VVGEEL VLPG DSIFS+LQLERF+GFSGIMGQRDQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YDIVIYDG+CTEETIRMIGATSK                                  RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISR GSHL G
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS DIWE LEH+LEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQI GALAF SS  NAE++ASLKEKFSPLSLAF+PQFS GSSVDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVLRD SS+GPRDLLS SKSLT SL  PVKFDPGNKSVTLLMPGFGKSEIKLY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAKF    L++
Subjt:  QGKVGGAKFTGFLLIV

XP_022979170.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima]7.6e-20074.03Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT T+PCR R M ++ASKE+TDVSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        +SPVECSHNLSAVRLETTQMLLEPLKRLKQADS LNMTQG LEG                 VVGEELG+LPG DSIFS+LQLERFLGFSGIM Q DQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YDIV+YDGICTEETIRMIGATSKA                                 RLYLKYLRSIAEKTDLGRLATPSILRLVDEAM+IS  GSHLSG
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS D W+ALEHMLEKGSSA AEPR+FSC+IVMDPTSPASV+SALRYWGCTIQAGAQISGA A ISS L+AES A LKE F PL LAFMPQ S GS VDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVL D SS+GPR+LLS SKS + +L SPVKFDPGNKSVTLLMPGF KSEI+LY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAKF    L++
Subjt:  QGKVGGAKFTGFLLIV

XP_023529730.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita pepo subsp. pepo]9.0e-20173.84Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT T+PCR R M ++ASKE+TDVSSQN  RMLTFLGKGGSGKTTSAVF A+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQG LEG                 +VGEELG+LPG DSIFS+LQLERFLG SGIM Q DQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YDIV+YDGICTEETIRMIGATSKA                                 RLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM+IS  GSHLSG
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS D W+ALEHMLEKGSSA AEPR+FSC+IVMDPTSPASV+SA RYWGCTIQAGAQISGA A ISS L+AES A LKE FSPL LAFMPQ S GS VDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVL D SS+GPR+LLS SKS + +LPSPVKF+PGNKSVTLLMPGF KSEI+LY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAKFT   L++
Subjt:  QGKVGGAKFTGFLLIV

XP_038891424.1 uncharacterized protein At1g26090, chloroplastic [Benincasa hispida]2.5e-21980.43Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSL FSASFFGNPIPISIRT T+PCR RF+ LQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQG+LEG                 VVGEELGVLPGTDSIFS+LQLERFLGFSGIMGQRDQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YD+VIYDGICTEETIRMIGATSKA                                 RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISR GSHLS 
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS DIWEALEH+LEKGSSAFAEPRKFSC+IVMDPTSPASVQSALRYWGCTIQAG QISGALAFISS L+AESTASLKEKFSPLSLAFMPQFSTGSSVDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVLRD SS+GPRDLLSLSKS+T SL SPVKFDPGNKSVTLLMPGFGKSEIKLY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAK T   L++
Subjt:  QGKVGGAKFTGFLLIV

TrEMBL top hitse value%identityAlignment
A0A1S3BET7 uncharacterized protein At1g26090, chloroplastic3.6e-21177.52Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFS SFFGNPIPISIRT T PC  R + LQASK+  DVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVI NQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEG                 VVGEEL VLPG DSIFS+LQLERF+GFSGIMGQRDQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YDIVIYDG+CTEETIRMIGATSK                                  RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISR GSHL G
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS DIWE LEH+LEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQI GALAF SS  NAE++ASLKEKFSPLSLAF+PQFS GSSVDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVLRD SS+GPRDLLS SKSLT SL  PVKFDPGNKSVTLLMPGFGKSEIKLY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAKF    L++
Subjt:  QGKVGGAKFTGFLLIV

A0A5A7STS2 ArsA_ATPase domain-containing protein5.2e-19477.68Show/hide
Query:  DVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV
        DVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVI NQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV
Subjt:  DVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV

Query:  LEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCT
        LEG                 VVGEEL VLPG DSIFS+LQLERF+GFSGIMGQRDQK KYDIVIYDG+CTEETIRMIGATSK                  
Subjt:  LEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCT

Query:  NWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASV
                        RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISR GSHL GRTS DIWE LEH+LEKGSSAFAEPRKFSCYIVMDPTSPASV
Subjt:  NWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASV

Query:  QSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTL
        QSALRYWGCTIQAGAQI GALAF SS  NAE++ASLKEKFSPLSLAF+PQFS GSSVDWNTVLRD SS+GPRDLLS SKSLT SL  PVKFDPGNKSVTL
Subjt:  QSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTL

Query:  LMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTGFLLIV
        LMPGFGKSEIKLYQA+                     GGSELLVEAGDQRRVISLPKEIQGKVGGAKF    L++
Subjt:  LMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTGFLLIV

A0A5D3CFE3 ArsA_ATPase domain-containing protein3.2e-18876Show/hide
Query:  DVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV
        DVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVI NQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV
Subjt:  DVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV

Query:  LEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCT
        LEG                 VVGEEL VLPG DSIFS+LQLERF+GFSGIMGQRDQK KYDIVIYDG+CTEETIRMIGATSK                  
Subjt:  LEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCT

Query:  NWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASV
                        RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISR GSHL GRTS DIWE LEH+LEKGSSAFAEPRKFSCYIVMDPTSPASV
Subjt:  NWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASV

Query:  QSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTL
        QSALRYWGCTIQAGAQI GALAF SS  NAE++ASLKEKFSPLSLAF+PQFS GSSVDWNTVLRD SS+GPRDLLS SKSLT SL  PVKFDPGNKSVTL
Subjt:  QSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTL

Query:  LMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTGFLLIV
        LMPGFGK                              GGSELLVEAGDQRRVISLPKEIQGKVGGAKF    L++
Subjt:  LMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTGFLLIV

A0A6J1GY43 uncharacterized protein At1g26090, chloroplastic5.3e-19973.45Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT T+PCR R M ++ASKE+TDVSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQG LEG                 +VGEELG+LPG DSIFS+LQLERFLG SGIM Q DQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YDIV+YDGICTEETIRMIGATSKA                                 RLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM IS  GSHLSG
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS D W+ALE MLEKGSSA AEPR+FSC+IVMDPTSPASV+SA RYWGCTIQAGAQISGA A ISS L+AES A LKE FSPLSL FMPQ S GS VDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVL D SS+GPR+LLS SKS + +LPSPVKF+PGNKSVTLLMPGF KSEI+LY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAKF    L++
Subjt:  QGKVGGAKFTGFLLIV

A0A6J1ISG8 uncharacterized protein At1g26090, chloroplastic3.7e-20074.03Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT T+PCR R M ++ASKE+TDVSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK
        +SPVECSHNLSAVRLETTQMLLEPLKRLKQADS LNMTQG LEG                 VVGEELG+LPG DSIFS+LQLERFLGFSGIM Q DQK K
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGK

Query:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG
        YDIV+YDGICTEETIRMIGATSKA                                 RLYLKYLRSIAEKTDLGRLATPSILRLVDEAM+IS  GSHLSG
Subjt:  YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSG

Query:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW
        RTS D W+ALEHMLEKGSSA AEPR+FSC+IVMDPTSPASV+SALRYWGCTIQAGAQISGA A ISS L+AES A LKE F PL LAFMPQ S GS VDW
Subjt:  RTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDW

Query:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI
        NTVL D SS+GPR+LLS SKS + +L SPVKFDPGNKSVTLLMPGF KSEI+LY                     QYRGGSELLVEAGDQRRVISLPKEI
Subjt:  NTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEI

Query:  QGKVGGAKFTGFLLIV
        QGKVGGAKF    L++
Subjt:  QGKVGGAKFTGFLLIV

SwissProt top hitse value%identityAlignment
O50593 Arsenical pump-driving ATPase8.6e-0526.86Show/hide
Query:  QNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNS--PVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLE
        QN    L F GKGG GKT+ +   A H A  G R  LV  +       + D  IGN+  PV     LSA+        ++P +  +Q  +R      +++
Subjt:  QNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNS--PVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLE

Query:  GVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMI
         +  LL +D      ++N + E+L     T       ++  F  F+G++       ++D +I+D   T  TIR++
Subjt:  GVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMI

O66908 Putative arsenical pump-driving ATPase 11.5e-0425.53Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDC---------KIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQG
        R++ F GKGG GKTT  + AA  + LS L   +++ + DP    L D          K    P++ + NL    ++    + E ++R             
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDC---------KIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQG

Query:  VLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMIGATSKARFEM
           G V+   E L     L  ++ +EL +LPG + I SLL + ++           ++G +D++I D   T E+IR +   +  ++ M
Subjt:  VLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMIGATSKARFEM

Q46366 Putative arsenical pump-driving ATPase3.4e-0926.04Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLL
        R+LTF GKGG GKT+ +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L +    +++  +R+ M QGV        
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLL

Query:  WEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMI
                    V+ +E+ +LPG + +FSLL+++R+             G YD ++ D   T ET+R++
Subjt:  WEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMI

Q46465 Putative arsenical pump-driving ATPase2.0e-0926.63Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLL
        R+LTF GKGG GKT+ +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L E    +++  +R+ M QGV        
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLL

Query:  WEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMI
                    V+ +E+ +LPG + +FSLL+++R+             G YD ++ D   T ET+R++
Subjt:  WEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMI

Q6DYE4 Uncharacterized protein At1g26090, chloroplastic7.1e-12450Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPC----RIRFMTLQASKEITDV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY
        + +S L  +S   N +PI +RT T       R  ++   +S+++ D    SSQ  T+ +TFLGKGGSGKTT+AVFAAQH+AL+GL TCLVIHNQDP+ E+
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPC----RIRFMTLQASKEITDV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY

Query:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMG
        LL  KIG SP   + NLS +RLETT+MLLEPLK+LKQAD+RLNMTQGVLEG                 VVGEELGVLPG DSIFS+L+LER +GF     
Subjt:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMG

Query:  QRDQKGK-YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSIS
        +++ KGK +D++IYDGI TEET+RMIG +SK                                  RLY KYLRS+AEKTDLGRL +PSI+R VDE+M+I+
Subjt:  QRDQKGK-YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSIS

Query:  RSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQF
         + S   G TS  +W+ LE  LE G+SA+ +P +F  ++VMDP +P SV++ALRYWGCT+QAG+ +SGA A  SS L ++     K  F PL  A     
Subjt:  RSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQF

Query:  STGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRR
         T + +DW+ +L D ++   R+LLS + S   SL   V FD   K VTL MPGF KSEIKLY                     QYRGGSELL+EAGDQRR
Subjt:  STGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRR

Query:  VISLPKEIQGKVGGAKFTGFLLIV
        VI LP +IQGKVGGAKF    LIV
Subjt:  VISLPKEIQGKVGGAKFTGFLLIV

Arabidopsis top hitse value%identityAlignment
AT1G26090.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein5.0e-12550Show/hide
Query:  MASSLLFSASFFGNPIPISIRTGTSPC----RIRFMTLQASKEITDV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY
        + +S L  +S   N +PI +RT T       R  ++   +S+++ D    SSQ  T+ +TFLGKGGSGKTT+AVFAAQH+AL+GL TCLVIHNQDP+ E+
Subjt:  MASSLLFSASFFGNPIPISIRTGTSPC----RIRFMTLQASKEITDV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY

Query:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMG
        LL  KIG SP   + NLS +RLETT+MLLEPLK+LKQAD+RLNMTQGVLEG                 VVGEELGVLPG DSIFS+L+LER +GF     
Subjt:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMG

Query:  QRDQKGK-YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSIS
        +++ KGK +D++IYDGI TEET+RMIG +SK                                  RLY KYLRS+AEKTDLGRL +PSI+R VDE+M+I+
Subjt:  QRDQKGK-YDIVIYDGICTEETIRMIGATSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSIS

Query:  RSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQF
         + S   G TS  +W+ LE  LE G+SA+ +P +F  ++VMDP +P SV++ALRYWGCT+QAG+ +SGA A  SS L ++     K  F PL  A     
Subjt:  RSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCYIVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQF

Query:  STGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRR
         T + +DW+ +L D ++   R+LLS + S   SL   V FD   K VTL MPGF KSEIKLY                     QYRGGSELL+EAGDQRR
Subjt:  STGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVTLLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRR

Query:  VISLPKEIQGKVGGAKFTGFLLIV
        VI LP +IQGKVGGAKF    LIV
Subjt:  VISLPKEIQGKVGGAKFTGFLLIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAGGAACATCTCCATGTAGGATAAGATTTATGACCCTTCAGGCTTC
AAAAGAGATTACAGACGTTTCTTCTCAAAACCCAACCCGGATGCTCACTTTTCTTGGCAAAGGCGGCTCCGGAAAGACCACTTCCGCGGTATTCGCCGCTCAGCATTTTG
CATTGTCTGGACTGCGCACATGTCTGGTGATACATAATCAAGACCCTACTCCTGAGTATCTTCTTGATTGCAAAATTGGAAATTCTCCTGTTGAATGCAGTCACAACCTC
TCTGCTGTTAGGTTGGAAACCACTCAAATGCTACTTGAACCTCTCAAACGGCTAAAGCAAGCTGATTCTCGTCTTAATATGACACAAGGAGTTCTTGAAGGGGTAGTATT
CCTACTTTGGGAGGATTTGCTATGTAATATGTACCTTCTTAATGTTGTTGGAGAAGAGCTTGGGGTACTTCCAGGAACAGATTCTATCTTTTCGCTACTTCAGCTTGAGA
GATTCCTTGGGTTCTCAGGTATTATGGGTCAAAGGGACCAAAAAGGTAAATATGACATAGTAATATATGATGGTATCTGCACCGAAGAAACAATAAGAATGATTGGAGCA
ACCAGTAAAGCAAGGTTTGAGATGCTTTCTCTCTCCATATATAGTTCCCCCCACCCCTGCACAAATTGGAAAAAGAGAAAGAAAATCCGTATTACTAAAAGAACTTTCGA
TAGGTTGTACTTAAAATATCTGAGGAGCATTGCTGAAAAAACTGATCTGGGGAGGTTGGCTACTCCTTCAATTCTGAGGCTTGTTGATGAAGCCATGAGTATAAGCAGGT
CAGGCTCCCATCTCAGCGGTAGAACTAGTGCTGATATATGGGAGGCACTGGAACACATGTTAGAGAAAGGGTCTTCTGCATTTGCAGAGCCAAGAAAATTTAGCTGCTAT
ATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGTCTGCATTACGGTACTGGGGTTGTACTATTCAAGCTGGTGCACAAATATCTGGTGCACTTGCTTTCATTTCTTC
AGACTTGAATGCAGAATCCACTGCTAGTTTGAAGGAGAAATTTTCACCCTTGTCTTTGGCCTTTATGCCACAGTTCTCAACTGGTTCCTCAGTAGATTGGAACACAGTTC
TGCGCGATGTGTCAAGTAGAGGCCCAAGGGATCTTCTTTCTTTGTCAAAAAGCCTCACCGGCAGTCTGCCGTCACCTGTAAAATTCGATCCTGGAAATAAATCAGTTACA
CTTCTCATGCCAGGCTTCGGGAAGTCAGAAATCAAGCTTTATCAGGCACAAGAGAAGATCATCAAATACTGCTCTATTAAGTATGCATTAGACTTGATTAATGCTCAATA
TAGGGGAGGATCTGAGCTGTTAGTGGAAGCTGGGGATCAGAGGCGTGTAATTTCATTGCCTAAAGAAATTCAAGGGAAGGTTGGTGGTGCCAAGTTCACGGGCTTCCTGC
TTATTGTCATCTCCTATCTGTGTTGTATGAATTCGGCTCCTCAACTCTTCAAAAGCCTTCATGCTCTCTGCGTCAAAGCCTCCCGCAAACTGGAAACATATCGAATCCAA
TATGTCAGATACACTTGTAATTTGACTTGTCACAAGTTCTCTAGTAAGAACCATGTCGTTGTAAGGTATACTTACTTCTCAAGCAAGGAGTACATTTTCTTTAAAGCTGC
TTCACATGGGAGTTTAGGTTCATCAACAAACGTGGTGACCCGCTTCTCCAACTTCATAAGGTCCTGGTATTCAAAAGATGCCTCTCTCAATGCATCTGCCTTTCCTTCTG
GCCAATCAAAATGCTTCAGGACTGCCCTCTCATCAACCTTGATCAAAGATGAAGAAAAGTACACTATTAAACAACGAGATAGA
mRNA sequenceShow/hide mRNA sequence
CGGGAATGCCGGATTTTTGCCGAGCCTAACAAGATTCAGGAAAAGTGTTTAGCTACTTCACATGCTTGTGTTTGGTAATGTCTTAATATAAACTCATCCACAGATTTCAC
CCACCTACGCCATTGTTGCTGTTGCAGAGATCTATGGCTTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAGGAACATCTCC
ATGTAGGATAAGATTTATGACCCTTCAGGCTTCAAAAGAGATTACAGACGTTTCTTCTCAAAACCCAACCCGGATGCTCACTTTTCTTGGCAAAGGCGGCTCCGGAAAGA
CCACTTCCGCGGTATTCGCCGCTCAGCATTTTGCATTGTCTGGACTGCGCACATGTCTGGTGATACATAATCAAGACCCTACTCCTGAGTATCTTCTTGATTGCAAAATT
GGAAATTCTCCTGTTGAATGCAGTCACAACCTCTCTGCTGTTAGGTTGGAAACCACTCAAATGCTACTTGAACCTCTCAAACGGCTAAAGCAAGCTGATTCTCGTCTTAA
TATGACACAAGGAGTTCTTGAAGGGGTAGTATTCCTACTTTGGGAGGATTTGCTATGTAATATGTACCTTCTTAATGTTGTTGGAGAAGAGCTTGGGGTACTTCCAGGAA
CAGATTCTATCTTTTCGCTACTTCAGCTTGAGAGATTCCTTGGGTTCTCAGGTATTATGGGTCAAAGGGACCAAAAAGGTAAATATGACATAGTAATATATGATGGTATC
TGCACCGAAGAAACAATAAGAATGATTGGAGCAACCAGTAAAGCAAGGTTTGAGATGCTTTCTCTCTCCATATATAGTTCCCCCCACCCCTGCACAAATTGGAAAAAGAG
AAAGAAAATCCGTATTACTAAAAGAACTTTCGATAGGTTGTACTTAAAATATCTGAGGAGCATTGCTGAAAAAACTGATCTGGGGAGGTTGGCTACTCCTTCAATTCTGA
GGCTTGTTGATGAAGCCATGAGTATAAGCAGGTCAGGCTCCCATCTCAGCGGTAGAACTAGTGCTGATATATGGGAGGCACTGGAACACATGTTAGAGAAAGGGTCTTCT
GCATTTGCAGAGCCAAGAAAATTTAGCTGCTATATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGTCTGCATTACGGTACTGGGGTTGTACTATTCAAGCTGGTGC
ACAAATATCTGGTGCACTTGCTTTCATTTCTTCAGACTTGAATGCAGAATCCACTGCTAGTTTGAAGGAGAAATTTTCACCCTTGTCTTTGGCCTTTATGCCACAGTTCT
CAACTGGTTCCTCAGTAGATTGGAACACAGTTCTGCGCGATGTGTCAAGTAGAGGCCCAAGGGATCTTCTTTCTTTGTCAAAAAGCCTCACCGGCAGTCTGCCGTCACCT
GTAAAATTCGATCCTGGAAATAAATCAGTTACACTTCTCATGCCAGGCTTCGGGAAGTCAGAAATCAAGCTTTATCAGGCACAAGAGAAGATCATCAAATACTGCTCTAT
TAAGTATGCATTAGACTTGATTAATGCTCAATATAGGGGAGGATCTGAGCTGTTAGTGGAAGCTGGGGATCAGAGGCGTGTAATTTCATTGCCTAAAGAAATTCAAGGGA
AGGTTGGTGGTGCCAAGTTCACGGGCTTCCTGCTTATTGTCATCTCCTATCTGTGTTGTATGAATTCGGCTCCTCAACTCTTCAAAAGCCTTCATGCTCTCTGCGTCAAA
GCCTCCCGCAAACTGGAAACATATCGAATCCAATATGTCAGATACACTTGTAATTTGACTTGTCACAAGTTCTCTAGTAAGAACCATGTCGTTGTAAGGTATACTTACTT
CTCAAGCAAGGAGTACATTTTCTTTAAAGCTGCTTCACATGGGAGTTTAGGTTCATCAACAAACGTGGTGACCCGCTTCTCCAACTTCATAAGGTCCTGGTATTCAAAAG
ATGCCTCTCTCAATGCATCTGCCTTTCCTTCTGGCCAATCAAAATGCTTCAGGACTGCCCTCTCATCAACCTTGATCAAAGATGAAGAAAAGTACACTATTAAACAACGA
GATAGA
Protein sequenceShow/hide protein sequence
MASSLLFSASFFGNPIPISIRTGTSPCRIRFMTLQASKEITDVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNL
SAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVFLLWEDLLCNMYLLNVVGEELGVLPGTDSIFSLLQLERFLGFSGIMGQRDQKGKYDIVIYDGICTEETIRMIGA
TSKARFEMLSLSIYSSPHPCTNWKKRKKIRITKRTFDRLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRSGSHLSGRTSADIWEALEHMLEKGSSAFAEPRKFSCY
IVMDPTSPASVQSALRYWGCTIQAGAQISGALAFISSDLNAESTASLKEKFSPLSLAFMPQFSTGSSVDWNTVLRDVSSRGPRDLLSLSKSLTGSLPSPVKFDPGNKSVT
LLMPGFGKSEIKLYQAQEKIIKYCSIKYALDLINAQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTGFLLIVISYLCCMNSAPQLFKSLHALCVKASRKLETYRIQ
YVRYTCNLTCHKFSSKNHVVVRYTYFSSKEYIFFKAASHGSLGSSTNVVTRFSNFIRSWYSKDASLNASAFPSGQSKCFRTALSSTLIKDEEKYTIKQRDR