; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023420 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023420
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionS-acyltransferase
Genome locationtig00000892:3118103..3134503
RNA-Seq ExpressionSgr023420
SyntenySgr023420
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006612 - protein targeting to membrane (biological process)
GO:0018230 - peptidyl-L-cysteine S-palmitoylation (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0005794 - Golgi apparatus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0019706 - protein-cysteine S-palmitoyltransferase activity (molecular function)
InterPro domainsIPR001594 - Palmitoyltransferase, DHHC domain
IPR003851 - Zinc finger, Dof-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605466.1 Palmitoyltransferase ZDHHC17, partial [Cucurbita argyrosperma subsp. sororia]6.6e-15483.83Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WPELIGR VVS ILVLLTQ +QFLVPQFF D  VF+QLLLSALLLLAV G  GWCRRLLRVRASAPAFVFFS+LF+WVVY+AIVR+AA
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY PSPSD++AQSS  EIDT HQETE+LM+SP LRRSTEA   LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQ  ++S VPDGDGTRLEIS SRKVASSTMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

XP_022148494.1 probable protein S-acyltransferase 15 [Momordica charantia]2.4e-15684.73Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M +LRDR +PADWPELIGR VVSL+LVLLTQ TQFLVPQFFSD SVFLQLL+SALLLLAV+   GWCRRLLRVRASAPAFVFFSILFIW+VY AIVRQAA
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        S+LMDLLFNGQ + LIFGL RMLSSDPGLVSY P PSDVIAQS V E DT HQETE L  SP+L  STEA IGLG+RVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQY AKSEVPDGDGTRLE SFSRKV  STMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        VQSE GQ S E+RF+NPY KGV QNVKEF+ASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

XP_022947310.1 probable palmitoyltransferase ZDHHC12 isoform X1 [Cucurbita moschata]1.6e-15584.43Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WP LIGR VVS ILVLLTQ +QFLVPQFF D  VF+QLLLSALLLLAV G  GWCRRLLRVRASAPAFVFFS+LF+WVVY+AIVR+ A
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY PSP DVIAQSS LEIDT HQETE L +SP LRRSTEA   LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQYAA+S VPDGDGTRLEIS SRKVASSTMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

XP_023532258.1 probable palmitoyltransferase ZDHHC12 isoform X1 [Cucurbita pepo subsp. pepo]1.6e-15584.13Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WPELIGR VVS ILVLLTQ +QFLVPQFF D  VF+QLLLSALLLLAV G  GWCRRLLRVRASAPAFVFFS+LF+WVVY+AIVRQAA
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY P PSD++AQSS  EIDT HQETE L +SP LRRSTEA   LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLF+VLLIGFLWTEA YLVCLSQYAA+S VPDGDGTRLEIS SRKVASSTMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

XP_023532259.1 probable palmitoyltransferase ZDHHC12 isoform X2 [Cucurbita pepo subsp. pepo]2.1e-15283.23Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WPELIGR VVS ILVLLTQ +QFLVPQFF D  VF+QLLLSALLLLAV G  GWCRRLLRVRASAPAFVFFS+LF+WVVY+AIVRQAA
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY P PSD++AQSS  EIDT HQETE L +SP LRRSTEA   LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLF+VLLIGFLWTEA YLVCLSQ  ++S VPDGDGTRLEIS SRKVASSTMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

TrEMBL top hitse value%identityAlignment
A0A6J1D461 S-acyltransferase1.2e-15684.73Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M +LRDR +PADWPELIGR VVSL+LVLLTQ TQFLVPQFFSD SVFLQLL+SALLLLAV+   GWCRRLLRVRASAPAFVFFSILFIW+VY AIVRQAA
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        S+LMDLLFNGQ + LIFGL RMLSSDPGLVSY P PSDVIAQS V E DT HQETE L  SP+L  STEA IGLG+RVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQY AKSEVPDGDGTRLE SFSRKV  STMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        VQSE GQ S E+RF+NPY KGV QNVKEF+ASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

A0A6J1G625 S-acyltransferase1.0e-15283.53Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WP LIGR VVS ILVLLTQ +QFLVPQFF D  VF+QLLLSALLLLAV G  GWCRRLLRVRASAPAFVFFS+LF+WVVY+AIVR+ A
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY PSP DVIAQSS LEIDT HQETE L +SP LRRSTEA   LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQ  ++S VPDGDGTRLEIS SRKVASSTMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

A0A6J1G639 S-acyltransferase2.2e-14781.14Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WP LIGR VVS ILVLLTQ +QFLVPQFF D  VF+QLLLSALLLLAV G  GWCRRLLRVRASAPAFVFFS+LF+WVVY+AIVR+ A
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY PSP DVIAQSS LEIDT HQ+TE                 LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQYAA+S VPDGDGTRLEIS SRKVASSTMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

A0A6J1G6I4 S-acyltransferase7.6e-15684.43Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WP LIGR VVS ILVLLTQ +QFLVPQFF D  VF+QLLLSALLLLAV G  GWCRRLLRVRASAPAFVFFS+LF+WVVY+AIVR+ A
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY PSP DVIAQSS LEIDT HQETE L +SP LRRSTEA   LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQYAA+S VPDGDGTRLEIS SRKVASSTMLFSILQLVWQVVFL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

A0A6J1L1G6 S-acyltransferase1.5e-14881.14Show/hide
Query:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA
        M EL DRKK A+WPELIGR VVS ILVLLTQC+QFLVPQFF D  VF+QLLLSALLLLAV G+ GWCRRLLRVRASAPAFVF S+LF+WVVY+AIVRQAA
Subjt:  MAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSVFLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAA

Query:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN
        ++LMDLLFNGQIILLIFGL RMLSSDPGLVSY PSPSD+IAQSS  EIDT HQ+TE                 LGDRVRCCQICKAYVKGFDHHCPAFGN
Subjt:  SNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETELLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGN

Query:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF
        CIGHKNYLLFMVLLIGFLWTEA YLVCLSQYAA+  VPDGDGTRLEIS SRKVASSTMLFSILQLVWQV+FL WH+YCICFNIRTDEWI+WKRYPEFQIF
Subjt:  CIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLVWQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIF

Query:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ
        +QS  G+ SSE+RF+NPYDKGV QN+KEFMASK+
Subjt:  VQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ

SwissProt top hitse value%identityAlignment
O80928 Dof zinc finger protein DOF2.44.8e-3842.97Show/hide
Query:  MAHSSLPIYLDPPNWQQSNQAPTANDHQD-----------------PRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDST
        M  SS+  YLD  NWQ   QAP +N + D                 P+Q     P P      G G  GSIR GSM  RAR A +  PEAALKCPRC+ST
Subjt:  MAHSSLPIYLDPPNWQQSNQAPTANDHQD-----------------PRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDST

Query:  NTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGA-----IISAGCNIDSSSTMGNFPS----
        NTKFCYFNNYSL+QPRHFCKTCRRYWTRGGALRNVPVGGG R+N ++ K+N + + ++T T  +   SSG A     I+S+    +  S +    S    
Subjt:  NTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGA-----IISAGCNIDSSSTMGNFPS----

Query:  QPPEFPSLPSL------QHHLSRFGAGNTGLNFTGIQLGNMSSGREL----DQWRS
          P +  L  L       +++S    G    +   I +G  +SG  L    D+WRS
Subjt:  QPPEFPSLPSL------QHHLSRFGAGNTGLNFTGIQLGNMSSGREL----DQWRS

Q8L9V6 Dof zinc finger protein DOF1.11.4e-2936.72Show/hide
Query:  LSPFLPPPPS--QPAHGSGVTGSIRP----GSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRK
        LS  LPP  +   P H    T +  P    GSMA RAR A IP     LKCPRCDS+NTKFCY+NNY+L+QPRHFCK CRRYWT+GGALRNVPVGGG R+
Subjt:  LSPFLPPPPS--QPAHGSGVTGSIRP----GSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRK

Query:  NKKKKKTNRSKSPSATDTEMSNPRS---SGGAIIS----------------AGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGNTGLNFTG-IQL
        N KK K    KS S++  + S+  +   S G + +                 G  ++ ++T GN  +Q  +  S   +   L      NT    TG I  
Subjt:  NKKKKKTNRSKSPSATDTEMSNPRS---SGGAIIS----------------AGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGNTGLNFTG-IQL

Query:  GNMSSGRELDQWRSQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFSRISQQQQLKNEEQNQNQEQGISNFLRPVMGSVSEANQ-
         N ++  E +   S       A  +P    Y    Q + N G+N   +      +      SR+ Q   +K EEQ       ++N  RPV G  S  NQ 
Subjt:  GNMSSGRELDQWRSQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFSRISQQQQLKNEEQNQNQEQGISNFLRPVMGSVSEANQ-

Query:  ---FW
           FW
Subjt:  ---FW

Q9LZ56 Dof zinc finger protein DOF5.11.2e-3648.37Show/hide
Query:  MAHSSLPIYLD-PPNWQQSNQAPTA----NDHQDPRQLSPFLPPPPSQP---------------AHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRC
        M  SS P Y D   NWQQ +Q  T       +   +Q  P  P PP Q                A   G  G IRPGSMA RARLA IP PE ALKCPRC
Subjt:  MAHSSLPIYLD-PPNWQQSNQAPTA----NDHQDPRQLSPFLPPPPSQP---------------AHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRC

Query:  DSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFP
        DSTNTKFCYFNNYSL+QPRHFCK CRRYWTRGGALR+VPVGGG R+N   K+T  S       T   N +S   A  +   +      M N    PP   
Subjt:  DSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFP

Query:  SLPSLQHHLSRFGAG
        S  SL   LS + AG
Subjt:  SLPSLQHHLSRFGAG

Q9M2U1 Dof zinc finger protein DOF3.64.0e-4541.25Show/hide
Query:  MAHSSLPI-YLDPPNWQQ---SNQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLS
        M  SSLP+   D  NWQQ    +Q       Q+P      L  PP+    GS      R  SM  RAR+A +P PEAAL CPRCDSTNTKFCYFNNYSL+
Subjt:  MAHSSLPI-YLDPPNWQQ---SNQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLS

Query:  QPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGN
        QPRHFCKTCRRYWTRGG+LRNVPVGGGFR+NK+ K  +RSKS     T+ +   SS  +  S   N     + G  P      P LP LQ  L  + + N
Subjt:  QPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGN

Query:  TGLNFTGIQLGNM-----SSGRELDQWR------SQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFS-RISQQQQLKNEEQNQN
        TGL+F G Q+ NM     SSG  LD WR      +QQ PF++                      N++  VQ  + L  L       +Q + +K EE +Q+
Subjt:  TGLNFTGIQLGNM-----SSGRELDQWR------SQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFS-RISQQQQLKNEEQNQN

Query:  QEQ---GISNFLRPVMGSVS
        + +   G++N  R  +G+++
Subjt:  QEQ---GISNFLRPVMGSVS

Q9ZV33 Dof zinc finger protein DOF2.21.0e-3537.32Show/hide
Query:  MAHSSLPIYLDPP-NWQQSNQAPTANDHQDPRQ------------LSPFLPPPPS------QPAHGSGVTGSIRPGSMAYRARLAMIPQ-PEAALKCPRC
        M  SS+  +LDPP NW QS   P  + H    Q            LS   P  P+      + A  + V  S   G  A RARLA   Q PE ALKCPRC
Subjt:  MAHSSLPIYLDPP-NWQQSNQAPTANDHQDPRQ------------LSPFLPPPPS------QPAHGSGVTGSIRPGSMAYRARLAMIPQ-PEAALKCPRC

Query:  DSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFP
        DS NTKFCYFNNY+L+QPRHFCK CRRYWTRGGALRNVPVGGG R+NKK K  N SKS S++  + S       ++++A    ++S+      SQ   FP
Subjt:  DSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFP

Query:  SLPSLQHHLSRFGAGNTGLNFTGIQLGNMSSGRELDQWRSQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPF-------------
         LP+LQ+       G  GLN   I   N  +G     + +    F       P            + GS+S  A+    +   L  F             
Subjt:  SLPSLQHHLSRFGAGNTGLNFTGIQLGNMSSGRELDQWRSQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPF-------------

Query:  SRISQQQQLKNEEQNQNQEQGISNFLRPVMGSVS---EANQFW
        +R+SQ  Q+K E+ +      + N  RPV G  S   ++NQ+W
Subjt:  SRISQQQQLKNEEQNQNQEQGISNFLRPVMGSVS---EANQFW

Arabidopsis top hitse value%identityAlignment
AT2G37590.1 DNA binding with one finger 2.43.4e-3942.97Show/hide
Query:  MAHSSLPIYLDPPNWQQSNQAPTANDHQD-----------------PRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDST
        M  SS+  YLD  NWQ   QAP +N + D                 P+Q     P P      G G  GSIR GSM  RAR A +  PEAALKCPRC+ST
Subjt:  MAHSSLPIYLDPPNWQQSNQAPTANDHQD-----------------PRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDST

Query:  NTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGA-----IISAGCNIDSSSTMGNFPS----
        NTKFCYFNNYSL+QPRHFCKTCRRYWTRGGALRNVPVGGG R+N ++ K+N + + ++T T  +   SSG A     I+S+    +  S +    S    
Subjt:  NTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGA-----IISAGCNIDSSSTMGNFPS----

Query:  QPPEFPSLPSL------QHHLSRFGAGNTGLNFTGIQLGNMSSGREL----DQWRS
          P +  L  L       +++S    G    +   I +G  +SG  L    D+WRS
Subjt:  QPPEFPSLPSL------QHHLSRFGAGNTGLNFTGIQLGNMSSGREL----DQWRS

AT3G55370.1 OBF-binding protein 32.9e-4641.25Show/hide
Query:  MAHSSLPI-YLDPPNWQQ---SNQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLS
        M  SSLP+   D  NWQQ    +Q       Q+P      L  PP+    GS      R  SM  RAR+A +P PEAAL CPRCDSTNTKFCYFNNYSL+
Subjt:  MAHSSLPI-YLDPPNWQQ---SNQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLS

Query:  QPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGN
        QPRHFCKTCRRYWTRGG+LRNVPVGGGFR+NK+ K  +RSKS     T+ +   SS  +  S   N     + G  P      P LP LQ  L  + + N
Subjt:  QPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGN

Query:  TGLNFTGIQLGNM-----SSGRELDQWR------SQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFS-RISQQQQLKNEEQNQN
        TGL+F G Q+ NM     SSG  LD WR      +QQ PF++                      N++  VQ  + L  L       +Q + +K EE +Q+
Subjt:  TGLNFTGIQLGNM-----SSGRELDQWR------SQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFS-RISQQQQLKNEEQNQN

Query:  QEQ---GISNFLRPVMGSVS
        + +   G++N  R  +G+++
Subjt:  QEQ---GISNFLRPVMGSVS

AT3G55370.2 OBF-binding protein 31.6e-4943.16Show/hide
Query:  MAHSSLPI-YLDPPNWQQ---SNQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLS
        M  SSLP+   D  NWQQ    +Q       Q+P      L  PP+    GS      R  SM  RAR+A +P PEAAL CPRCDSTNTKFCYFNNYSL+
Subjt:  MAHSSLPI-YLDPPNWQQ---SNQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLS

Query:  QPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGN
        QPRHFCKTCRRYWTRGG+LRNVPVGGGFR+NK+ K  +RSKS     T+ +   SS  +  S   N     + G  P      P LP LQ  L  + + N
Subjt:  QPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGN

Query:  TGLNFTGIQLGNM-----SSGRELDQWR------SQQPPFIV--AGL-EPPAAAYP-LNIQTEVNFGSNSSAAVQYCHQL--QNLTPFS----RISQQQQ
        TGL+F G Q+ NM     SSG  LD WR      +QQ PF++   GL +   A YP L  +  VN G +   +  Y +QL  + L  FS      +Q + 
Subjt:  TGLNFTGIQLGNM-----SSGRELDQWR------SQQPPFIV--AGL-EPPAAAYP-LNIQTEVNFGSNSSAAVQYCHQL--QNLTPFS----RISQQQQ

Query:  LKNEEQNQNQEQ---GISNFLRPVMGSVS
        +K EE +Q++ +   G++N  R  +G+++
Subjt:  LKNEEQNQNQEQ---GISNFLRPVMGSVS

AT3G55370.3 OBF-binding protein 36.6e-4340.94Show/hide
Query:  NQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNV
        +Q       Q+P      L  PP+    GS      R  SM  RAR+A +P PEAAL CPRCDSTNTKFCYFNNYSL+QPRHFCKTCRRYWTRGG+LRNV
Subjt:  NQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNV

Query:  PVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGNTGLNFTGIQLGNM-----SSGR
        PVGGGFR+NK+ K  +RSKS     T+ +   SS  +  S   N     + G  P      P LP LQ  L  + + NTGL+F G Q+ NM     SSG 
Subjt:  PVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGNTGLNFTGIQLGNM-----SSGR

Query:  ELDQWR------SQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFS-RISQQQQLKNEEQNQNQEQ---GISNFLRPVMGSVS
         LD WR      +QQ PF++                      N++  VQ  + L  L       +Q + +K EE +Q++ +   G++N  R  +G+++
Subjt:  ELDQWR------SQQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFS-RISQQQQLKNEEQNQNQEQ---GISNFLRPVMGSVS

AT5G02460.1 Dof-type zinc finger DNA-binding family protein8.4e-3848.37Show/hide
Query:  MAHSSLPIYLD-PPNWQQSNQAPTA----NDHQDPRQLSPFLPPPPSQP---------------AHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRC
        M  SS P Y D   NWQQ +Q  T       +   +Q  P  P PP Q                A   G  G IRPGSMA RARLA IP PE ALKCPRC
Subjt:  MAHSSLPIYLD-PPNWQQSNQAPTA----NDHQDPRQLSPFLPPPPSQP---------------AHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRC

Query:  DSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFP
        DSTNTKFCYFNNYSL+QPRHFCK CRRYWTRGGALR+VPVGGG R+N   K+T  S       T   N +S   A  +   +      M N    PP   
Subjt:  DSTNTKFCYFNNYSLSQPRHFCKTCRRYWTRGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFP

Query:  SLPSLQHHLSRFGAG
        S  SL   LS + AG
Subjt:  SLPSLQHHLSRFGAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCACTCTTCTCTTCCAATCTATTTAGATCCCCCGAATTGGCAGCAGTCGAATCAAGCACCAACAGCTAACGATCATCAAGACCCACGCCAGCTTTCTCCGTTTCT
GCCACCGCCACCGTCTCAGCCTGCTCATGGCAGCGGCGTTACAGGCTCCATCAGGCCGGGTTCCATGGCGTACCGAGCGCGGCTGGCCATGATACCGCAACCGGAGGCGG
CACTCAAGTGCCCCCGCTGCGACTCCACCAACACCAAATTCTGCTACTTCAACAACTACAGCCTTTCTCAGCCGCGACACTTCTGCAAAACCTGTCGCCGCTACTGGACC
CGCGGCGGCGCTCTTAGAAACGTCCCCGTCGGAGGAGGGTTTCGAAAAAACAAGAAGAAGAAGAAAACTAACCGATCCAAGTCGCCCTCAGCCACAGATACTGAAATGAG
TAATCCGAGATCAAGCGGTGGTGCAATAATCTCAGCGGGCTGCAACATTGACTCGAGCAGCACAATGGGCAATTTCCCATCTCAGCCTCCTGAGTTTCCTTCGCTGCCAT
CTCTGCAACATCATCTTTCTCGGTTCGGAGCTGGAAACACAGGCCTGAACTTCACCGGAATTCAACTCGGAAACATGAGTTCCGGTCGGGAATTGGATCAATGGCGTTCA
CAGCAACCTCCGTTCATCGTCGCCGGACTCGAACCGCCGGCAGCTGCATACCCATTAAATATTCAAACCGAAGTTAACTTTGGCTCTAACTCGTCAGCTGCTGTACAGTA
TTGTCATCAACTCCAGAATTTGACACCTTTTTCGAGGATTTCTCAGCAGCAGCAGCTGAAAAACGAAGAACAAAACCAAAACCAAGAACAAGGAATATCGAACTTCTTGC
GACCAGTAATGGGTTCCGTTTCAGAAGCCAACCAGTTCTGGGGAGTTCTCATCAGTCGTCTTCAAGCCAATCGTAAAACGACATGTCGGCACGGCGGGGCGCTTGGCCGG
CGGCTGCCTTCTTTCCTTCCCCGGCGAGCCCAATTCTTGAAGCAGCACAAACTGATCACTCCGAGCTCCTTAGCCTCCACGAGATGTCGTTTGTCTCCCACACAAAGATG
CACCAAAACTGCAGCTGCATTCTCCCTGTTCCTAGGGGATCCGGTCGCAATGAAATCTACTAAAACAGGGACAGCTGCTGCAGCTCCGATGGCAGCCCGCCCTTCGGAGT
TGCTGGCCAGGATGGCTAAAATTGCCAGGGCCTCATCTACCATCCCTATTCTAGATTCTGTCAACAGCTGCATCAATATGGAAACCACACCACCCCTCACTGCCTTTATC
TTCAGTGTAATTAGTGGGGGAATGGCTCCAGATGCACCAATTGTAACTTTATATTCGTCTACCACAGAAAGGCTGAAAAGAGTGGCTGCAGCATTTTCACGTGCTTCCAT
ACTCCCCCACTTCAGCACATGGACTATACCAGGAGGTCTGTTGTTGAAAGCAGGCCGACAAGGAGAGGGATTGCACCAGCTTCAGCAATGGCCACACGATTATTTGCGTT
GCGTTTGGCAAGAAAGCGAATCTCACCGGCAGCAGATCGCTTGTCTTCAATGTTACCTGATCAGGAGACGTGAGTTCACTGGGCTGAGAACTGCTGGTTCGTTTTGGTGG
TTCAATCCCACTGGCCTCACACCATTGAGCTATGAGACTACGTCTCTTAATATTTTTGCCAGTGCAAAATCTGGAAGTAAAACACAAGCTGATGGAAACTCCGAGGTCAT
ACCTAATGCTTCAATTTATCGACTGGTACCTTGGAAGTTTCTTAAGGAAATTGTGGCTGAGCCGGGATTCTCTATTAGTGGAAGATGGCCATCGAAACATTATTTCCAGA
AGTGTAAAACCACTGTACAGCCTGATCAAAATGTTCTGTTACGAATACAAATGGAAGCGGAAGTACGCTCTGGAAATCTCAGTAACCGTCAATCTGCGTTCGTTTTCTTC
CTTCTTCCACTTAGCCAAATAGACATCCGAAGCAAGAAAAGAGAAAACACTAAATCCGAACGAAATTCCAAAATCACTTGTAATTGTAAACCAATTCACCACGCCTGCAC
AACATCGCGAGGCAGAACTGCGGACGCTTATGAAACAAATAACACACAAACGAGCTTCCTTCAACGAAGCAAGAGCTCTCACGGTGTCATCAGACAGCGCCTGCTTCGTA
TCCCTTATTTCCTCGAACATTGGGATCAACAGCTTCATCGATCAAACTTCGAACGCCGCAGCCATTCTCCTCCTCCATATCTCCTCAGTCAGCCAAACAGAACCTCTCAA
ACAACCGCCTATAGCAAAGAAAACGCATCAAATCCTCCAAAATCTTTGCGTGTTCGCCAGCTGCTTCTCGCCGCGGTGTTCGGAAAATCTCTTCATAATCGCCCCCGCGG
GAGCGACCATCATCATTCATCGGAGTCTTCTCTCTTCCGAAGCATAATCAAGTACTACGATGAATTTTCAGTTTGGAAAGAAATGGCGGAATTACGTGATCGGAAGAAGC
CGGCGGATTGGCCGGAGCTAATCGGTCGTTGGGTAGTCTCTCTCATACTTGTTTTGCTAACTCAATGTACCCAATTTCTAGTTCCCCAATTCTTCTCCGATTCGTCCGTC
TTCCTTCAACTCCTACTTTCAGCTCTACTGCTTCTGGCAGTTGCAGGCATTGGGGGATGGTGCAGGCGGCTTCTTCGAGTCCGCGCGTCAGCTCCAGCTTTCGTTTTCTT
CAGTATACTTTTTATTTGGGTGGTTTACGTAGCTATTGTTCGACAAGCTGCTTCGAATTTGATGGATCTTCTGTTTAATGGGCAGATAATCTTGCTCATCTTTGGCCTTT
CCAGGATGTTATCCAGTGATCCTGGTTTGGTGTCATATGGTCCATCTCCTTCAGATGTAATTGCTCAAAGTTCAGTTTTAGAAATTGACACTCAGCATCAGGAAACAGAA
CTTCTAATGAATAGTCCTAATCTCAGGCGTTCAACTGAAGCAGGTATTGGATTGGGTGATAGGGTGAGATGCTGCCAGATCTGCAAAGCATATGTTAAAGGCTTTGACCA
CCATTGTCCTGCATTTGGAAACTGTATTGGGCATAAAAATTATCTCCTCTTCATGGTCCTACTAATTGGATTTCTCTGGACTGAAGCTGCTTACTTAGTGTGCTTATCTC
AATATGCCGCAAAGTCGGAGGTTCCTGATGGTGATGGAACTAGGTTAGAGATTAGTTTCTCAAGGAAAGTGGCCAGCAGCACCATGCTATTCTCCATTCTACAATTAGTA
TGGCAGGTAGTATTCTTGTGGTGGCATATATATTGTATTTGCTTCAACATCAGAACAGATGAATGGATTAACTGGAAGAGGTATCCAGAATTTCAAATTTTTGTCCAGTC
TGAGCTGGGCCAATGGTCCAGTGAAATGAGGTTCAGAAACCCATATGACAAAGGAGTTTTTCAGAATGTGAAGGAGTTTATGGCGTCAAAACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCACTCTTCTCTTCCAATCTATTTAGATCCCCCGAATTGGCAGCAGTCGAATCAAGCACCAACAGCTAACGATCATCAAGACCCACGCCAGCTTTCTCCGTTTCT
GCCACCGCCACCGTCTCAGCCTGCTCATGGCAGCGGCGTTACAGGCTCCATCAGGCCGGGTTCCATGGCGTACCGAGCGCGGCTGGCCATGATACCGCAACCGGAGGCGG
CACTCAAGTGCCCCCGCTGCGACTCCACCAACACCAAATTCTGCTACTTCAACAACTACAGCCTTTCTCAGCCGCGACACTTCTGCAAAACCTGTCGCCGCTACTGGACC
CGCGGCGGCGCTCTTAGAAACGTCCCCGTCGGAGGAGGGTTTCGAAAAAACAAGAAGAAGAAGAAAACTAACCGATCCAAGTCGCCCTCAGCCACAGATACTGAAATGAG
TAATCCGAGATCAAGCGGTGGTGCAATAATCTCAGCGGGCTGCAACATTGACTCGAGCAGCACAATGGGCAATTTCCCATCTCAGCCTCCTGAGTTTCCTTCGCTGCCAT
CTCTGCAACATCATCTTTCTCGGTTCGGAGCTGGAAACACAGGCCTGAACTTCACCGGAATTCAACTCGGAAACATGAGTTCCGGTCGGGAATTGGATCAATGGCGTTCA
CAGCAACCTCCGTTCATCGTCGCCGGACTCGAACCGCCGGCAGCTGCATACCCATTAAATATTCAAACCGAAGTTAACTTTGGCTCTAACTCGTCAGCTGCTGTACAGTA
TTGTCATCAACTCCAGAATTTGACACCTTTTTCGAGGATTTCTCAGCAGCAGCAGCTGAAAAACGAAGAACAAAACCAAAACCAAGAACAAGGAATATCGAACTTCTTGC
GACCAGTAATGGGTTCCGTTTCAGAAGCCAACCAGTTCTGGGGAGTTCTCATCAGTCGTCTTCAAGCCAATCGTAAAACGACATGTCGGCACGGCGGGGCGCTTGGCCGG
CGGCTGCCTTCTTTCCTTCCCCGGCGAGCCCAATTCTTGAAGCAGCACAAACTGATCACTCCGAGCTCCTTAGCCTCCACGAGATGTCGTTTGTCTCCCACACAAAGATG
CACCAAAACTGCAGCTGCATTCTCCCTGTTCCTAGGGGATCCGGTCGCAATGAAATCTACTAAAACAGGGACAGCTGCTGCAGCTCCGATGGCAGCCCGCCCTTCGGAGT
TGCTGGCCAGGATGGCTAAAATTGCCAGGGCCTCATCTACCATCCCTATTCTAGATTCTGTCAACAGCTGCATCAATATGGAAACCACACCACCCCTCACTGCCTTTATC
TTCAGTGTAATTAGTGGGGGAATGGCTCCAGATGCACCAATTGTAACTTTATATTCGTCTACCACAGAAAGGCTGAAAAGAGTGGCTGCAGCATTTTCACGTGCTTCCAT
ACTCCCCCACTTCAGCACATGGACTATACCAGGAGGTCTGTTGTTGAAAGCAGGCCGACAAGGAGAGGGATTGCACCAGCTTCAGCAATGGCCACACGATTATTTGCGTT
GCGTTTGGCAAGAAAGCGAATCTCACCGGCAGCAGATCGCTTGTCTTCAATGTTACCTGATCAGGAGACGTGAGTTCACTGGGCTGAGAACTGCTGGTTCGTTTTGGTGG
TTCAATCCCACTGGCCTCACACCATTGAGCTATGAGACTACGTCTCTTAATATTTTTGCCAGTGCAAAATCTGGAAGTAAAACACAAGCTGATGGAAACTCCGAGGTCAT
ACCTAATGCTTCAATTTATCGACTGGTACCTTGGAAGTTTCTTAAGGAAATTGTGGCTGAGCCGGGATTCTCTATTAGTGGAAGATGGCCATCGAAACATTATTTCCAGA
AGTGTAAAACCACTGTACAGCCTGATCAAAATGTTCTGTTACGAATACAAATGGAAGCGGAAGTACGCTCTGGAAATCTCAGTAACCGTCAATCTGCGTTCGTTTTCTTC
CTTCTTCCACTTAGCCAAATAGACATCCGAAGCAAGAAAAGAGAAAACACTAAATCCGAACGAAATTCCAAAATCACTTGTAATTGTAAACCAATTCACCACGCCTGCAC
AACATCGCGAGGCAGAACTGCGGACGCTTATGAAACAAATAACACACAAACGAGCTTCCTTCAACGAAGCAAGAGCTCTCACGGTGTCATCAGACAGCGCCTGCTTCGTA
TCCCTTATTTCCTCGAACATTGGGATCAACAGCTTCATCGATCAAACTTCGAACGCCGCAGCCATTCTCCTCCTCCATATCTCCTCAGTCAGCCAAACAGAACCTCTCAA
ACAACCGCCTATAGCAAAGAAAACGCATCAAATCCTCCAAAATCTTTGCGTGTTCGCCAGCTGCTTCTCGCCGCGGTGTTCGGAAAATCTCTTCATAATCGCCCCCGCGG
GAGCGACCATCATCATTCATCGGAGTCTTCTCTCTTCCGAAGCATAATCAAGTACTACGATGAATTTTCAGTTTGGAAAGAAATGGCGGAATTACGTGATCGGAAGAAGC
CGGCGGATTGGCCGGAGCTAATCGGTCGTTGGGTAGTCTCTCTCATACTTGTTTTGCTAACTCAATGTACCCAATTTCTAGTTCCCCAATTCTTCTCCGATTCGTCCGTC
TTCCTTCAACTCCTACTTTCAGCTCTACTGCTTCTGGCAGTTGCAGGCATTGGGGGATGGTGCAGGCGGCTTCTTCGAGTCCGCGCGTCAGCTCCAGCTTTCGTTTTCTT
CAGTATACTTTTTATTTGGGTGGTTTACGTAGCTATTGTTCGACAAGCTGCTTCGAATTTGATGGATCTTCTGTTTAATGGGCAGATAATCTTGCTCATCTTTGGCCTTT
CCAGGATGTTATCCAGTGATCCTGGTTTGGTGTCATATGGTCCATCTCCTTCAGATGTAATTGCTCAAAGTTCAGTTTTAGAAATTGACACTCAGCATCAGGAAACAGAA
CTTCTAATGAATAGTCCTAATCTCAGGCGTTCAACTGAAGCAGGTATTGGATTGGGTGATAGGGTGAGATGCTGCCAGATCTGCAAAGCATATGTTAAAGGCTTTGACCA
CCATTGTCCTGCATTTGGAAACTGTATTGGGCATAAAAATTATCTCCTCTTCATGGTCCTACTAATTGGATTTCTCTGGACTGAAGCTGCTTACTTAGTGTGCTTATCTC
AATATGCCGCAAAGTCGGAGGTTCCTGATGGTGATGGAACTAGGTTAGAGATTAGTTTCTCAAGGAAAGTGGCCAGCAGCACCATGCTATTCTCCATTCTACAATTAGTA
TGGCAGGTAGTATTCTTGTGGTGGCATATATATTGTATTTGCTTCAACATCAGAACAGATGAATGGATTAACTGGAAGAGGTATCCAGAATTTCAAATTTTTGTCCAGTC
TGAGCTGGGCCAATGGTCCAGTGAAATGAGGTTCAGAAACCCATATGACAAAGGAGTTTTTCAGAATGTGAAGGAGTTTATGGCGTCAAAACAATGA
Protein sequenceShow/hide protein sequence
MAHSSLPIYLDPPNWQQSNQAPTANDHQDPRQLSPFLPPPPSQPAHGSGVTGSIRPGSMAYRARLAMIPQPEAALKCPRCDSTNTKFCYFNNYSLSQPRHFCKTCRRYWT
RGGALRNVPVGGGFRKNKKKKKTNRSKSPSATDTEMSNPRSSGGAIISAGCNIDSSSTMGNFPSQPPEFPSLPSLQHHLSRFGAGNTGLNFTGIQLGNMSSGRELDQWRS
QQPPFIVAGLEPPAAAYPLNIQTEVNFGSNSSAAVQYCHQLQNLTPFSRISQQQQLKNEEQNQNQEQGISNFLRPVMGSVSEANQFWGVLISRLQANRKTTCRHGGALGR
RLPSFLPRRAQFLKQHKLITPSSLASTRCRLSPTQRCTKTAAAFSLFLGDPVAMKSTKTGTAAAAPMAARPSELLARMAKIARASSTIPILDSVNSCINMETTPPLTAFI
FSVISGGMAPDAPIVTLYSSTTERLKRVAAAFSRASILPHFSTWTIPGGLLLKAGRQGEGLHQLQQWPHDYLRCVWQESESHRQQIACLQCYLIRRREFTGLRTAGSFWW
FNPTGLTPLSYETTSLNIFASAKSGSKTQADGNSEVIPNASIYRLVPWKFLKEIVAEPGFSISGRWPSKHYFQKCKTTVQPDQNVLLRIQMEAEVRSGNLSNRQSAFVFF
LLPLSQIDIRSKKRENTKSERNSKITCNCKPIHHACTTSRGRTADAYETNNTQTSFLQRSKSSHGVIRQRLLRIPYFLEHWDQQLHRSNFERRSHSPPPYLLSQPNRTSQ
TTAYSKENASNPPKSLRVRQLLLAAVFGKSLHNRPRGSDHHHSSESSLFRSIIKYYDEFSVWKEMAELRDRKKPADWPELIGRWVVSLILVLLTQCTQFLVPQFFSDSSV
FLQLLLSALLLLAVAGIGGWCRRLLRVRASAPAFVFFSILFIWVVYVAIVRQAASNLMDLLFNGQIILLIFGLSRMLSSDPGLVSYGPSPSDVIAQSSVLEIDTQHQETE
LLMNSPNLRRSTEAGIGLGDRVRCCQICKAYVKGFDHHCPAFGNCIGHKNYLLFMVLLIGFLWTEAAYLVCLSQYAAKSEVPDGDGTRLEISFSRKVASSTMLFSILQLV
WQVVFLWWHIYCICFNIRTDEWINWKRYPEFQIFVQSELGQWSSEMRFRNPYDKGVFQNVKEFMASKQ