; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025329 (gene) of Chayote v1 genome

Gene IDSed0025329
OrganismSechium edule (Chayote v1)
DescriptionC2H2-like zinc finger protein
Genome locationLG03:13885629..13889139
RNA-Seq ExpressionSed0025329
SyntenySed0025329
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR022755 - Zinc finger, double-stranded RNA binding
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595262.1 Zinc finger protein BALDIBIS, partial [Cucurbita argyrosperma subsp. sororia]1.0e-11669.36Show/hide
Query:  DPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCE
        DP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQ+S+K + KKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCE
Subjt:  DPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCE

Query:  KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFN
        KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES RLI    ANS PNF+ NS ++S+F    NSQGGIFVQSD+NN     
Subjt:  KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFN

Query:  HFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLKAAQLGSTKSNPSCSGNSVGVM----SSFEQNE--
           INN  P     PPSNLF+ISNRIL  NK +S   V   S CS G          MSATALLLKAA  GSTKSNPS SG SVGVM    SSF+Q++  
Subjt:  HFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLKAAQLGSTKSNPSCSGNSVGVM----SSFEQNE--

Query:  ------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG
                    GLRDFLGT            SLMEM+K G   SS+MGLS+F+ N RG
Subjt:  ------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG

XP_022962752.1 protein indeterminate-domain 9-like [Cucurbita moschata]9.9e-13370.15Show/hide
Query:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE
        ++SSQLYHLPHNPNPNP+ N   PKK+RN PGKPDP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQ+S+K + KKKVYICPE
Subjt:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE

Query:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES---SRLIANSKPNF
        KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES   S + ANS PNF
Subjt:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES---SRLIANSKPNF

Query:  IGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLKA
        I NS ++S+F    NSQGGIFVQSD+NN      F INN  P     PPSNLF+ISNRIL  NK +S   V   S CS G          MSATALLLKA
Subjt:  IGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLKA

Query:  AQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG
        A  GSTKSNPS SGNSVGVM    SSF+Q++              GLRDFLGT            SLMEM+K G   SS+MGLS+F+ N RG
Subjt:  AQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG

XP_022972578.1 protein indeterminate-domain 9-like [Cucurbita maxima]2.1e-13068.5Show/hide
Query:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE
        ++SSQLYHLPHNPNPN       PKK+RNLPGKPDP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQ+S+K + KKKVYICPE
Subjt:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE

Query:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----------
        KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES RLI          
Subjt:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----------

Query:  -ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MS
         ANS PNFI NS ++S+F    NSQGGIFVQSD+NN      F IN   P     PPSNLF+ISNRIL  NK +S   V   S CSGG          MS
Subjt:  -ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MS

Query:  ATALLLKAAQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG
        ATALLLKAA  GS KS PS SGNSVGVM    SSF+Q++              GLRDFLGT            SLMEM+K G   SS+MGLS+F+ N RG
Subjt:  ATALLLKAAQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG

XP_023518386.1 protein indeterminate-domain 9-like [Cucurbita pepo subsp. pepo]2.4e-13169.47Show/hide
Query:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE
        ++SSQLYHLPHNPNPNP+ N   PKK+RN PGKPDP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQ+S+K + KKKVYICPE
Subjt:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE

Query:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----ANSKPN
        KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES RLI    ANS PN
Subjt:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----ANSKPN

Query:  FIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLK
        FI NS ++S+F    NSQGGIFVQS +NN      F INN  P     PPSNLF+ SN IL  NK +S   V   S CS G          MSATALLLK
Subjt:  FIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLK

Query:  AAQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG
        AA  GSTKSNPS SGNSVGVM    SSF+Q++              GLRDFLGT            SL+EM+K G   SS+MGLS+F+ N RG
Subjt:  AAQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG

XP_038883199.1 zinc finger protein BALDIBIS-like [Benincasa hispida]4.1e-11860.18Show/hide
Query:  MMSGIDFS-GPSLVGEF--VPQKSSQLY--HLPHNPNPNPHNNSS----LPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHR
        MMSGIDFS   S +GEF    Q+SSQLY  HLPH+PNPN + +SS    LPKK+RNLPGKPDP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQLHR
Subjt:  MMSGIDFS-GPSLVGEF--VPQKSSQLY--HLPHNPNPNPHNNSS----LPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHR

Query:  RGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFI
        RGHNLPWKLRQ+S+K+  KKKVYICPEK CVHHDP+RALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFI
Subjt:  RGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFI

Query:  THRAFCDALADESSRLI-----ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINN------NFPPSNLFEISNRILS-SNK
        THRAFCDALA+ES++LI     ANS+ NFI N+ +                 ++NNNN + N    N+ +  +N        PPSNL++ISN+ILS +NK
Subjt:  THRAFCDALADESSRLI-----ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINN------NFPPSNLFEISNRILS-SNK

Query:  KNSSK-------------------------IVSGCS-------GGMSATALLLKAAQLGSTKSNPSCS--GNSVGVMSSFEQN--------------EGL
         NSSK                         + S CS       GGMSATALL+KAAQLGSTKSN S S   NSVGV+ S   N               GL
Subjt:  KNSSK-------------------------IVSGCS-------GGMSATALLLKAAQLGSTKSNPSCS--GNSVGVMSSFEQN--------------EGL

Query:  RDFLGTNS-----------LMEMTK---SGSSDMGLSQFVRN
        RDF G              LMEMTK   S SS+MGLS F+ N
Subjt:  RDFLGTNS-----------LMEMTK---SGSSDMGLSQFVRN

TrEMBL top hitse value%identityAlignment
A0A0A0KIB8 C2H2-type domain-containing protein5.7e-10253.68Show/hide
Query:  MMSGIDFS-GPSLVGEF-----VPQKSSQLYHLPH------NPNPNPHN-------------NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICE
        MMSGIDFS   + +GEF       Q++SQL H  H      NPNP P+                SLPKK+RNLPGKPDP+AEVI LSPN+LMATNRFICE
Subjt:  MMSGIDFS-GPSLVGEF-----VPQKSSQLYHLPH------NPNPNPHN-------------NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQ--AKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGT
        ICNKGFQRDQNLQLHRRGHNLPWKLRQ+SSK+    KKKVYICPEK CVHHDP+RALGDLTGIKKH+SRKHGEKKWKCEKC KKYAVQSDWKAHSKTCGT
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQ--AKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGT

Query:  REYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-----NSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLF
        R+YKCDCGTLFSRKDSFITHRAFCDALA+ES++LI+     NS+ NFI +           N+   I  Q  ++N  +  +  I  ++P     P +NLF
Subjt:  REYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-----NSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLF

Query:  EISNRILSSNKKNSS-------------------------------------KIVSGCSGGMSATALLLKAAQLGSTKSNPS--CS-GNSVGVMSSFE--
        +ISN++LS N KN++                                         G  GGMSATALLLKAAQLGSTKSN +  CS  NSVGV+ S    
Subjt:  EISNRILSSNKKNSS-------------------------------------KIVSGCSGGMSATALLLKAAQLGSTKSNPS--CS-GNSVGVMSSFE--

Query:  --------------QNEGLRDFLGTNS--------LM--EMTK---SGSSDMG-LSQFVRND
                      +  GL+DF G           LM  EMTK   S SS+MG +SQF+ N+
Subjt:  --------------QNEGLRDFLGTNS--------LM--EMTK---SGSSDMG-LSQFVRND

A0A1S3BM97 protein indeterminate-domain 91.3e-10657.3Show/hide
Query:  MMSGIDFS-GPSLVGEF-----VPQKSSQLYH------LPHNPNPNPHN-----------NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEIC
        MMSGIDFS   + +GEF       Q++SQLYH      LP NPNP P+              SLPKK+RNLPGKPDP+AEVI LSPN+LMATNRFICE+C
Subjt:  MMSGIDFS-GPSLVGEF-----VPQKSSQLYH------LPHNPNPNPHN-----------NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEIC

Query:  NKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQ--AKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTRE
        NKGFQRDQNLQLHRRGHNLPWKLRQ+SSK+    KKKVYICPEK CVHHDP+RALGDLTGIKKH+SRKHGEKKWKCEKC KKYAVQSDWKAHSKTCGTR+
Subjt:  NKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQ--AKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTRE

Query:  YKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-----NSKPNFIGNSFDQSIFNH---DPNSQGG-IFVQSDNNNNTNF-NHFKINNLIPINNNFPPS
        YKCDCGTLFSRKDSFITHRAFCDALA+ES++LI+     NS+ +FI N+ + SI      D N +   +F Q  +  +  F N  +I +L   NNN  PS
Subjt:  YKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-----NSKPNFIGNSFDQSIFNH---DPNSQGG-IFVQSDNNNNTNF-NHFKINNLIPINNNFPPS

Query:  NLFEISNRILSS---------------NKKNSSKIVSGCSGGMSATALLLKAAQLGSTKS--NPSCSG-NSVGVMSSFE-----------------QNEG
            I N  LSS               +   S + V+G  GGMSATALLLKAAQLGSTKS  N  CSG NSVGV+ S                   +  G
Subjt:  NLFEISNRILSS---------------NKKNSSKIVSGCSGGMSATALLLKAAQLGSTKS--NPSCSG-NSVGVMSSFE-----------------QNEG

Query:  LRDFLGTNSL------------MEMTK---SGSSDMGLSQFVRND
        LRDF G   L            MEMTK   S SS+MG SQF+ N+
Subjt:  LRDFLGTNSL------------MEMTK---SGSSDMGLSQFVRND

A0A6J1CRP6 protein indeterminate-domain 9-like2.5e-11357.32Show/hide
Query:  MMSGIDFSG-PSLVGEFVPQKSSQLYHLPHNPNPNPHNN----------SSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQL
        MMSGIDF   PS VGE    +SSQLY LP NPNPNP+ N           SLPKK+RNLPGKPDP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQL
Subjt:  MMSGIDFSG-PSLVGEFVPQKSSQLYHLPHNPNPNPHNN----------SSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQL

Query:  HRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDS
        HRRGHNLPWKLRQ+S+K + KKKVYICPEKSCVHHD ARALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDS
Subjt:  HRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDS

Query:  FITHRAFCDALADESSRLI-----ANSKPNFIG-NSFDQSIF--------------NHDPNSQG-------GIFVQSDNNNNTNFNHF--KINNLIPIN-
        FITHRAFCDALA+ES+RLI     AN KPNF G NS + S                N + NSQG       G+FV      + NF+HF    NNL+P   
Subjt:  FITHRAFCDALADESSRLI-----ANSKPNFIG-NSFDQSIF--------------NHDPNSQG-------GIFVQSDNNNNTNFNHF--KINNLIPIN-

Query:  ---NNFP-------PSNLFEISNRILSSNKKNSSK-----------------IVSGCSGGMSATALLLKAAQLGSTKSNP-SCSGNSVGVMSS-------
           + FP        +NLFEI NRIL    KN SK                    G  GG+SATALLLKAAQLGST+S+P S SG+SVGVM S       
Subjt:  ---NNFP-------PSNLFEISNRILSSNKKNSSK-----------------IVSGCSGGMSATALLLKAAQLGSTKSNP-SCSGNSVGVMSS-------

Query:  --------------------FEQ--NEGL-RDFLGTNS-------------LMEMTKSGSSDM-----GLSQFVRNDR
                            FE   N GL RDFLG+               LM++TK  +S+M     G +QF+ NDR
Subjt:  --------------------FEQ--NEGL-RDFLGTNS-------------LMEMTKSGSSDM-----GLSQFVRNDR

A0A6J1HDF4 protein indeterminate-domain 9-like4.8e-13370.15Show/hide
Query:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE
        ++SSQLYHLPHNPNPNP+ N   PKK+RN PGKPDP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQ+S+K + KKKVYICPE
Subjt:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE

Query:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES---SRLIANSKPNF
        KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES   S + ANS PNF
Subjt:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES---SRLIANSKPNF

Query:  IGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLKA
        I NS ++S+F    NSQGGIFVQSD+NN      F INN  P     PPSNLF+ISNRIL  NK +S   V   S CS G          MSATALLLKA
Subjt:  IGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MSATALLLKA

Query:  AQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG
        A  GSTKSNPS SGNSVGVM    SSF+Q++              GLRDFLGT            SLMEM+K G   SS+MGLS+F+ N RG
Subjt:  AQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG

A0A6J1IAE3 protein indeterminate-domain 9-like1.0e-13068.5Show/hide
Query:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE
        ++SSQLYHLPHNPNPN       PKK+RNLPGKPDP+AEVI LSPN+LMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQ+S+K + KKKVYICPE
Subjt:  QKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPE

Query:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----------
        KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADES RLI          
Subjt:  KSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLI----------

Query:  -ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MS
         ANS PNFI NS ++S+F    NSQGGIFVQSD+NN      F IN   P     PPSNLF+ISNRIL  NK +S   V   S CSGG          MS
Subjt:  -ANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIV---SGCSGG----------MS

Query:  ATALLLKAAQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG
        ATALLLKAA  GS KS PS SGNSVGVM    SSF+Q++              GLRDFLGT            SLMEM+K G   SS+MGLS+F+ N RG
Subjt:  ATALLLKAAQLGSTKSNPSCSGNSVGVM----SSFEQNE--------------GLRDFLGTN-----------SLMEMTKSG---SSDMGLSQFVRNDRG

SwissProt top hitse value%identityAlignment
O22759 Protein indeterminate-domain 123.3e-8654.6Show/hide
Query:  HNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTG
        H  +  PKK+R LPG PDP+AEVI LSP +L+ATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QK++K+Q KKKVY+CPE +C HH P+RALGDLTG
Subjt:  HNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTG

Query:  IKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGI
        IKKHF RKHGEKKWKCEKCSK YAVQSDWKAH+K CGTR+Y+CDCGTLFSRKD+FITHRAFCDALA+ES+RL + S  N         + N +PN QG  
Subjt:  IKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGI

Query:  FVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKSNPSCSGNSVGVMSSFEQNEGLRDFL
                    +HF  N           S+L   S+ +      +++ + +  +  +SATALL KA  L ST         S+G          + +FL
Subjt:  FVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKSNPSCSGNSVGVMSSFEQNEGLRDFL

Query:  GTNSLMEMTKSGSSD
        G + +M MT + SS+
Subjt:  GTNSLMEMTKSGSSD

Q700D2 Zinc finger protein JACKDAW5.0e-8764.34Show/hide
Query:  MMSGIDFSGPSLVGEFVPQKSSQLYHLPH---------NPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHR
        M+ G  FS  S +G FV Q+ + L+HL           NPNPN   NSS  KK+RN PG PDP+A+VI LSP +LMATNRF+CEICNKGFQRDQNLQLHR
Subjt:  MMSGIDFSGPSLVGEFVPQKSSQLYHLPH---------NPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHR

Query:  RGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFI
        RGHNLPWKL+Q+S ++  KKKVYICP K+CVHHD +RALGDLTGIKKH+SRKHGEKKWKCEKCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFI
Subjt:  RGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFI

Query:  THRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNH
        THRAFCDAL +E +R+ + S  N + ++ + +  N           +S+  NN N  H
Subjt:  THRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNH

Q8H1F5 Protein indeterminate-domain 71.4e-8461.35Show/hide
Query:  KKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSR
        K++RN PG PDPEAEV+ LSP +LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+Q+S+KD  +KKVY+CPE  CVHH P+RALGDLTGIKKHF R
Subjt:  KKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSR

Query:  KHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNN
        KHGEKKWKCEKCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALA+ES+R    + PN I      S  +H   +Q  I   S + 
Subjt:  KHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNN

Query:  N---NTNF----------NHFK-INNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKS
        N   N+N           +H++ I   +  +N  P  N   +   + SS     S      S  MSATALL KAAQ+GSTKS
Subjt:  N---NTNF----------NHFK-INNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKS

Q944L3 Zinc finger protein BALDIBIS6.3e-9050.38Show/hide
Query:  HLPHNPNPNPH-NNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHH
        H+  NPNPNP+  +S+  K++RNLPG PDP+AEVI LSPNSLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+Q+++K+Q KKKVYICPEK+CVHH
Subjt:  HLPHNPNPNPH-NNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHH

Query:  DPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-NSKPNFIGNSFDQS
        DPARALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALA+ES+R ++    P ++ N+ D  
Subjt:  DPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-NSKPNFIGNSFDQS

Query:  I------FNHDPNSQGGIFVQSDNNN-NTNFNHFK-INNLIPIN---NNFPPS---------NLFEISNR-----ILSSNKKNSSKIV------------
        +       NH          Q D    NTN N+   +   +P N   ++  PS         NL+ +  +     +L+ N  N++ I+            
Subjt:  I------FNHDPNSQGGIFVQSDNNN-NTNFNHFK-INNLIPIN---NNFPPS---------NLFEISNR-----ILSSNKKNSSKIV------------

Query:  -------------------------SGCSGGMSATALLLKAAQLGSTKSNPSCSGN-SVGVMSSFEQNEG--------------LRDFLGTNS
                                  G    MSATALL KAAQ+GS +S+ S S + + G+M+S   N+                RDFLG  S
Subjt:  -------------------------SGCSGGMSATALLLKAAQLGSTKSNPSCSGN-SVGVMSSFEQNEG--------------LRDFLGTNS

Q9LRW7 Protein indeterminate-domain 111.6e-8556.74Show/hide
Query:  NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIK
        +S   KKRRN PG PDPE+EVI LSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+Q+S+K+  +KKVY+CPE SCVHHDP+RALGDLTGIK
Subjt:  NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIK

Query:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA------NSKPNFIGNSFDQSIFNHDPNS
        KHF RKHGEKKWKC+KCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALA+E++R +       N++PN +      S  +H   +
Subjt:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA------NSKPNFIGNSFDQSIFNHDPNS

Query:  QGGIFVQSDNNNNTNFN-----HFKINNLIPINNN--------FP-------------------PSNLFEISNRILSSNKKNSS------KIVSGCSGGM
        Q  I V S ++++ N N     HF  NN    N+N        FP                   P  L    + + SSN   S+       + S  S  M
Subjt:  QGGIFVQSDNNNNTNFN-----HFKINNLIPINNN--------FP-------------------PSNLFEISNRILSSNKKNSS------KIVSGCSGGM

Query:  SATALLLKAAQLGSTKSNP
        SATALL KAAQ+GSTK+ P
Subjt:  SATALLLKAAQLGSTKSNP

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 79.7e-8661.35Show/hide
Query:  KKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSR
        K++RN PG PDPEAEV+ LSP +LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+Q+S+KD  +KKVY+CPE  CVHH P+RALGDLTGIKKHF R
Subjt:  KKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSR

Query:  KHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNN
        KHGEKKWKCEKCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALA+ES+R    + PN I      S  +H   +Q  I   S + 
Subjt:  KHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNN

Query:  N---NTNF----------NHFK-INNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKS
        N   N+N           +H++ I   +  +N  P  N   +   + SS     S      S  MSATALL KAAQ+GSTKS
Subjt:  N---NTNF----------NHFK-INNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKS

AT3G13810.1 indeterminate(ID)-domain 111.1e-8656.74Show/hide
Query:  NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIK
        +S   KKRRN PG PDPE+EVI LSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+Q+S+K+  +KKVY+CPE SCVHHDP+RALGDLTGIK
Subjt:  NSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIK

Query:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA------NSKPNFIGNSFDQSIFNHDPNS
        KHF RKHGEKKWKC+KCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALA+E++R +       N++PN +      S  +H   +
Subjt:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA------NSKPNFIGNSFDQSIFNHDPNS

Query:  QGGIFVQSDNNNNTNFN-----HFKINNLIPINNN--------FP-------------------PSNLFEISNRILSSNKKNSS------KIVSGCSGGM
        Q  I V S ++++ N N     HF  NN    N+N        FP                   P  L    + + SSN   S+       + S  S  M
Subjt:  QGGIFVQSDNNNNTNFN-----HFKINNLIPINNN--------FP-------------------PSNLFEISNRILSSNKKNSS------KIVSGCSGGM

Query:  SATALLLKAAQLGSTKSNP
        SATALL KAAQ+GSTK+ P
Subjt:  SATALLLKAAQLGSTKSNP

AT3G45260.1 C2H2-like zinc finger protein4.5e-9150.38Show/hide
Query:  HLPHNPNPNPH-NNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHH
        H+  NPNPNP+  +S+  K++RNLPG PDP+AEVI LSPNSLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+Q+++K+Q KKKVYICPEK+CVHH
Subjt:  HLPHNPNPNPH-NNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHH

Query:  DPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-NSKPNFIGNSFDQS
        DPARALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALA+ES+R ++    P ++ N+ D  
Subjt:  DPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIA-NSKPNFIGNSFDQS

Query:  I------FNHDPNSQGGIFVQSDNNN-NTNFNHFK-INNLIPIN---NNFPPS---------NLFEISNR-----ILSSNKKNSSKIV------------
        +       NH          Q D    NTN N+   +   +P N   ++  PS         NL+ +  +     +L+ N  N++ I+            
Subjt:  I------FNHDPNSQGGIFVQSDNNN-NTNFNHFK-INNLIPIN---NNFPPS---------NLFEISNR-----ILSSNKKNSSKIV------------

Query:  -------------------------SGCSGGMSATALLLKAAQLGSTKSNPSCSGN-SVGVMSSFEQNEG--------------LRDFLGTNS
                                  G    MSATALL KAAQ+GS +S+ S S + + G+M+S   N+                RDFLG  S
Subjt:  -------------------------SGCSGGMSATALLLKAAQLGSTKSNPSCSGN-SVGVMSSFEQNEG--------------LRDFLGTNS

AT4G02670.1 indeterminate(ID)-domain 122.3e-8754.6Show/hide
Query:  HNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTG
        H  +  PKK+R LPG PDP+AEVI LSP +L+ATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QK++K+Q KKKVY+CPE +C HH P+RALGDLTG
Subjt:  HNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTG

Query:  IKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGI
        IKKHF RKHGEKKWKCEKCSK YAVQSDWKAH+K CGTR+Y+CDCGTLFSRKD+FITHRAFCDALA+ES+RL + S  N         + N +PN QG  
Subjt:  IKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGI

Query:  FVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKSNPSCSGNSVGVMSSFEQNEGLRDFL
                    +HF  N           S+L   S+ +      +++ + +  +  +SATALL KA  L ST         S+G          + +FL
Subjt:  FVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKSNPSCSGNSVGVMSSFEQNEGLRDFL

Query:  GTNSLMEMTKSGSSD
        G + +M MT + SS+
Subjt:  GTNSLMEMTKSGSSD

AT5G03150.1 C2H2-like zinc finger protein3.6e-8864.34Show/hide
Query:  MMSGIDFSGPSLVGEFVPQKSSQLYHLPH---------NPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHR
        M+ G  FS  S +G FV Q+ + L+HL           NPNPN   NSS  KK+RN PG PDP+A+VI LSP +LMATNRF+CEICNKGFQRDQNLQLHR
Subjt:  MMSGIDFSGPSLVGEFVPQKSSQLYHLPH---------NPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHR

Query:  RGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFI
        RGHNLPWKL+Q+S ++  KKKVYICP K+CVHHD +RALGDLTGIKKH+SRKHGEKKWKCEKCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFI
Subjt:  RGHNLPWKLRQKSSKDQAKKKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFI

Query:  THRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNH
        THRAFCDAL +E +R+ + S  N + ++ + +  N           +S+  NN N  H
Subjt:  THRAFCDALADESSRLIANSKPNFIGNSFDQSIFNHDPNSQGGIFVQSDNNNNTNFNH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCTGGCATTGACTTTTCTGGTCCTTCTCTTGTTGGTGAGTTTGTTCCTCAAAAAAGCTCTCAACTTTATCATCTTCCTCATAACCCTAATCCAAACCCTCATAA
TAACTCTTCTCTCCCCAAGAAAAGGAGAAATTTGCCAGGAAAACCAGATCCAGAGGCAGAGGTAATTGGTCTATCTCCCAACAGTCTAATGGCAACAAACAGATTCATAT
GTGAAATATGCAACAAAGGGTTCCAAAGGGATCAAAACTTGCAGCTTCACAGAAGAGGGCACAATCTGCCATGGAAGCTAAGGCAGAAAAGCAGCAAAGATCAAGCAAAA
AAGAAAGTTTATATCTGCCCTGAAAAGTCTTGTGTTCATCATGATCCAGCTAGAGCTCTTGGGGACTTAACTGGAATTAAAAAGCATTTTAGTAGAAAACATGGGGAAAA
GAAATGGAAATGTGAGAAATGTTCTAAGAAATATGCTGTTCAATCTGACTGGAAAGCTCACTCCAAAACTTGTGGAACCAGAGAGTACAAATGTGACTGTGGAACACTCT
TTTCCAGGAAAGATAGCTTCATTACTCACAGAGCATTTTGTGACGCTTTAGCAGATGAAAGTTCAAGGCTCATTGCAAATTCAAAGCCAAATTTCATTGGCAATTCATTC
GACCAATCAATCTTCAATCATGATCCCAATTCCCAAGGCGGAATCTTTGTCCAATCGGATAATAATAATAATACTAATTTCAATCATTTCAAAATTAACAACCTCATTCC
CATCAACAACAACTTCCCGCCTTCTAATCTCTTCGAAATCTCGAACCGAATTCTCAGTTCGAACAAAAAAAACTCTTCCAAAATAGTGTCAGGTTGCTCGGGAGGAATGT
CGGCCACGGCACTGCTTCTTAAGGCGGCACAATTGGGGTCGACAAAAAGTAATCCGTCATGCTCGGGGAATAGTGTGGGGGTTATGAGTTCGTTTGAGCAGAACGAAGGG
TTGAGGGATTTTCTTGGCACGAATTCGTTAATGGAGATGACGAAATCCGGGAGCTCGGATATGGGGTTGAGCCAATTCGTTAGGAATGATCGAGGATGA
mRNA sequenceShow/hide mRNA sequence
CACAAATACAATATACTCTCTCTTTTAGTATCTTCCTCTTCTTCTTCTTCTTTGAGGGTTTTGTTTACAACTTAGTTTGAGTATGTTTCTTCTTGTCTTTGGCCATAAGT
CTTAAAAAAATTTCATCTTTTTCTCTTGATTTGGAGCTTCATCAAAAGCATATAAAAGCAAAACCCCCTTTGGATTTTTGTTTTTTTGAATTATTTTCTTCTTTTCTTGT
AGAATCTTTAATTAATTTGTTTGTAAGCTAATTTATGCAATGATGTCTGGCATTGACTTTTCTGGTCCTTCTCTTGTTGGTGAGTTTGTTCCTCAAAAAAGCTCTCAACT
TTATCATCTTCCTCATAACCCTAATCCAAACCCTCATAATAACTCTTCTCTCCCCAAGAAAAGGAGAAATTTGCCAGGAAAACCAGATCCAGAGGCAGAGGTAATTGGTC
TATCTCCCAACAGTCTAATGGCAACAAACAGATTCATATGTGAAATATGCAACAAAGGGTTCCAAAGGGATCAAAACTTGCAGCTTCACAGAAGAGGGCACAATCTGCCA
TGGAAGCTAAGGCAGAAAAGCAGCAAAGATCAAGCAAAAAAGAAAGTTTATATCTGCCCTGAAAAGTCTTGTGTTCATCATGATCCAGCTAGAGCTCTTGGGGACTTAAC
TGGAATTAAAAAGCATTTTAGTAGAAAACATGGGGAAAAGAAATGGAAATGTGAGAAATGTTCTAAGAAATATGCTGTTCAATCTGACTGGAAAGCTCACTCCAAAACTT
GTGGAACCAGAGAGTACAAATGTGACTGTGGAACACTCTTTTCCAGGAAAGATAGCTTCATTACTCACAGAGCATTTTGTGACGCTTTAGCAGATGAAAGTTCAAGGCTC
ATTGCAAATTCAAAGCCAAATTTCATTGGCAATTCATTCGACCAATCAATCTTCAATCATGATCCCAATTCCCAAGGCGGAATCTTTGTCCAATCGGATAATAATAATAA
TACTAATTTCAATCATTTCAAAATTAACAACCTCATTCCCATCAACAACAACTTCCCGCCTTCTAATCTCTTCGAAATCTCGAACCGAATTCTCAGTTCGAACAAAAAAA
ACTCTTCCAAAATAGTGTCAGGTTGCTCGGGAGGAATGTCGGCCACGGCACTGCTTCTTAAGGCGGCACAATTGGGGTCGACAAAAAGTAATCCGTCATGCTCGGGGAAT
AGTGTGGGGGTTATGAGTTCGTTTGAGCAGAACGAAGGGTTGAGGGATTTTCTTGGCACGAATTCGTTAATGGAGATGACGAAATCCGGGAGCTCGGATATGGGGTTGAG
CCAATTCGTTAGGAATGATCGAGGATGAGGAAAATGCTTGAGATGTTTGTTGTTATTTTAGGATTGAGATATTAGAAGCAAGATCGAGAGGGGGATTTTATAAGGGAAGG
GGAAAATGTCATGTATTGATAATATATAATAAAGTTTTTGGAGGAAATTTTGAAATATTTGAATGCTTCTTAATATGAATTTGAAATTCTATTGTGCATGTATCTTTGTA
TTAAAGAATGAAAGTATTTTTCTGTTTTTCTTTGTGAAAA
Protein sequenceShow/hide protein sequence
MMSGIDFSGPSLVGEFVPQKSSQLYHLPHNPNPNPHNNSSLPKKRRNLPGKPDPEAEVIGLSPNSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQKSSKDQAK
KKVYICPEKSCVHHDPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALADESSRLIANSKPNFIGNSF
DQSIFNHDPNSQGGIFVQSDNNNNTNFNHFKINNLIPINNNFPPSNLFEISNRILSSNKKNSSKIVSGCSGGMSATALLLKAAQLGSTKSNPSCSGNSVGVMSSFEQNEG
LRDFLGTNSLMEMTKSGSSDMGLSQFVRNDRG