; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001971 (gene) of Chayote v1 genome

Gene IDSed0001971
OrganismSechium edule (Chayote v1)
Descriptionprotein indeterminate-domain 12-like
Genome locationLG04:47585521..47591794
RNA-Seq ExpressionSed0001971
SyntenySed0001971
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588479.1 Protein indeterminate-domain 12, partial [Cucurbita argyrosperma subsp. sororia]1.5e-14173.21Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
        MSK SGCSDDG  L   N+ +HGSSQ HQ   +SV KKKRNLPGTP   PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT

Query:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE
        SGS+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHRA+CDAL EE
Subjt:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE

Query:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA
        ++K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSP     NGF+TTMA
Subjt:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA

Query:  PSSFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        PSSFGG+LP+            GGV  EMIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF   GAMG
Subjt:  PSSFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

XP_022925600.1 protein indeterminate-domain 7-like isoform X1 [Cucurbita moschata]2.7e-14674.49Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
        MSK SGCSDDG  L   N+ +HGSSQ HQ   +SVPKKKRNLPGTP   PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT

Query:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE
        SGS+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHRA+CDAL EE
Subjt:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE

Query:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA
        ++K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNGF+TTMA
Subjt:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA

Query:  PSSFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        PSSFGG+LP+            GGV  EMIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  PSSFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

XP_022925609.1 protein indeterminate-domain 7-like isoform X2 [Cucurbita moschata]8.4e-14874.87Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
        MSK SGCSDDG  L   N+ +HGSSQ HQ   +SVPKKKRNLPGTP PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG

Query:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS
        S+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHRA+CDAL EE++
Subjt:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS

Query:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS
        K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNGF+TTMAPS
Subjt:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS

Query:  SFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        SFGG+LP+            GGV  EMIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  SFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

XP_022969709.1 protein indeterminate-domain 7-like isoform X1 [Cucurbita maxima]3.9e-14573.98Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
        MSK SGCSDDG  L   N+ SHGSSQ HQ   +SVPKKKRNLPGTP   PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT

Query:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE
        SGS+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHR +CDAL EE
Subjt:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE

Query:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA
        ++K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNG +TTMA
Subjt:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA

Query:  PSSFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        PSSFGG+LP+            GGV+  MIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  PSSFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

XP_022969710.1 protein indeterminate-domain 7-like isoform X2 [Cucurbita maxima]1.2e-14674.36Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
        MSK SGCSDDG  L   N+ SHGSSQ HQ   +SVPKKKRNLPGTP PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG

Query:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS
        S+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHR +CDAL EE++
Subjt:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS

Query:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS
        K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNG +TTMAPS
Subjt:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS

Query:  SFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        SFGG+LP+            GGV+  MIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  SFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

TrEMBL top hitse value%identityAlignment
A0A6J1CY07 protein indeterminate-domain 7-like isoform X16.1e-12862.24Show/hide
Query:  MSKMSGCSDDG--EKLGNEFSHGSSQSHQIISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT--SG
        MS++SGCS+DG  EKLGN+FSH SSQ    +  PKKKRNLPGTP PDA VVALSPETLM  NRFVCEICNKGF+RDQNLQLHRRGHNLPWKLKQRT  SG
Subjt:  MSKMSGCSDDG--EKLGNEFSHGSSQSHQIISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT--SG

Query:  SETKRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSK
        SE KRRVYVCPE +CVHH+PARALGDLTG+KKHFSRKHGEKKWKCEKCSKKYAVQSDLKAH+KTCGSKEYKCDCGTIFSRRDSFITHRAFCDAL EEN+K
Subjt:  SETKRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSK

Query:  MA-----------SFQP----------ALPHHLMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPS-SSLTSATALLQKAAQMGAVL--------S
        +A            FQ            LPH LMPIPP P N+SA +MFSSS       +   ND  FPS S+L SATALLQKA QMGA +        S
Subjt:  MA-----------SFQP----------ALPHHLMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPS-SSLTSATALLQKAAQMGAVL--------S

Query:  SPIIHHKNGFITTMAPSSFGGLLP---------------------------SNGGVEMIDDISMSNEMF-----EAARKR----------MTLNLLGAEG
        S +  HK G++T MAPSSFGGL P                           S   VEMI+DIS  N MF     EA RK+          MTL+LLGAEG
Subjt:  SPIIHHKNGFITTMAPSSFGGLLP---------------------------SNGGVEMIDDISMSNEMF-----EAARKR----------MTLNLLGAEG

Query:  GKGRNFQVQESVGFGGLMENMSYFHQQGAMGGRHTWD
         K  +        FGGLMENMSYF  Q AM GRHTW+
Subjt:  GKGRNFQVQESVGFGGLMENMSYFHQQGAMGGRHTWD

A0A6J1ECN6 protein indeterminate-domain 7-like isoform X24.1e-14874.87Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
        MSK SGCSDDG  L   N+ +HGSSQ HQ   +SVPKKKRNLPGTP PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG

Query:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS
        S+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHRA+CDAL EE++
Subjt:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS

Query:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS
        K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNGF+TTMAPS
Subjt:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS

Query:  SFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        SFGG+LP+            GGV  EMIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  SFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

A0A6J1EIJ6 protein indeterminate-domain 7-like isoform X11.3e-14674.49Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
        MSK SGCSDDG  L   N+ +HGSSQ HQ   +SVPKKKRNLPGTP   PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT

Query:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE
        SGS+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHRA+CDAL EE
Subjt:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE

Query:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA
        ++K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNGF+TTMA
Subjt:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA

Query:  PSSFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        PSSFGG+LP+            GGV  EMIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  PSSFGGLLPS-----------NGGV--EMIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

A0A6J1I0Q2 protein indeterminate-domain 7-like isoform X25.9e-14774.36Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
        MSK SGCSDDG  L   N+ SHGSSQ HQ   +SVPKKKRNLPGTP PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSG

Query:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS
        S+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHR +CDAL EE++
Subjt:  SET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENS

Query:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS
        K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNG +TTMAPS
Subjt:  K-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPS

Query:  SFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        SFGG+LP+            GGV+  MIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  SFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

A0A6J1I3G2 protein indeterminate-domain 7-like isoform X11.9e-14573.98Show/hide
Query:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
        MSK SGCSDDG  L   N+ SHGSSQ HQ   +SVPKKKRNLPGTP   PDAEVVALSP TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT
Subjt:  MSKMSGCSDDGEKLG--NEFSHGSSQSHQ--IISVPKKKRNLPGTP--GPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRT

Query:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE
        SGS+T +RRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEK WKCEKCSKKYAV SDLKAH KTCGSKEYKCDCGT+FSRRDSFITHR +CDAL EE
Subjt:  SGSET-KRRVYVCPETSCVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEE

Query:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA
        ++K       MA+  P  LPHH LMPIPP PFNLSA SMFSSS+         SND HFPSSS   L SATALLQKAAQMGA +SSPII  KNG +TTMA
Subjt:  NSK-------MASFQP-ALPHH-LMPIPPNPFNLSARSMFSSSLPDNSAPSHRSNDLHFPSSS---LTSATALLQKAAQMGAVLSSPIIHHKNGFITTMA

Query:  PSSFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG
        PSSFGG+LP+            GGV+  MIDDI M N MF     E ARK MTL+LLGAEGGKGR FQ +ESVGFGG+MEN+ YF  QGAMG
Subjt:  PSSFGGLLPS-----------NGGVE--MIDDISMSNEMF-----EAARKRMTLNLLGAEGGKGRNFQVQESVGFGGLMENMSYFHQQGAMG

SwissProt top hitse value%identityAlignment
O22759 Protein indeterminate-domain 122.0e-8360Show/hide
Query:  SHQIISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLT
        +H     PKKKR LPG P PDAEV+ALSP+TL+A NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQ+ +  + K++VYVCPET+C HH+P+RALGDLT
Subjt:  SHQIISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLT

Query:  GIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA---------LPHHLMPIPPNP
        GIKKHF RKHGEKKWKCEKCSK YAVQSD KAH+K CG+++Y+CDCGT+FSR+D+FITHRAFCDALAEE++++ S   +           HH M      
Subjt:  GIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA---------LPHHLMPIPPNP

Query:  FNLSARSMFSSSLPDNSAPSHRSNDLHFPSSSLTSATALLQKAAQM------GAVLSSPIIHHKN
        FN S+  +F+SS P    PS  +  L  P ++  SATALLQKA  +      G   +  I HH++
Subjt:  FNLSARSMFSSSLPDNSAPSHRSNDLHFPSSSLTSATALLQKAAQM------GAVLSSPIIHHKN

Q944L3 Zinc finger protein BALDIBIS1.4e-8178.36Show/hide
Query:  KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKHFSR
        K+KRNLPG P PDAEV+ALSP +LM  NRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQRT+  + K++VY+CPE +CVHH+PARALGDLTGIKKHFSR
Subjt:  KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKHFSR

Query:  KHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA
        KHGEKKWKC+KCSKKYAV SD KAHSK CG+KEY+CDCGT+FSR+DSFITHRAFCDALAEE+++  S  PA
Subjt:  KHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA

Q9LVQ7 Zinc finger protein ENHYDROUS7.0e-8173.56Show/hide
Query:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH
        SV KKKRNLPG P PDAEV+ALSP+TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKL+QR S  E +++VYVCP + CVHH+P+RALGDLTGIKKH
Subjt:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH

Query:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPAL-PHHLMPIPPNPFNLSARSMFSS-S
        F RKHGEKKWKCEKCSKKYAVQSD KAHSK CG+KEYKCDCGT+FSRRDSFITHRAFCDALAEE++K  +    L P  +    P     S  ++ SS S
Subjt:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPAL-PHHLMPIPPNPFNLSARSMFSS-S

Query:  LPDNSAPS
        LP +S PS
Subjt:  LPDNSAPS

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 11.8e-8145.93Show/hide
Query:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH
        S  KKKRNLPG P P++EV+ALSP+TL+A NRFVCEICNKGFQRDQNLQLHRRGHNLPWKL+Q+ S  E K++VYVCPE SCVHH+P+RALGDLTGIKKH
Subjt:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH

Query:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSK---------------------------------
        F RKHGEKKWKC+KCSKKYAVQSD KAHSK CG+KEYKCDCGT+FSRRDSFITHRAFCDALAEEN++                                 
Subjt:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSK---------------------------------

Query:  ----------MASFQPALPHHLMPIPPNPFNL---------------------------SARSMFSSS-----LPDNSAPSHRSNDL---HFPSSSLTSA
                    S  P  P  ++   P P +L                           S++S+F+SS     +    + SH S+ L    F +    SA
Subjt:  ----------MASFQPALPHHLMPIPPNPFNL---------------------------SARSMFSSS-----LPDNSAPSHRSNDL---HFPSSSLTSA

Query:  TALLQKAAQMGAVLSSPIIHHKNGFITT-------MAPSSFGGLLPSNG-GVEMIDDISMSNEMFEAARKRMTLNLLG----AEGGKGRNFQVQESVGFG
        TALLQKAAQMGA  S   + H  G +++       + P   G  LP  G     + ++ M N       K+ TL+ LG       G G +  +   VG G
Subjt:  TALLQKAAQMGAVLSSPIIHHKNGFITT-------MAPSSFGGLLPSNG-GVEMIDDISMSNEMFEAARKRMTLNLLG----AEGGKGRNFQVQESVGFG

Query:  GLMENMSYFHQQGAMGGR
        G   +M+     G   G+
Subjt:  GLMENMSYFHQQGAMGGR

Q9ZWA6 Zinc finger protein MAGPIE1.4e-8178.33Show/hide
Query:  HGSSQSHQIISVP--KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPA
        H     H+ ++ P  KKKRNLPG P P+AEV+ALSP+TLMA NRF+CEIC KGFQRDQNLQLHRRGHNLPWKLKQRTS  E ++RVYVCPE SCVHH+P 
Subjt:  HGSSQSHQIISVP--KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPA

Query:  RALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKM
        RALGDLTGIKKHF RKHGEKKWKCEKC+K+YAVQSD KAHSKTCG++EY+CDCGTIFSRRDSFITHRAFCDALAEE +++
Subjt:  RALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKM

Arabidopsis top hitse value%identityAlignment
AT1G03840.1 C2H2 and C2HC zinc fingers superfamily protein1.0e-8278.33Show/hide
Query:  HGSSQSHQIISVP--KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPA
        H     H+ ++ P  KKKRNLPG P P+AEV+ALSP+TLMA NRF+CEIC KGFQRDQNLQLHRRGHNLPWKLKQRTS  E ++RVYVCPE SCVHH+P 
Subjt:  HGSSQSHQIISVP--KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPA

Query:  RALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKM
        RALGDLTGIKKHF RKHGEKKWKCEKC+K+YAVQSD KAHSKTCG++EY+CDCGTIFSRRDSFITHRAFCDALAEE +++
Subjt:  RALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKM

AT3G45260.1 C2H2-like zinc finger protein1.0e-8278.36Show/hide
Query:  KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKHFSR
        K+KRNLPG P PDAEV+ALSP +LM  NRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQRT+  + K++VY+CPE +CVHH+PARALGDLTGIKKHFSR
Subjt:  KKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKHFSR

Query:  KHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA
        KHGEKKWKC+KCSKKYAV SD KAHSK CG+KEY+CDCGT+FSR+DSFITHRAFCDALAEE+++  S  PA
Subjt:  KHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA

AT3G50700.1 indeterminate(ID)-domain 21.3e-8245.93Show/hide
Query:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH
        S  KKKRNLPG P P++EV+ALSP+TL+A NRFVCEICNKGFQRDQNLQLHRRGHNLPWKL+Q+ S  E K++VYVCPE SCVHH+P+RALGDLTGIKKH
Subjt:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH

Query:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSK---------------------------------
        F RKHGEKKWKC+KCSKKYAVQSD KAHSK CG+KEYKCDCGT+FSRRDSFITHRAFCDALAEEN++                                 
Subjt:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSK---------------------------------

Query:  ----------MASFQPALPHHLMPIPPNPFNL---------------------------SARSMFSSS-----LPDNSAPSHRSNDL---HFPSSSLTSA
                    S  P  P  ++   P P +L                           S++S+F+SS     +    + SH S+ L    F +    SA
Subjt:  ----------MASFQPALPHHLMPIPPNPFNL---------------------------SARSMFSSS-----LPDNSAPSHRSNDL---HFPSSSLTSA

Query:  TALLQKAAQMGAVLSSPIIHHKNGFITT-------MAPSSFGGLLPSNG-GVEMIDDISMSNEMFEAARKRMTLNLLG----AEGGKGRNFQVQESVGFG
        TALLQKAAQMGA  S   + H  G +++       + P   G  LP  G     + ++ M N       K+ TL+ LG       G G +  +   VG G
Subjt:  TALLQKAAQMGAVLSSPIIHHKNGFITT-------MAPSSFGGLLPSNG-GVEMIDDISMSNEMFEAARKRMTLNLLG----AEGGKGRNFQVQESVGFG

Query:  GLMENMSYFHQQGAMGGR
        G   +M+     G   G+
Subjt:  GLMENMSYFHQQGAMGGR

AT4G02670.1 indeterminate(ID)-domain 121.4e-8460Show/hide
Query:  SHQIISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLT
        +H     PKKKR LPG P PDAEV+ALSP+TL+A NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQ+ +  + K++VYVCPET+C HH+P+RALGDLT
Subjt:  SHQIISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLT

Query:  GIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA---------LPHHLMPIPPNP
        GIKKHF RKHGEKKWKCEKCSK YAVQSD KAH+K CG+++Y+CDCGT+FSR+D+FITHRAFCDALAEE++++ S   +           HH M      
Subjt:  GIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPA---------LPHHLMPIPPNP

Query:  FNLSARSMFSSSLPDNSAPSHRSNDLHFPSSSLTSATALLQKAAQM------GAVLSSPIIHHKN
        FN S+  +F+SS P    PS  +  L  P ++  SATALLQKA  +      G   +  I HH++
Subjt:  FNLSARSMFSSSLPDNSAPSHRSNDLHFPSSSLTSATALLQKAAQM------GAVLSSPIIHHKN

AT5G66730.1 C2H2-like zinc finger protein5.0e-8273.56Show/hide
Query:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH
        SV KKKRNLPG P PDAEV+ALSP+TLMA NRFVCEICNKGFQRDQNLQLHRRGHNLPWKL+QR S  E +++VYVCP + CVHH+P+RALGDLTGIKKH
Subjt:  SVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETSCVHHNPARALGDLTGIKKH

Query:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPAL-PHHLMPIPPNPFNLSARSMFSS-S
        F RKHGEKKWKCEKCSKKYAVQSD KAHSK CG+KEYKCDCGT+FSRRDSFITHRAFCDALAEE++K  +    L P  +    P     S  ++ SS S
Subjt:  FSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPAL-PHHLMPIPPNPFNLSARSMFSS-S

Query:  LPDNSAPS
        LP +S PS
Subjt:  LPDNSAPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAAAATGTCAGGGTGTTCTGATGATGGTGAGAAATTAGGGAATGAATTCAGCCATGGAAGCTCTCAATCTCATCAGATCATTTCAGTCCCAAAGAAGAAAAGGAA
CTTACCGGGAACACCAGGCCCAGATGCTGAAGTGGTAGCGCTATCTCCAGAAACCCTAATGGCGAAAAATAGGTTTGTTTGTGAAATATGTAACAAGGGATTTCAGAGAG
ACCAGAACTTGCAGCTGCATAGGCGAGGGCACAACCTGCCGTGGAAGTTGAAGCAGCGAACGAGCGGGAGCGAGACGAAGAGGAGGGTGTACGTGTGCCCCGAAACATCG
TGCGTCCACCACAATCCCGCGAGAGCCCTCGGAGACCTCACCGGAATCAAGAAGCATTTTAGTAGAAAACATGGGGAGAAGAAGTGGAAATGTGAGAAATGCTCAAAGAA
GTATGCAGTTCAATCCGACTTGAAAGCTCACTCCAAAACCTGTGGCTCCAAAGAATACAAATGCGACTGTGGCACAATCTTTTCCAGGAGAGATAGCTTCATCACTCACA
GAGCATTCTGCGATGCCTTAGCAGAAGAAAACAGCAAAATGGCTTCCTTTCAACCTGCCCTTCCCCACCACCTCATGCCAATCCCTCCCAACCCCTTCAACCTCTCTGCT
CGATCCATGTTTTCGTCTTCCCTTCCCGACAACTCCGCCCCGTCCCATCGATCAAACGACCTCCATTTTCCCTCCTCCTCACTCACGTCCGCCACCGCCCTTCTACAGAA
AGCGGCACAGATGGGAGCTGTACTGAGTTCGCCTATCATCCATCATAAGAACGGTTTCATTACGACAATGGCTCCTTCTTCCTTCGGTGGGCTGTTGCCTTCTAATGGGG
GAGTAGAGATGATCGACGACATTTCGATGTCGAACGAGATGTTCGAGGCTGCAAGGAAGAGGATGACACTGAATTTGTTGGGGGCCGAGGGAGGAAAAGGGAGGAACTTT
CAAGTACAAGAAAGTGTAGGATTTGGAGGGCTTATGGAGAATATGAGTTATTTTCACCAACAAGGGGCCATGGGAGGAAGGCACACATGGGACACTTGA
mRNA sequenceShow/hide mRNA sequence
CTAAGATTAGGGGCAAATGTTAGGAACCTATATTATAAATTCTCTAATTGCAGGCCATAGAAAAAACCCTTTTTTATTTATTTATTTTATTTTCTTCTAATTCAGCCATG
AAAGCTTCCTCTCCACCAAAATGAAAGGGGAAAATCCTGCAATTTTCCTTTCTCCTCTAAACCCTAATCGCAAAAGCCCCCAAAAATCTCTCTGCCTTCTTAATTTCCAT
TTCTTTACAAATAGAAGACAATAATTTCCATTTCTTCTTTTTTTGTGTGCATTTTTCAATGTCCAAAATGTCAGGGTGTTCTGATGATGGTGAGAAATTAGGGAATGAAT
TCAGCCATGGAAGCTCTCAATCTCATCAGATCATTTCAGTCCCAAAGAAGAAAAGGAACTTACCGGGAACACCAGGCCCAGATGCTGAAGTGGTAGCGCTATCTCCAGAA
ACCCTAATGGCGAAAAATAGGTTTGTTTGTGAAATATGTAACAAGGGATTTCAGAGAGACCAGAACTTGCAGCTGCATAGGCGAGGGCACAACCTGCCGTGGAAGTTGAA
GCAGCGAACGAGCGGGAGCGAGACGAAGAGGAGGGTGTACGTGTGCCCCGAAACATCGTGCGTCCACCACAATCCCGCGAGAGCCCTCGGAGACCTCACCGGAATCAAGA
AGCATTTTAGTAGAAAACATGGGGAGAAGAAGTGGAAATGTGAGAAATGCTCAAAGAAGTATGCAGTTCAATCCGACTTGAAAGCTCACTCCAAAACCTGTGGCTCCAAA
GAATACAAATGCGACTGTGGCACAATCTTTTCCAGGAGAGATAGCTTCATCACTCACAGAGCATTCTGCGATGCCTTAGCAGAAGAAAACAGCAAAATGGCTTCCTTTCA
ACCTGCCCTTCCCCACCACCTCATGCCAATCCCTCCCAACCCCTTCAACCTCTCTGCTCGATCCATGTTTTCGTCTTCCCTTCCCGACAACTCCGCCCCGTCCCATCGAT
CAAACGACCTCCATTTTCCCTCCTCCTCACTCACGTCCGCCACCGCCCTTCTACAGAAAGCGGCACAGATGGGAGCTGTACTGAGTTCGCCTATCATCCATCATAAGAAC
GGTTTCATTACGACAATGGCTCCTTCTTCCTTCGGTGGGCTGTTGCCTTCTAATGGGGGAGTAGAGATGATCGACGACATTTCGATGTCGAACGAGATGTTCGAGGCTGC
AAGGAAGAGGATGACACTGAATTTGTTGGGGGCCGAGGGAGGAAAAGGGAGGAACTTTCAAGTACAAGAAAGTGTAGGATTTGGAGGGCTTATGGAGAATATGAGTTATT
TTCACCAACAAGGGGCCATGGGAGGAAGGCACACATGGGACACTTGATGGTTCGTTTGAGAGTCCAAGTCCTTTGCAAATAACAATTATGAATGGATTTAGAATACAATT
TTATATTGTTTTTGTTAGTTTTTTTCTGACTCCTTATTTATATTTTATTTCAAAGTTGTTTAGATTGCATCTTTTTCTGCCTGTATATAATATTGCATTAGTTCTTAGGC
TTCCCTTGAGAAAAATGTAGGGTAGGGGCATTTACGTCCAATTTTAATGAATTTGTT
Protein sequenceShow/hide protein sequence
MSKMSGCSDDGEKLGNEFSHGSSQSHQIISVPKKKRNLPGTPGPDAEVVALSPETLMAKNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRTSGSETKRRVYVCPETS
CVHHNPARALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDLKAHSKTCGSKEYKCDCGTIFSRRDSFITHRAFCDALAEENSKMASFQPALPHHLMPIPPNPFNLSA
RSMFSSSLPDNSAPSHRSNDLHFPSSSLTSATALLQKAAQMGAVLSSPIIHHKNGFITTMAPSSFGGLLPSNGGVEMIDDISMSNEMFEAARKRMTLNLLGAEGGKGRNF
QVQESVGFGGLMENMSYFHQQGAMGGRHTWDT