; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000995 (gene) of Snake gourd v1 genome

Gene IDTan0000995
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein indeterminate-domain 2-like
Genome locationLG01:114873037..114876716
RNA-Seq ExpressionTan0000995
SyntenyTan0000995
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037846.1 Protein indeterminate-domain 1 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-25790.6Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        MVDLENSSPV VS DAGLSSSGYIEPV VVAP KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         V NSEGKES TSKAA+ SPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVS  AATTCLTSA+VNSTTNASAN SSSN++FGN TVFAP  SIS TT
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
        QISPTLPPPSLP SIVPYDC STRP  SA+NPTSLSLSTSLYLS+KGSSLFAP DQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS

Query:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG
        S+GQD+S+T QWSRN+NR+HRNG+SL AG+GL+LPSGP GSG+MMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGY VA+S ACGG
Subjt:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG

Query:  SGDGSSSGDAWDRDHEKDEKH
        SG+GS SG+AWD    +DEKH
Subjt:  SGDGSSSGDAWDRDHEKDEKH

XP_022941213.1 protein indeterminate-domain 2-like [Cucurbita moschata]1.7e-25890.98Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        MVDLENSSPV VS DAGLSSSGYIEPV VVAP KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         V NSEGKES TSKAA+ SPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVS  AATTCLTSA+VNSTTNASAN SSSN++FGN TVFAP ASIS TT
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
        QISPTLPPPSLP SIVPYDC STRP  SA+NPTSLSLSTSLYLS+KGSSLFAP DQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS

Query:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG
        S+GQD+S+T QWSRN+NR+HRNG+SL AG+GL+LPSGP GSG+MMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGY VA+S ACGG
Subjt:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG

Query:  SGDGSSSGDAWDRDHEKDEKH
        SG+GS SGDAWD    +DEKH
Subjt:  SGDGSSSGDAWDRDHEKDEKH

XP_022975944.1 protein indeterminate-domain 1-like [Cucurbita maxima]9.1e-20477.82Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        M D+EN SPVAVSGDA LSSS YIEPVT VA PKKKRNLPGMPDP AEVIALS ESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRT NEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEP+CVHHN ARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAV+SDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEES+RAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         VP S+ KES T      SPPPPPLTP+TTVVSPALSIHSSELAD  IR+ S+               ST +A+A +SSSNDLFGNSTVFAP AS     
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA-
          S T+   SL ++IV YDCPSTRPPVS +NPTSLSLST LYLS KG SSLF  PDQDRLQYT STQPAAMSATALLQKAAEMGATASNPSLFRG GMA 
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA-

Query:  TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVAC
        T+SS  D S+TA    NVNR+HRNGASL AGL LELPSGPAGSG+MMAGGPFCSFGSQPMTRDLLGLG+ GGGAS SRFSALIASM GG  Y        
Subjt:  TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVAC

Query:  GGSGDGSSSGDAWDRDHEKDEKH
            D SS+ DAWDRD E+D K+
Subjt:  GGSGDGSSSGDAWDRDHEKDEKH

XP_022981927.1 protein indeterminate-domain 2-like [Cucurbita maxima]4.1e-25789.83Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        MVDLE SSPV VS +AGLSSSGYIEPV VVAPPKKKRNLPGMPDPAAEVIALSP SLLATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         VPNSEGKES TSKAA+ SPPPPPLTPSTTV+SPALSIHSSELADNPIRLPSVS  AATTCLTSA+VNSTTNASAN SSSN++FGN TVFAP ASIS TT
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
        QISPTLPPPSLP SIVPYDC STRP  SA+NPTSLSLSTSLYLS+KGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS

Query:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG
        S+GQD+S+T QWSRN+NR+HRNG+SL AG+GL+LPSGP GSG+M++GGPFCSFGSQPMTRDLLGLGLGGGGAS+SRFSALIASMGGGSGY VA+S ACGG
Subjt:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG

Query:  SGDGSSSGDAWDRDHEKDEKH
        SG+GS SGDAWD    + EKH
Subjt:  SGDGSSSGDAWDRDHEKDEKH

XP_023535867.1 protein indeterminate-domain 1-like [Cucurbita pepo subsp. pepo]7.4e-20678.59Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        MVD+EN SPVAVSGDA LSSS YIEPVT VA PKKKRNLPGMPDP AEVIALS ESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRT NEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEP+CVHHN ARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAV+SDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         VP S+ KES T      SPP PPLTP+TTVVSPALSIHSSELAD  IR+ S+               ST +A+A +SSSNDLFGNSTVFAP AS     
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA-
          SPT+   +LPTSIV YDCPSTRP VS +NPTSLSLST LYLSTKG SSLF  PDQDRLQYT STQPAAMSATALLQKAAEMGATASNPSLFRG GMA 
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA-

Query:  TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVAC
        T+SS  D S+TA    NVNR+HRNGASL AGL LELPSGPAGSG+MMAGGPFCSFGSQPMTRDLLGLG+ GGGAS SRFSALIASM GG  Y        
Subjt:  TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVAC

Query:  GGSGDGSSSGDAWDRDHEKDEKH
            D SS+ DAWDRD E+D K+
Subjt:  GGSGDGSSSGDAWDRDHEKDEKH

TrEMBL top hitse value%identityAlignment
A0A6J1F9H6 protein indeterminate-domain 2-like1.9e-19977.1Show/hide
Query:  MVDLENSSPVAVS-GDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEV
        MVD+EN SPVAVS GDA LSSS YIEPVT VA PKKKRNLPGMPDP AEVIALS ESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRT NEV
Subjt:  MVDLENSSPVAVS-GDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEV

Query:  RKRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQT
        RKRVYVCPEP+CVHHN ARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAV+SDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQT
Subjt:  RKRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQT

Query:  LVVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVT
        L +P S  KES T      SPPPPPLTP+TTVVSPALSIHSSELAD  IR+ S+                T + +A +SSSNDLFGNSTVFA  AS    
Subjt:  LVVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVT

Query:  TQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA
           SPT+   +LPTSIV YDCPST P VS +NPTSLSLST LYLSTKG SSLF  PDQDRLQYT STQPAAMSATALLQKAAEMGATASNPSLFRG GMA
Subjt:  TQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA

Query:  -TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVA
         T+SS  D S+TA    NVNR+H NGASL  GL LELPSG AGSG+MMAGGPFCSFGSQPMTRDLLGLG+ GGGAS SRFSALIASM GG  Y       
Subjt:  -TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVA

Query:  CGGSGDGSSSGDAWDRDHEKDEKH
             D SS+ DAWDRD E+D K+
Subjt:  CGGSGDGSSSGDAWDRDHEKDEKH

A0A6J1FRH8 protein indeterminate-domain 2-like8.2e-25990.98Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        MVDLENSSPV VS DAGLSSSGYIEPV VVAP KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         V NSEGKES TSKAA+ SPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVS  AATTCLTSA+VNSTTNASAN SSSN++FGN TVFAP ASIS TT
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
        QISPTLPPPSLP SIVPYDC STRP  SA+NPTSLSLSTSLYLS+KGSSLFAP DQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS

Query:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG
        S+GQD+S+T QWSRN+NR+HRNG+SL AG+GL+LPSGP GSG+MMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGY VA+S ACGG
Subjt:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG

Query:  SGDGSSSGDAWDRDHEKDEKH
        SG+GS SGDAWD    +DEKH
Subjt:  SGDGSSSGDAWDRDHEKDEKH

A0A6J1IM25 protein indeterminate-domain 1-like4.4e-20477.82Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        M D+EN SPVAVSGDA LSSS YIEPVT VA PKKKRNLPGMPDP AEVIALS ESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRT NEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEP+CVHHN ARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAV+SDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEES+RAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         VP S+ KES T      SPPPPPLTP+TTVVSPALSIHSSELAD  IR+ S+               ST +A+A +SSSNDLFGNSTVFAP AS     
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA-
          S T+   SL ++IV YDCPSTRPPVS +NPTSLSLST LYLS KG SSLF  PDQDRLQYT STQPAAMSATALLQKAAEMGATASNPSLFRG GMA 
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKG-SSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMA-

Query:  TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVAC
        T+SS  D S+TA    NVNR+HRNGASL AGL LELPSGPAGSG+MMAGGPFCSFGSQPMTRDLLGLG+ GGGAS SRFSALIASM GG  Y        
Subjt:  TSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVAC

Query:  GGSGDGSSSGDAWDRDHEKDEKH
            D SS+ DAWDRD E+D K+
Subjt:  GGSGDGSSSGDAWDRDHEKDEKH

A0A6J1IVB0 protein indeterminate-domain 2-like2.0e-25789.83Show/hide
Query:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
        MVDLE SSPV VS +AGLSSSGYIEPV VVAPPKKKRNLPGMPDPAAEVIALSP SLLATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR
Subjt:  MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVR

Query:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
Subjt:  KRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Query:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT
         VPNSEGKES TSKAA+ SPPPPPLTPSTTV+SPALSIHSSELADNPIRLPSVS  AATTCLTSA+VNSTTNASAN SSSN++FGN TVFAP ASIS TT
Subjt:  VVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTT

Query:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
        QISPTLPPPSLP SIVPYDC STRP  SA+NPTSLSLSTSLYLS+KGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS
Subjt:  QISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATS

Query:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG
        S+GQD+S+T QWSRN+NR+HRNG+SL AG+GL+LPSGP GSG+M++GGPFCSFGSQPMTRDLLGLGLGGGGAS+SRFSALIASMGGGSGY VA+S ACGG
Subjt:  SSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGG

Query:  SGDGSSSGDAWDRDHEKDEKH
        SG+GS SGDAWD    + EKH
Subjt:  SGDGSSSGDAWDRDHEKDEKH

A0A6P3ZWE0 protein indeterminate-domain 2-like9.9e-15661.1Show/hide
Query:  MVDLENSSPVAV---SGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSN
        M + ENSSP+ V   S +A +SSSGY      VAPPKKKRNLPGMPDP AEV+ALSP++LLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTS 
Subjt:  MVDLENSSPVAV---SGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSN

Query:  EVRKRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARA
        E+RKRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCD LAEESA+A
Subjt:  EVRKRVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARA

Query:  QTLVVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHS--------SELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTV
        QTL   NSE      +KA +ASPPPPP TP   +V   ++  S        +EL +NPI L  +++ A  TCLT+   + + ++S+N +++N+   +S+V
Subjt:  QTLVVPNSEGKESATSKAAMASPPPPPLTPSTTVVSPALSIHS--------SELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTV

Query:  FAPPASISVTTQISPTLPPPSLPTS--------IVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQD-RLQYTWSTQPAAMSATALLQKAA
        F   ASI   +  +P   PP   TS        +   DC ++ P +S I PTSLSLSTSLYLS  GSSLF  PDQD   QYT S QPAAMSATALLQKAA
Subjt:  FAPPASISVTTQISPTLPPPSLPTS--------IVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQD-RLQYTWSTQPAAMSATALLQKAA

Query:  EMGATASNPSLFRGFGMATSSS--GQDSSSTAQWSRNV-NREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRF
        ++GA ASN SL RGFG+ATSSS  GQD+S+  QW+ N  ++   N +S+AAGLGL LPS    +   +  GP  +FGSQPMTRDLLGL +G GGAS    
Subjt:  EMGATASNPSLFRGFGMATSSS--GQDSSSTAQWSRNV-NREHRNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRF

Query:  SALIASMGGGSGYE-----VASSVACGGSGDGSSSGDAWDRDHEK
        SAL+ S  GG+G++      A++ + GG G G S  + W+   E+
Subjt:  SALIASMGGGSGYE-----VASSVACGGSGDGSSSGDAWDRDHEK

SwissProt top hitse value%identityAlignment
Q700D2 Zinc finger protein JACKDAW1.2e-7848.91Show/hide
Query:  KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEV-RKRVYVCPEPSCVHHNPARALGDLTGIKKHFCR
        KKKRN PG PDP A+VIALSP +L+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL+QR+  EV +K+VY+CP  +CVHH+ +RALGDLTGIKKH+ R
Subjt:  KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEV-RKRVYVCPEPSCVHHNPARALGDLTGIKKHFCR

Query:  KHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL----VVPNSEGKESATSKAAMASPPPPPLTPS
        KHGEKKWKCE+CSKKYAVQSDWKAH KTCGTREYKCDCGTLFSR+DSFITHRAFCD L EE AR  +L     V ++           M +P  P     
Subjt:  KHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL----VVPNSEGKESATSKAAMASPPPPPLTPS

Query:  TTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTTQISPTLPPPSLPTSIVPYDCPSTRPPVS
          V  P ++   S+         S   A   + +   A          S+ ++ LF +S+   P  S     QI  T   PSL  S              
Subjt:  TTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTTQISPTLPPPSLPTSIVPYDCPSTRPPVS

Query:  AINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASN----PSLFRGFGM----ATSSSGQDSSSTA-------QWS
          + TS   S SL   T   S F+P      +   +   + MSATALLQKAA+MG+T SN    PS F G  M    AT+S    SSS          ++
Subjt:  AINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASN----PSLFRGFGM----ATSSSGQDSSSTA-------QWS

Query:  RNVNREHRNGA
         NV RE+ N A
Subjt:  RNVNREHRNGA

Q8RWX7 Protein indeterminate-domain 6, chloroplastic3.1e-8276.88Show/hide
Query:  TVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIK
        +V  PPKK+RN PG P+P AEVIALSP++++ATNRF+CE+CNKGFQR+QNLQLHRRGHNLPWKL+Q+++ EVR++VY+CPEPSCVHH+PARALGDLTGIK
Subjt:  TVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIK

Query:  KHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KH+ RKHGEKKWKC++CSK+YAVQSDWKAH KTCGT+EY+CDCGT+FSRRDS+ITHRAFCD L +ESAR  T+
Subjt:  KHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

Q9LVQ7 Zinc finger protein ENHYDROUS6.9e-10649.24Show/hide
Query:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK
        VDL+NSS   VSGDA +SS+G  + +T  +  KKKRNLPGMPDP AEVIALSP++L+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQR++ EVRK
Subjt:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK

Query:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQT--
        +VYVCP   CVHH+P+RALGDLTGIKKHFCRKHGEKKWKCE+CSKKYAVQSDWKAH K CGT+EYKCDCGTLFSRRDSFITHRAFCD LAEESA+  T  
Subjt:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQT--

Query:  ------LVVPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSE---LADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFG-NS
               V   +   E  +  A  +SP  PP +P +  +  +PA+S+ +     ++ + + + +  ++          +   +     + SS+DL   +S
Subjt:  ------LVVPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSE---LADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFG-NS

Query:  TVFAPPASISVTTQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQP-AAMSATALLQKAAEMGATA
              A + V++  SP+L   S  +       PS   P S++ P SL LST+        SLF P  +D   +     P  AMSATALLQKAA+MG+T 
Subjt:  TVFAPPASISVTTQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQP-AAMSATALLQKAAEMGATA

Query:  SNPSLFRGFGMATSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP--SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALI
        S  SL RG G+ +++S            ++   + +  SLA GLGL LP  SG +GSG+  +  G    FG +  T D LGLG  +G GG +    SAL+
Subjt:  SNPSLFRGFGMATSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP--SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALI

Query:  ASMGGGSGYEVASSVACGGSGDGSSS
         S+GGG G ++  S    G   G SS
Subjt:  ASMGGGSGYEVASSVACGGSGDGSSS

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 17.0e-9848.81Show/hide
Query:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK
        VDL+NSS V+      +SS+G   P+   +  KKKRNLPGMPDP +EVIALSP++LLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQ+++ EV+K
Subjt:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK

Query:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLV
        +VYVCPE SCVHH+P+RALGDLTGIKKHFCRKHGEKKWKC++CSKKYAVQSDWKAH K CGT+EYKCDCGTLFSRRDSFITHRAFCD LAEE+AR+    
Subjt:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLV

Query:  VPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVT
            +  E  T K  + +P P P+   +  +  S  L+I  SE    P  +  V +A   T L     N        SSS++             SI  T
Subjt:  VPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVT

Query:  TQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMAT
        +  S +L   S                 S+I P SL LSTS   S  GS+ F              QP AMSATALLQKAA+MGA +S  SL  G G+ +
Subjt:  TQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMAT

Query:  SSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP-SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALIASMGGGSGYEVASS
        S+S                     A +  GLGL LP  G + SG+  +  G    FG +  T D LGLG  +G G   ++  S L+   GGG+G ++A++
Subjt:  SSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP-SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALIASMGGGSGYEVASS

Query:  VACG
           G
Subjt:  VACG

Q9ZWA6 Zinc finger protein MAGPIE1.6e-8145.15Show/hide
Query:  PP--KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIKKH
        PP  KKKRNLPG PDP AEVIALSP++L+ATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRTS EVRKRVYVCPE SCVHH+P RALGDLTGIKKH
Subjt:  PP--KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIKKH

Query:  FCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLVVPNSEGKESATS-------KAAMASP--
        FCRKHGEKKWKCE+C+K+YAVQSDWKAH KTCGTREY+CDCGT+FSRRDSFITHRAFCD LAEE+AR        S    + ++          + SP  
Subjt:  FCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLVVPNSEGKESATS-------KAAMASP--

Query:  PPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAA-VNSTTNASANSSSS----------NDLFGNSTVFAPPASISVTTQISPTLPPP
        P PP  P           H   +  N      V K A+T  L S   +N     +     +          N +FGN+          +TT         
Subjt:  PPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAA-VNSTTNASANSSSS----------NDLFGNSTVFAPPASISVTTQISPTLPPP

Query:  SLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQ-DRLQYTWSTQPAAMSATALLQKAAEMGATAS----------NPSLFRGFGMA
            S++ +D        + IN      + +   S    SLF+  DQ  +     S   A MSATALLQKAA+MGAT+S            +  + F   
Subjt:  SLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQ-DRLQYTWSTQPAAMSATALLQKAAEMGATAS----------NPSLFRGFGMA

Query:  TSSSGQD-----------SSSTAQWSRNVNREH-----RNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGL
        ++   +D           S+S    S N N  H     RNG ++ +G+G EL + P     +  G    + G    TRD LG+G+
Subjt:  TSSSGQD-----------SSSTAQWSRNVNREH-----RNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGL

Arabidopsis top hitse value%identityAlignment
AT1G03840.1 C2H2 and C2HC zinc fingers superfamily protein1.1e-8245.15Show/hide
Query:  PP--KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIKKH
        PP  KKKRNLPG PDP AEVIALSP++L+ATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRTS EVRKRVYVCPE SCVHH+P RALGDLTGIKKH
Subjt:  PP--KKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIKKH

Query:  FCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLVVPNSEGKESATS-------KAAMASP--
        FCRKHGEKKWKCE+C+K+YAVQSDWKAH KTCGTREY+CDCGT+FSRRDSFITHRAFCD LAEE+AR        S    + ++          + SP  
Subjt:  FCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLVVPNSEGKESATS-------KAAMASP--

Query:  PPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAA-VNSTTNASANSSSS----------NDLFGNSTVFAPPASISVTTQISPTLPPP
        P PP  P           H   +  N      V K A+T  L S   +N     +     +          N +FGN+          +TT         
Subjt:  PPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAA-VNSTTNASANSSSS----------NDLFGNSTVFAPPASISVTTQISPTLPPP

Query:  SLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQ-DRLQYTWSTQPAAMSATALLQKAAEMGATAS----------NPSLFRGFGMA
            S++ +D        + IN      + +   S    SLF+  DQ  +     S   A MSATALLQKAA+MGAT+S            +  + F   
Subjt:  SLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQ-DRLQYTWSTQPAAMSATALLQKAAEMGATAS----------NPSLFRGFGMA

Query:  TSSSGQD-----------SSSTAQWSRNVNREH-----RNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGL
        ++   +D           S+S    S N N  H     RNG ++ +G+G EL + P     +  G    + G    TRD LG+G+
Subjt:  TSSSGQD-----------SSSTAQWSRNVNREH-----RNGASLAAGLGLELPSGPAGSGMMMAGGPFCSFGSQPMTRDLLGLGL

AT1G14580.1 C2H2-like zinc finger protein2.2e-8376.88Show/hide
Query:  TVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIK
        +V  PPKK+RN PG P+P AEVIALSP++++ATNRF+CE+CNKGFQR+QNLQLHRRGHNLPWKL+Q+++ EVR++VY+CPEPSCVHH+PARALGDLTGIK
Subjt:  TVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIK

Query:  KHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KH+ RKHGEKKWKC++CSK+YAVQSDWKAH KTCGT+EY+CDCGT+FSRRDS+ITHRAFCD L +ESAR  T+
Subjt:  KHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

AT1G14580.2 C2H2-like zinc finger protein2.2e-8376.88Show/hide
Query:  TVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIK
        +V  PPKK+RN PG P+P AEVIALSP++++ATNRF+CE+CNKGFQR+QNLQLHRRGHNLPWKL+Q+++ EVR++VY+CPEPSCVHH+PARALGDLTGIK
Subjt:  TVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPSCVHHNPARALGDLTGIK

Query:  KHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL
        KH+ RKHGEKKWKC++CSK+YAVQSDWKAH KTCGT+EY+CDCGT+FSRRDS+ITHRAFCD L +ESAR  T+
Subjt:  KHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTL

AT3G50700.1 indeterminate(ID)-domain 24.9e-9948.81Show/hide
Query:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK
        VDL+NSS V+      +SS+G   P+   +  KKKRNLPGMPDP +EVIALSP++LLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQ+++ EV+K
Subjt:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK

Query:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLV
        +VYVCPE SCVHH+P+RALGDLTGIKKHFCRKHGEKKWKC++CSKKYAVQSDWKAH K CGT+EYKCDCGTLFSRRDSFITHRAFCD LAEE+AR+    
Subjt:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLV

Query:  VPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVT
            +  E  T K  + +P P P+   +  +  S  L+I  SE    P  +  V +A   T L     N        SSS++             SI  T
Subjt:  VPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVT

Query:  TQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMAT
        +  S +L   S                 S+I P SL LSTS   S  GS+ F              QP AMSATALLQKAA+MGA +S  SL  G G+ +
Subjt:  TQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMAT

Query:  SSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP-SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALIASMGGGSGYEVASS
        S+S                     A +  GLGL LP  G + SG+  +  G    FG +  T D LGLG  +G G   ++  S L+   GGG+G ++A++
Subjt:  SSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP-SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALIASMGGGSGYEVASS

Query:  VACG
           G
Subjt:  VACG

AT5G66730.1 C2H2-like zinc finger protein4.9e-10749.24Show/hide
Query:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK
        VDL+NSS   VSGDA +SS+G  + +T  +  KKKRNLPGMPDP AEVIALSP++L+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQR++ EVRK
Subjt:  VDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRK

Query:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQT--
        +VYVCP   CVHH+P+RALGDLTGIKKHFCRKHGEKKWKCE+CSKKYAVQSDWKAH K CGT+EYKCDCGTLFSRRDSFITHRAFCD LAEESA+  T  
Subjt:  RVYVCPEPSCVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQT--

Query:  ------LVVPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSE---LADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFG-NS
               V   +   E  +  A  +SP  PP +P +  +  +PA+S+ +     ++ + + + +  ++          +   +     + SS+DL   +S
Subjt:  ------LVVPNSEGKESATSKAAMASPPPPPLTPSTTVV--SPALSIHSSE---LADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFG-NS

Query:  TVFAPPASISVTTQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQP-AAMSATALLQKAAEMGATA
              A + V++  SP+L   S  +       PS   P S++ P SL LST+        SLF P  +D   +     P  AMSATALLQKAA+MG+T 
Subjt:  TVFAPPASISVTTQISPTLPPPSLPTSIVPYDCPSTRPPVSAINPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQP-AAMSATALLQKAAEMGATA

Query:  SNPSLFRGFGMATSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP--SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALI
        S  SL RG G+ +++S            ++   + +  SLA GLGL LP  SG +GSG+  +  G    FG +  T D LGLG  +G GG +    SAL+
Subjt:  SNPSLFRGFGMATSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELP--SGPAGSGMM-MAGGPFCSFGSQPMTRDLLGLG--LGGGGASASRFSALI

Query:  ASMGGGSGYEVASSVACGGSGDGSSS
         S+GGG G ++  S    G   G SS
Subjt:  ASMGGGSGYEVASSVACGGSGDGSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGATTTGGAAAATTCTTCACCGGTTGCGGTTTCTGGAGATGCTGGTTTATCTTCGTCAGGTTATATTGAACCGGTGACGGTGGTGGCTCCGCCGAAGAAGAAACG
GAACCTGCCTGGAATGCCAGATCCGGCGGCGGAGGTGATTGCTTTATCGCCAGAGAGTTTACTGGCCACCAACCGGTTCGTGTGCGAGATCTGCAACAAAGGTTTCCAGC
GGGACCAGAACCTGCAACTTCACCGGCGAGGCCACAACCTTCCGTGGAAGCTCCGGCAGCGGACCAGCAATGAGGTGCGTAAGCGCGTCTATGTCTGCCCGGAGCCTTCC
TGCGTGCACCATAATCCGGCAAGAGCGCTCGGCGATCTCACTGGAATAAAGAAGCACTTCTGTAGAAAGCACGGGGAGAAGAAGTGGAAGTGCGAGAGATGCTCGAAGAA
ATACGCCGTACAATCCGATTGGAAAGCGCATATGAAAACTTGCGGAACTCGAGAGTACAAATGCGATTGTGGCACTTTATTTTCAAGGAGGGATAGCTTTATTACACATC
GAGCCTTTTGCGATGTATTGGCGGAGGAAAGTGCTCGTGCTCAAACCCTAGTAGTTCCAAACTCTGAAGGAAAGGAATCGGCCACTTCGAAAGCCGCCATGGCTTCGCCT
CCTCCGCCGCCGCTAACTCCGTCGACCACGGTGGTTTCTCCAGCGTTGTCAATTCACAGCTCAGAGCTAGCTGATAATCCGATAAGACTTCCATCAGTATCAAAGGCCGC
AGCCACGACATGCTTGACCTCCGCGGCTGTCAATAGTACCACAAATGCAAGCGCCAACAGCAGTAGCTCAAACGATTTGTTTGGAAATAGTACTGTTTTTGCTCCTCCTG
CGTCAATTTCAGTGACGACCCAAATATCCCCAACATTACCACCACCCAGTTTACCGACCTCAATAGTACCCTACGATTGCCCTTCTACTCGCCCTCCCGTTTCCGCCATT
AATCCCACATCGCTTTCTCTCTCCACATCTCTCTATCTTTCCACTAAAGGATCCTCGCTCTTTGCCCCACCCGACCAAGACCGTCTGCAGTACACGTGGTCCACTCAACC
GGCGGCCATGTCCGCCACCGCTTTGCTGCAAAAGGCAGCGGAAATGGGGGCAACGGCGTCTAATCCGTCGTTGTTCCGTGGCTTTGGGATGGCTACTTCTTCTTCTGGTC
AAGATAGCTCTTCCACTGCTCAATGGAGTAGGAATGTGAACCGAGAACACCGGAATGGCGCTTCGTTAGCGGCTGGTTTAGGGCTTGAACTTCCCTCCGGACCCGCCGGT
TCTGGTATGATGATGGCGGGTGGTCCATTTTGCTCGTTTGGAAGTCAGCCCATGACGAGAGATCTTCTTGGCCTTGGCTTAGGCGGCGGTGGGGCCTCTGCAAGTAGATT
TTCAGCCTTGATTGCTTCCATGGGTGGTGGGTCTGGTTACGAAGTTGCATCTTCAGTCGCCTGCGGTGGAAGTGGCGACGGAAGCTCATCCGGCGACGCTTGGGATCGAG
ATCACGAAAAAGATGAAAAACATTGA
mRNA sequenceShow/hide mRNA sequence
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCATTTTGGCTCTTTTCTATTCCGGTAAGGGCAAACTGAGAAACCGCCGCCTTAGGCGTACTTGTTTCCCGGCCAACTTC
CGACGAAAACCGCCATGTATATTTTATTTTTCACTTGAACATTCACTGAGCAACCCGTTCGATTACACCTAATGGACTGTAAATTCAAACCGTCTCAAGGTTTTTAGGTT
CACAGTTCATCATTTCAATTTTCCTAACGATCACAATTTTCACAACGCTTAGTTGTGGAGAAGTATTAAAGTTGCTGTATTGATTATTCGATAGAGCGTGGTGTTTTTGT
TTGGATTTATTCAGTATTGATAGTTTCGTCAATCGACTGTGTGGCTGAGCTTTCGGTGAAGCTAAAAGCCTAAAACTATCTTTTCGTTAAGGTTATATGCGGAATGGTTG
ATTTGGAAAATTCTTCACCGGTTGCGGTTTCTGGAGATGCTGGTTTATCTTCGTCAGGTTATATTGAACCGGTGACGGTGGTGGCTCCGCCGAAGAAGAAACGGAACCTG
CCTGGAATGCCAGATCCGGCGGCGGAGGTGATTGCTTTATCGCCAGAGAGTTTACTGGCCACCAACCGGTTCGTGTGCGAGATCTGCAACAAAGGTTTCCAGCGGGACCA
GAACCTGCAACTTCACCGGCGAGGCCACAACCTTCCGTGGAAGCTCCGGCAGCGGACCAGCAATGAGGTGCGTAAGCGCGTCTATGTCTGCCCGGAGCCTTCCTGCGTGC
ACCATAATCCGGCAAGAGCGCTCGGCGATCTCACTGGAATAAAGAAGCACTTCTGTAGAAAGCACGGGGAGAAGAAGTGGAAGTGCGAGAGATGCTCGAAGAAATACGCC
GTACAATCCGATTGGAAAGCGCATATGAAAACTTGCGGAACTCGAGAGTACAAATGCGATTGTGGCACTTTATTTTCAAGGAGGGATAGCTTTATTACACATCGAGCCTT
TTGCGATGTATTGGCGGAGGAAAGTGCTCGTGCTCAAACCCTAGTAGTTCCAAACTCTGAAGGAAAGGAATCGGCCACTTCGAAAGCCGCCATGGCTTCGCCTCCTCCGC
CGCCGCTAACTCCGTCGACCACGGTGGTTTCTCCAGCGTTGTCAATTCACAGCTCAGAGCTAGCTGATAATCCGATAAGACTTCCATCAGTATCAAAGGCCGCAGCCACG
ACATGCTTGACCTCCGCGGCTGTCAATAGTACCACAAATGCAAGCGCCAACAGCAGTAGCTCAAACGATTTGTTTGGAAATAGTACTGTTTTTGCTCCTCCTGCGTCAAT
TTCAGTGACGACCCAAATATCCCCAACATTACCACCACCCAGTTTACCGACCTCAATAGTACCCTACGATTGCCCTTCTACTCGCCCTCCCGTTTCCGCCATTAATCCCA
CATCGCTTTCTCTCTCCACATCTCTCTATCTTTCCACTAAAGGATCCTCGCTCTTTGCCCCACCCGACCAAGACCGTCTGCAGTACACGTGGTCCACTCAACCGGCGGCC
ATGTCCGCCACCGCTTTGCTGCAAAAGGCAGCGGAAATGGGGGCAACGGCGTCTAATCCGTCGTTGTTCCGTGGCTTTGGGATGGCTACTTCTTCTTCTGGTCAAGATAG
CTCTTCCACTGCTCAATGGAGTAGGAATGTGAACCGAGAACACCGGAATGGCGCTTCGTTAGCGGCTGGTTTAGGGCTTGAACTTCCCTCCGGACCCGCCGGTTCTGGTA
TGATGATGGCGGGTGGTCCATTTTGCTCGTTTGGAAGTCAGCCCATGACGAGAGATCTTCTTGGCCTTGGCTTAGGCGGCGGTGGGGCCTCTGCAAGTAGATTTTCAGCC
TTGATTGCTTCCATGGGTGGTGGGTCTGGTTACGAAGTTGCATCTTCAGTCGCCTGCGGTGGAAGTGGCGACGGAAGCTCATCCGGCGACGCTTGGGATCGAGATCACGA
AAAAGATGAAAAACATTGAAAGTTGAAACCCAACCGTTGATTGCTGGCTGGAGCTGCTCGTATCGGCTGCTTATATGGAGAGATTGTCTCCGTGGAAGGTGACATTGGGG
CTTCAATGGCGGTGAGGGGTTTGAACAAATTGGTACTTGGAAAATTAATGTTCATTATGGTTCTGTTCATTTGACGTGTCGATTCCCGGTGGTCCAAGTTCCGATCTGGT
CCATTAGTCAGACCGTCAAACTAAATTTAATGAGATGAGTTGGCTGAGGTGGTCCTCCACCGACTTCTCGCCTCGTAGGTCCGTTCATCTCGACGGTCGCCGCCCGCCGA
ACTGTCACTGTTTGTGGATATTTCAACAGTGACACAACTGGATAATTACAAAATCAATTATTTCTTATGTGTTGAATTTGAATATATGCTTCTAACTTTGTCCTTATCAT
TTTTAATTTACTCCACATGATTTAGGCTAGGCTTGATTGCTTTTATTGTTATTATCTACAAAATGTTTATACATAAGTTGGAACATTTTCTATTTATCGAATGAAATGGA
AGATTCTTTTTGGTGAGG
Protein sequenceShow/hide protein sequence
MVDLENSSPVAVSGDAGLSSSGYIEPVTVVAPPKKKRNLPGMPDPAAEVIALSPESLLATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLRQRTSNEVRKRVYVCPEPS
CVHHNPARALGDLTGIKKHFCRKHGEKKWKCERCSKKYAVQSDWKAHMKTCGTREYKCDCGTLFSRRDSFITHRAFCDVLAEESARAQTLVVPNSEGKESATSKAAMASP
PPPPLTPSTTVVSPALSIHSSELADNPIRLPSVSKAAATTCLTSAAVNSTTNASANSSSSNDLFGNSTVFAPPASISVTTQISPTLPPPSLPTSIVPYDCPSTRPPVSAI
NPTSLSLSTSLYLSTKGSSLFAPPDQDRLQYTWSTQPAAMSATALLQKAAEMGATASNPSLFRGFGMATSSSGQDSSSTAQWSRNVNREHRNGASLAAGLGLELPSGPAG
SGMMMAGGPFCSFGSQPMTRDLLGLGLGGGGASASRFSALIASMGGGSGYEVASSVACGGSGDGSSSGDAWDRDHEKDEKH