; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019660 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019660
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein indeterminate-domain 9-like
Genome locationscaffold729:1346402..1348657
RNA-Seq ExpressionMS019660
SyntenyMS019660
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR022755 - Zinc finger, double-stranded RNA binding
IPR033243 - Zinc finger protein JACKDAW-like
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137968.1 zinc finger protein BALDIBIS [Cucumis sativus]4.3e-21183.46Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F   QDANP+PN KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQSL+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
        +NN+IS+FFG+SSSSSNLFGSI E   +GLSMLPV++KE+VENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTRS NNNN+
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE
         LFG+G FGVM  SSSSS SSSSSSNAV  LNSLNK+RSLTM DS+QM+GS SDLSSNCLSQ L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPPE
Subjt:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE

Query:  LAKFAAINSTMGLSQFAANH
        LAKF  INSTMGLSQFAANH
Subjt:  LAKFAAINSTMGLSQFAANH

XP_008442674.1 PREDICTED: protein indeterminate-domain 9 [Cucumis melo]1.3e-21083.85Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS
        NNNNIS+FFG+SSSSSNLFGSI E   +GLSMLPV++KE+VENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS

Query:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE
         LFG+G FGVM  SSSSS SSSSSSNAV  LNS NK+RSLTMADS+QM+GS SDLSSNCLSQ L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPPE
Subjt:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE

Query:  LAKFAAINSTMGLSQFAANH
        LAKF  INSTMGLSQFAANH
Subjt:  LAKFAAINSTMGLSQFAANH

XP_022145765.1 protein indeterminate-domain 9-like [Momordica charantia]2.8e-26698.81Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISS
        ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAI NNNNISS
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISS

Query:  FFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSS
        FFGSSSSSSNLFGSINESGISGLS+LPVIDKE+VENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSS
Subjt:  FFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSS

Query:  SS-SSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQF
        SS SSSSSSSSSNAV LNSLNKTRSLTMADSMQMVGSSDLSSNCLSQ LLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQF
Subjt:  SS-SSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQF

Query:  AANH
        AANH
Subjt:  AANH

XP_022934107.1 protein indeterminate-domain 9-like [Cucurbita moschata]6.9e-20981.96Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSS PT FA   DANP PN KPKPSAAAKKKRNLPG PDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNIS
        ITTVSATNILNNLRNDS+LLHQQD  QSL+D          H  NNLQ+LGDV  LSQF+HSD FLRD ED QHKNRSP SLWL         NN+NNIS
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNIS

Query:  SFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGV
        +F+G+SSSSSNLFGSI E   +GLSMLPV +KE+VE K SL KAT  AAALLSGQSS   V SSSPMSATALLQKAALMGSTRS+ NNNS L  AG FGV
Subjt:  SFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGV

Query:  MSSSSSSSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQFL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST
        M   +SSS SSSSSSNAV LNSLNK+RS++MADS+QMVG +SDLSSN LSQ L  L+NGNN M+SN QTRDFLGVGG+GEAPRPPFLPPELAKF AINST
Subjt:  MSSSSSSSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQFL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST

Query:  MGLSQFAANH
        MGLSQFAANH
Subjt:  MGLSQFAANH

XP_038903191.1 zinc finger protein BALDIBIS-like [Benincasa hispida]1.2e-21685.16Show/hide
Query:  MSSNPFSLLSSAPTAFAPQ-QDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  TAFA Q  DANP+PN KPKPSAA KKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQ-QDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI----N
        RITTVSATNILNNLRNDS ILLHQQD  QSL+D H+N         N LQSLGD+SGLSQF+HSD FLRD ED QHKNRSPLSLWLNQASAETA+    N
Subjt:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI----N

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKA--SLSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSS-
        NNNNIS+ FG+SSSSSNLFGSI E   +GLSMLPVI+KE+VENK   +LSKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTRSNNNN+S 
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKA--SLSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSS-

Query:  LFGAGGFGVMSSSSSSSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQFLL-SNGNNGM-RSNGQTRDFLGVGGSGEAPRPPFLPPEL
        LFGAG FGVM    SSSSSSSSSSNAV LNSLNK+RSLTMADS+QM+G +SDLSSNCLSQ L+  NGNN M RS+GQTRDFLGVGG GEAPRPPFLPPEL
Subjt:  LFGAGGFGVMSSSSSSSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQFLL-SNGNNGM-RSNGQTRDFLGVGGSGEAPRPPFLPPEL

Query:  AKFAAINSTMGLSQFAANH
        AKFA INST+GLSQFAANH
Subjt:  AKFAAINSTMGLSQFAANH

TrEMBL top hitse value%identityAlignment
A0A0A0LDZ1 C2H2-type domain-containing protein2.1e-21183.46Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F   QDANP+PN KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQSL+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
        +NN+IS+FFG+SSSSSNLFGSI E   +GLSMLPV++KE+VENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTRS NNNN+
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE
         LFG+G FGVM  SSSSS SSSSSSNAV  LNSLNK+RSLTM DS+QM+GS SDLSSNCLSQ L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPPE
Subjt:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE

Query:  LAKFAAINSTMGLSQFAANH
        LAKF  INSTMGLSQFAANH
Subjt:  LAKFAAINSTMGLSQFAANH

A0A1S3B706 protein indeterminate-domain 96.1e-21183.85Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS
        NNNNIS+FFG+SSSSSNLFGSI E   +GLSMLPV++KE+VENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS

Query:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE
         LFG+G FGVM  SSSSS SSSSSSNAV  LNS NK+RSLTMADS+QM+GS SDLSSNCLSQ L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPPE
Subjt:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE

Query:  LAKFAAINSTMGLSQFAANH
        LAKF  INSTMGLSQFAANH
Subjt:  LAKFAAINSTMGLSQFAANH

A0A5D3DNM0 Protein indeterminate-domain 96.1e-21183.85Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS
        NNNNIS+FFG+SSSSSNLFGSI E   +GLSMLPV++KE+VENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS

Query:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE
         LFG+G FGVM  SSSSS SSSSSSNAV  LNS NK+RSLTMADS+QM+GS SDLSSNCLSQ L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPPE
Subjt:  SLFGAGGFGVMSSSSSSSSSSSSSSNAV-GLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQFLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPE

Query:  LAKFAAINSTMGLSQFAANH
        LAKF  INSTMGLSQFAANH
Subjt:  LAKFAAINSTMGLSQFAANH

A0A6J1CVG4 protein indeterminate-domain 9-like1.3e-26698.81Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISS
        ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAI NNNNISS
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISS

Query:  FFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSS
        FFGSSSSSSNLFGSINESGISGLS+LPVIDKE+VENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSS
Subjt:  FFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSS

Query:  SS-SSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQF
        SS SSSSSSSSSNAV LNSLNKTRSLTMADSMQMVGSSDLSSNCLSQ LLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQF
Subjt:  SS-SSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQF

Query:  AANH
        AANH
Subjt:  AANH

A0A6J1F1R6 protein indeterminate-domain 9-like3.4e-20981.96Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSS PT FA   DANP PN KPKPSAAAKKKRNLPG PDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNIS
        ITTVSATNILNNLRNDS+LLHQQD  QSL+D          H  NNLQ+LGDV  LSQF+HSD FLRD ED QHKNRSP SLWL         NN+NNIS
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNIS

Query:  SFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGV
        +F+G+SSSSSNLFGSI E   +GLSMLPV +KE+VE K SL KAT  AAALLSGQSS   V SSSPMSATALLQKAALMGSTRS+ NNNS L  AG FGV
Subjt:  SFFGSSSSSSNLFGSINESGISGLSMLPVIDKEEVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGV

Query:  MSSSSSSSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQFL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST
        M   +SSS SSSSSSNAV LNSLNK+RS++MADS+QMVG +SDLSSN LSQ L  L+NGNN M+SN QTRDFLGVGG+GEAPRPPFLPPELAKF AINST
Subjt:  MSSSSSSSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQFL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST

Query:  MGLSQFAANH
        MGLSQFAANH
Subjt:  MGLSQFAANH

SwissProt top hitse value%identityAlignment
Q700D2 Zinc finger protein JACKDAW3.5e-9449.19Show/hide
Query:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP
        ++NP+PN+KP  S++AKKKRN PG PDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+ +E IKKKVYICP KTCVHHD 
Subjt:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP

Query:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL
        SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAFCDAL EE AR+++       +S TN+  N 
Subjt:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL

Query:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFG
         N+S +++  +       PH   +   +H   N       + +SQF    F  DL     +  S +         + A   N+++   F SSSSS   F 
Subjt:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFG

Query:  SINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSS
          ++  I   S  P +           S +     L   S   + SSS         SPMSATALLQKAA MGSTRSN++ +  F AG   + SSS+++S
Subjt:  SINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSS

Query:  SSSSSSSNAVGLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI
            SSS  +    LN   +  + ++       +  V +S + +N        +G N  +  G TRDFLGV       +  R PFLP ELA+FA +
Subjt:  SSSSSSSNAVGLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI

Q8H1F5 Protein indeterminate-domain 71.7e-8552.54Show/hide
Query:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PGNPDP+AEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD
        KHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR            + N  ++     PH     
Subjt:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLP
                 +H+H   Q++G   S  +  S+S+    ++  E Q H    P   WL              ISS    + ++ NLF  +  S  +G S  P
Subjt:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLP

Query:  VIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS
                                         S  MSATALLQKAA MGST+S
Subjt:  VIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS

Q944L3 Zinc finger protein BALDIBIS6.1e-9153.23Show/hide
Query:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR
        NP+PN  P  S +AK+KRNLPGNPDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCVHHDP+R
Subjt:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR

Query:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---
        ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN  +  +   
Subjt:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---

Query:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNNISSFFGSSSSSSNLFGSI
         ++Q    + L    S  +    N N NN+  LG     + F+ S         +  + S  +LW    Q+S +  +N NNN ++             +I
Subjt:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNNISSFFGSSSSSSNLFGSI

Query:  NESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSSSSSSSSSN
         + GIS        ++EE E K  +S  +  +  +  ++ +   +    + MSATALLQKAA MGS RS++++S+   +  FG+M+S  ++  + +  + 
Subjt:  NESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSSSSSSSSSN

Query:  AV
         V
Subjt:  AV

Q9LRW7 Protein indeterminate-domain 111.0e-8545.59Show/hide
Query:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PGNPDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H      
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP

Query:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKE
                 H+H+  Q   +VS  S  SH+          H   + L    N  +   + N+NN++ +F       SN         I    + P     
Subjt:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKE

Query:  EVENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSSSSSSSSSS------
              +L+ +       G    S+ S +SP MSATALLQKAA MGST++              NNN ++   A    +M+S S   SS++++       
Subjt:  EVENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSSSSSSSSSS------

Query:  NAVGLNSLNKTRSLTMADSM-QMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS
        NA G +  N  R     D+    + ++++++   S+    +G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  NAVGLNSLNKTRSLTMADSM-QMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS

Q9ZWA6 Zinc finger protein MAGPIE4.0e-8245.96Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR
        KKKRNLPGNPDP+AEVIALSPK+LMATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRT+KE ++K+VY+CPEK+CVHH P+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD
        KHGEKKWKC+KC+K+YAVQSDWKAHSKTCGTREY+CDCGT+FSR+DSFITHRAFCDALAEE+AR+   S     A    +NL N   L+    P  SL  
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW------------LNQASAETAINNNNNISSFFGSSSSSSNLFGSIN
        P S         +H+H+           + F H D +        K  S LSLW            +    A    +   + +  FG++++   L  + +
Subjt:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW------------LNQASAETAINNNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSL----------FGAGGFGVMSSSSSS
        +S I+  + + ++  +E  N A+     +      Q +Q   ++S   + MSATALLQKAA MG+T S +  +++          F +    ++    S 
Subjt:  ESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSL----------FGAGGFGVMSSSSSS

Query:  SSSSSSSSNAVGLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVG
           +S  SN+V L S N      + +    V    G  +L +    +  +  GN G    GQTRDFLGVG
Subjt:  SSSSSSSSNAVGLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVG

Arabidopsis top hitse value%identityAlignment
AT1G03840.1 C2H2 and C2HC zinc fingers superfamily protein2.8e-8345.96Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR
        KKKRNLPGNPDP+AEVIALSPK+LMATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRT+KE ++K+VY+CPEK+CVHH P+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD
        KHGEKKWKC+KC+K+YAVQSDWKAHSKTCGTREY+CDCGT+FSR+DSFITHRAFCDALAEE+AR+   S     A    +NL N   L+    P  SL  
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW------------LNQASAETAINNNNNISSFFGSSSSSSNLFGSIN
        P S         +H+H+           + F H D +        K  S LSLW            +    A    +   + +  FG++++   L  + +
Subjt:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW------------LNQASAETAINNNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSL----------FGAGGFGVMSSSSSS
        +S I+  + + ++  +E  N A+     +      Q +Q   ++S   + MSATALLQKAA MG+T S +  +++          F +    ++    S 
Subjt:  ESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSL----------FGAGGFGVMSSSSSS

Query:  SSSSSSSSNAVGLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVG
           +S  SN+V L S N      + +    V    G  +L +    +  +  GN G    GQTRDFLGVG
Subjt:  SSSSSSSSNAVGLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVG

AT1G55110.1 indeterminate(ID)-domain 71.2e-8652.54Show/hide
Query:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PGNPDP+AEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD
        KHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR            + N  ++     PH     
Subjt:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLP
                 +H+H   Q++G   S  +  S+S+    ++  E Q H    P   WL              ISS    + ++ NLF  +  S  +G S  P
Subjt:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLP

Query:  VIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS
                                         S  MSATALLQKAA MGST+S
Subjt:  VIDKEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS

AT3G13810.1 indeterminate(ID)-domain 117.2e-8745.59Show/hide
Query:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PGNPDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H      
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP

Query:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKE
                 H+H+  Q   +VS  S  SH+          H   + L    N  +   + N+NN++ +F       SN         I    + P     
Subjt:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVIDKE

Query:  EVENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSSSSSSSSSS------
              +L+ +       G    S+ S +SP MSATALLQKAA MGST++              NNN ++   A    +M+S S   SS++++       
Subjt:  EVENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSSSSSSSSSS------

Query:  NAVGLNSLNKTRSLTMADSM-QMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS
        NA G +  N  R     D+    + ++++++   S+    +G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  NAVGLNSLNKTRSLTMADSM-QMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS

AT3G45260.1 C2H2-like zinc finger protein4.3e-9253.23Show/hide
Query:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR
        NP+PN  P  S +AK+KRNLPGNPDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCVHHDP+R
Subjt:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR

Query:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---
        ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN  +  +   
Subjt:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---

Query:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNNISSFFGSSSSSSNLFGSI
         ++Q    + L    S  +    N N NN+  LG     + F+ S         +  + S  +LW    Q+S +  +N NNN ++             +I
Subjt:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNNISSFFGSSSSSSNLFGSI

Query:  NESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSSSSSSSSSN
         + GIS        ++EE E K  +S  +  +  +  ++ +   +    + MSATALLQKAA MGS RS++++S+   +  FG+M+S  ++  + +  + 
Subjt:  NESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSSSSSSSSSN

Query:  AV
         V
Subjt:  AV

AT5G03150.1 C2H2-like zinc finger protein2.5e-9549.19Show/hide
Query:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP
        ++NP+PN+KP  S++AKKKRN PG PDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+ +E IKKKVYICP KTCVHHD 
Subjt:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP

Query:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL
        SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAFCDAL EE AR+++       +S TN+  N 
Subjt:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL

Query:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFG
         N+S +++  +       PH   +   +H   N       + +SQF    F  DL     +  S +         + A   N+++   F SSSSS   F 
Subjt:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFG

Query:  SINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSS
          ++  I   S  P +           S +     L   S   + SSS         SPMSATALLQKAA MGSTRSN++ +  F AG   + SSS+++S
Subjt:  SINESGISGLSMLPVIDKEEVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSS

Query:  SSSSSSSNAVGLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI
            SSS  +    LN   +  + ++       +  V +S + +N        +G N  +  G TRDFLGV       +  R PFLP ELA+FA +
Subjt:  SSSSSSSNAVGLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQFLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCGGCTCCGACCGCGTTCGCTCCCCAACAAGATGCAAACCCTGACCCTAACTCTAAACCGAAACCCTCCGCTGCGGCGAA
GAAGAAGAGAAACCTCCCCGGAAACCCAGATCCGGATGCGGAGGTCATCGCTCTATCGCCGAAGTCTCTCATGGCGACGAACAGATTCATCTGCGAAATATGCAACAAGG
GGTTTCAGAGAGATCAGAACCTGCAGCTTCACCGACGAGGGCACAACCTGCCGTGGAAGCTGCGGCAGCGGACAAACAAGGAGCCGATCAAGAAGAAGGTGTATATCTGC
CCGGAGAAGACGTGCGTCCACCACGACCCGTCGCGGGCCCTCGGCGACCTCACTGGAATAAAGAAACACTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATGCGATAA
GTGTTCCAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGCGATTGTGGAACACTTTTTTCCAGGAAAGATAGCT
TCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCTGCAACAAATATTCTCAATAATCTCAGAAACGATTCAATTCTTCTT
CATCAACAAGATCCTCACCAATCTCTCGTTGATCCTCACTCCAATAATAATAACAATAATAATCATAATCATAATAATCTTCAATCTCTTGGAGACGTTTCAGGGCTTTC
CCAATTCAGCCATTCAGATTTTCTGAGAGATTTAGAAGATCAGCAACACAAGAACAGATCCCCTTTGTCCCTCTGGTTGAACCAAGCTTCTGCTGAAACTGCCATCAACA
ACAACAACAACATTTCTAGCTTTTTCGGGTCGTCGTCTTCTTCTTCCAATCTTTTTGGATCGATCAACGAGAGTGGGATATCGGGGCTCTCAATGCTCCCAGTGATTGAC
AAGGAAGAGGTTGAGAACAAGGCGAGTTTGTCGAAAGCTACCGCGGCAGCGTTGTTATCGGGTCAATCTTCTCAGTCTGTCGTTTCTTCTTCTTCTCCGATGTCGGCCAC
TGCCCTTCTGCAGAAAGCTGCCCTTATGGGCTCAACGAGGAGCAACAACAATAACTCATCGCTCTTCGGAGCAGGCGGTTTTGGAGTAATGAGCTCGTCGTCGTCATCTT
CTTCGTCGTCATCGTCATCTTCTAATGCGGTGGGCTTGAACTCTCTTAATAAAACTAGGAGCCTGACAATGGCCGACTCAATGCAGATGGTCGGGAGCTCCGACTTGAGC
TCAAATTGCCTCAGCCAATTTTTACTGTCCAACGGTAATAACGGAATGAGAAGTAACGGTCAAACGCGAGACTTCCTCGGCGTGGGAGGATCAGGAGAAGCTCCCCGGCC
ACCATTCCTCCCACCGGAGCTGGCAAAGTTCGCCGCCATAAACTCAACCATGGGGCTAAGCCAATTCGCCGCCAACCAC
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCGGCTCCGACCGCGTTCGCTCCCCAACAAGATGCAAACCCTGACCCTAACTCTAAACCGAAACCCTCCGCTGCGGCGAA
GAAGAAGAGAAACCTCCCCGGAAACCCAGATCCGGATGCGGAGGTCATCGCTCTATCGCCGAAGTCTCTCATGGCGACGAACAGATTCATCTGCGAAATATGCAACAAGG
GGTTTCAGAGAGATCAGAACCTGCAGCTTCACCGACGAGGGCACAACCTGCCGTGGAAGCTGCGGCAGCGGACAAACAAGGAGCCGATCAAGAAGAAGGTGTATATCTGC
CCGGAGAAGACGTGCGTCCACCACGACCCGTCGCGGGCCCTCGGCGACCTCACTGGAATAAAGAAACACTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATGCGATAA
GTGTTCCAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGCGATTGTGGAACACTTTTTTCCAGGAAAGATAGCT
TCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCTGCAACAAATATTCTCAATAATCTCAGAAACGATTCAATTCTTCTT
CATCAACAAGATCCTCACCAATCTCTCGTTGATCCTCACTCCAATAATAATAACAATAATAATCATAATCATAATAATCTTCAATCTCTTGGAGACGTTTCAGGGCTTTC
CCAATTCAGCCATTCAGATTTTCTGAGAGATTTAGAAGATCAGCAACACAAGAACAGATCCCCTTTGTCCCTCTGGTTGAACCAAGCTTCTGCTGAAACTGCCATCAACA
ACAACAACAACATTTCTAGCTTTTTCGGGTCGTCGTCTTCTTCTTCCAATCTTTTTGGATCGATCAACGAGAGTGGGATATCGGGGCTCTCAATGCTCCCAGTGATTGAC
AAGGAAGAGGTTGAGAACAAGGCGAGTTTGTCGAAAGCTACCGCGGCAGCGTTGTTATCGGGTCAATCTTCTCAGTCTGTCGTTTCTTCTTCTTCTCCGATGTCGGCCAC
TGCCCTTCTGCAGAAAGCTGCCCTTATGGGCTCAACGAGGAGCAACAACAATAACTCATCGCTCTTCGGAGCAGGCGGTTTTGGAGTAATGAGCTCGTCGTCGTCATCTT
CTTCGTCGTCATCGTCATCTTCTAATGCGGTGGGCTTGAACTCTCTTAATAAAACTAGGAGCCTGACAATGGCCGACTCAATGCAGATGGTCGGGAGCTCCGACTTGAGC
TCAAATTGCCTCAGCCAATTTTTACTGTCCAACGGTAATAACGGAATGAGAAGTAACGGTCAAACGCGAGACTTCCTCGGCGTGGGAGGATCAGGAGAAGCTCCCCGGCC
ACCATTCCTCCCACCGGAGCTGGCAAAGTTCGCCGCCATAAACTCAACCATGGGGCTAAGCCAATTCGCCGCCAACCAC
Protein sequenceShow/hide protein sequence
MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYIC
PEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILL
HQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNNISSFFGSSSSSSNLFGSINESGISGLSMLPVID
KEEVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSSSSSSSSSSNAVGLNSLNKTRSLTMADSMQMVGSSDLS
SNCLSQFLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFAANH