; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g31470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g31470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein indeterminate-domain 9-like
Genome locationchr5:23617045..23619288
RNA-Seq ExpressionMoc05g31470
SyntenyMoc05g31470
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR022755 - Zinc finger, double-stranded RNA binding
IPR033243 - Zinc finger protein JACKDAW-like
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137968.1 zinc finger protein BALDIBIS [Cucumis sativus]2.8e-21083.49Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F   QDANP+PN KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQSL+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  -NNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
         NN+IS+FFG+SSSSSNLFGSI E   +GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTRS NNNN+
Subjt:  -NNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS    SSSSSSNAV SLNSLNK+RSLTM DS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

XP_008442674.1 PREDICTED: protein indeterminate-domain 9 [Cucumis melo]3.1e-20983.69Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AI 
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS
        NNNNIS+FFG+SSSSSNLFGSI E   +GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS    SSSSSSNAV SLNS NK+RSLTMADS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

XP_022145765.1 protein indeterminate-domain 9-like [Momordica charantia]3.7e-271100Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
        ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF

Query:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
        FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
Subjt:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS

Query:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
        SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
Subjt:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA

Query:  ANH
        ANH
Subjt:  ANH

XP_022934107.1 protein indeterminate-domain 9-like [Cucurbita moschata]3.1e-20982.16Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSS PT FA   DANP PN KPKPSAAAKKKRNLPG PDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS
        ITTVSATNILNNLRNDS+LLHQQD  QSL+D          H  NNLQ+LGDV  LSQF+HSD FLRD ED QHKNRSP SLWLN        N+NNIS+
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS

Query:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM
        F+G+SSSSSNLFGSI E   +GLS+LPV +KEDVE K SL KAT  AAALLSGQSS   V SSSPMSATALLQKAALMGSTRS+ NNNS L  AG FGVM
Subjt:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM

Query:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST
        +S    SS SSSSSSNAVSLNSLNK+RS++MADS+QMVG +SDLSSN LSQ+L  L+NGNN M+SN QTRDFLGVGG+GEAPRPPFLPPELAKF AINST
Subjt:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST

Query:  MGLSQFAANH
        MGLSQFAANH
Subjt:  MGLSQFAANH

XP_038903191.1 zinc finger protein BALDIBIS-like [Benincasa hispida]2.6e-21685Show/hide
Query:  MSSNPFSLLSSAPTAFAPQ-QDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  TAFA Q  DANP+PN KPKPSAA KKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQ-QDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-----
        RITTVSATNILNNLRNDS ILLHQQD  QSL+D H+N         N LQSLGD+SGLSQF+HSD FLRD ED QHKNRSPLSLWLNQASAETA+     
Subjt:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-----

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKA--SLSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSS-
        NNNNIS+ FG+SSSSSNLFGSI E   +GLS+LPVI+KEDVENK   +LSKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTRSNNNN+S 
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKA--SLSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSS-

Query:  LFGAGGFGVMSSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQVLL-SNGNNGM-RSNGQTRDFLGVGGSGEAPRPPFLPPE
        LFGAG FGVMS     SSSSSSSSSNAVSLNSLNK+RSLTMADS+QM+G +SDLSSNCLSQ+L+  NGNN M RS+GQTRDFLGVGG GEAPRPPFLPPE
Subjt:  LFGAGGFGVMSSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQVLL-SNGNNGM-RSNGQTRDFLGVGGSGEAPRPPFLPPE

Query:  LAKFAAINSTMGLSQFAANH
        LAKFA INST+GLSQFAANH
Subjt:  LAKFAAINSTMGLSQFAANH

TrEMBL top hitse value%identityAlignment
A0A0A0LDZ1 C2H2-type domain-containing protein1.4e-21083.49Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F   QDANP+PN KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQSL+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  -NNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
         NN+IS+FFG+SSSSSNLFGSI E   +GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTRS NNNN+
Subjt:  -NNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS    SSSSSSNAV SLNSLNK+RSLTM DS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

A0A1S3B706 protein indeterminate-domain 91.5e-20983.69Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AI 
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS
        NNNNIS+FFG+SSSSSNLFGSI E   +GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS    SSSSSSNAV SLNS NK+RSLTMADS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

A0A5D3DNM0 Protein indeterminate-domain 91.5e-20983.69Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPS AAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPS-AAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF +HSD FLRD ED Q KNRSPLSLWLNQASAE AI 
Subjt:  RITTVSATNILNNLRNDS---ILLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQF-SHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAI-

Query:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS
        NNNNIS+FFG+SSSSSNLFGSI E   +GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS
Subjt:  NNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTR-SNNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS    SSSSSSNAV SLNS NK+RSLTMADS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAV-SLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

A0A6J1CVG4 protein indeterminate-domain 9-like1.8e-271100Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
        ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF

Query:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
        FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
Subjt:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS

Query:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
        SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
Subjt:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA

Query:  ANH
        ANH
Subjt:  ANH

A0A6J1F1R6 protein indeterminate-domain 9-like1.5e-20982.16Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSS PT FA   DANP PN KPKPSAAAKKKRNLPG PDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS
        ITTVSATNILNNLRNDS+LLHQQD  QSL+D          H  NNLQ+LGDV  LSQF+HSD FLRD ED QHKNRSP SLWLN        N+NNIS+
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS

Query:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM
        F+G+SSSSSNLFGSI E   +GLS+LPV +KEDVE K SL KAT  AAALLSGQSS   V SSSPMSATALLQKAALMGSTRS+ NNNS L  AG FGVM
Subjt:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM

Query:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST
        +S    SS SSSSSSNAVSLNSLNK+RS++MADS+QMVG +SDLSSN LSQ+L  L+NGNN M+SN QTRDFLGVGG+GEAPRPPFLPPELAKF AINST
Subjt:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVG-SSDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST

Query:  MGLSQFAANH
        MGLSQFAANH
Subjt:  MGLSQFAANH

SwissProt top hitse value%identityAlignment
Q700D2 Zinc finger protein JACKDAW1.2e-9449.4Show/hide
Query:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP
        ++NP+PN+KP  S++AKKKRN PG PDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+ +E IKKKVYICP KTCVHHD 
Subjt:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP

Query:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL
        SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAFCDAL EE AR+++       +S TN+  N 
Subjt:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL

Query:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS
         N+S +++  +       PH   +   +H   N       + +SQF    F  DL     +  S +         + A   N+    F SSSSS   F  
Subjt:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS

Query:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS
         ++  I   S  P +           S +     L   S   + SSS         SPMSATALLQKAA MGSTRSN++ +  F AG    M+SSS+ +S
Subjt:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS

Query:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI
            SSS  +    LN   +  + ++       +  V +S + +N        +G N  +  G TRDFLGV       +  R PFLP ELA+FA +
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI

Q8H1F5 Protein indeterminate-domain 71.7e-8552.69Show/hide
Query:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PGNPDP+AEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD
        KHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR            + N  ++     PH     
Subjt:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV
                 +H+H   Q++G   S  +  S+S+    ++  E Q H    P   WL             ISS    + ++ NLF  +  S  +G S  P 
Subjt:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV

Query:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS
                                        S  MSATALLQKAA MGST+S
Subjt:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS

Q944L3 Zinc finger protein BALDIBIS8.5e-9354.92Show/hide
Query:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR
        NP+PN  P  S +AK+KRNLPGNPDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCVHHDP+R
Subjt:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR

Query:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---
        ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN  +  +   
Subjt:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---

Query:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN
         ++Q    + L    S  +    N N NN+  LG     + F+ S         +  + S  +LW    Q+S +  +N NN            N   +I 
Subjt:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS
        + GIS        ++E+ E K  +S  +  +  +  ++ +   +    + MSATALLQKAA MGS RS++++S+   +  FG+M+S
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS

Q9LRW7 Protein indeterminate-domain 111.2e-8345.04Show/hide
Query:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PGNPDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H      
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP

Query:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED
                 H+H+  Q   +VS  S  SH+          H   + L    N  +   + N+NN    F       +    +N           +I    
Subjt:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED

Query:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN
             +L+ +       G    S+ S +SP MSATALLQKAA MGST++              NNN ++   A    +M+S S   SS++++       N
Subjt:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN

Query:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS
        +    N  R     D+    + ++++++   S+    +G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS

Q9ZWA6 Zinc finger protein MAGPIE1.3e-8045.63Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR
        KKKRNLPGNPDP+AEVIALSPK+LMATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRT+KE ++K+VY+CPEK+CVHH P+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD
        KHGEKKWKC+KC+K+YAVQSDWKAHSKTCGTREY+CDCGT+FSR+DSFITHRAFCDALAEE+AR+   S     A    +NL N   L+    P  SL  
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN
        P S         +H+H+           + F H D +        K  S LSLW              ++ + +      + +  FG++++   L  + +
Subjt:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------
               ++  V  KE+     SLS  +  + +    Q + +   + + MSATALLQKAA MG+T S +  +++       + S +S ++          
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------

Query:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG
          +S  SN+V L S N      + +    V    G  +L +    +  +  GN G    GQTRDFLGVG
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG

Arabidopsis top hitse value%identityAlignment
AT1G03840.1 C2H2 and C2HC zinc fingers superfamily protein9.1e-8245.63Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR
        KKKRNLPGNPDP+AEVIALSPK+LMATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRT+KE ++K+VY+CPEK+CVHH P+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD
        KHGEKKWKC+KC+K+YAVQSDWKAHSKTCGTREY+CDCGT+FSR+DSFITHRAFCDALAEE+AR+   S     A    +NL N   L+    P  SL  
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN
        P S         +H+H+           + F H D +        K  S LSLW              ++ + +      + +  FG++++   L  + +
Subjt:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------
               ++  V  KE+     SLS  +  + +    Q + +   + + MSATALLQKAA MG+T S +  +++       + S +S ++          
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------

Query:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG
          +S  SN+V L S N      + +    V    G  +L +    +  +  GN G    GQTRDFLGVG
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG

AT1G55110.1 indeterminate(ID)-domain 71.2e-8652.69Show/hide
Query:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PGNPDP+AEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD
        KHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR            + N  ++     PH     
Subjt:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV
                 +H+H   Q++G   S  +  S+S+    ++  E Q H    P   WL             ISS    + ++ NLF  +  S  +G S  P 
Subjt:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV

Query:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS
                                        S  MSATALLQKAA MGST+S
Subjt:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS

AT3G13810.1 indeterminate(ID)-domain 118.7e-8545.04Show/hide
Query:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PGNPDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H      
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP

Query:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED
                 H+H+  Q   +VS  S  SH+          H   + L    N  +   + N+NN    F       +    +N           +I    
Subjt:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED

Query:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN
             +L+ +       G    S+ S +SP MSATALLQKAA MGST++              NNN ++   A    +M+S S   SS++++       N
Subjt:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN

Query:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS
        +    N  R     D+    + ++++++   S+    +G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS

AT3G45260.1 C2H2-like zinc finger protein6.0e-9454.92Show/hide
Query:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR
        NP+PN  P  S +AK+KRNLPGNPDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCVHHDP+R
Subjt:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR

Query:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---
        ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN  +  +   
Subjt:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---

Query:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN
         ++Q    + L    S  +    N N NN+  LG     + F+ S         +  + S  +LW    Q+S +  +N NN            N   +I 
Subjt:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS
        + GIS        ++E+ E K  +S  +  +  +  ++ +   +    + MSATALLQKAA MGS RS++++S+   +  FG+M+S
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS

AT5G03150.1 C2H2-like zinc finger protein8.4e-9649.4Show/hide
Query:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP
        ++NP+PN+KP  S++AKKKRN PG PDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+ +E IKKKVYICP KTCVHHD 
Subjt:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP

Query:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL
        SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAFCDAL EE AR+++       +S TN+  N 
Subjt:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL

Query:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS
         N+S +++  +       PH   +   +H   N       + +SQF    F  DL     +  S +         + A   N+    F SSSSS   F  
Subjt:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS

Query:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS
         ++  I   S  P +           S +     L   S   + SSS         SPMSATALLQKAA MGSTRSN++ +  F AG    M+SSS+ +S
Subjt:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS

Query:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI
            SSS  +    LN   +  + ++       +  V +S + +N        +G N  +  G TRDFLGV       +  R PFLP ELA+FA +
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCGGCACCGACCGCGTTCGCTCCACAACAAGATGCAAACCCTGACCCTAACTCTAAACCGAAACCCTCCGCTGCGGCGAA
GAAGAAGAGAAACCTCCCCGGAAACCCAGATCCGGATGCGGAGGTCATCGCTCTATCGCCTAAGTCTCTCATGGCGACGAACAGATTCATCTGCGAAATATGCAACAAGG
GGTTTCAGAGAGATCAGAACCTGCAGCTTCACCGACGAGGGCACAACCTGCCGTGGAAGCTGCGGCAGCGGACAAACAAGGAGCCGATCAAGAAGAAGGTGTATATCTGC
CCGGAGAAGACGTGCGTCCACCACGACCCGTCGCGGGCCCTTGGCGACCTCACCGGAATAAAGAAACACTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATGCGATAA
GTGTTCCAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGCGATTGTGGAACACTTTTTTCCAGGAAAGATAGCT
TCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCTGCAACAAATATTCTCAATAATCTCAGAAACGATTCAATTCTTCTT
CATCAACAAGATCCTCACCAATCTCTCGTTGATCCTCACTCCAATAATAATAACAATAATAATCATAATCATAATAATCTTCAATCTCTTGGAGACGTTTCAGGGCTTTC
CCAATTCAGCCATTCAGATTTTCTGAGAGATTTAGAAGATCAGCAACACAAGAACAGATCCCCTTTGTCCCTCTGGTTGAACCAAGCTTCTGCTGAAACTGCGATCAACA
ACAACAACATTTCTAGCTTTTTCGGGTCGTCGTCTTCTTCTTCCAATCTTTTTGGATCGATCAACGAGAGCGGGATATCGGGGCTCTCAGTGCTCCCGGTGATTGACAAG
GAAGATGTTGAGAACAAGGCGAGTTTGTCGAAAGCTACCGCGGCAGCGTTATTATCGGGTCAATCTTCTCAGTCCGTCGTTTCTTCTTCTTCTCCGATGTCGGCCACTGC
CCTTCTGCAGAAAGCTGCCCTTATGGGCTCAACGAGGAGCAACAACAATAACTCATCGCTCTTCGGAGCAGGCGGTTTTGGAGTAATGAGCTCGTCGTCGTCGGCATCTT
CTTCGTCGTCATCGTCATCTTCTAATGCGGTGAGCTTGAACTCTCTCAACAAAACTAGGAGCCTGACAATGGCCGACTCAATGCAGATGGTCGGGAGCTCCGACTTGAGT
TCAAATTGCCTCAGCCAAGTTTTACTGTCCAACGGTAATAACGGAATGAGAAGTAATGGTCAAACGCGAGACTTCCTCGGCGTGGGAGGATCAGGGGAAGCTCCCCGGCC
ACCGTTCCTCCCACCGGAGCTGGCAAAGTTCGCCGCCATAAACTCAACCATGGGGCTAAGCCAATTCGCCGCCAACCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCGGCACCGACCGCGTTCGCTCCACAACAAGATGCAAACCCTGACCCTAACTCTAAACCGAAACCCTCCGCTGCGGCGAA
GAAGAAGAGAAACCTCCCCGGAAACCCAGATCCGGATGCGGAGGTCATCGCTCTATCGCCTAAGTCTCTCATGGCGACGAACAGATTCATCTGCGAAATATGCAACAAGG
GGTTTCAGAGAGATCAGAACCTGCAGCTTCACCGACGAGGGCACAACCTGCCGTGGAAGCTGCGGCAGCGGACAAACAAGGAGCCGATCAAGAAGAAGGTGTATATCTGC
CCGGAGAAGACGTGCGTCCACCACGACCCGTCGCGGGCCCTTGGCGACCTCACCGGAATAAAGAAACACTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATGCGATAA
GTGTTCCAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGCGATTGTGGAACACTTTTTTCCAGGAAAGATAGCT
TCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCTGCAACAAATATTCTCAATAATCTCAGAAACGATTCAATTCTTCTT
CATCAACAAGATCCTCACCAATCTCTCGTTGATCCTCACTCCAATAATAATAACAATAATAATCATAATCATAATAATCTTCAATCTCTTGGAGACGTTTCAGGGCTTTC
CCAATTCAGCCATTCAGATTTTCTGAGAGATTTAGAAGATCAGCAACACAAGAACAGATCCCCTTTGTCCCTCTGGTTGAACCAAGCTTCTGCTGAAACTGCGATCAACA
ACAACAACATTTCTAGCTTTTTCGGGTCGTCGTCTTCTTCTTCCAATCTTTTTGGATCGATCAACGAGAGCGGGATATCGGGGCTCTCAGTGCTCCCGGTGATTGACAAG
GAAGATGTTGAGAACAAGGCGAGTTTGTCGAAAGCTACCGCGGCAGCGTTATTATCGGGTCAATCTTCTCAGTCCGTCGTTTCTTCTTCTTCTCCGATGTCGGCCACTGC
CCTTCTGCAGAAAGCTGCCCTTATGGGCTCAACGAGGAGCAACAACAATAACTCATCGCTCTTCGGAGCAGGCGGTTTTGGAGTAATGAGCTCGTCGTCGTCGGCATCTT
CTTCGTCGTCATCGTCATCTTCTAATGCGGTGAGCTTGAACTCTCTCAACAAAACTAGGAGCCTGACAATGGCCGACTCAATGCAGATGGTCGGGAGCTCCGACTTGAGT
TCAAATTGCCTCAGCCAAGTTTTACTGTCCAACGGTAATAACGGAATGAGAAGTAATGGTCAAACGCGAGACTTCCTCGGCGTGGGAGGATCAGGGGAAGCTCCCCGGCC
ACCGTTCCTCCCACCGGAGCTGGCAAAGTTCGCCGCCATAAACTCAACCATGGGGCTAAGCCAATTCGCCGCCAACCACTGA
Protein sequenceShow/hide protein sequence
MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYIC
PEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILL
HQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDK
EDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLS
SNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFAANH