; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1399 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1399
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein indeterminate-domain 9-like
Genome locationMC05:18158748..18160988
RNA-Seq ExpressionMC05g1399
SyntenyMC05g1399
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR022755 - Zinc finger, double-stranded RNA binding
IPR033243 - Zinc finger protein JACKDAW-like
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137968.1 zinc finger protein BALDIBIS [Cucumis sativus]8.03e-26583.49Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F   QDANP+PN KPKPSAAA KKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQSL+D H              QSLGD+SGLSQF+ HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
        +NN IS+FFG+SSSSSNLFGSI E+   GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVVSSS PMSATALLQKAALMGSTRS NNNN+
Subjt:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS SSSSSS   NAVS LNSLNK+RSLTM DS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

XP_022145765.1 protein indeterminate-domain 9-like [Momordica charantia]0.0100Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
        ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF

Query:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
        FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
Subjt:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS

Query:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
        SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
Subjt:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA

Query:  ANH
        ANH
Subjt:  ANH

XP_022934107.1 protein indeterminate-domain 9-like [Cucurbita moschata]9.67e-26482.16Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSS PT FA   DANP PN KPKPSAAAKKKRNLPG PDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS
        ITTVSATNILNNLRNDS+LLHQQD  QSL+D          H  NNLQ+LGDV  LSQF+HSD FLRD ED QHKNRSP SLWLN        N+NNIS+
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS

Query:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM
        F+G+SSSSSNLFGSI E+   GLS+LPV +KEDVE K SL KAT  AAALLSGQSS  V  SSSPMSATALLQKAALMGSTRS+ NNNS L  AG FGVM
Subjt:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM

Query:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST
        +SSS     SSSSSSNAVSLNSLNK+RS++MADS+QMVG+ SDLSSN LSQ+L  L+NGNN M+SN QTRDFLGVGG+GEAPRPPFLPPELAKF AINST
Subjt:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST

Query:  MGLSQFAANH
        MGLSQFAANH
Subjt:  MGLSQFAANH

XP_023526465.1 protein indeterminate-domain 9-like [Cucurbita pepo subsp. pepo]2.10e-26382.23Show/hide
Query:  MSSNPFSLLSSAPTA--FAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSS PT+  FA   DANP PN KPKPSAAAKKKRNLPG PDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSAPTA--FAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKEPIKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNI
        ARITTVSATNILNNLRNDS+LLHQQD  QSL+D          H  NNLQ+LGDV  LSQF+HSD FLRD ED QHKNRSP SLWLN        NNNNI
Subjt:  ARITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNI

Query:  SSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFG
        S+F+G SSSSSNLFGSI E+   GLS+LPV +KEDVE K SLSKAT  AAALLSGQSS  V  SSSPMSATALLQKAALMGSTRS+ NNNS L  AG FG
Subjt:  SSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFG

Query:  VMSSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAIN
        VM+SSS     SSSSSSNAVSLNSLNK+RS+TMADS+QMVG+ SDLSSN LSQ+L  L+NGNN M+SN QTRDFLGVGG+GEAP+PPFLPPELAKFAAIN
Subjt:  VMSSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAIN

Query:  STMGLSQFAANH
        STMGLSQFAANH
Subjt:  STMGLSQFAANH

XP_038903191.1 zinc finger protein BALDIBIS-like [Benincasa hispida]9.83e-27385Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQD-ANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  TAFA Q D ANP+PN KPKPSAA KKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQD-ANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINN---
        RITTVSATNILNNLRNDS ILLHQQD  QSL+D H+NN          LQSLGD+SGLSQF+HSD FLRD ED QHKNRSPLSLWLNQASAETA+NN   
Subjt:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINN---

Query:  --NNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKA--SLSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSS-
          NNIS+ FG+SSSSSNLFGSI E+   GLS+LPVI+KEDVENK   +LSKAT   AAALLSGQSSQSVVSSS PMSATALLQKAALMGSTRSNNNN+S 
Subjt:  --NNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKA--SLSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSS-

Query:  LFGAGGFGVMSSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGM-RSNGQTRDFLGVGGSGEAPRPPFLPPE
        LFGAG FGVMSSSSS     SSSSSNAVSLNSLNK+RSLTMADS+QM+G+ SDLSSNCLSQ+L+  NGNN M RS+GQTRDFLGVGG GEAPRPPFLPPE
Subjt:  LFGAGGFGVMSSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLL-SNGNNGM-RSNGQTRDFLGVGGSGEAPRPPFLPPE

Query:  LAKFAAINSTMGLSQFAANH
        LAKFA INST+GLSQFAANH
Subjt:  LAKFAAINSTMGLSQFAANH

TrEMBL top hitse value%identityAlignment
A0A0A0LDZ1 C2H2-type domain-containing protein3.89e-26583.49Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F   QDANP+PN KPKPSAAA KKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQSL+D H              QSLGD+SGLSQF+ HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
        +NN IS+FFG+SSSSSNLFGSI E+   GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVVSSS PMSATALLQKAALMGSTRS NNNN+
Subjt:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS SSSSSS   NAVS LNSLNK+RSLTM DS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

A0A1S3B706 protein indeterminate-domain 91.20e-26383.69Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPSAAA KKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF+ HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
        NNN IS+FFG+SSSSSNLFGSI E+   GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVVSSS PMSATALLQKAALMGSTRS NNNNS
Subjt:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS SSSSSS   NAVS LNS NK+RSLTMADS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

A0A5D3DNM0 Protein indeterminate-domain 91.20e-26383.69Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSS  T+F  QQDANP+P  KPKPSAAA KKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAA-KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN
        RITTVSATNILNNLRNDS    LLHQQ D HQ L+D H              QSLGD+SGLSQF+ HSD FLRD ED Q KNRSPLSLWLNQASAE AIN
Subjt:  RITTVSATNILNNLRNDSI---LLHQQ-DPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFS-HSD-FLRDLEDQQHKNRSPLSLWLNQASAETAIN

Query:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS
        NNN IS+FFG+SSSSSNLFGSI E+   GLS+LPV++KEDVENK S    SKAT   AAALLSGQSSQSVVSSS PMSATALLQKAALMGSTRS NNNNS
Subjt:  NNN-ISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKAS---LSKAT---AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS-NNNNS

Query:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP
         LFG+G FGVMSSSSS SSSSSS   NAVS LNS NK+RSLTMADS+QM+GS SDLSSNCLSQ+L+  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPP
Subjt:  SLFGAGGFGVMSSSSSASSSSSSSSSNAVS-LNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVLLS-NGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPP

Query:  ELAKFAAINSTMGLSQFAANH
        ELAKF  INSTMGLSQFAANH
Subjt:  ELAKFAAINSTMGLSQFAANH

A0A6J1CVG4 protein indeterminate-domain 9-like0.0100Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
        ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSF

Query:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
        FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS
Subjt:  FGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSS

Query:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
        SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA
Subjt:  SASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFA

Query:  ANH
        ANH
Subjt:  ANH

A0A6J1F1R6 protein indeterminate-domain 9-like4.68e-26482.16Show/hide
Query:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
        MSSNPFSLLSS PT FA   DANP PN KPKPSAAAKKKRNLPG PDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK
Subjt:  MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNK

Query:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
        EPIKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR
Subjt:  EPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR

Query:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS
        ITTVSATNILNNLRNDS+LLHQQD  QSL+D          H  NNLQ+LGDV  LSQF+HSD FLRD ED QHKNRSP SLWLN        N+NNIS+
Subjt:  ITTVSATNILNNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSD-FLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISS

Query:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM
        F+G+SSSSSNLFGSI E+   GLS+LPV +KEDVE K SL KAT  AAALLSGQSS  V  SSSPMSATALLQKAALMGSTRS+ NNNS L  AG FGVM
Subjt:  FFGSSSSSSNLFGSINESGISGLSVLPVIDKEDVENKASLSKAT--AAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSN-NNNSSLFGAGGFGVM

Query:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST
        +SSS     SSSSSSNAVSLNSLNK+RS++MADS+QMVG+ SDLSSN LSQ+L  L+NGNN M+SN QTRDFLGVGG+GEAPRPPFLPPELAKF AINST
Subjt:  SSSSSASSSSSSSSSNAVSLNSLNKTRSLTMADSMQMVGS-SDLSSNCLSQVL--LSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINST

Query:  MGLSQFAANH
        MGLSQFAANH
Subjt:  MGLSQFAANH

SwissProt top hitse value%identityAlignment
Q700D2 Zinc finger protein JACKDAW1.2e-9449.4Show/hide
Query:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP
        ++NP+PN+KP  S++AKKKRN PG PDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+ +E IKKKVYICP KTCVHHD 
Subjt:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP

Query:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL
        SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAFCDAL EE AR+++       +S TN+  N 
Subjt:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL

Query:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS
         N+S +++  +       PH   +   +H   N       + +SQF    F  DL     +  S +         + A   N+    F SSSSS   F  
Subjt:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS

Query:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS
         ++  I   S  P +           S +     L   S   + SSS         SPMSATALLQKAA MGSTRSN++ +  F AG    M+SSS+ +S
Subjt:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS

Query:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI
            SSS  +    LN   +  + ++       +  V +S + +N        +G N  +  G TRDFLGV       +  R PFLP ELA+FA +
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI

Q8H1F5 Protein indeterminate-domain 71.7e-8552.69Show/hide
Query:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PGNPDP+AEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD
        KHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR            + N  ++     PH     
Subjt:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV
                 +H+H   Q++G   S  +  S+S+    ++  E Q H    P   WL             ISS    + ++ NLF  +  S  +G S  P 
Subjt:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV

Query:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS
                                        S  MSATALLQKAA MGST+S
Subjt:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS

Q944L3 Zinc finger protein BALDIBIS8.5e-9354.92Show/hide
Query:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR
        NP+PN  P  S +AK+KRNLPGNPDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCVHHDP+R
Subjt:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR

Query:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---
        ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN  +  +   
Subjt:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---

Query:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN
         ++Q    + L    S  +    N N NN+  LG     + F+ S         +  + S  +LW    Q+S +  +N NN            N   +I 
Subjt:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS
        + GIS        ++E+ E K  +S  +  +  +  ++ +   +    + MSATALLQKAA MGS RS++++S+   +  FG+M+S
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS

Q9LRW7 Protein indeterminate-domain 111.2e-8345.04Show/hide
Query:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PGNPDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H      
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP

Query:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED
                 H+H+  Q   +VS  S  SH+          H   + L    N  +   + N+NN    F       +    +N           +I    
Subjt:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED

Query:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN
             +L+ +       G    S+ S +SP MSATALLQKAA MGST++              NNN ++   A    +M+S S   SS++++       N
Subjt:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN

Query:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS
        +    N  R     D+    + ++++++   S+    +G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS

Q9ZWA6 Zinc finger protein MAGPIE1.3e-8045.63Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR
        KKKRNLPGNPDP+AEVIALSPK+LMATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRT+KE ++K+VY+CPEK+CVHH P+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD
        KHGEKKWKC+KC+K+YAVQSDWKAHSKTCGTREY+CDCGT+FSR+DSFITHRAFCDALAEE+AR+   S     A    +NL N   L+    P  SL  
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN
        P S         +H+H+           + F H D +        K  S LSLW              ++ + +      + +  FG++++   L  + +
Subjt:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------
               ++  V  KE+     SLS  +  + +    Q + +   + + MSATALLQKAA MG+T S +  +++       + S +S ++          
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------

Query:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG
          +S  SN+V L S N      + +    V    G  +L +    +  +  GN G    GQTRDFLGVG
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG

Arabidopsis top hitse value%identityAlignment
AT1G03840.1 C2H2 and C2HC zinc fingers superfamily protein9.1e-8245.63Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR
        KKKRNLPGNPDP+AEVIALSPK+LMATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKL+QRT+KE ++K+VY+CPEK+CVHH P+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD
        KHGEKKWKC+KC+K+YAVQSDWKAHSKTCGTREY+CDCGT+FSR+DSFITHRAFCDALAEE+AR+   S     A    +NL N   L+    P  SL  
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVS-----ATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN
        P S         +H+H+           + F H D +        K  S LSLW              ++ + +      + +  FG++++   L  + +
Subjt:  PHS---NNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWL-------------NQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------
               ++  V  KE+     SLS  +  + +    Q + +   + + MSATALLQKAA MG+T S +  +++       + S +S ++          
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLS--GQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS---------

Query:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG
          +S  SN+V L S N      + +    V    G  +L +    +  +  GN G    GQTRDFLGVG
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADSMQMV----GSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVG

AT1G55110.1 indeterminate(ID)-domain 71.2e-8652.69Show/hide
Query:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PGNPDP+AEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD
        KHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR            + N  ++     PH     
Subjt:  KHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVD

Query:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV
                 +H+H   Q++G   S  +  S+S+    ++  E Q H    P   WL             ISS    + ++ NLF  +  S  +G S  P 
Subjt:  PHSNNNNNNNHNHNNLQSLG-DVSGLSQFSHSDF---LRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPV

Query:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS
                                        S  MSATALLQKAA MGST+S
Subjt:  IDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRS

AT3G13810.1 indeterminate(ID)-domain 118.7e-8545.04Show/hide
Query:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PGNPDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H      
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLVDP

Query:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED
                 H+H+  Q   +VS  S  SH+          H   + L    N  +   + N+NN    F       +    +N           +I    
Subjt:  HSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSINESGISGLSVLPVIDKED

Query:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN
             +L+ +       G    S+ S +SP MSATALLQKAA MGST++              NNN ++   A    +M+S S   SS++++       N
Subjt:  VENKASLSKATAAALLSGQSSQSVVSSSSP-MSATALLQKAALMGSTRS--------------NNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLN

Query:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS
        +    N  R     D+    + ++++++   S+    +G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  SL---NKTRSLTMADSM-QMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLS

AT3G45260.1 C2H2-like zinc finger protein6.0e-9454.92Show/hide
Query:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR
        NP+PN  P  S +AK+KRNLPGNPDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCVHHDP+R
Subjt:  NPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDPSR

Query:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---
        ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN  +  +   
Subjt:  ALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLRNDSI---

Query:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN
         ++Q    + L    S  +    N N NN+  LG     + F+ S         +  + S  +LW    Q+S +  +N NN            N   +I 
Subjt:  LLHQQDPHQSLVDPHSN-NNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLW--LNQASAETAINNNNISSFFGSSSSSSNLFGSIN

Query:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS
        + GIS        ++E+ E K  +S  +  +  +  ++ +   +    + MSATALLQKAA MGS RS++++S+   +  FG+M+S
Subjt:  ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSS

AT5G03150.1 C2H2-like zinc finger protein8.4e-9649.4Show/hide
Query:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP
        ++NP+PN+KP  S++AKKKRN PG PDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+ +E IKKKVYICP KTCVHHD 
Subjt:  DANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKKKVYICPEKTCVHHDP

Query:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL
        SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAFCDAL EE AR+++       +S TN+  N 
Subjt:  SRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITT-------VSATNILNNL

Query:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS
         N+S +++  +       PH   +   +H   N       + +SQF    F  DL     +  S +         + A   N+    F SSSSS   F  
Subjt:  RNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGS

Query:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS
         ++  I   S  P +           S +     L   S   + SSS         SPMSATALLQKAA MGSTRSN++ +  F AG    M+SSS+ +S
Subjt:  INESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSS---------SPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASS

Query:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI
            SSS  +    LN   +  + ++       +  V +S + +N        +G N  +  G TRDFLGV       +  R PFLP ELA+FA +
Subjt:  SSSSSSSNAVSLNSLNKTRSLTMADS-------MQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGS---GEAPRPPFLPPELAKFAAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCGGCACCGACCGCGTTCGCTCCACAACAAGATGCAAACCCTGACCCTAACTCTAAACCGAAACCCTCCGCTGCG
GCGAAGAAGAAGAGAAACCTCCCCGGAAACCCAGATCCGGATGCGGAGGTCATCGCTCTATCGCCTAAGTCTCTCATGGCGACGAACAGATTCATCTGCGAAATA
TGCAACAAGGGGTTTCAGAGAGATCAGAACCTGCAGCTTCACCGACGAGGGCACAACCTGCCGTGGAAGCTGCGGCAGCGGACAAACAAGGAGCCGATCAAGAAG
AAGGTGTATATCTGCCCGGAGAAGACGTGCGTCCACCACGACCCGTCGCGGGCCCTTGGCGACCTCACCGGAATAAAGAAACACTTCAGCCGGAAACACGGCGAG
AAGAAGTGGAAATGCGATAAGTGTTCCAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGCGATTGTGGA
ACACTTTTTTCCAGGAAAGATAGCTTCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCTGCAACAAATATTCTC
AATAATCTCAGAAACGATTCAATTCTTCTTCATCAACAAGATCCTCACCAATCTCTCGTTGATCCTCACTCCAATAATAATAACAATAATAATCATAATCATAAT
AATCTTCAATCTCTTGGAGACGTTTCAGGGCTTTCCCAATTCAGCCATTCAGATTTTCTGAGAGATTTAGAAGATCAGCAACACAAGAACAGATCCCCTTTGTCC
CTCTGGTTGAACCAAGCTTCTGCTGAAACTGCGATCAACAACAACAACATTTCTAGCTTTTTCGGGTCGTCGTCTTCTTCTTCCAATCTTTTTGGATCGATCAAC
GAGAGCGGGATATCGGGGCTCTCAGTGCTCCCGGTGATTGACAAGGAAGATGTTGAGAACAAGGCGAGTTTGTCGAAAGCTACCGCGGCAGCGTTATTATCGGGT
CAATCTTCTCAGTCCGTCGTTTCTTCTTCTTCTCCGATGTCGGCCACTGCCCTTCTGCAGAAAGCTGCCCTTATGGGCTCAACGAGGAGCAACAACAATAACTCA
TCGCTCTTCGGAGCAGGCGGTTTTGGAGTAATGAGCTCGTCGTCGTCGGCATCTTCTTCGTCGTCATCGTCATCTTCTAATGCGGTGAGCTTGAACTCTCTCAAC
AAAACTAGGAGCCTGACAATGGCCGACTCAATGCAGATGGTCGGGAGCTCCGACTTGAGTTCAAATTGCCTCAGCCAAGTTTTACTGTCCAACGGTAATAACGGA
ATGAGAAGTAATGGTCAAACGCGAGACTTCCTCGGCGTGGGAGGATCAGGGGAAGCTCCCCGGCCACCGTTCCTCCCACCGGAGCTGGCAAAGTTCGCCGCCATA
AACTCAACCATGGGGCTAAGCCAATTCGCCGCCAACCAC
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCGGCACCGACCGCGTTCGCTCCACAACAAGATGCAAACCCTGACCCTAACTCTAAACCGAAACCCTCCGCTGCG
GCGAAGAAGAAGAGAAACCTCCCCGGAAACCCAGATCCGGATGCGGAGGTCATCGCTCTATCGCCTAAGTCTCTCATGGCGACGAACAGATTCATCTGCGAAATA
TGCAACAAGGGGTTTCAGAGAGATCAGAACCTGCAGCTTCACCGACGAGGGCACAACCTGCCGTGGAAGCTGCGGCAGCGGACAAACAAGGAGCCGATCAAGAAG
AAGGTGTATATCTGCCCGGAGAAGACGTGCGTCCACCACGACCCGTCGCGGGCCCTTGGCGACCTCACCGGAATAAAGAAACACTTCAGCCGGAAACACGGCGAG
AAGAAGTGGAAATGCGATAAGTGTTCCAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGCGATTGTGGA
ACACTTTTTTCCAGGAAAGATAGCTTCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCTGCAACAAATATTCTC
AATAATCTCAGAAACGATTCAATTCTTCTTCATCAACAAGATCCTCACCAATCTCTCGTTGATCCTCACTCCAATAATAATAACAATAATAATCATAATCATAAT
AATCTTCAATCTCTTGGAGACGTTTCAGGGCTTTCCCAATTCAGCCATTCAGATTTTCTGAGAGATTTAGAAGATCAGCAACACAAGAACAGATCCCCTTTGTCC
CTCTGGTTGAACCAAGCTTCTGCTGAAACTGCGATCAACAACAACAACATTTCTAGCTTTTTCGGGTCGTCGTCTTCTTCTTCCAATCTTTTTGGATCGATCAAC
GAGAGCGGGATATCGGGGCTCTCAGTGCTCCCGGTGATTGACAAGGAAGATGTTGAGAACAAGGCGAGTTTGTCGAAAGCTACCGCGGCAGCGTTATTATCGGGT
CAATCTTCTCAGTCCGTCGTTTCTTCTTCTTCTCCGATGTCGGCCACTGCCCTTCTGCAGAAAGCTGCCCTTATGGGCTCAACGAGGAGCAACAACAATAACTCA
TCGCTCTTCGGAGCAGGCGGTTTTGGAGTAATGAGCTCGTCGTCGTCGGCATCTTCTTCGTCGTCATCGTCATCTTCTAATGCGGTGAGCTTGAACTCTCTCAAC
AAAACTAGGAGCCTGACAATGGCCGACTCAATGCAGATGGTCGGGAGCTCCGACTTGAGTTCAAATTGCCTCAGCCAAGTTTTACTGTCCAACGGTAATAACGGA
ATGAGAAGTAATGGTCAAACGCGAGACTTCCTCGGCGTGGGAGGATCAGGGGAAGCTCCCCGGCCACCGTTCCTCCCACCGGAGCTGGCAAAGTTCGCCGCCATA
AACTCAACCATGGGGCTAAGCCAATTCGCCGCCAACCAC
Protein sequenceShow/hide protein sequence
MSSNPFSLLSSAPTAFAPQQDANPDPNSKPKPSAAAKKKRNLPGNPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEPIKK
KVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNIL
NNLRNDSILLHQQDPHQSLVDPHSNNNNNNNHNHNNLQSLGDVSGLSQFSHSDFLRDLEDQQHKNRSPLSLWLNQASAETAINNNNISSFFGSSSSSSNLFGSIN
ESGISGLSVLPVIDKEDVENKASLSKATAAALLSGQSSQSVVSSSSPMSATALLQKAALMGSTRSNNNNSSLFGAGGFGVMSSSSSASSSSSSSSSNAVSLNSLN
KTRSLTMADSMQMVGSSDLSSNCLSQVLLSNGNNGMRSNGQTRDFLGVGGSGEAPRPPFLPPELAKFAAINSTMGLSQFAANH