; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033377 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033377
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionC2H2-type domain-containing protein
Genome locationscaffold5:2632744..2635507
RNA-Seq ExpressionSpg033377
SyntenySpg033377
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR022755 - Zinc finger, double-stranded RNA binding
IPR033243 - Zinc finger protein JACKDAW-like
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137968.1 zinc finger protein BALDIBIS [Cucumis sativus]5.5e-22787.01Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPS-AAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQR
        MS+NPFSLLSST T+F     QDANPNPNPKPKPS AAAKKKRNLPGTPDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQR
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPS-AAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQR

Query:  TNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEE
        TNKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEE
Subjt:  TNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEE

Query:  SARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNIS
        SARITTVSATNILNNLRNDS    LLH Q D  QSLIDH     QSLGD+SGLSQF +HSDHFLRDFED QQKNRSPLSLWLNQASA  A+  N+NN+IS
Subjt:  SARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNIS

Query:  NFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGV
        NFFGASSSSSNLFGSI E G+SMLPVMEK++VENKGS    SKAT     ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS NNNN+PLFG+GAFGV
Subjt:  NFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGV

Query:  MSSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMG
        MSSSSS SSS S+NAV SLNSLNKS SLTM DSVQM+GS+SDLSSNCLSQLL+PPN NN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMG
Subjt:  MSSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMG

Query:  LSQFAANH
        LSQFAANH
Subjt:  LSQFAANH

XP_008442674.1 PREDICTED: protein indeterminate-domain 9 [Cucumis melo]3.6e-22686.98Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSST T+F    QQDANPNP PKP  +AAAKKKRNLPGTPDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISN
        ARITTVSATNILNNLRNDS    LLH Q D  Q LIDH     QSLGD+SGLSQF +HSDHFLRDFED QQKNRSPLSLWLNQASA  A+  NNNNNISN
Subjt:  ARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISN

Query:  FFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVM
        FFGASSSSSNLFGSI E G+SMLPVMEK++VENKGS    SKAT     ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFG+GAFGVM
Subjt:  FFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVM

Query:  SSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGL
        SSSSS SSS S+NAV SLNS NKS SLTM DSVQM+GS+SDLSSNCLSQLL+PPN NN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGL
Subjt:  SSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGL

Query:  SQFAANH
        SQFAANH
Subjt:  SQFAANH

XP_022145765.1 protein indeterminate-domain 9-like [Momordica charantia]3.0e-22586.94Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MSSNPFSLLSS PTAFAP  QQDANP+PN KPKPSAAAKKKRNLPG PDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLID----------HQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNN
        ARITTVSATNILNNLRNDSILL HQQDP QSL+D          H  NNLQSLGDVSGLSQFSHSD FLRD EDQQ KNRSPLSLWLNQASA TA+   N
Subjt:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLID----------HQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNN

Query:  NNNISNFFGASSSSSNLFGSINETGI---SMLPVMEKDEVENKGSLSKAT--ALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGA
        NNNIS+FFG+SSSSSNLFGSINE+GI   S+LPV++K++VENK SLSKAT  ALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS LFGAG 
Subjt:  NNNISNFFGASSSSSNLFGSINETGI---SMLPVMEKDEVENKGSLSKAT--ALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGA

Query:  FGVMSSS---SSSSSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATI
        FGVMSSS   SSSSSS S+NAVSLNSLNK+ SLTM DS+QMVG SSDLSSNCLSQ+LL  N NNGMRS+GQTRDFLGVGG GEAPRPPFLPPELAKFA I
Subjt:  FGVMSSS---SSSSSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATI

Query:  NSTMGLSQFAANH
        NSTMGLSQFAANH
Subjt:  NSTMGLSQFAANH

XP_023526465.1 protein indeterminate-domain 9-like [Cucurbita pepo subsp. pepo]2.9e-22084.57Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSSTPT+       DANP+PNPKPKPSAAAKKKRNLPGTPDP+AEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGA
        ARITTVSATNILNNLRNDS+LLH Q    QSLIDHQ NNLQ+LGDV  LSQF+HSDHFLRDFED Q KNRSP SLWL           NNNNNISNF+G 
Subjt:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGA

Query:  SSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKAT----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSSSSSSSS
        SSSSSNLFGSI ETG+SMLPV EK++VE KGSLSKAT    ALLSGQSS SVVSSSPMSATALLQKAALMGSTRSS NNNSPL  AGAFGVM+SSS SSS
Subjt:  SSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKAT----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSSSSSSSS

Query:  SPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLP-PNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH
        S S+NAVSLNSLNKS S+TM DSVQMVG++SDLSSN LSQLL+P  N NN M+S+ QTRDFLGVGG GEAP+PPFLPPELAKFA INSTMGLSQFAANH
Subjt:  SPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLP-PNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH

XP_038903191.1 zinc finger protein BALDIBIS-like [Benincasa hispida]1.2e-23288.91Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSST TAFA  Q  DANPNPNPKPKPSAA KKKRNLPGTPDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLIDHQQNN-LQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMN--NNNNNNISNF
        ARITTVSATNILNNLRNDS +L HQQD  QSLIDH  NN LQSLGD+SGLSQF+HSDHFLRDFED Q KNRSPLSLWLNQASA TAMN  NNNNNNISN 
Subjt:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLIDHQQNN-LQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMN--NNNNNNISNF

Query:  FGASSSSSNLFGSINETGISMLPVMEKDEVENKG--SLSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSS
        FGASSSSSNLFGSI E G+SMLPV+EK++VENKG  +LSKAT     ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS+NNNNSPLFGAGAFGVMSS
Subjt:  FGASSSSSNLFGSINETGISMLPVMEKDEVENKG--SLSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSS

Query:  SSSSSSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGM-RSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQ
        SSSSSS  S+NAVSLNSLNKS SLTM DSVQM+G++SDLSSNCLSQLL+P N NN M RSSGQTRDFLGV GGGEAPRPPFLPPELAKFATINST+GLSQ
Subjt:  SSSSSSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGM-RSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQ

Query:  FAANH
        FAANH
Subjt:  FAANH

TrEMBL top hitse value%identityAlignment
A0A0A0LDZ1 C2H2-type domain-containing protein2.7e-22787.01Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPS-AAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQR
        MS+NPFSLLSST T+F     QDANPNPNPKPKPS AAAKKKRNLPGTPDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQR
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPS-AAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQR

Query:  TNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEE
        TNKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEE
Subjt:  TNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEE

Query:  SARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNIS
        SARITTVSATNILNNLRNDS    LLH Q D  QSLIDH     QSLGD+SGLSQF +HSDHFLRDFED QQKNRSPLSLWLNQASA  A+  N+NN+IS
Subjt:  SARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNIS

Query:  NFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGV
        NFFGASSSSSNLFGSI E G+SMLPVMEK++VENKGS    SKAT     ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS NNNN+PLFG+GAFGV
Subjt:  NFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGV

Query:  MSSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMG
        MSSSSS SSS S+NAV SLNSLNKS SLTM DSVQM+GS+SDLSSNCLSQLL+PPN NN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMG
Subjt:  MSSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMG

Query:  LSQFAANH
        LSQFAANH
Subjt:  LSQFAANH

A0A1S3B706 protein indeterminate-domain 91.7e-22686.98Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSST T+F    QQDANPNP PKP  +AAAKKKRNLPGTPDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISN
        ARITTVSATNILNNLRNDS    LLH Q D  Q LIDH     QSLGD+SGLSQF +HSDHFLRDFED QQKNRSPLSLWLNQASA  A+  NNNNNISN
Subjt:  ARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISN

Query:  FFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVM
        FFGASSSSSNLFGSI E G+SMLPVMEK++VENKGS    SKAT     ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFG+GAFGVM
Subjt:  FFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVM

Query:  SSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGL
        SSSSS SSS S+NAV SLNS NKS SLTM DSVQM+GS+SDLSSNCLSQLL+PPN NN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGL
Subjt:  SSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGL

Query:  SQFAANH
        SQFAANH
Subjt:  SQFAANH

A0A5D3DNM0 Protein indeterminate-domain 91.7e-22686.98Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSST T+F    QQDANPNP PKP  +AAAKKKRNLPGTPDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISN
        ARITTVSATNILNNLRNDS    LLH Q D  Q LIDH     QSLGD+SGLSQF +HSDHFLRDFED QQKNRSPLSLWLNQASA  A+  NNNNNISN
Subjt:  ARITTVSATNILNNLRNDS---ILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQF-SHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISN

Query:  FFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVM
        FFGASSSSSNLFGSI E G+SMLPVMEK++VENKGS    SKAT     ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFG+GAFGVM
Subjt:  FFGASSSSSNLFGSINETGISMLPVMEKDEVENKGS---LSKAT-----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVM

Query:  SSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGL
        SSSSS SSS S+NAV SLNS NKS SLTM DSVQM+GS+SDLSSNCLSQLL+PPN NN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGL
Subjt:  SSSSSSSSSPSNNAV-SLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGL

Query:  SQFAANH
        SQFAANH
Subjt:  SQFAANH

A0A6J1CVG4 protein indeterminate-domain 9-like1.5e-22586.94Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MSSNPFSLLSS PTAFAP  QQDANP+PN KPKPSAAAKKKRNLPG PDP+AEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLID----------HQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNN
        ARITTVSATNILNNLRNDSILL HQQDP QSL+D          H  NNLQSLGDVSGLSQFSHSD FLRD EDQQ KNRSPLSLWLNQASA TA+   N
Subjt:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLID----------HQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNN

Query:  NNNISNFFGASSSSSNLFGSINETGI---SMLPVMEKDEVENKGSLSKAT--ALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGA
        NNNIS+FFG+SSSSSNLFGSINE+GI   S+LPV++K++VENK SLSKAT  ALLSGQSSQSVV SSSPMSATALLQKAALMGSTR SNNNNS LFGAG 
Subjt:  NNNISNFFGASSSSSNLFGSINETGI---SMLPVMEKDEVENKGSLSKAT--ALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGA

Query:  FGVMSSS---SSSSSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATI
        FGVMSSS   SSSSSS S+NAVSLNSLNK+ SLTM DS+QMVG SSDLSSNCLSQ+LL  N NNGMRS+GQTRDFLGVGG GEAPRPPFLPPELAKFA I
Subjt:  FGVMSSS---SSSSSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATI

Query:  NSTMGLSQFAANH
        NSTMGLSQFAANH
Subjt:  NSTMGLSQFAANH

A0A6J1F1R6 protein indeterminate-domain 9-like1.9e-22084.77Show/hide
Query:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MSSNPFSLLSSTPT FA     DANP+PNPKPKPSAAAKKKRNLPGTPDP+AEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKC+KCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGA
        ARITTVSATNILNNLRNDS+LLH Q    QSLIDHQ NNLQ+LGDV  LSQF+HSDHFLRDFED Q KNRSP SLWL           NN+NNISNF+GA
Subjt:  ARITTVSATNILNNLRNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGA

Query:  SSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKAT----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSSSSSSSS
        SSSSSNLFGSI ETG+SMLPV EK++VE KGSL KAT    ALLSGQSS SVVSSSPMSATALLQKAALMGSTRSS NNNSPL  AGAFGVM+SSS SSS
Subjt:  SSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKAT----ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSSSSSSSS

Query:  SPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLP-PNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH
        S S+NAVSLNSLNKS S++M DSVQMVG++SDLSSN LSQLL+P  N NN M+S+ QTRDFLGVGG GEAPRPPFLPPELAKF  INSTMGLSQFAANH
Subjt:  SPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLP-PNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH

SwissProt top hitse value%identityAlignment
Q700D2 Zinc finger protein JACKDAW3.3e-9748.76Show/hide
Query:  NPFSLLSS--------TPTAFAPQQQQDANP--NPNPKPKP-SAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNL
        +PFS+ SS        T      QQ  D NP  NPNP  KP S++AKKKRN PGTPDP+A+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNL
Subjt:  NPFSLLSS--------TPTAFAPQQQQDANP--NPNPKPKP-SAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNL

Query:  PWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAF
        PWKL+QR+ +E IKKKVYICP KTCVHHD SRALGDLTGIKKH+SRKHGEKKWKCEKCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAF
Subjt:  PWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAF

Query:  CDALAEESARITT-------VSATNILNNLRNDSILLHHQQDPQQSL---IDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASA
        CDAL EE AR+++       +S TN+  N  N+S ++++   P   +   + H   N       + +SQF      L    D    +   LS  +  AS 
Subjt:  CDALAEESARITT-------VSATNILNNLRNDSILLHHQQDPQQSL---IDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASA

Query:  ATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKATALLSGQ----SSQSVVSS-----------SPMSATALLQKAALMG
                     + F +SSSS   F   ++  I M        + +  +  + +A L  Q    SS S + S           SPMSATALLQKAA MG
Subjt:  ATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKATALLSGQ----SSQSVVSS-----------SPMSATALLQKAALMG

Query:  STRSSNNNNSPLFGAGAFGVMSSSSSS----SSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVN--NGMRSSGQTRDFLGVGG
        STR SN++ +P F AG     SS+++S    SSSP      LN+ N   ++   +  +     S +S++ +       N +  N  +  G TRDFLGV  
Subjt:  STRSSNNNNSPLFGAGAFGVMSSSSSS----SSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVN--NGMRSSGQTRDFLGVGG

Query:  ---GGEAPRPPFLPPELAKFATI
             +  R PFLP ELA+FA +
Subjt:  ---GGEAPRPPFLPPELAKFATI

Q8H1F5 Protein indeterminate-domain 78.1e-8855.16Show/hide
Query:  PSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PG PDPEAEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHHQQDPQQSLI
        KHF RKHGEKKWKCEKCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR    +  N +    ++S   HH Q  Q    
Subjt:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHHQQDPQQSLI

Query:  DHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSL
             N+ S  ++ G  +   S H  ++           +  WL       + N N N N  N F   +SS N       TG S  P             
Subjt:  DHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSL

Query:  SKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSS
                         S  MSATALLQKAA MGST+S+
Subjt:  SKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSS

Q944L3 Zinc finger protein BALDIBIS1.9e-9755.25Show/hide
Query:  STPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY
        S P+    Q+    NPNPNP P  S +AK+KRNLPG PDP+AEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVY
Subjt:  STPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY

Query:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SAT
        ICPEKTCVHHDP+RALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A 
Subjt:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SAT

Query:  NILNN-----LRNDSILLHHQQ---DPQQSLIDH-----QQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLW--LNQASAATAMNNNNNNNI
          LNN     + + +I  +HQQ   +   S +D       +NN+  LG     + F+ S          +  + S  +LW    Q+S    +N NNNNN 
Subjt:  NILNN-----LRNDSILLHHQQ---DPQQSLIDH-----QQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLW--LNQASAATAMNNNNNNNI

Query:  SNFFGASSSSSNLFGSINETGISMLPVMEKDEVEN---KGSLSKATALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSS
                       +I + GIS     E+ E++N    GSL  + A  +  + +Q+    + MSATALLQKAA MGS RSS+++++    +  FG+M+S
Subjt:  SNFFGASSSSSNLFGSINETGISMLPVMEKDEVEN---KGSLSKATALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSS

Q9LRW7 Protein indeterminate-domain 113.1e-8746.23Show/hide
Query:  FAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEK
        F   QQQ            S   KK+RN PG PDPE+EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE 
Subjt:  FAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEK

Query:  TCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNL
        +CVHHDPSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN 
Subjt:  TCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNL

Query:  RNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETG
        + + +L+H     Q +   H  +  Q   +VS  S  SH+ + +              SL  +  +  T  +NN+NN++  F       SN    I    
Subjt:  RNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETG

Query:  ISMLPVMEKDEVENKGSLSKATALLSGQSSQSVVSSSP-MSATALLQKAALMGST------------RSSNNNNSPLFGAGAFGVMSSSSSSSSSPSNNA
         S++P     +     S +   +   G        +SP MSATALLQKAA MGST            RS++NNN     A    +M+S S   SS +NN 
Subjt:  ISMLPVMEKDEVENKGSLSKATALLSGQSSQSVVSSSP-MSATALLQKAALMGST------------RSSNNNNSPLFGAGAFGVMSSSSSSSSSPSNNA

Query:  VSLNSLNKSGSLTMTDSVQMVGSSSD-----LSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
        V     N SG     D+     +  D     L +N ++               G TRDFLG+       RP     E+  FA + S +  S
Subjt:  VSLNSLNKSGSLTMTDSVQMVGSSSD-----LSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

Q9ZUL3 Protein indeterminate-domain 5, chloroplastic4.9e-8543.54Show/hide
Query:  LLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKK
        LL    +A AP      +  P P P  +   KKKRN P TP+ +AEVIALSPK+LMATNRFICE+CNKGFQR+QNLQLHRRGHNLPWKL+Q++ KE +K+
Subjt:  LLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKK

Query:  KVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----I
        KVY+CPE +CVHHDPSRALGDLTGIKKH+ RKHGEKKWKC+KCSK+YAVQSDWKAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFCDALA+ESAR    +
Subjt:  KVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----I

Query:  TTVSA--------TNILNNLRNDSIL-LHHQQDPQQSLIDHQQNNLQSLGDVSG-----------LSQFSHSDHFLRD----FEDQQQKN----------
        T++ +        TN  NN  +  IL L H   PQ   +DHQ  ++  LG   G           L   + S +F+++    F DQQ  +          
Subjt:  TTVSA--------TNILNNLRNDSIL-LHHQQDPQQSLIDHQQNNLQSLGDVSG-----------LSQFSHSDHFLRD----FEDQQQKN----------

Query:  -----RSPLSLWLNQASAATAMNNNNNNNISNF-------------------FGASSSSSNLF-------------GSINETGISMLPVMEKDEVENKGS
             +SP+S   N    +   +N+  +N+ N                      A+ SS NL              G    TG+    +M   +  + GS
Subjt:  -----RSPLSLWLNQASAATAMNNNNNNNISNF-------------------FGASSSSSNLF-------------GSINETGISMLPVMEKDEVENKGS

Query:  LSKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPL------------FGAGAFGVMSSSSSSSSSPSNNAVSLNSLNKSGSLTMTDSV
        +      L   S QS  S+  MSATALLQKAA MGST S+NNN S              FG+G +G          + SN    +NS +  G+    + V
Subjt:  LSKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPL------------FGAGAFGVMSSSSSSSSSPSNNAVSLNSLNKSGSLTMTDSV

Query:  QMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQ--TRDFLGVG
             S                VN G+ +  Q  TRDFLGVG
Subjt:  QMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQ--TRDFLGVG

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 75.7e-8955.16Show/hide
Query:  PSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIK
        P ++ K+KRN PG PDPEAEV+ALSPK+LMATNRFICE+CNKGFQRDQNLQLH+RGHNLPWKL+QR+NK+ ++KKVY+CPE  CVHH PSRALGDLTGIK
Subjt:  PSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIK

Query:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHHQQDPQQSLI
        KHF RKHGEKKWKCEKCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESAR    +  N +    ++S   HH Q  Q    
Subjt:  KHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHHQQDPQQSLI

Query:  DHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSL
             N+ S  ++ G  +   S H  ++           +  WL       + N N N N  N F   +SS N       TG S  P             
Subjt:  DHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSL

Query:  SKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSS
                         S  MSATALLQKAA MGST+S+
Subjt:  SKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSS

AT2G02070.1 indeterminate(ID)-domain 53.5e-8643.54Show/hide
Query:  LLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKK
        LL    +A AP      +  P P P  +   KKKRN P TP+ +AEVIALSPK+LMATNRFICE+CNKGFQR+QNLQLHRRGHNLPWKL+Q++ KE +K+
Subjt:  LLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKK

Query:  KVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----I
        KVY+CPE +CVHHDPSRALGDLTGIKKH+ RKHGEKKWKC+KCSK+YAVQSDWKAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFCDALA+ESAR    +
Subjt:  KVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----I

Query:  TTVSA--------TNILNNLRNDSIL-LHHQQDPQQSLIDHQQNNLQSLGDVSG-----------LSQFSHSDHFLRD----FEDQQQKN----------
        T++ +        TN  NN  +  IL L H   PQ   +DHQ  ++  LG   G           L   + S +F+++    F DQQ  +          
Subjt:  TTVSA--------TNILNNLRNDSIL-LHHQQDPQQSLIDHQQNNLQSLGDVSG-----------LSQFSHSDHFLRD----FEDQQQKN----------

Query:  -----RSPLSLWLNQASAATAMNNNNNNNISNF-------------------FGASSSSSNLF-------------GSINETGISMLPVMEKDEVENKGS
             +SP+S   N    +   +N+  +N+ N                      A+ SS NL              G    TG+    +M   +  + GS
Subjt:  -----RSPLSLWLNQASAATAMNNNNNNNISNF-------------------FGASSSSSNLF-------------GSINETGISMLPVMEKDEVENKGS

Query:  LSKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPL------------FGAGAFGVMSSSSSSSSSPSNNAVSLNSLNKSGSLTMTDSV
        +      L   S QS  S+  MSATALLQKAA MGST S+NNN S              FG+G +G          + SN    +NS +  G+    + V
Subjt:  LSKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPL------------FGAGAFGVMSSSSSSSSSPSNNAVSLNSLNKSGSLTMTDSV

Query:  QMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQ--TRDFLGVG
             S                VN G+ +  Q  TRDFLGVG
Subjt:  QMVGSSSDLSSNCLSQLLLPPNVNNGMRSSGQ--TRDFLGVG

AT3G13810.1 indeterminate(ID)-domain 112.2e-8846.23Show/hide
Query:  FAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEK
        F   QQQ            S   KK+RN PG PDPE+EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE 
Subjt:  FAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEK

Query:  TCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNL
        +CVHHDPSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN 
Subjt:  TCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNL

Query:  RNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETG
        + + +L+H     Q +   H  +  Q   +VS  S  SH+ + +              SL  +  +  T  +NN+NN++  F       SN    I    
Subjt:  RNDSILLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETG

Query:  ISMLPVMEKDEVENKGSLSKATALLSGQSSQSVVSSSP-MSATALLQKAALMGST------------RSSNNNNSPLFGAGAFGVMSSSSSSSSSPSNNA
         S++P     +     S +   +   G        +SP MSATALLQKAA MGST            RS++NNN     A    +M+S S   SS +NN 
Subjt:  ISMLPVMEKDEVENKGSLSKATALLSGQSSQSVVSSSP-MSATALLQKAALMGST------------RSSNNNNSPLFGAGAFGVMSSSSSSSSSPSNNA

Query:  VSLNSLNKSGSLTMTDSVQMVGSSSD-----LSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
        V     N SG     D+     +  D     L +N ++               G TRDFLG+       RP     E+  FA + S +  S
Subjt:  VSLNSLNKSGSLTMTDSVQMVGSSSD-----LSSNCLSQLLLPPNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

AT3G45260.1 C2H2-like zinc finger protein1.4e-9855.25Show/hide
Query:  STPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY
        S P+    Q+    NPNPNP P  S +AK+KRNLPG PDP+AEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVY
Subjt:  STPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY

Query:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SAT
        ICPEKTCVHHDP+RALGDLTGIKKHFSRKHGEKKWKC+KCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A 
Subjt:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SAT

Query:  NILNN-----LRNDSILLHHQQ---DPQQSLIDH-----QQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLW--LNQASAATAMNNNNNNNI
          LNN     + + +I  +HQQ   +   S +D       +NN+  LG     + F+ S          +  + S  +LW    Q+S    +N NNNNN 
Subjt:  NILNN-----LRNDSILLHHQQ---DPQQSLIDH-----QQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLW--LNQASAATAMNNNNNNNI

Query:  SNFFGASSSSSNLFGSINETGISMLPVMEKDEVEN---KGSLSKATALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSS
                       +I + GIS     E+ E++N    GSL  + A  +  + +Q+    + MSATALLQKAA MGS RSS+++++    +  FG+M+S
Subjt:  SNFFGASSSSSNLFGSINETGISMLPVMEKDEVEN---KGSLSKATALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSS

AT5G03150.1 C2H2-like zinc finger protein2.3e-9848.76Show/hide
Query:  NPFSLLSS--------TPTAFAPQQQQDANP--NPNPKPKP-SAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNL
        +PFS+ SS        T      QQ  D NP  NPNP  KP S++AKKKRN PGTPDP+A+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGHNL
Subjt:  NPFSLLSS--------TPTAFAPQQQQDANP--NPNPKPKP-SAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNL

Query:  PWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAF
        PWKL+QR+ +E IKKKVYICP KTCVHHD SRALGDLTGIKKH+SRKHGEKKWKCEKCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHRAF
Subjt:  PWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAF

Query:  CDALAEESARITT-------VSATNILNNLRNDSILLHHQQDPQQSL---IDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASA
        CDAL EE AR+++       +S TN+  N  N+S ++++   P   +   + H   N       + +SQF      L    D    +   LS  +  AS 
Subjt:  CDALAEESARITT-------VSATNILNNLRNDSILLHHQQDPQQSL---IDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASA

Query:  ATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKATALLSGQ----SSQSVVSS-----------SPMSATALLQKAALMG
                     + F +SSSS   F   ++  I M        + +  +  + +A L  Q    SS S + S           SPMSATALLQKAA MG
Subjt:  ATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENKGSLSKATALLSGQ----SSQSVVSS-----------SPMSATALLQKAALMG

Query:  STRSSNNNNSPLFGAGAFGVMSSSSSS----SSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVN--NGMRSSGQTRDFLGVGG
        STR SN++ +P F AG     SS+++S    SSSP      LN+ N   ++   +  +     S +S++ +       N +  N  +  G TRDFLGV  
Subjt:  STRSSNNNNSPLFGAGAFGVMSSSSSS----SSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLPPNVN--NGMRSSGQTRDFLGVGG

Query:  ---GGEAPRPPFLPPELAKFATI
             +  R PFLP ELA+FA +
Subjt:  ---GGEAPRPPFLPPELAKFATI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCCACTCCCACTGCCTTTGCTCCACAGCAGCAGCAAGATGCTAACCCTAACCCTAACCCTAAACCGAAACCCTCCGCTGC
GGCAAAGAAGAAGAGAAACCTCCCCGGAACCCCAGATCCAGAGGCAGAGGTTATTGCTTTGTCGCCGAAATCGCTCATGGCGACGAATAGATTCATATGCGAAATTTGCA
ACAAGGGGTTTCAGAGAGATCAGAACCTGCAACTTCACCGACGAGGACATAACCTACCGTGGAAGCTACGACAGCGGACGAACAAGGAGGCAATCAAGAAGAAGGTGTAT
ATTTGCCCGGAGAAGACGTGCGTCCACCACGATCCGTCGCGAGCTCTCGGTGACCTCACCGGAATAAAGAAACATTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATG
TGAGAAGTGTTCTAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGTGATTGTGGAACCCTTTTTTCCAGGAAAG
ACAGCTTCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCAGCCACAAACATTCTCAATAATCTCAGAAATGATTCAATT
CTTCTTCATCATCAACAAGATCCACAGCAATCTTTGATCGATCATCAGCAAAATAATCTTCAATCTCTTGGTGATGTTTCTGGGCTTTCCCAATTCAGTCATTCAGATCA
TTTTTTGAGAGATTTTGAAGATCAGCAGCAGAAGAACAGATCTCCATTGTCACTTTGGCTGAACCAAGCTTCTGCTGCAACTGCAATGAACAATAACAACAACAACAATA
TTTCCAACTTTTTTGGAGCTTCTTCTTCCTCGTCCAATCTTTTCGGATCGATAAACGAAACCGGGATCTCGATGTTGCCGGTGATGGAGAAGGACGAGGTTGAGAATAAG
GGAAGCTTGTCGAAAGCTACCGCACTGTTGTCGGGTCAATCTTCTCAGTCTGTTGTTTCGTCTTCTCCGATGTCGGCGACTGCCCTTCTGCAAAAAGCTGCTCTTATGGG
CTCAACTAGAAGCAGCAACAACAATAATTCTCCGCTCTTCGGAGCGGGCGCTTTCGGAGTAATGAGCTCTTCGTCTTCGTCGTCGTCTTCACCTTCGAATAATGCAGTGA
GTTTGAACTCTCTGAATAAATCTGGAAGCCTGACAATGACGGACTCGGTGCAGATGGTCGGTAGCAGCTCTGACTTGAGCTCGAATTGTCTCAGCCAGCTTCTGCTGCCA
CCAAACGTTAACAACGGTATGAGAAGTAGCGGTCAAACGAGGGACTTCCTCGGTGTCGGGGGAGGAGGAGAAGCGCCTCGGCCTCCGTTCCTCCCACCGGAGCTAGCGAA
ATTCGCCACCATAAACTCAACAATGGGACTAAGCCAATTTGCAGCCAACCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGTAATCCCTTTTCTCTTCTCTCTTCCACTCCCACTGCCTTTGCTCCACAGCAGCAGCAAGATGCTAACCCTAACCCTAACCCTAAACCGAAACCCTCCGCTGC
GGCAAAGAAGAAGAGAAACCTCCCCGGAACCCCAGATCCAGAGGCAGAGGTTATTGCTTTGTCGCCGAAATCGCTCATGGCGACGAATAGATTCATATGCGAAATTTGCA
ACAAGGGGTTTCAGAGAGATCAGAACCTGCAACTTCACCGACGAGGACATAACCTACCGTGGAAGCTACGACAGCGGACGAACAAGGAGGCAATCAAGAAGAAGGTGTAT
ATTTGCCCGGAGAAGACGTGCGTCCACCACGATCCGTCGCGAGCTCTCGGTGACCTCACCGGAATAAAGAAACATTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATG
TGAGAAGTGTTCTAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCCAAAACTTGTGGGACAAGAGAATATAAGTGTGATTGTGGAACCCTTTTTTCCAGGAAAG
ACAGCTTCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCAGCCACAAACATTCTCAATAATCTCAGAAATGATTCAATT
CTTCTTCATCATCAACAAGATCCACAGCAATCTTTGATCGATCATCAGCAAAATAATCTTCAATCTCTTGGTGATGTTTCTGGGCTTTCCCAATTCAGTCATTCAGATCA
TTTTTTGAGAGATTTTGAAGATCAGCAGCAGAAGAACAGATCTCCATTGTCACTTTGGCTGAACCAAGCTTCTGCTGCAACTGCAATGAACAATAACAACAACAACAATA
TTTCCAACTTTTTTGGAGCTTCTTCTTCCTCGTCCAATCTTTTCGGATCGATAAACGAAACCGGGATCTCGATGTTGCCGGTGATGGAGAAGGACGAGGTTGAGAATAAG
GGAAGCTTGTCGAAAGCTACCGCACTGTTGTCGGGTCAATCTTCTCAGTCTGTTGTTTCGTCTTCTCCGATGTCGGCGACTGCCCTTCTGCAAAAAGCTGCTCTTATGGG
CTCAACTAGAAGCAGCAACAACAATAATTCTCCGCTCTTCGGAGCGGGCGCTTTCGGAGTAATGAGCTCTTCGTCTTCGTCGTCGTCTTCACCTTCGAATAATGCAGTGA
GTTTGAACTCTCTGAATAAATCTGGAAGCCTGACAATGACGGACTCGGTGCAGATGGTCGGTAGCAGCTCTGACTTGAGCTCGAATTGTCTCAGCCAGCTTCTGCTGCCA
CCAAACGTTAACAACGGTATGAGAAGTAGCGGTCAAACGAGGGACTTCCTCGGTGTCGGGGGAGGAGGAGAAGCGCCTCGGCCTCCGTTCCTCCCACCGGAGCTAGCGAA
ATTCGCCACCATAAACTCAACAATGGGACTAAGCCAATTTGCAGCCAACCACTAA
Protein sequenceShow/hide protein sequence
MSSNPFSLLSSTPTAFAPQQQQDANPNPNPKPKPSAAAKKKRNLPGTPDPEAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY
ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCEKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSI
LLHHQQDPQQSLIDHQQNNLQSLGDVSGLSQFSHSDHFLRDFEDQQQKNRSPLSLWLNQASAATAMNNNNNNNISNFFGASSSSSNLFGSINETGISMLPVMEKDEVENK
GSLSKATALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSSNNNNSPLFGAGAFGVMSSSSSSSSSPSNNAVSLNSLNKSGSLTMTDSVQMVGSSSDLSSNCLSQLLLP
PNVNNGMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH