; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020787 (gene) of Snake gourd v1 genome

Gene IDTan0020787
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionC2H2-type domain-containing protein
Genome locationLG08:71233370..71236644
RNA-Seq ExpressionTan0020787
SyntenyTan0020787
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR022755 - Zinc finger, double-stranded RNA binding
IPR033243 - Zinc finger protein JACKDAW-like
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137968.1 zinc finger protein BALDIBIS [Cucumis sativus]2.0e-23289.33Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSSTTT F   QD ANPNPNPKPKPS AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF
        ARITTVSATNILNNLRNDS    LLHQQ D HQSLIDH     QSLGD+SGLSQFT HSDHFLRDFED QQKNRSPLSLWLNQASA+NAI N+NN++SNF
Subjt:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF

Query:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS
        FGASSSSSNLFGSITE+GLS+LPV+EKEDVENKGS    SKAT  +A ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS NNNN+PLFG+GAFGVMS
Subjt:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS

Query:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
        SSS  SSSSSSNAV +LNSLNKSRS TM DSVQM+GS+S+LSSNCLSQLL+PPNGNN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGLS
Subjt:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

Query:  QFAANH
        QFAANH
Subjt:  QFAANH

XP_008442674.1 PREDICTED: protein indeterminate-domain 9 [Cucumis melo]2.2e-23189.13Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSST T F  QQD    NPNPKPKPS AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF
        ARITTVSATNILNNLRNDS    LLHQQ D HQ LIDH     QSLGD+SGLSQFT HSDHFLRDFED QQKNRSPLSLWLNQASA+NAI NNNNN+SNF
Subjt:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF

Query:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS
        FGASSSSSNLFGSITE+GLS+LPV+EKEDVENKGS    SKAT  +A ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS+NNNNSPLFG+GAFGVMS
Subjt:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS

Query:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
        SSS  SSSSSSNAV +LNS NKSRS TMADSVQM+GS+S+LSSNCLSQLL+PPNGNN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGLS
Subjt:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

Query:  QFAANH
        QFAANH
Subjt:  QFAANH

XP_022145765.1 protein indeterminate-domain 9-like [Momordica charantia]1.6e-22987.87Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MSSNPFSLLSS  T FAPQQD ANP+PN KPKPSAAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSILLHQQDPHQSLID----------HQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNN
        RITTVSATNILNNLRNDSILLHQQDPHQSL+D          H  NNLQSLGDVSGLSQF+HSD FLRD EDQQ KNRSPLSLWLNQASA+ AI  NNNN
Subjt:  RITTVSATNILNNLRNDSILLHQQDPHQSLID----------HQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNN

Query:  LSNFFGASSSSSNLFGSITE---SGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFG
        +S+FFG+SSSSSNLFGSI E   SGLS+LPVI+KEDVENK SLSKATAA ALLSGQSSQSVV SSSPMSATALLQKAALMGSTRSNNNN+S LFGAG FG
Subjt:  LSNFFGASSSSSNLFGSITE---SGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFG

Query:  VMS-----SSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINS
        VMS     SSSSSSSSSNAV+LNSLNK+RS TMADS+QMVG SS+LSSNCLSQ+LL  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPPELAKFA INS
Subjt:  VMS-----SSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINS

Query:  TMGLSQFAANH
        TMGLSQFAANH
Subjt:  TMGLSQFAANH

XP_022934107.1 protein indeterminate-domain 9-like [Cucurbita moschata]6.7e-22586.67Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MSSNPFSLLSST T FA   D ANP+PNPKPKPSAAAKKKRNLPGTPDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KE IKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSILLHQQDPHQSLIDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSS
        RITTVSATNILNNLRNDS+LLHQQD  QSLIDHQ NNLQ+LGDV  LSQFTHSDHFLRDFED Q KNRSP SLWL          NN+NN+SNF+GASSS
Subjt:  RITTVSATNILNNLRNDSILLHQQDPHQSLIDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSS

Query:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKAT-AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSS-SSSSSS
        SSNLFGSI E+GLS+LPV EKEDVE KGSL KAT +A ALLSGQSS SVVSSSPMSATALLQKAALMGSTRS+ NNNSPL  AGAFGVM+SSS SSSSSS
Subjt:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKAT-AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSS-SSSSSS

Query:  NAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLP-PNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH
        NAV+LNSLNKSRS +MADSVQMVG++S+LSSN LSQLL+P  NGNN M+S+ QTRDFLGVGG GEAPRPPFLPPELAKF  INSTMGLSQFAANH
Subjt:  NAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLP-PNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH

XP_038903191.1 zinc finger protein BALDIBIS-like [Benincasa hispida]1.5e-24091.04Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MS+NPFSLLSSTTT FA Q D ANPNPNPKPKPSAA KKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLIDHQQNN-LQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAI---NNNNNNLSNFF
        RITTVSATNILNNLRNDS ILLHQQD  QSLIDH  NN LQSLGD+SGLSQF HSDHFLRDFED Q KNRSPLSLWLNQASA+ A+   NNNNNN+SN F
Subjt:  RITTVSATNILNNLRNDS-ILLHQQDPHQSLIDHQQNN-LQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAI---NNNNNNLSNFF

Query:  GASSSSSNLFGSITESGLSILPVIEKEDVENKG--SLSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSS
        GASSSSSNLFGSITE+GLS+LPVIEKEDVENKG  +LSKAT  +A ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSS
Subjt:  GASSSSSNLFGSITESGLSILPVIEKEDVENKG--SLSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSS

Query:  SSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVM-RSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAA
        SSSSSSSNAV+LNSLNKSRS TMADSVQM+G++S+LSSNCLSQLL+P NGNNVM RSSGQTRDFLGV GGGEAPRPPFLPPELAKFATINST+GLSQFAA
Subjt:  SSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVM-RSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAA

Query:  NH
        NH
Subjt:  NH

TrEMBL top hitse value%identityAlignment
A0A0A0LDZ1 C2H2-type domain-containing protein9.5e-23389.33Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSSTTT F   QD ANPNPNPKPKPS AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF
        ARITTVSATNILNNLRNDS    LLHQQ D HQSLIDH     QSLGD+SGLSQFT HSDHFLRDFED QQKNRSPLSLWLNQASA+NAI N+NN++SNF
Subjt:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF

Query:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS
        FGASSSSSNLFGSITE+GLS+LPV+EKEDVENKGS    SKAT  +A ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS NNNN+PLFG+GAFGVMS
Subjt:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS

Query:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
        SSS  SSSSSSNAV +LNSLNKSRS TM DSVQM+GS+S+LSSNCLSQLL+PPNGNN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGLS
Subjt:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

Query:  QFAANH
        QFAANH
Subjt:  QFAANH

A0A1S3B706 protein indeterminate-domain 91.0e-23189.13Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSST T F  QQD    NPNPKPKPS AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF
        ARITTVSATNILNNLRNDS    LLHQQ D HQ LIDH     QSLGD+SGLSQFT HSDHFLRDFED QQKNRSPLSLWLNQASA+NAI NNNNN+SNF
Subjt:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF

Query:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS
        FGASSSSSNLFGSITE+GLS+LPV+EKEDVENKGS    SKAT  +A ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS+NNNNSPLFG+GAFGVMS
Subjt:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS

Query:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
        SSS  SSSSSSNAV +LNS NKSRS TMADSVQM+GS+S+LSSNCLSQLL+PPNGNN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGLS
Subjt:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

Query:  QFAANH
        QFAANH
Subjt:  QFAANH

A0A5D3DNM0 Protein indeterminate-domain 91.0e-23189.13Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
        MS+NPFSLLSST T F  QQD    NPNPKPKPS AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPS-AAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRT

Query:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
        NKE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES
Subjt:  NKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEES

Query:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF
        ARITTVSATNILNNLRNDS    LLHQQ D HQ LIDH     QSLGD+SGLSQFT HSDHFLRDFED QQKNRSPLSLWLNQASA+NAI NNNNN+SNF
Subjt:  ARITTVSATNILNNLRNDS---ILLHQQ-DPHQSLIDHQQNNLQSLGDVSGLSQFT-HSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNF

Query:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS
        FGASSSSSNLFGSITE+GLS+LPV+EKEDVENKGS    SKAT  +A ALLSGQSSQSVVSSSPMSATALLQKAALMGSTRS+NNNNSPLFG+GAFGVMS
Subjt:  FGASSSSSNLFGSITESGLSILPVIEKEDVENKGS---LSKAT--AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMS

Query:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
        SSS  SSSSSSNAV +LNS NKSRS TMADSVQM+GS+S+LSSNCLSQLL+PPNGNN MRSSGQTRDFLGV GGGEAPRPPFLPPELAKF TINSTMGLS
Subjt:  SSS--SSSSSSNAV-NLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

Query:  QFAANH
        QFAANH
Subjt:  QFAANH

A0A6J1CVG4 protein indeterminate-domain 9-like7.5e-23087.87Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MSSNPFSLLSS  T FAPQQD ANP+PN KPKPSAAAKKKRNLPG PDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KE IKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSILLHQQDPHQSLID----------HQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNN
        RITTVSATNILNNLRNDSILLHQQDPHQSL+D          H  NNLQSLGDVSGLSQF+HSD FLRD EDQQ KNRSPLSLWLNQASA+ AI  NNNN
Subjt:  RITTVSATNILNNLRNDSILLHQQDPHQSLID----------HQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNN

Query:  LSNFFGASSSSSNLFGSITE---SGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFG
        +S+FFG+SSSSSNLFGSI E   SGLS+LPVI+KEDVENK SLSKATAA ALLSGQSSQSVV SSSPMSATALLQKAALMGSTRSNNNN+S LFGAG FG
Subjt:  LSNFFGASSSSSNLFGSITE---SGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVV-SSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFG

Query:  VMS-----SSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINS
        VMS     SSSSSSSSSNAV+LNSLNK+RS TMADS+QMVG SS+LSSNCLSQ+LL  NGNN MRS+GQTRDFLGVGG GEAPRPPFLPPELAKFA INS
Subjt:  VMS-----SSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINS

Query:  TMGLSQFAANH
        TMGLSQFAANH
Subjt:  TMGLSQFAANH

A0A6J1F1R6 protein indeterminate-domain 9-like3.3e-22586.67Show/hide
Query:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
        MSSNPFSLLSST T FA   D ANP+PNPKPKPSAAAKKKRNLPGTPDPDAEV+ALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN
Subjt:  MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTN

Query:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
        KE IKKKVYICPEKTCVHHDPSRALGDLTG+KKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA
Subjt:  KEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESA

Query:  RITTVSATNILNNLRNDSILLHQQDPHQSLIDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSS
        RITTVSATNILNNLRNDS+LLHQQD  QSLIDHQ NNLQ+LGDV  LSQFTHSDHFLRDFED Q KNRSP SLWL          NN+NN+SNF+GASSS
Subjt:  RITTVSATNILNNLRNDSILLHQQDPHQSLIDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSS

Query:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKAT-AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSS-SSSSSS
        SSNLFGSI E+GLS+LPV EKEDVE KGSL KAT +A ALLSGQSS SVVSSSPMSATALLQKAALMGSTRS+ NNNSPL  AGAFGVM+SSS SSSSSS
Subjt:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKAT-AAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSS-SSSSSS

Query:  NAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLP-PNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH
        NAV+LNSLNKSRS +MADSVQMVG++S+LSSN LSQLL+P  NGNN M+S+ QTRDFLGVGG GEAPRPPFLPPELAKF  INSTMGLSQFAANH
Subjt:  NAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLP-PNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH

SwissProt top hitse value%identityAlignment
Q700D2 Zinc finger protein JACKDAW6.6e-9848.54Show/hide
Query:  NPFSLLSSTTTGFAPQQDH--------------ANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGH
        +PFS +SS+  GF  Q+ H              +NPNPN KP  S++AKKKRN PGTPDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGH
Subjt:  NPFSLLSSTTTGFAPQQDH--------------ANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGH

Query:  NLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHR
        NLPWKL+QR+ +E IKKKVYICP KTCVHHD SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHR
Subjt:  NLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHR

Query:  AFCDALAEESARITT-------VSATNILNNLRNDSILLHQQD-PHQSL---IDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQA
        AFCDAL EE AR+++       +S TN+  N  N+S +++  + PH  +   + H   N       + +SQF      L    D    +   LS  +  A
Subjt:  AFCDALAEESARITT-------VSATNILNNLRNDSILLHQQD-PHQSL---IDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQA

Query:  SADN--AINNNNNNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSNNNN
        S  N     +++++L +F G       +  +     LS     ++     +    K ++   L S  S ++     SPMSATALLQKAA MGSTRS N++
Subjt:  SADN--AINNNNNNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSNNNN

Query:  NSPLFGAGAFGVMSSSSSS---SSSSNAVNLNSLNKSRSFTMADSVQMVGSS-SNLSSNCLSQLLLPPN--GNNVMRSSGQTRDFLGVGG---GGEAPRP
         +P F AG     SS+++S    SSS  +    LN   +  + ++        S +S++ +       N  G N  +  G TRDFLGV       +  R 
Subjt:  NSPLFGAGAFGVMSSSSSS---SSSSNAVNLNSLNKSRSFTMADSVQMVGSS-SNLSSNCLSQLLLPPN--GNNVMRSSGQTRDFLGVGG---GGEAPRP

Query:  PFLPPELAKFATI
        PFLP ELA+FA +
Subjt:  PFLPPELAKFATI

Q944L3 Zinc finger protein BALDIBIS1.8e-9554.14Show/hide
Query:  QDHANPNPNPKPKP--SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCV
        Q+H  PNPNP P P  S +AK+KRNLPG PDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCV
Subjt:  QDHANPNPNPKPKP--SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCV

Query:  HHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLR-
        HHDP+RALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN   
Subjt:  HHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLR-

Query:  --------NDSILLHQQDPHQSLIDH-----QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLW--LNQASADNAINNNNNNLSNFFGASSS
                N +    Q +   S +D       +NN+  LG     + F  S          +  + S  +LW    Q+S    +N NNNN +N      S
Subjt:  --------NDSILLHQQDPHQSLIDH-----QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLW--LNQASADNAINNNNNNLSNFFGASSS

Query:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSN
         +                 E ++V + GSL  + A     +   +   ++S  MSATALLQKAA MGS RS++++++    +  FG+M+S  ++  + N
Subjt:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSN

Q9LRW7 Protein indeterminate-domain 114.7e-8846.72Show/hide
Query:  SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PG PDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLIDH
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H     H
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLIDH

Query:  QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSSSSNLFGSITESGLSILP--VIEKEDVENKGSLS
          +  Q   +VS  S  +H+ + +           + L    N  + +N+ NN+NN+L  F       SN    I     SI+P  +  +       + +
Subjt:  QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSSSSNLFGSITESGLSILP--VIEKEDVENKGSLS

Query:  KATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGST------------RSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKS------RSFT
         +        G  S   ++S  MSATALLQKAA MGST            RS +NNN     A A     S   SS+++N V     N S      R   
Subjt:  KATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGST------------RSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKS------RSFT

Query:  MADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
          D+      ++ +++   S+      G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  MADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

Q9LVQ7 Zinc finger protein ENHYDROUS5.2e-8747.13Show/hide
Query:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY
        SST +G A      N N  PK    +  KKKRNLPG PDPDAEVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKLRQR+ KE ++KKVY
Subjt:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY

Query:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATN
        +CP   CVHHDPSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAHSK CGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESA+  T S   
Subjt:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATN

Query:  ILNNLRNDSILLHQQDP----HQSLIDHQQNNLQSLGDVSGLSQFTHSDHF-------LRDFEDQQQKNRSP--------LSLWLNQASAD--NAINNNN
            +   +  + Q+ P        +        ++     +S  T S          +++  + Q+ N  P         ++  N +S+D  N  +NNN
Subjt:  ILNNLRNDSILLHQQDP----HQSLIDHQQNNLQSLGDVSGLSQFTHSDHF-------LRDFEDQQQKNRSP--------LSLWLNQASAD--NAINNNN

Query:  NNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSP-MSATALLQKAALMGSTRSNNNNNSPLFGAGAFGV
           +  F +S++S +L+ S T S     P    E +    S + +     +       + +   P MSATALLQKAA MGST S     S L G    G+
Subjt:  NNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSP-MSATALLQKAALMGSTRSNNNNNSPLFGAGAFGV

Query:  MSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVG
        +S++SSS   SN    ++L+ +    +       GS S L         L    ++V      T DFLG+G
Subjt:  MSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVG

Q9ZUL3 Protein indeterminate-domain 5, chloroplastic8.3e-8545.79Show/hide
Query:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY
        SS      P   H  P P   P  +   KKKRN P TP+ DAEVIALSPK+LMATNRFICE+CNKGFQR+QNLQLHRRGHNLPWKL+Q++ KE +K+KVY
Subjt:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY

Query:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----ITTV
        +CPE +CVHHDPSRALGDLTGIKKH+ RKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFCDALA+ESAR    +T++
Subjt:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----ITTV

Query:  SA--------TNILNNLRNDSI--LLHQQDPHQSLIDHQQNNLQSLGDVSG-----------LSQFTHSDHFLRD----FEDQQQKN-------------
         +        TN  NN  +  I  L H   P    +DHQ  ++  LG   G           L     S +F+++    F DQQ  +             
Subjt:  SA--------TNILNNLRNDSI--LLHQQDPHQSLIDHQQNNLQSLGDVSG-----------LSQFTHSDHFLRD----FEDQQQKN-------------

Query:  --RSPLSLWLN--QASADN---AINN--NNNNLSNFFGASSSSSN----LFGSITESGLSILPVIEKEDVENKGS-----------LSKA----TAAVAL
          +SP+S   N  Q S DN   A +N  N + LS   G +S++SN       +++   L I    + E+    G            +S A    + +V  
Subjt:  --RSPLSLWLN--QASADN---AINN--NNNNLSNFFGASSSSSN----LFGSITESGLSILPVIEKEDVENKGS-----------LSKA----TAAVAL

Query:  LSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPP
        L   S QS  S+  MSATALLQKAA MGST SNNNN S           ++++SS   S    +   N+S    + +S    G++ N+  N +       
Subjt:  LSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPP

Query:  NGNNVMRSS---GQTRDFLGVG
         G N   S+     TRDFLGVG
Subjt:  NGNNVMRSS---GQTRDFLGVG

Arabidopsis top hitse value%identityAlignment
AT2G02070.1 indeterminate(ID)-domain 55.9e-8645.79Show/hide
Query:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY
        SS      P   H  P P   P  +   KKKRN P TP+ DAEVIALSPK+LMATNRFICE+CNKGFQR+QNLQLHRRGHNLPWKL+Q++ KE +K+KVY
Subjt:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY

Query:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----ITTV
        +CPE +CVHHDPSRALGDLTGIKKH+ RKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFCDALA+ESAR    +T++
Subjt:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESAR----ITTV

Query:  SA--------TNILNNLRNDSI--LLHQQDPHQSLIDHQQNNLQSLGDVSG-----------LSQFTHSDHFLRD----FEDQQQKN-------------
         +        TN  NN  +  I  L H   P    +DHQ  ++  LG   G           L     S +F+++    F DQQ  +             
Subjt:  SA--------TNILNNLRNDSI--LLHQQDPHQSLIDHQQNNLQSLGDVSG-----------LSQFTHSDHFLRD----FEDQQQKN-------------

Query:  --RSPLSLWLN--QASADN---AINN--NNNNLSNFFGASSSSSN----LFGSITESGLSILPVIEKEDVENKGS-----------LSKA----TAAVAL
          +SP+S   N  Q S DN   A +N  N + LS   G +S++SN       +++   L I    + E+    G            +S A    + +V  
Subjt:  --RSPLSLWLN--QASADN---AINN--NNNNLSNFFGASSSSSN----LFGSITESGLSILPVIEKEDVENKGS-----------LSKA----TAAVAL

Query:  LSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPP
        L   S QS  S+  MSATALLQKAA MGST SNNNN S           ++++SS   S    +   N+S    + +S    G++ N+  N +       
Subjt:  LSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPP

Query:  NGNNVMRSS---GQTRDFLGVG
         G N   S+     TRDFLGVG
Subjt:  NGNNVMRSS---GQTRDFLGVG

AT3G13810.1 indeterminate(ID)-domain 113.4e-8946.72Show/hide
Query:  SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKK
        S   KK+RN PG PDP++EVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKL+QR+NKE I+KKVY+CPE +CVHHDPSRALGDLTGIKK
Subjt:  SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKK

Query:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLIDH
        HF RKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDCGTLFSR+DSFITHRAFC+ALAEE+AR   +      NN + + +L+HQ   H     H
Subjt:  HFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSILLHQQDPHQSLIDH

Query:  QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSSSSNLFGSITESGLSILP--VIEKEDVENKGSLS
          +  Q   +VS  S  +H+ + +           + L    N  + +N+ NN+NN+L  F       SN    I     SI+P  +  +       + +
Subjt:  QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSSSSNLFGSITESGLSILP--VIEKEDVENKGSLS

Query:  KATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGST------------RSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKS------RSFT
         +        G  S   ++S  MSATALLQKAA MGST            RS +NNN     A A     S   SS+++N V     N S      R   
Subjt:  KATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGST------------RSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKS------RSFT

Query:  MADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS
          D+      ++ +++   S+      G       G TRDFLG+       RP     E+  FA + S +  S
Subjt:  MADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLS

AT3G45260.1 C2H2-like zinc finger protein1.3e-9654.14Show/hide
Query:  QDHANPNPNPKPKP--SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCV
        Q+H  PNPNP P P  S +AK+KRNLPG PDPDAEVIALSP SLM TNRFICE+CNKGF+RDQNLQLHRRGHNLPWKL+QRTNKE +KKKVYICPEKTCV
Subjt:  QDHANPNPNPKPKP--SAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYICPEKTCV

Query:  HHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLR-
        HHDP+RALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAV SDWKAHSK CGT+EY+CDCGTLFSRKDSFITHRAFCDALAEESAR  +V  A   LNN   
Subjt:  HHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTV-SATNILNNLR-

Query:  --------NDSILLHQQDPHQSLIDH-----QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLW--LNQASADNAINNNNNNLSNFFGASSS
                N +    Q +   S +D       +NN+  LG     + F  S          +  + S  +LW    Q+S    +N NNNN +N      S
Subjt:  --------NDSILLHQQDPHQSLIDH-----QQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLW--LNQASADNAINNNNNNLSNFFGASSS

Query:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSN
         +                 E ++V + GSL  + A     +   +   ++S  MSATALLQKAA MGS RS++++++    +  FG+M+S  ++  + N
Subjt:  SSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSN

AT5G03150.1 C2H2-like zinc finger protein4.7e-9948.54Show/hide
Query:  NPFSLLSSTTTGFAPQQDH--------------ANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGH
        +PFS +SS+  GF  Q+ H              +NPNPN KP  S++AKKKRN PGTPDPDA+VIALSP +LMATNRF+CEICNKGFQRDQNLQLHRRGH
Subjt:  NPFSLLSSTTTGFAPQQDH--------------ANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGH

Query:  NLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHR
        NLPWKL+QR+ +E IKKKVYICP KTCVHHD SRALGDLTGIKKH+SRKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSRKDSFITHR
Subjt:  NLPWKLRQRTNKEAIKKKVYICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHR

Query:  AFCDALAEESARITT-------VSATNILNNLRNDSILLHQQD-PHQSL---IDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQA
        AFCDAL EE AR+++       +S TN+  N  N+S +++  + PH  +   + H   N       + +SQF      L    D    +   LS  +  A
Subjt:  AFCDALAEESARITT-------VSATNILNNLRNDSILLHQQD-PHQSL---IDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQA

Query:  SADN--AINNNNNNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSNNNN
        S  N     +++++L +F G       +  +     LS     ++     +    K ++   L S  S ++     SPMSATALLQKAA MGSTRS N++
Subjt:  SADN--AINNNNNNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQS-SQSVVSSSPMSATALLQKAALMGSTRSNNNN

Query:  NSPLFGAGAFGVMSSSSSS---SSSSNAVNLNSLNKSRSFTMADSVQMVGSS-SNLSSNCLSQLLLPPN--GNNVMRSSGQTRDFLGVGG---GGEAPRP
         +P F AG     SS+++S    SSS  +    LN   +  + ++        S +S++ +       N  G N  +  G TRDFLGV       +  R 
Subjt:  NSPLFGAGAFGVMSSSSSS---SSSSNAVNLNSLNKSRSFTMADSVQMVGSS-SNLSSNCLSQLLLPPN--GNNVMRSSGQTRDFLGVGG---GGEAPRP

Query:  PFLPPELAKFATI
        PFLP ELA+FA +
Subjt:  PFLPPELAKFATI

AT5G66730.1 C2H2-like zinc finger protein3.7e-8847.13Show/hide
Query:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY
        SST +G A      N N  PK    +  KKKRNLPG PDPDAEVIALSPK+LMATNRF+CEICNKGFQRDQNLQLHRRGHNLPWKLRQR+ KE ++KKVY
Subjt:  SSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVY

Query:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATN
        +CP   CVHHDPSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAHSK CGT+EYKCDCGTLFSR+DSFITHRAFCDALAEESA+  T S   
Subjt:  ICPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATN

Query:  ILNNLRNDSILLHQQDP----HQSLIDHQQNNLQSLGDVSGLSQFTHSDHF-------LRDFEDQQQKNRSP--------LSLWLNQASAD--NAINNNN
            +   +  + Q+ P        +        ++     +S  T S          +++  + Q+ N  P         ++  N +S+D  N  +NNN
Subjt:  ILNNLRNDSILLHQQDP----HQSLIDHQQNNLQSLGDVSGLSQFTHSDHF-------LRDFEDQQQKNRSP--------LSLWLNQASAD--NAINNNN

Query:  NNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSP-MSATALLQKAALMGSTRSNNNNNSPLFGAGAFGV
           +  F +S++S +L+ S T S     P    E +    S + +     +       + +   P MSATALLQKAA MGST S     S L G    G+
Subjt:  NNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSLSKATAAVALLSGQSSQSVVSSSP-MSATALLQKAALMGSTRSNNNNNSPLFGAGAFGV

Query:  MSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVG
        +S++SSS   SN    ++L+ +    +       GS S L         L    ++V      T DFLG+G
Subjt:  MSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPNGNNVMRSSGQTRDFLGVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGTAATCCTTTTTCTCTTCTCTCTTCCACTACCACTGGCTTTGCTCCACAACAAGATCATGCTAACCCTAATCCTAACCCTAAACCGAAACCCTCCGCCGCAGC
GAAGAAGAAGAGAAACCTCCCCGGAACCCCAGATCCAGATGCGGAGGTTATTGCTTTGTCGCCGAAATCGCTCATGGCGACGAATAGATTCATATGCGAAATTTGTAACA
AGGGGTTTCAGAGAGACCAAAACCTGCAGCTTCACCGACGAGGGCACAACCTCCCGTGGAAGCTGCGACAACGGACGAACAAGGAGGCGATCAAGAAGAAGGTGTATATT
TGCCCGGAGAAGACATGTGTCCACCACGATCCGTCGCGAGCCCTCGGCGACCTCACCGGAATAAAGAAACATTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATGTGA
CAAGTGTTCGAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCACTCTAAAACTTGTGGGACTAGAGAATATAAGTGTGATTGTGGAACCCTTTTTTCCAGGAAAGACA
GCTTCATAACCCACAGAGCATTTTGCGATGCCTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCAGCAACAAATATTCTCAATAATCTCAGAAATGATTCAATTCTT
CTTCATCAACAAGATCCTCACCAATCTTTGATTGATCATCAACAAAATAATCTTCAATCTCTTGGAGATGTTTCTGGGCTTTCCCAATTCACTCATTCAGATCATTTTTT
GAGAGATTTTGAAGATCAACAACAGAAGAACAGATCTCCATTGTCACTTTGGTTGAACCAAGCTTCTGCTGATAATGCAATCAACAACAACAATAATAATCTTTCCAACT
TTTTTGGAGCTTCTTCTTCCTCTTCCAATCTTTTCGGATCGATAACCGAAAGCGGGCTTTCGATCTTGCCAGTGATTGAGAAGGAAGATGTCGAGAATAAGGGAAGTTTG
TCGAAAGCTACGGCGGCGGTGGCGCTGTTGTCGGGTCAATCTTCTCAGTCTGTTGTCTCTTCTTCTCCGATGTCGGCCACTGCCCTTCTGCAAAAAGCTGCTCTTATGGG
CTCTACTAGAAGCAACAACAATAATAATTCTCCGCTCTTCGGAGCGGGTGCTTTCGGAGTAATGAGCTCTTCGTCTTCGTCGTCGTCGTCTTCAAATGCAGTGAACTTGA
ACTCTCTTAATAAATCTAGAAGCTTCACAATGGCTGACTCGGTGCAAATGGTGGGTAGTAGCTCTAACTTGAGCTCGAATTGTCTCAGCCAACTTCTGCTACCACCCAAC
GGTAATAATGTTATGAGAAGTAGCGGTCAAACGAGAGACTTCCTCGGAGTCGGGGGAGGAGGAGAAGCACCTCGGCCACCGTTCCTCCCACCGGAGCTAGCGAAATTTGC
CACCATAAACTCAACAATGGGACTAAGCCAATTCGCCGCCAATCACTAA
mRNA sequenceShow/hide mRNA sequence
TGGAGTTTGAGCTCAATACACATTCTAGCTAGTTACAAAGCATATGTATATAGAAGAGAGAGAGAGAGAGAGAGAGAGAGAAAAAGAAATATTCAATATAAAAAAAGAGT
AAAATAAAAAGGAAGAGATATTCTGTTTTAAGAGCAAAACATGTTGGTTTTTGGTGTTTTCTTAAAAGCAAACAAAAGAGGGAAAAACCCAGATCTTCATCTGGAAAGCT
CTATTTTTCTTGTTTGATCATATATAAATTTTTATTTTCTCTCTTCAAGGTCTTTTGTTCAGATTCAAAGTCTTCAAAACTTTGGTTTTTACTGTCTTTTGGCAGAACCC
TAGCTAGTCTCAAACAGATCATCATTCTTCATCATCATCATCATCCTTTTACATAACAAAAAACATCCAAGGTTTTTTTAGCTGTTTGTCTGCAAAAAGAAAAAATTAAA
AATCTTTGGTTTTTTCTTGTTTGTTCTTGTCTAAACCCATCTTCAATCACCTCCAAAATAAGACCCCAAATAAGAAATCATGTCAAGTAATCCTTTTTCTCTTCTCTCTT
CCACTACCACTGGCTTTGCTCCACAACAAGATCATGCTAACCCTAATCCTAACCCTAAACCGAAACCCTCCGCCGCAGCGAAGAAGAAGAGAAACCTCCCCGGAACCCCA
GATCCAGATGCGGAGGTTATTGCTTTGTCGCCGAAATCGCTCATGGCGACGAATAGATTCATATGCGAAATTTGTAACAAGGGGTTTCAGAGAGACCAAAACCTGCAGCT
TCACCGACGAGGGCACAACCTCCCGTGGAAGCTGCGACAACGGACGAACAAGGAGGCGATCAAGAAGAAGGTGTATATTTGCCCGGAGAAGACATGTGTCCACCACGATC
CGTCGCGAGCCCTCGGCGACCTCACCGGAATAAAGAAACATTTCAGCCGGAAACACGGCGAGAAGAAGTGGAAATGTGACAAGTGTTCGAAGAAATATGCTGTTCAATCT
GATTGGAAAGCTCACTCTAAAACTTGTGGGACTAGAGAATATAAGTGTGATTGTGGAACCCTTTTTTCCAGGAAAGACAGCTTCATAACCCACAGAGCATTTTGCGATGC
CTTAGCTGAAGAAAGTGCAAGAATCACAACAGTTTCAGCAACAAATATTCTCAATAATCTCAGAAATGATTCAATTCTTCTTCATCAACAAGATCCTCACCAATCTTTGA
TTGATCATCAACAAAATAATCTTCAATCTCTTGGAGATGTTTCTGGGCTTTCCCAATTCACTCATTCAGATCATTTTTTGAGAGATTTTGAAGATCAACAACAGAAGAAC
AGATCTCCATTGTCACTTTGGTTGAACCAAGCTTCTGCTGATAATGCAATCAACAACAACAATAATAATCTTTCCAACTTTTTTGGAGCTTCTTCTTCCTCTTCCAATCT
TTTCGGATCGATAACCGAAAGCGGGCTTTCGATCTTGCCAGTGATTGAGAAGGAAGATGTCGAGAATAAGGGAAGTTTGTCGAAAGCTACGGCGGCGGTGGCGCTGTTGT
CGGGTCAATCTTCTCAGTCTGTTGTCTCTTCTTCTCCGATGTCGGCCACTGCCCTTCTGCAAAAAGCTGCTCTTATGGGCTCTACTAGAAGCAACAACAATAATAATTCT
CCGCTCTTCGGAGCGGGTGCTTTCGGAGTAATGAGCTCTTCGTCTTCGTCGTCGTCGTCTTCAAATGCAGTGAACTTGAACTCTCTTAATAAATCTAGAAGCTTCACAAT
GGCTGACTCGGTGCAAATGGTGGGTAGTAGCTCTAACTTGAGCTCGAATTGTCTCAGCCAACTTCTGCTACCACCCAACGGTAATAATGTTATGAGAAGTAGCGGTCAAA
CGAGAGACTTCCTCGGAGTCGGGGGAGGAGGAGAAGCACCTCGGCCACCGTTCCTCCCACCGGAGCTAGCGAAATTTGCCACCATAAACTCAACAATGGGACTAAGCCAA
TTCGCCGCCAATCACTAAAGTCTAAAATCCAATCCTCACCCCATCAAAATTGAAGTGCAGGGCGGCGGAAAACGGTCAGAAGGGGCCGGCGGCGGGGCCACCGGTGAATG
TTTAAAGTTGACAGTTTTCAGAGAGTAGTAAAAAGGGCCATGCATGTAACTTTTTTTTTCACTTCCTCTTGTATTTTATATTTTGTCGTTTGAACTTTTTTTTTCACCCT
TTTCGTAATTTCGCTTTCAAATTATTGCATTTGCCAGAGAAAACTACTTCTATATATATATATATATATATAAATATAAATATAAATATATATTTGACTAGTGGTCAAAT
TAATTTGGCCGGCAGCCACCACCAGGTCAACG
Protein sequenceShow/hide protein sequence
MSSNPFSLLSSTTTGFAPQQDHANPNPNPKPKPSAAAKKKRNLPGTPDPDAEVIALSPKSLMATNRFICEICNKGFQRDQNLQLHRRGHNLPWKLRQRTNKEAIKKKVYI
CPEKTCVHHDPSRALGDLTGIKKHFSRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRKDSFITHRAFCDALAEESARITTVSATNILNNLRNDSIL
LHQQDPHQSLIDHQQNNLQSLGDVSGLSQFTHSDHFLRDFEDQQQKNRSPLSLWLNQASADNAINNNNNNLSNFFGASSSSSNLFGSITESGLSILPVIEKEDVENKGSL
SKATAAVALLSGQSSQSVVSSSPMSATALLQKAALMGSTRSNNNNNSPLFGAGAFGVMSSSSSSSSSSNAVNLNSLNKSRSFTMADSVQMVGSSSNLSSNCLSQLLLPPN
GNNVMRSSGQTRDFLGVGGGGEAPRPPFLPPELAKFATINSTMGLSQFAANH