; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014953 (gene) of Snake gourd v1 genome

Gene IDTan0014953
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein indeterminate-domain 7-like
Genome locationLG06:1498890..1503050
RNA-Seq ExpressionTan0014953
SyntenyTan0014953
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141684.2 protein indeterminate-domain 7 [Cucumis sativus]2.2e-21376.18Show/hide
Query:  MMMKGNFLS---QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        MMMKGNFLS   QQQ+VVM+ENLSNLTSASGEATASVSSANK+EFPNQYFAPQT   QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
Subjt:  MMMKGNFLS---QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
        GFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD

Query:  CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSH
        CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNS    +   NN+ ALKRDF + N++NNN +LR EIP WLQ +SD    + VGS  
Subjt:  CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSH

Query:  DQPQQQQHHQHHQTLNPNP--NPSGVGCG------------PSNLLPP-PAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTG
              Q   + +T+NPNP  N S  GCG            P+N   P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH L+HVSTG
Subjt:  DQPQQQQHHQHHQTLNPNP--NPSGVGCG------------PSNLLPP-PAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTG

Query:  NFGQIGLSSREGEMGR--------GRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAAF
        NFG+IGL S + E+GR        G GGAVSCSSSSCTDYGNKAAA  SASA ASASASA +TFLHD+I NSLSSP   S SHP  L   +SSF D  AF
Subjt:  NFGQIGLSSREGEMGR--------GRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAAF

Query:  AAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
        AAMHHH     V        ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NL  QIQKPWQG
Subjt:  AAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

XP_022954034.1 protein indeterminate-domain 7-like [Cucurbita moschata]5.0e-21376.79Show/hide
Query:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
        MMMKGNFLSQQQ      V+M+ENLSNLTSASGEATASVSSA      N YFAPQ+Q      PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
Subjt:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
        ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE+SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE

Query:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFV
        YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPL+SSYN          NNNNNN F +KRDFSN N      N+RAEIP WL SN DLR EIF+
Subjt:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFV

Query:  GSSHDQPQQQQHHQHHQTLNPN------PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQ
        GS  D    +  HQ+H+TLNPN       +  G GCG S  LPPP +YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++GQ
Subjt:  GSSHDQPQQQQHHQHHQTLNPN------PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQ

Query:  IGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATAT
        IG            GGAVSC SSSCTDYG+KAAAV         SAS P+TFLHDMINSLS   SASASHPFL D SSFNDV AF AMHHH    TAT T
Subjt:  IGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATAT

Query:  VTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
        VTA      ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NLQSQIQKPWQG
Subjt:  VTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

XP_022992237.1 protein indeterminate-domain 7-like [Cucurbita maxima]6.1e-21176.3Show/hide
Query:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVC
        MMMKGNFLSQQQ      V+M+ENLSNLTSASGEATASVSSA      N Y APQ+Q       PPPPKKKRNLPGNPDPDAEVIALSP+TLMATNRFVC
Subjt:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVC

Query:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
        EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE+SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
Subjt:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK

Query:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIF
        EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNP++SSY           NNNNNN F +KRDFSN+N      N+RAEIP WL SN DLR EIF
Subjt:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIF

Query:  VGSSHDQPQQQQHHQHHQTLNPN--PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGL
        +GS  D    +  HQ+H+TLNPN   +  G GCG S  LPPP +YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++G IG 
Subjt:  VGSSHDQPQQQQHHQHHQTLNPN--PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGL

Query:  SSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATATVTA
                   GGAVSC SSSCTDYG+KAAAV         SAS P+TFLHDMINSLS   SASASHPFL D SSFNDV AF AMHHH    TAT TVTA
Subjt:  SSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATATVTA

Query:  ANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
              ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NLQSQIQKPWQG
Subjt:  ANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

XP_023547451.1 protein indeterminate-domain 7-like [Cucurbita pepo subsp. pepo]1.7e-21376.48Show/hide
Query:  MMMKGNFLSQQQ----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ---------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF
        MMMKGNFLSQQQ     V+M+ENLSNLTSASGEATASVSSA      N YF PQ+Q          PPPPKKKRNLPGNPDPDAEVIALSP+TLMATNRF
Subjt:  MMMKGNFLSQQQ----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ---------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF

Query:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
        VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE+SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
Subjt:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG

Query:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAE
        TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPL+SSYN+NNNN            F +KRDFSN N      N+RAEIP WL SN DLR E
Subjt:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAE

Query:  IFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGL
        IF+GS  D    +  HQ+H+TLNPN +  G GCG S  LPPP +YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++GQIG 
Subjt:  IFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGL

Query:  SSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATATVTA
                   GGAVSC SSSCTDYG+KAAAV         SAS P+TFLHDMINSLS   SASASHPFL D SSFNDV AF AMHHH    TAT TVTA
Subjt:  SSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATATVTA

Query:  ANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
              ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NLQSQIQKPWQG
Subjt:  ANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

XP_038898698.1 protein indeterminate-domain 7-like [Benincasa hispida]1.1e-23683.09Show/hide
Query:  MMMKGNFLS---QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ--QPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG
        MMMKGNFLS   QQQ+VVM+ENLSNLTSASGEATASVSSANKTEFPNQYFAPQ+Q   PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG
Subjt:  MMMKGNFLS---QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ--QPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG

Query:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
        FQRDQNLQLHRRGHNLPWKLKQR+NKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
Subjt:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC

Query:  GTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-HNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSH
        GTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN +NNNNSQEF NNNN   FALKRDF+NNN+N    NLRAEIP WLQ  SDLRAEI +GS H
Subjt:  GTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-HNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSH

Query:  DQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGLSSREGEMG
        ++        +H+TLNPNPNPSG GCGP++ LPPPAYQS    SPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKL+HVSTGN+G++GL SRE EMG
Subjt:  DQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGLSSREGEMG

Query:  RGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFND----VAAFAAMHHHTATATVTAANVNVNAS
        RG GGAVSCSSSSCTDYGNKA     A+ANASASASA +TFLHDMINSLSSP   S SHPFL  +SSFND     AAF+AMHHHTA    T A   V  S
Subjt:  RGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFND----VAAFAAMHHHTATATVTAANVNVNAS

Query:  GGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVP-NSANLQSQIQKPWQG
        G RSDGLTRDFLGLRPLSHGDILS+TGFGNCIVP NS+NLQ+QIQKPWQG
Subjt:  GGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVP-NSANLQSQIQKPWQG

TrEMBL top hitse value%identityAlignment
A0A0A0K7W2 C2H2-type domain-containing protein1.1e-21376.18Show/hide
Query:  MMMKGNFLS---QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        MMMKGNFLS   QQQ+VVM+ENLSNLTSASGEATASVSSANK+EFPNQYFAPQT   QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
Subjt:  MMMKGNFLS---QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
        GFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD

Query:  CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSH
        CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNS    +   NN+ ALKRDF + N++NNN +LR EIP WLQ +SD    + VGS  
Subjt:  CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSH

Query:  DQPQQQQHHQHHQTLNPNP--NPSGVGCG------------PSNLLPP-PAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTG
              Q   + +T+NPNP  N S  GCG            P+N   P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH L+HVSTG
Subjt:  DQPQQQQHHQHHQTLNPNP--NPSGVGCG------------PSNLLPP-PAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTG

Query:  NFGQIGLSSREGEMGR--------GRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAAF
        NFG+IGL S + E+GR        G GGAVSCSSSSCTDYGNKAAA  SASA ASASASA +TFLHD+I NSLSSP   S SHP  L   +SSF D  AF
Subjt:  NFGQIGLSSREGEMGR--------GRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAAF

Query:  AAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
        AAMHHH     V        ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NL  QIQKPWQG
Subjt:  AAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

A0A1S3CGT8 protein indeterminate-domain 74.7e-20974.91Show/hide
Query:  MMMKGNFLS------QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS      QQQ+VVM+ENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS------QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNN--NNIFALKRDFSN----NNSNNNNTNLRAEIPSWLQSNSDLR
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN+   S ++   NN+ ALKRDF N    NN+NNNN +LR EIP WLQ +SD  
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNN--NNIFALKRDFSN----NNSNNNNTNLRAEIPSWLQSNSDLR

Query:  AEIFVGSSHDQPQQQQHHQHHQTLNPNPNPS------------GVGCG-PSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHK
          + VGS        Q   + +T+NPNP+ S            GVG G P+N  P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH 
Subjt:  AEIFVGSSHDQPQQQQHHQHHQTLNPNPNPS------------GVGCG-PSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHK

Query:  LVHVSTGNFGQIGLSSREGEMGR-GRGGAVSCSSSSCTDYGNKAAAV-VSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAA
        L+HVSTGNFG++GL S + E+GR G GGAVSCSSSSCTDYGNKAAA   SASA+ASASASA +TFLHD+I NSLSSP   S+SHP  L   +SSF D  A
Subjt:  LVHVSTGNFGQIGLSSREGEMGR-GRGGAVSCSSSSCTDYGNKAAAV-VSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAA

Query:  FAAM------HHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
        FAA+      HHH  T   T A     ASGGR+DGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NL  QIQKPWQG
Subjt:  FAAM------HHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

A0A6J1GRP3 protein indeterminate-domain 7-like2.4e-21376.79Show/hide
Query:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
        MMMKGNFLSQQQ      V+M+ENLSNLTSASGEATASVSSA      N YFAPQ+Q      PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
Subjt:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
        ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE+SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE

Query:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFV
        YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPL+SSYN          NNNNNN F +KRDFSN N      N+RAEIP WL SN DLR EIF+
Subjt:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFV

Query:  GSSHDQPQQQQHHQHHQTLNPN------PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQ
        GS  D    +  HQ+H+TLNPN       +  G GCG S  LPPP +YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++GQ
Subjt:  GSSHDQPQQQQHHQHHQTLNPN------PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQ

Query:  IGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATAT
        IG            GGAVSC SSSCTDYG+KAAAV         SAS P+TFLHDMINSLS   SASASHPFL D SSFNDV AF AMHHH    TAT T
Subjt:  IGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATAT

Query:  VTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
        VTA      ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NLQSQIQKPWQG
Subjt:  VTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

A0A6J1JP83 protein indeterminate-domain 7-like2.9e-21176.3Show/hide
Query:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVC
        MMMKGNFLSQQQ      V+M+ENLSNLTSASGEATASVSSA      N Y APQ+Q       PPPPKKKRNLPGNPDPDAEVIALSP+TLMATNRFVC
Subjt:  MMMKGNFLSQQQ-----VVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVC

Query:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
        EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE+SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
Subjt:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK

Query:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIF
        EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNP++SSY           NNNNNN F +KRDFSN+N      N+RAEIP WL SN DLR EIF
Subjt:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIF

Query:  VGSSHDQPQQQQHHQHHQTLNPN--PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGL
        +GS  D    +  HQ+H+TLNPN   +  G GCG S  LPPP +YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++G IG 
Subjt:  VGSSHDQPQQQQHHQHHQTLNPN--PNPSGVGCGPSNLLPPP-AYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGL

Query:  SSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATATVTA
                   GGAVSC SSSCTDYG+KAAAV         SAS P+TFLHDMINSLS   SASASHPFL D SSFNDV AF AMHHH    TAT TVTA
Subjt:  SSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHH----TATATVTA

Query:  ANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
              ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NLQSQIQKPWQG
Subjt:  ANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

E5GCB4 Nucleic acid binding protein8.0e-20974.91Show/hide
Query:  MMMKGNFLS------QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS      QQQ+VVM+ENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS------QQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNN--NNIFALKRDFSN----NNSNNNNTNLRAEIPSWLQSNSDLR
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN+   S ++   NN+ ALKRDF N    NN+NNNN +LR EIP WLQ +SD  
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNSQEFSNNNN--NNIFALKRDFSN----NNSNNNNTNLRAEIPSWLQSNSDLR

Query:  AEIFVGSSHDQPQQQQHHQHHQTLNPNPNPS------------GVGCG-PSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHK
          + VGS        Q   + +T+NPNP+ S            GVG G P+N  P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH 
Subjt:  AEIFVGSSHDQPQQQQHHQHHQTLNPNPNPS------------GVGCG-PSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHK

Query:  LVHVSTGNFGQIGLSSREGEMGR-GRGGAVSCSSSSCTDYGNKAAAV-VSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAA
        L+HVSTGNFG++GL S + E+GR G GGAVSCSSSSCTDYGNKAAA   SASA+ASASASA +TFLHD+I NSLSSP   S SHP  L   +SSF D  A
Subjt:  LVHVSTGNFGQIGLSSREGEMGR-GRGGAVSCSSSSCTDYGNKAAAV-VSASANASASASAPSTFLHDMI-NSLSSPNSASASHPFLLD--SSSFNDVAA

Query:  FAAM------HHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG
        FAA+      HHH  T   T A     ASGGR+DGLTRDFLGLRPLSHGDILS+TGFGNCIVPNS+NL  QIQKPWQG
Subjt:  FAAM------HHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG

SwissProt top hitse value%identityAlignment
Q8H1F5 Protein indeterminate-domain 71.0e-9958.42Show/hide
Query:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFP----NQYFAPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG
        MM +     QQQ   MEEN+SNLTSASG+  ASVSS N+TE      NQ+   Q   P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+CNKG
Subjt:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFP----NQYFAPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG

Query:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
        FQRDQNLQLH+RGHNLPWKLKQRSNK+V++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGTKEY+CDC
Subjt:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC

Query:  GTLFSRRDSFITHRAFCDALADESARSAMALNPLL----SSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG
        GTLFSRRDSFITHRAFCDALA+ESAR+    NP++    +S +H+++ +Q+       NI      FS+++ N             + SNS+L       
Subjt:  GTLFSRRDSFITHRAFCDALADESARSAMALNPLL----SSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG

Query:  SSHDQPQQQQHHQHHQTLNP-----NPNPSGVGCGPSNLLPPPA------YQSSSLSSPHISATALLQKAAQMGATMSST
          H   +Q++   H+Q + P     NPNP+G      NL PP A        S    SP +SATALLQKAAQMG+T S+T
Subjt:  SSHDQPQQQQHHQHHQTLNP-----NPNPSGVGCGPSNLLPPPA------YQSSSLSSPHISATALLQKAAQMGATMSST

Q944L3 Zinc finger protein BALDIBIS1.1e-8553.13Show/hide
Query:  QYFAPQTQQPPPP------KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVH
        ++ AP     P P      K+KRNLPGNPDPDAEVIALSP +LM TNRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQR+NKE +KKKVY+CPE +CVH
Subjt:  QYFAPQTQQPPPP------KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVH

Query:  HDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESAR-------SAMALNPLLS
        HDP+RALGDLTGIKKHF RKHGEKKWKCDKCSKKYAV SDWKAHSKICGTKEYRCDCGTLFSR+DSFITHRAFCDALA+ESAR        A   N L  
Subjt:  HDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESAR-------SAMALNPLLS

Query:  SYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQ-----------QQHHQHHQTLNPNPNPS----
          NH N N        N     L +   N N NN      A +   L +N      +F  SS   P+            Q    H   LN N N +    
Subjt:  SYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQ-----------QQHHQHHQTLNPNPNPS----

Query:  --GVGCGPSNLLPPPAYQSSSLSSPH-----------------ISATALLQKAAQMGATMSSTTTTS
          G+              + SL S                   +SATALLQKAAQMG+  SS+++++
Subjt:  --GVGCGPSNLLPPPAYQSSSLSSPH-----------------ISATALLQKAAQMGATMSSTTTTS

Q9LRW7 Protein indeterminate-domain 111.7e-10748Show/hide
Query:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF
        MM K   L Q Q    +EN+SNLTSASG+  ASVSS N TE     + P  QQ                 KK+RN PGNPDP++EVIALSPKTLMATNRF
Subjt:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF

Query:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
        VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CG
Subjt:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG

Query:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNN
        TKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H+        N +S   S++N+N I +L  D +N N+NN+N
Subjt:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNN

Query:  TNLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSM
         N    + ++         +  +   H         Q H   + NPNPS  G G  +L         SL+SP +SATALLQKAAQMG+T +     + + 
Subjt:  TNLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSM

Query:  PRPHKLVHVSTGNFGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDV
         R       ST N                                       + +   +A  ++PS F+    N+  L    +AS       + +  +  
Subjt:  PRPHKLVHVSTGNFGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDV

Query:  AAFAAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG
          F   +  TA     AA    +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NS+       KPWQG
Subjt:  AAFAAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG

Q9LVQ7 Zinc finger protein ENHYDROUS6.3e-8648.69Show/hide
Query:  MEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL
        M  +L N ++ SG+  ASVSS       NQ   P++      KKKRNLPG PDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL
Subjt:  MEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL

Query:  KQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDAL
        +QRS KEV +KKVYVCP + CVHHDPSRALGDLTGIKKHFCRKHGEKKWKC+KCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDAL
Subjt:  KQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDAL

Query:  ADESARS---------------------------------------AMALNP------------LLSSYNHNNNNSQEFSNNNNNNIFALKR--------
        A+ESA++                                       ++A+ P            ++SS      NS E   NNN+    ++         
Subjt:  ADESARS---------------------------------------AMALNP------------LLSSYNHNNNNSQEFSNNNNNNIFALKR--------

Query:  ----DFSNNNSNNNNTNLRAEIPSW----LQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQ
            D SN++SNNN       + S     L ++S     +F  SS  +P          +L  + NPS  G     +  PP + +     P +SATALLQ
Subjt:  ----DFSNNNSNNNNTNLRAEIPSW----LQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQ

Query:  KAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGLSSREG-EMGRGRGGAVSCSS
        KAAQMG+T S      GS+ R   +V  ++ +   + LS+ +   +  G G  + CSS
Subjt:  KAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGLSSREG-EMGRGRGGAVSCSS

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 11.2e-8755.01Show/hide
Query:  MEENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
        M  +L N ++ SGEA+ S+SS  N+   PN             KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
Subjt:  MEENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK

Query:  LKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA
        L+Q+SNKEV KKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDA
Subjt:  LKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA

Query:  LADESARS----AMALNP-LLSSYNHNNNN-----SQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG---SSHDQPQQQQ
        LA+E+ARS    +   NP +L+  N   N        E +   +++   +K+    + S      +  E P     N      +F G   SS   P    
Subjt:  LADESARS----AMALNP-LLSSYNHNNNN-----SQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG---SSHDQPQQQQ

Query:  HHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSS------PHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL
             ++L      S     P +L    ++ SS L S      P +SATALLQKAAQMGA  S           S+T+TS     PH L
Subjt:  HHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSS------PHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 77.1e-10158.42Show/hide
Query:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFP----NQYFAPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG
        MM +     QQQ   MEEN+SNLTSASG+  ASVSS N+TE      NQ+   Q   P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+CNKG
Subjt:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFP----NQYFAPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG

Query:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
        FQRDQNLQLH+RGHNLPWKLKQRSNK+V++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGTKEY+CDC
Subjt:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC

Query:  GTLFSRRDSFITHRAFCDALADESARSAMALNPLL----SSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG
        GTLFSRRDSFITHRAFCDALA+ESAR+    NP++    +S +H+++ +Q+       NI      FS+++ N             + SNS+L       
Subjt:  GTLFSRRDSFITHRAFCDALADESARSAMALNPLL----SSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG

Query:  SSHDQPQQQQHHQHHQTLNP-----NPNPSGVGCGPSNLLPPPA------YQSSSLSSPHISATALLQKAAQMGATMSST
          H   +Q++   H+Q + P     NPNP+G      NL PP A        S    SP +SATALLQKAAQMG+T S+T
Subjt:  SSHDQPQQQQHHQHHQTLNP-----NPNPSGVGCGPSNLLPPPA------YQSSSLSSPHISATALLQKAAQMGATMSST

AT3G13810.1 indeterminate(ID)-domain 111.2e-10848Show/hide
Query:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF
        MM K   L Q Q    +EN+SNLTSASG+  ASVSS N TE     + P  QQ                 KK+RN PGNPDP++EVIALSPKTLMATNRF
Subjt:  MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF

Query:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
        VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CG
Subjt:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG

Query:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNN
        TKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H+        N +S   S++N+N I +L  D +N N+NN+N
Subjt:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNN

Query:  TNLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSM
         N    + ++         +  +   H         Q H   + NPNPS  G G  +L         SL+SP +SATALLQKAAQMG+T +     + + 
Subjt:  TNLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSM

Query:  PRPHKLVHVSTGNFGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDV
         R       ST N                                       + +   +A  ++PS F+    N+  L    +AS       + +  +  
Subjt:  PRPHKLVHVSTGNFGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDV

Query:  AAFAAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG
          F   +  TA     AA    +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NS+       KPWQG
Subjt:  AAFAAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG

AT3G13810.2 indeterminate(ID)-domain 116.0e-10045.99Show/hide
Query:  LSQQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFV
        L Q Q    +EN+SNLTSASG+  ASVSS N TE     + P  QQ    ++++    +                     P++EVIALSPKTLMATNRFV
Subjt:  LSQQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFV

Query:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT
        CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CGT
Subjt:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT

Query:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNNT
        KEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H+        N +S   S++N+N I +L  D +N N+NN+N 
Subjt:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNNT

Query:  NLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMP
        N    + ++         +  +   H         Q H   + NPNPS  G G  +L         SL+SP +SATALLQKAAQMG+T +     + +  
Subjt:  NLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMP

Query:  RPHKLVHVSTGNFGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDVA
        R       ST N                                       + +   +A  ++PS F+    N+  L    +AS       + +  +   
Subjt:  RPHKLVHVSTGNFGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDVA

Query:  AFAAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG
         F   +  TA     AA    +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NS+       KPWQG
Subjt:  AFAAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG

AT3G13810.3 indeterminate(ID)-domain 114.3e-9846.09Show/hide
Query:  LSNLTSASGEATASVSSANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        +SNLTSASG+  ASVSS N TE     + P  QQ    ++++    +                     P++EVIALSPKTLMATNRFVCEICNKGFQRDQ
Subjt:  LSNLTSASGEATASVSSANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS
        NLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CGTKEYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS

Query:  RRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQS
        RRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H+        N +S   S++N+N I +L  D +N N+NN+N N    + ++   
Subjt:  RRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNHN--------NNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQS

Query:  NSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN
              +  +   H         Q H   + NPNPS  G G  +L         SL+SP +SATALLQKAAQMG+T +     + +  R       ST N
Subjt:  NSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN

Query:  FGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDVAAFAAMHHHTATA
                                               + +   +A  ++PS F+    N+  L    +AS       + +  +    F   +  TA  
Subjt:  FGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMINS--LSSPNSASASHPFLLDSSSFNDVAAFAAMHHHTATA

Query:  TVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG
           AA    +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NS+       KPWQG
Subjt:  TVTAANVNVNASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSANLQSQIQKPWQG

AT3G50700.1 indeterminate(ID)-domain 28.2e-8955.01Show/hide
Query:  MEENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
        M  +L N ++ SGEA+ S+SS  N+   PN             KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
Subjt:  MEENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK

Query:  LKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA
        L+Q+SNKEV KKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDA
Subjt:  LKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA

Query:  LADESARS----AMALNP-LLSSYNHNNNN-----SQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG---SSHDQPQQQQ
        LA+E+ARS    +   NP +L+  N   N        E +   +++   +K+    + S      +  E P     N      +F G   SS   P    
Subjt:  LADESARS----AMALNP-LLSSYNHNNNN-----SQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVG---SSHDQPQQQQ

Query:  HHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSS------PHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL
             ++L      S     P +L    ++ SS L S      P +SATALLQKAAQMGA  S           S+T+TS     PH L
Subjt:  HHQHHQTLNPNPNPSGVGCGPSNLLPPPAYQSSSLSS------PHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGAAAGGCAATTTTTTGTCTCAACAACAAGTAGTAGTCATGGAGGAAAATTTGTCCAATTTGACTTCTGCTTCTGGTGAAGCTACTGCTAGTGTTTCTTCTGC
CAATAAAACTGAATTCCCCAATCAATATTTTGCTCCTCAAACCCAACAACCACCTCCTCCCAAGAAGAAGCGTAATCTCCCAGGAAATCCCGACCCAGATGCGGAAGTGA
TAGCGTTATCGCCGAAAACACTGATGGCGACGAATCGATTCGTGTGCGAGATCTGCAACAAGGGGTTTCAGAGAGATCAAAATCTTCAACTACATAGAAGAGGACACAAT
CTTCCATGGAAGCTGAAGCAGAGATCAAACAAAGAAGTGATAAAGAAGAAGGTATATGTTTGTCCAGAAGCAAGCTGTGTCCACCATGATCCATCAAGAGCACTTGGAGA
CCTTACAGGAATAAAGAAGCACTTCTGCAGAAAGCACGGTGAAAAGAAGTGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAATCTGATTGGAAAGCTCATTCCA
AGATCTGTGGCACTAAGGAGTACAGATGTGACTGTGGAACTCTCTTTTCAAGAAGAGATAGTTTCATTACACACAGAGCCTTCTGTGATGCATTAGCAGATGAAAGTGCA
AGATCAGCCATGGCATTAAACCCTCTTCTTTCTTCTTACAACCACAACAATAATAATTCACAAGAATTCTCAAATAATAATAATAATAATATTTTCGCTCTCAAACGAGA
CTTCAGCAACAACAACAGCAACAACAACAACACTAATTTGAGAGCAGAAATCCCGTCATGGCTGCAATCGAATTCTGATCTCCGAGCGGAAATCTTCGTCGGGAGCAGCC
ACGACCAGCCTCAACAACAACAACATCATCAACATCATCAAACCCTAAACCCTAACCCTAATCCTAGCGGAGTCGGGTGTGGGCCCAGTAATCTTCTTCCACCTCCTGCT
TATCAATCGTCTTCTCTTTCATCTCCTCATATTTCAGCCACTGCGCTGCTGCAGAAGGCAGCCCAAATGGGTGCGACGATGAGTAGTACCACTACCACGAGTGGCTCTAT
GCCAAGGCCCCACAAGCTGGTTCACGTGTCTACAGGTAATTTTGGACAGATAGGTTTAAGCTCACGTGAAGGTGAAATGGGCAGAGGAAGAGGAGGGGCCGTCAGTTGTA
GTAGTAGTAGTTGTACTGATTATGGGAATAAAGCTGCTGCTGTTGTTTCTGCTTCTGCTAATGCTTCTGCTTCTGCTTCTGCACCAAGCACTTTTCTTCATGACATGATA
AATTCCCTCTCTTCTCCTAATTCTGCTTCTGCTTCTCATCCCTTCCTCCTAGATTCCTCTTCCTTTAACGACGTCGCAGCGTTCGCCGCTATGCATCATCACACCGCTAC
CGCTACCGTTACCGCTGCCAACGTCAACGTCAATGCTTCGGGCGGTCGAAGCGACGGTTTAACGAGAGATTTCTTGGGGCTTCGCCCTCTCTCCCATGGAGATATTCTAA
GCATTACTGGTTTTGGAAATTGTATTGTTCCTAATTCTGCCAATCTTCAGAGCCAAATCCAAAAGCCATGGCAGGGTTAG
mRNA sequenceShow/hide mRNA sequence
TTTGTGATTACATTACACAGGGTTTCTTCAATTCTCATCCATATCTCTCTCTTTCTTCTTTTTCTCTCTCTAGAATTTCAAGAACTGGGTTTCAAATTTTCGGGTTCATA
TATTATTTTTGAGTGGCTATTATATAGTTTGAAGATTTTAATTTTCTCTTTTCTGGGTTATCATATGAATTTATTTTTTAAAAAAAGTTCAGTTTTTGAGTTCTCTCTCT
CTTTCTTAAACAGCAGGCTTGCTTGTTTCTTTTGAGCTTTCAAAAAAAACTAAAAGTTGTCTCTTTGGTTGGAGCCAAATCAAAGATTGAATGCAGTAAAGAGCACAAAT
ATATCAAATTTTGTTTTGTTTTTCTTTTTCTTTTTTTTTTTTCCTCTTCTCTATTTTGTCTTGAATTAGATCTTGGAAAGTTTTGTTAATTACTTCAAATATTCAAGATT
GTTTTGTTTCTTCTTGTTCTTGTTCTTGTTCTTTGTCTCTGTTAGATCTGTTGTTGTCTCATGTGGAAACCTTAGGAATTTCAAACTACTGTAAGAGGGTTATTTGTTCA
TAAAGATGATGATGAAAGGCAATTTTTTGTCTCAACAACAAGTAGTAGTCATGGAGGAAAATTTGTCCAATTTGACTTCTGCTTCTGGTGAAGCTACTGCTAGTGTTTCT
TCTGCCAATAAAACTGAATTCCCCAATCAATATTTTGCTCCTCAAACCCAACAACCACCTCCTCCCAAGAAGAAGCGTAATCTCCCAGGAAATCCCGACCCAGATGCGGA
AGTGATAGCGTTATCGCCGAAAACACTGATGGCGACGAATCGATTCGTGTGCGAGATCTGCAACAAGGGGTTTCAGAGAGATCAAAATCTTCAACTACATAGAAGAGGAC
ACAATCTTCCATGGAAGCTGAAGCAGAGATCAAACAAAGAAGTGATAAAGAAGAAGGTATATGTTTGTCCAGAAGCAAGCTGTGTCCACCATGATCCATCAAGAGCACTT
GGAGACCTTACAGGAATAAAGAAGCACTTCTGCAGAAAGCACGGTGAAAAGAAGTGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAATCTGATTGGAAAGCTCA
TTCCAAGATCTGTGGCACTAAGGAGTACAGATGTGACTGTGGAACTCTCTTTTCAAGAAGAGATAGTTTCATTACACACAGAGCCTTCTGTGATGCATTAGCAGATGAAA
GTGCAAGATCAGCCATGGCATTAAACCCTCTTCTTTCTTCTTACAACCACAACAATAATAATTCACAAGAATTCTCAAATAATAATAATAATAATATTTTCGCTCTCAAA
CGAGACTTCAGCAACAACAACAGCAACAACAACAACACTAATTTGAGAGCAGAAATCCCGTCATGGCTGCAATCGAATTCTGATCTCCGAGCGGAAATCTTCGTCGGGAG
CAGCCACGACCAGCCTCAACAACAACAACATCATCAACATCATCAAACCCTAAACCCTAACCCTAATCCTAGCGGAGTCGGGTGTGGGCCCAGTAATCTTCTTCCACCTC
CTGCTTATCAATCGTCTTCTCTTTCATCTCCTCATATTTCAGCCACTGCGCTGCTGCAGAAGGCAGCCCAAATGGGTGCGACGATGAGTAGTACCACTACCACGAGTGGC
TCTATGCCAAGGCCCCACAAGCTGGTTCACGTGTCTACAGGTAATTTTGGACAGATAGGTTTAAGCTCACGTGAAGGTGAAATGGGCAGAGGAAGAGGAGGGGCCGTCAG
TTGTAGTAGTAGTAGTTGTACTGATTATGGGAATAAAGCTGCTGCTGTTGTTTCTGCTTCTGCTAATGCTTCTGCTTCTGCTTCTGCACCAAGCACTTTTCTTCATGACA
TGATAAATTCCCTCTCTTCTCCTAATTCTGCTTCTGCTTCTCATCCCTTCCTCCTAGATTCCTCTTCCTTTAACGACGTCGCAGCGTTCGCCGCTATGCATCATCACACC
GCTACCGCTACCGTTACCGCTGCCAACGTCAACGTCAATGCTTCGGGCGGTCGAAGCGACGGTTTAACGAGAGATTTCTTGGGGCTTCGCCCTCTCTCCCATGGAGATAT
TCTAAGCATTACTGGTTTTGGAAATTGTATTGTTCCTAATTCTGCCAATCTTCAGAGCCAAATCCAAAAGCCATGGCAGGGTTAGATTCGTAAATTTGCCAACTTTTTTA
CTTTCTTTTTTTCTTTTTTTCTCTAAGAATTGGGAAGAAGAAGATTGTATTAATTCTCATTATAAGAGATCATGAAGAACCATTTTAATTTTCTCACCTCTTTTTCCACG
TAAAAAACTTTATGGAATGTTCATCATTTGTAAGCTTAATTATATATATGACTCTTTTTATATGGGAAGAATATTTGAATATATTGAAATACAATACATGTGCTTGATGG
CTGATCA
Protein sequenceShow/hide protein sequence
MMMKGNFLSQQQVVVMEENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHN
LPWKLKQRSNKEVIKKKVYVCPEASCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESA
RSAMALNPLLSSYNHNNNNSQEFSNNNNNNIFALKRDFSNNNSNNNNTNLRAEIPSWLQSNSDLRAEIFVGSSHDQPQQQQHHQHHQTLNPNPNPSGVGCGPSNLLPPPA
YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQIGLSSREGEMGRGRGGAVSCSSSSCTDYGNKAAAVVSASANASASASAPSTFLHDMI
NSLSSPNSASASHPFLLDSSSFNDVAAFAAMHHHTATATVTAANVNVNASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSANLQSQIQKPWQG