; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G00740 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G00740
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein indeterminate-domain 7
Genome locationClcChr09:583081..585921
RNA-Seq ExpressionClc09G00740
SyntenyClc09G00740
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34113.1 nucleic acid binding protein [Cucumis melo subsp. melo]5.1e-21876.8Show/hide
Query:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR
        MMMKGNFLS    QQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNR
Subjt:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR

Query:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
        FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
Subjt:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC

Query:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW
        GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNN+    NN+N       NN ALKRDF+N+++S+NNNN    LR EIPPW
Subjt:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW

Query:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP
        LQPS     +++GS G+DEN     N ET+NPNP+     GCGA+  SV      P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRP
Subjt:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP

Query:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP
        H LLHVSTGNFGE+GLWS +VE+GR   GGGGGAVSCSSSSCTDYGNKA        ASASASASA TTFLHD+I NSLSSPS SHP F    NSSF D 
Subjt:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP

Query:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
        A FAA+        HHHT  P      TTA  SGGR+DGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

KAE8646512.1 hypothetical protein Csa_016174 [Cucumis sativus]4.0e-21578.53Show/hide
Query:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRR
        MDENLSNLTSASGEATASVSSANK+EFPNQYFAPQT   Q PPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRR
Subjt:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRR

Query:  GHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFIT
        GHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFIT
Subjt:  GHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFIT

Query:  HRAFCDALADESARSAMALNPLLSSYN-NNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPHNNHNHETLNP
        HRAFCDALADESARSAMALNPLLSSYN NNNNS    +   NN ALKRDF++ N+S+NNN+LR EIPPWLQPS     +++GSG        +N ET+NP
Subjt:  HRAFCDALADESARSAMALNPLLSSYN-NNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPHNNHNHETLNP

Query:  NPNP-----GCGA--------TSVPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGR----G
        NP+      GCGA           P      C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGEIGLWS +VE+GR    G
Subjt:  NPNP-----GCGA--------TSVPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGR----G

Query:  GGGGGGGAVSCSSSSCTDYGNKA-----ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAMHHHTAAPAPAINTTTAATSGGRS
        GGGGGGGAVSCSSSSCTDYGNKA     A+ASASASA TTFLHD+I NSLSSPS SHP F    NSSF D A FAAMHHH       I  TTA  SGGRS
Subjt:  GGGGGGGAVSCSSSSCTDYGNKA-----ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAMHHHTAAPAPAINTTTAATSGGRS

Query:  DGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
        DGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  DGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

XP_004141684.2 protein indeterminate-domain 7 [Cucumis sativus]1.2e-22279.04Show/hide
Query:  MMMKGNFLS-QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVC
        MMMKGNFLS QQQQQIVVMDENLSNLTSASGEATASVSSANK+EFPNQYFAPQT   Q PPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNRFVC
Subjt:  MMMKGNFLS-QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVC

Query:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
        EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
Subjt:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK

Query:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMG
        EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNNS    +   NN ALKRDF++ N+S+NNN+LR EIPPWLQPS     +++G
Subjt:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMG

Query:  SGEDENPHNNHNHETLNPNPNP-----GCGA--------TSVPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFG
        SG        +N ET+NPNP+      GCGA           P      C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFG
Subjt:  SGEDENPHNNHNHETLNPNPNP-----GCGA--------TSVPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFG

Query:  EIGLWSREVEMGR----GGGGGGGGAVSCSSSSCTDYGNKA-----ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAMHHHTA
        EIGLWS +VE+GR    GGGGGGGGAVSCSSSSCTDYGNKA     A+ASASASA TTFLHD+I NSLSSPS SHP F    NSSF D A FAAMHHH  
Subjt:  EIGLWSREVEMGR----GGGGGGGGAVSCSSSSCTDYGNKA-----ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAMHHHTA

Query:  APAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
             I  TTA  SGGRSDGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  APAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

XP_008462349.1 PREDICTED: protein indeterminate-domain 7 [Cucumis melo]3.0e-21876.8Show/hide
Query:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR
        MMMKGNFLS    QQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNR
Subjt:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR

Query:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
        FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
Subjt:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC

Query:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW
        GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNN+    NN+N       NN ALKRDF+N+++S+NNNN    LR EIPPW
Subjt:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW

Query:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP
        LQPS     +++GS G+DEN     N ET+NPNP+     GCGA+  SV      P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRP
Subjt:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP

Query:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP
        H LLHVSTGNFGE+GLWS +VE+GR   GGGGGAVSCSSSSCTDYGNKA        ASASASASA TTFLHD+I NSLSSPS+SHP F    NSSF D 
Subjt:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP

Query:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
        A FAA+        HHHT  P      TTA  SGGR+DGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

XP_038898698.1 protein indeterminate-domain 7-like [Benincasa hispida]2.1e-24886.84Show/hide
Query:  MMMKGNFLS-QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ--PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCE
        MMMKGNFLS QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQ+Q  PPPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNRFVCE
Subjt:  MMMKGNFLS-QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ--PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
        ICNKGFQRDQNLQLHRRGHNLPWKLKQR+NKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE

Query:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSY--NNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMG
        YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSY  NNNNNSQEFL  NNNNFALKRDFNNN    NNNNLRAEIPPWLQPSDLRAEILMG
Subjt:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSY--NNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMG

Query:  SGEDENPHNNHNHETLNPNPNP---GCGATS-VPPPAYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEM
        SG     H  HNHETLNPNPNP   GCG TS +PPPAYQSC   SPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGN+GE+GLWSREVEM
Subjt:  SGEDENPHNNHNHETLNPNPNP---GCGATS-VPPPAYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEM

Query:  GRGGGGGGGGAVSCSSSSCTDYGNKA--ASASASASAPTTFLHDMINSLSSPSASHPFFHDNSSFND----PATFAAMHHHTAAPAPAINTTTAATSGGR
        GR     GGGAVSCSSSSCTDYGNKA  A+ASASASA TTFLHDMINSLSSPS SHPF   NSSFND     A F+AMHHHT AP PA    +A  SG R
Subjt:  GRGGGGGGGGAVSCSSSSCTDYGNKA--ASASASASAPTTFLHDMINSLSSPSASHPFFHDNSSFND----PATFAAMHHHTAAPAPAINTTTAATSGGR

Query:  SDGLTRDFLGLRPLSHGDILSLTGFGSCIVP-NSSNLHSQIQKPWQG
        SDGLTRDFLGLRPLSHGDILSLTGFG+CIVP NSSNL +QIQKPWQG
Subjt:  SDGLTRDFLGLRPLSHGDILSLTGFGSCIVP-NSSNLHSQIQKPWQG

TrEMBL top hitse value%identityAlignment
A0A0A0K7W2 C2H2-type domain-containing protein5.6e-22379.04Show/hide
Query:  MMMKGNFLS-QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVC
        MMMKGNFLS QQQQQIVVMDENLSNLTSASGEATASVSSANK+EFPNQYFAPQT   Q PPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNRFVC
Subjt:  MMMKGNFLS-QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVC

Query:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
        EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK
Subjt:  EICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTK

Query:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMG
        EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNNS    +   NN ALKRDF++ N+S+NNN+LR EIPPWLQPS     +++G
Subjt:  EYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMG

Query:  SGEDENPHNNHNHETLNPNPNP-----GCGA--------TSVPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFG
        SG        +N ET+NPNP+      GCGA           P      C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFG
Subjt:  SGEDENPHNNHNHETLNPNPNP-----GCGA--------TSVPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFG

Query:  EIGLWSREVEMGR----GGGGGGGGAVSCSSSSCTDYGNKA-----ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAMHHHTA
        EIGLWS +VE+GR    GGGGGGGGAVSCSSSSCTDYGNKA     A+ASASASA TTFLHD+I NSLSSPS SHP F    NSSF D A FAAMHHH  
Subjt:  EIGLWSREVEMGR----GGGGGGGGAVSCSSSSCTDYGNKA-----ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAMHHHTA

Query:  APAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
             I  TTA  SGGRSDGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  APAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

A0A1S3CGT8 protein indeterminate-domain 71.4e-21876.8Show/hide
Query:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR
        MMMKGNFLS    QQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNR
Subjt:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR

Query:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
        FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
Subjt:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC

Query:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW
        GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNN+    NN+N       NN ALKRDF+N+++S+NNNN    LR EIPPW
Subjt:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW

Query:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP
        LQPS     +++GS G+DEN     N ET+NPNP+     GCGA+  SV      P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRP
Subjt:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP

Query:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP
        H LLHVSTGNFGE+GLWS +VE+GR   GGGGGAVSCSSSSCTDYGNKA        ASASASASA TTFLHD+I NSLSSPS+SHP F    NSSF D 
Subjt:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP

Query:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
        A FAA+        HHHT  P      TTA  SGGR+DGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

A0A5A7UZ06 Protein indeterminate-domain 71.0e-21176.96Show/hide
Query:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRR
        MDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRR
Subjt:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRR

Query:  GHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFIT
        GHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFIT
Subjt:  GHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFIT

Query:  HRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPWLQPSDLRAEILMGS-GEDENP
        HRAFCDALADESARSAMALNPLLSSYN NNN     NNNN       NN ALKRDF+N+++S+NNNN    LR EIPPWLQPS     +++GS G+DEN 
Subjt:  HRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPWLQPSDLRAEILMGS-GEDENP

Query:  HNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREV
            N ET+NPNP+     GCGA+  SV      P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGE+GLWS +V
Subjt:  HNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREV

Query:  EMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAM-------HHHTAAPA
        E+GR   GGGGGAVSCSSSSCTDYGNKA        ASASASASA TTFLHD+I NSLSSPS SHP F    NSSF D A FAA+       HHHT  P 
Subjt:  EMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDPATFAAM-------HHHTAAPA

Query:  PAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
             TTA  SGGR+DGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  PAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

A0A6J1GRP3 protein indeterminate-domain 7-like3.1e-21377.09Show/hide
Query:  MMMKGNFLSQQQQQ---IVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-----PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATN
        MMMKGNFLSQQQQQ    V+MDENLSNLTSASGEATASVSSA      N YFAPQ+Q     PPPPPKKKRNLPGNP+     PDAEVIALSPKTLMATN
Subjt:  MMMKGNFLSQQQQQ---IVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-----PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATN

Query:  RFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKI
        RFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKI
Subjt:  RFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKI

Query:  CGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEI
        CGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPL+SSYNN        NNNNNNF +KRDF      SN NN+RAEIPPWL  +DLR EI
Subjt:  CGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEI

Query:  LMGSGEDENPHNNHNHETLNPNPN---------PGCGAT-SVPPP-AYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGE
         +GS +DE+PH   NHETLNPN +          GCGA+  +PPP +YQ  SVSSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK++HVSTG++G+
Subjt:  LMGSGEDENPHNNHNHETLNPNPN---------PGCGAT-SVPPP-AYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGE

Query:  IGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHDNSSFNDPATFAAMHHHTAAPAPAINTTTAAT
        I                GGGAVSC SSSCTDYG+KA  A+ SAS P TFLHDMINSLSS SASHPF  D SSFND   F AMHHH   PA A  T TAA 
Subjt:  IGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHDNSSFNDPATFAAMHHHTAAPAPAINTTTAAT

Query:  SGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
        SGGRSDGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNL SQIQKPWQG
Subjt:  SGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

E5GCB4 Nucleic acid binding protein2.4e-21876.8Show/hide
Query:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR
        MMMKGNFLS    QQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNP+     PDAEVIALSPKTLMATNR
Subjt:  MMMKGNFLS----QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNR

Query:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
        FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC
Subjt:  FVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKIC

Query:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW
        GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNN+    NN+N       NN ALKRDF+N+++S+NNNN    LR EIPPW
Subjt:  GTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNN-------NNFALKRDFNNNNSSSNNNN----LRAEIPPW

Query:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP
        LQPS     +++GS G+DEN     N ET+NPNP+     GCGA+  SV      P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRP
Subjt:  LQPSDLRAEILMGS-GEDENPHNNHNHETLNPNPNP----GCGAT--SV-----PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRP

Query:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP
        H LLHVSTGNFGE+GLWS +VE+GR   GGGGGAVSCSSSSCTDYGNKA        ASASASASA TTFLHD+I NSLSSPS SHP F    NSSF D 
Subjt:  HKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKA--------ASASASASAPTTFLHDMI-NSLSSPSASHPFF--HDNSSFNDP

Query:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
        A FAA+        HHHT  P      TTA  SGGR+DGLTRDFLGLRPLSHGDILSLTGFG+CIVPNSSNLH QIQKPWQG
Subjt:  ATFAAM--------HHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

SwissProt top hitse value%identityAlignment
O22759 Protein indeterminate-domain 122.1e-8159.22Show/hide
Query:  PKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGI
        PKKKR LPGNP+     PDAEVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQ++ KE  KKKVYVCPE +C HH PSRALGDLTGI
Subjt:  PKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGI

Query:  KKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFA
        KKHFCRKHGEKKWKC+KCSK YAVQSDWKAH+KICGT++YRCDCGTLFSR+D+FITHRAFCDALA+ESAR        L S +++N     L N N NF 
Subjt:  KKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFA

Query:  LKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPHNNHNHETLNPNPNPGCGATSVPPPAYQSCSVSSPHISATALLQKAAQMGATMSST
            F  N SSS    L    P +++PS   A                              A S PP A          +SATALLQKA     ++SST
Subjt:  LKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPHNNHNHETLNPNPNPGCGATSVPPPAYQSCSVSSPHISATALLQKAAQMGATMSST

Query:  TTTSGSMPR
        T   G   R
Subjt:  TTTSGSMPR

Q8H1F5 Protein indeterminate-domain 76.6e-9658.33Show/hide
Query:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFV
        MMM  + L  QQQQ   M+EN+SNLTSASG+  ASVSS N+TE      NQ+   Q   P    K+KRN PGNP+     P+AEV+ALSPKTLMATNRF+
Subjt:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFV

Query:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT
        CE+CNKGFQRDQNLQLH+RGHNLPWKLKQRSNK+V++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGT
Subjt:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT

Query:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAE-------IPPWLQPSDL
        KEY+CDCGTLFSRRDSFITHRAFCDALA+ESAR+    NP++   +N+ +        N  F+       +NS+ +    + E       IPPWL  S  
Subjt:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAE-------IPPWLQPSDL

Query:  RAEILMGSGEDENPHNNHNHETLNP--NPNPGCGATSVPPPAYQSCSVSSPHISATALLQKAAQMGATMSST
                    NP+ N N+  L P    +   G +S P P        SP +SATALLQKAAQMG+T S+T
Subjt:  RAEILMGSGEDENPHNNHNHETLNP--NPNPGCGATSVPPPAYQSCSVSSPHISATALLQKAAQMGATMSST

Q944L3 Zinc finger protein BALDIBIS5.3e-8552.89Show/hide
Query:  QYFAPQTQPPPPP------KKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPE
        ++ AP   P P P      K+KRNLPGNP+     PDAEVIALSP +LM TNRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQR+NKE +KKKVY+CPE
Subjt:  QYFAPQTQPPPPP------KKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPE

Query:  VSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSY
         +CVHHDP+RALGDLTGIKKHF RKHGEKKWKCDKCSKKYAV SDWKAHSKICGTKEYRCDCGTLFSR+DSFITHRAFCDALA+ESAR      P   +Y
Subjt:  VSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSY

Query:  NNNNNSQEFLNNNNNNFALKRDFNNNNS-------SSNNNNL-------------RAEIPPWLQPSDLRAEILMGSGE-------DENPHNNHN------
         NN    E  + N N    +R  N  +S       ++N NN+              +  P     SD    +    G+       +EN +NN+N      
Subjt:  NNNNNSQEFLNNNNNNFALKRDFNNNNS-------SSNNNNL-------------RAEIPPWLQPSDLRAEILMGSGE-------DENPHNNHN------

Query:  ------HETLNPNPNPGCGATSVPPPA--YQSCSVSSPHISATALLQKAAQMGATMSSTTTTS
              HE  N   N    ++        Y         +SATALLQKAAQMG+  SS+++++
Subjt:  ------HETLNPNPNPGCGATSVPPPA--YQSCSVSSPHISATALLQKAAQMGATMSSTTTTS

Q9LRW7 Protein indeterminate-domain 112.1e-10247.5Show/hide
Query:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKT
        MM K   L Q QQ     DEN+SNLTSASG+  ASVSS N TE     + P  Q                  KK+RN PGNP+     P++EVIALSPKT
Subjt:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKT

Query:  LMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWK
        LMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD K
Subjt:  LMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWK

Query:  AHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNN
        AHSK CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL     S  ++++ +Q  +N       ++N+N      F+ NN
Subjt:  AHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNN

Query:  SSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSST
         ++NN+N           ++      M   +  N H  N++H  + P   P P    +S P P+          S++SP +SATALLQKAAQMG+T +  
Subjt:  SSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSST

Query:  TTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDP
           + +  R       ST N                                    N   + +A  ++P+ F       +SS + +H  F D N+S  D 
Subjt:  TTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDP

Query:  ATFAAMHHHTAAPAPAINTTTAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
                 T       N  TAA      T  G  +GLTRDFLGLRPL SH +ILS  G GSCI  NSS       KPWQG
Subjt:  ATFAAMHHHTAAPAPAINTTTAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 12.4e-8552.45Show/hide
Query:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGH
        M  +L N ++ SGEA+ S+SS  N+   PN             KKKRNLPG P+     P++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGH
Subjt:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGH

Query:  NLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHR
        NLPWKL+Q+SNKEV KKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHR
Subjt:  NLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHR

Query:  AFCDALADESARS----AMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWL-QPSDLRAEILMG--SGEDENPHNNHNHE
        AFCDALA+E+ARS    +   NP + +  N   +      +  +  +K         S +     EI     +P+ L      G  +G  E+   + +  
Subjt:  AFCDALADESARS----AMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWL-QPSDLRAEILMG--SGEDENPHNNHNHE

Query:  TLNPNPNPGCGATSVPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGL
        T + +      ++S   P     S S              P +SATALLQKAAQMGA  S           S+T+TS     PH L L +  G     GL
Subjt:  TLNPNPNPGCGATSVPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGL

Query:  WSREVEMG
          +E+ MG
Subjt:  WSREVEMG

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 74.7e-9758.33Show/hide
Query:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFV
        MMM  + L  QQQQ   M+EN+SNLTSASG+  ASVSS N+TE      NQ+   Q   P    K+KRN PGNP+     P+AEV+ALSPKTLMATNRF+
Subjt:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFV

Query:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT
        CE+CNKGFQRDQNLQLH+RGHNLPWKLKQRSNK+V++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGT
Subjt:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT

Query:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAE-------IPPWLQPSDL
        KEY+CDCGTLFSRRDSFITHRAFCDALA+ESAR+    NP++   +N+ +        N  F+       +NS+ +    + E       IPPWL  S  
Subjt:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAE-------IPPWLQPSDL

Query:  RAEILMGSGEDENPHNNHNHETLNP--NPNPGCGATSVPPPAYQSCSVSSPHISATALLQKAAQMGATMSST
                    NP+ N N+  L P    +   G +S P P        SP +SATALLQKAAQMG+T S+T
Subjt:  RAEILMGSGEDENPHNNHNHETLNP--NPNPGCGATSVPPPAYQSCSVSSPHISATALLQKAAQMGATMSST

AT3G13810.1 indeterminate(ID)-domain 111.5e-10347.5Show/hide
Query:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKT
        MM K   L Q QQ     DEN+SNLTSASG+  ASVSS N TE     + P  Q                  KK+RN PGNP+     P++EVIALSPKT
Subjt:  MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKT

Query:  LMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWK
        LMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD K
Subjt:  LMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWK

Query:  AHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNN
        AHSK CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL     S  ++++ +Q  +N       ++N+N      F+ NN
Subjt:  AHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNN

Query:  SSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSST
         ++NN+N           ++      M   +  N H  N++H  + P   P P    +S P P+          S++SP +SATALLQKAAQMG+T +  
Subjt:  SSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSST

Query:  TTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDP
           + +  R       ST N                                    N   + +A  ++P+ F       +SS + +H  F D N+S  D 
Subjt:  TTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDP

Query:  ATFAAMHHHTAAPAPAINTTTAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
                 T       N  TAA      T  G  +GLTRDFLGLRPL SH +ILS  G GSCI  NSS       KPWQG
Subjt:  ATFAAMHHHTAAPAPAINTTTAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

AT3G13810.2 indeterminate(ID)-domain 111.5e-9846.6Show/hide
Query:  QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKR-------------NLPGNPEFMN-YNPDAEVIALSPKTLMATNRFV
        QQ QQ    DEN+SNLTSASG+  ASVSS N TE     + P  Q     ++++              L     F+    P++EVIALSPKTLMATNRFV
Subjt:  QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKR-------------NLPGNPEFMN-YNPDAEVIALSPKTLMATNRFV

Query:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT
        CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CGT
Subjt:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGT

Query:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNNSSSNNNNL
        KEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL     S  ++++ +Q  +N       ++N+N      F+ NN ++NN+N 
Subjt:  KEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNNSSSNNNNL

Query:  RAEIPPWLQPSDLRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMP
                  ++      M   +  N H  N++H  + P   P P    +S P P+          S++SP +SATALLQKAAQMG+T +     + +  
Subjt:  RAEIPPWLQPSDLRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMP

Query:  RPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDPATFAAMHH
        R       ST N                                    N   + +A  ++P+ F       +SS + +H  F D N+S  D         
Subjt:  RPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDPATFAAMHH

Query:  HTAAPAPAINTTTAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
         T       N  TAA      T  G  +GLTRDFLGLRPL SH +ILS  G GSCI  NSS       KPWQG
Subjt:  HTAAPAPAINTTTAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

AT3G13810.3 indeterminate(ID)-domain 111.8e-9646.35Show/hide
Query:  LSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKR-------------NLPGNPEFMN-YNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        +SNLTSASG+  ASVSS N TE     + P  Q     ++++              L     F+    P++EVIALSPKTLMATNRFVCEICNKGFQRDQ
Subjt:  LSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKR-------------NLPGNPEFMN-YNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS
        NLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CGTKEYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS

Query:  RRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSD
        RRDSFITHRAFC+ALA+E+AR  +          NPLL     S  ++++ +Q  +N       ++N+N      F+ NN ++NN+N           ++
Subjt:  RRDSFITHRAFCDALADESARSAM--------ALNPLL-----SSYNNNNNSQEFLN-------NNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSD

Query:  LRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGN
              M   +  N H  N++H  + P   P P    +S P P+          S++SP +SATALLQKAAQMG+T +     + +  R       ST N
Subjt:  LRAEILMGSGEDENPH-NNHNHETLNP--NPNPGCGATSVPPPAY------QSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGN

Query:  FGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDPATFAAMHHHTAAPAPAINTT
                                            N   + +A  ++P+ F       +SS + +H  F D N+S  D          T       N  
Subjt:  FGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSSPSASHPFFHD-NSSFNDPATFAAMHHHTAAPAPAINTT

Query:  TAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG
        TAA      T  G  +GLTRDFLGLRPL SH +ILS  G GSCI  NSS       KPWQG
Subjt:  TAA------TSGGRSDGLTRDFLGLRPL-SHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG

AT3G50700.1 indeterminate(ID)-domain 21.7e-8652.45Show/hide
Query:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGH
        M  +L N ++ SGEA+ S+SS  N+   PN             KKKRNLPG P+     P++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGH
Subjt:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGH

Query:  NLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHR
        NLPWKL+Q+SNKEV KKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHR
Subjt:  NLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHR

Query:  AFCDALADESARS----AMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWL-QPSDLRAEILMG--SGEDENPHNNHNHE
        AFCDALA+E+ARS    +   NP + +  N   +      +  +  +K         S +     EI     +P+ L      G  +G  E+   + +  
Subjt:  AFCDALADESARS----AMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWL-QPSDLRAEILMG--SGEDENPHNNHNHE

Query:  TLNPNPNPGCGATSVPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGL
        T + +      ++S   P     S S              P +SATALLQKAAQMGA  S           S+T+TS     PH L L +  G     GL
Subjt:  TLNPNPNPGCGATSVPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGL

Query:  WSREVEMG
          +E+ MG
Subjt:  WSREVEMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGAAAGGTAATTTTTTGTCTCAACAACAACAACAAATTGTAGTGATGGATGAAAATTTGTCCAATTTGACTTCTGCATCTGGTGAAGCTACTGCTAGTGTTTC
TTCTGCCAATAAAACTGAATTCCCCAATCAATATTTTGCTCCACAAACCCAACCACCACCTCCTCCTAAGAAGAAGCGCAACCTCCCCGGAAATCCCGAGTTTATGAATT
ATAACCCAGATGCGGAAGTGATAGCGTTATCGCCGAAGACGTTGATGGCGACAAATAGATTCGTATGCGAGATATGCAACAAAGGGTTTCAAAGAGATCAGAATCTTCAA
CTACATAGAAGAGGACACAATTTGCCATGGAAATTAAAGCAAAGATCAAACAAAGAAGTAATAAAGAAGAAGGTATATGTTTGTCCAGAAGTGAGTTGCGTACACCATGA
TCCATCAAGAGCACTTGGAGACCTAACAGGAATAAAGAAGCACTTCTGCAGAAAGCATGGTGAAAAGAAATGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAAT
CTGATTGGAAAGCTCACTCCAAGATCTGTGGCACTAAGGAGTACAGATGTGACTGTGGAACTCTCTTTTCAAGAAGAGATAGTTTCATTACACATAGAGCTTTCTGTGAT
GCATTAGCAGATGAAAGTGCAAGATCAGCCATGGCATTAAACCCTCTTCTCTCTTCTTACAACAACAATAATAATTCACAAGAATTCCTCAACAATAACAATAATAATTT
TGCTCTCAAACGAGATTTCAACAATAATAACAGCAGCAGCAACAACAACAATTTGAGAGCGGAGATTCCGCCGTGGCTACAGCCATCGGATCTCCGAGCGGAGATCCTGA
TGGGAAGTGGGGAGGATGAGAATCCTCATAATAATCATAATCATGAAACCCTAAACCCTAACCCTAATCCGGGGTGCGGGGCCACAAGTGTTCCTCCTCCGGCGTACCAA
TCGTGTTCTGTTTCTTCTCCTCATATCTCAGCGACTGCACTGCTGCAGAAGGCAGCTCAGATGGGTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTATGCCAAG
GCCCCACAAGCTTCTTCACGTGTCTACAGGTAATTTTGGGGAAATAGGATTATGGTCACGTGAAGTTGAAATGGGTAGAGGAGGAGGAGGAGGAGGAGGAGGGGCTGTGA
GTTGTAGTAGTAGTAGTTGTACTGATTATGGGAATAAAGCTGCTTCTGCTTCTGCTTCTGCTTCTGCACCCACCACTTTTCTTCATGATATGATCAATTCCCTCTCTTCT
CCTTCTGCTTCTCATCCTTTCTTCCATGATAATTCCTCTTTCAACGACCCCGCCACTTTCGCCGCTATGCATCATCACACCGCCGCTCCCGCACCCGCCATCAACACCAC
CACCGCTGCCACTTCCGGCGGTCGTAGCGACGGTTTAACCAGAGATTTCTTGGGACTTCGCCCTCTTTCTCATGGTGATATTCTAAGCCTTACTGGATTTGGAAGTTGCA
TTGTTCCTAATTCCTCCAATCTTCATTCTCAAATCCAGAAGCCATGGCAGGGTTAG
mRNA sequenceShow/hide mRNA sequence
AAACTTTTGTTAATTACTCAAAATACAAATAATTGTTTCTTCTTCTTCTTCTTCTTCTTCTTCGCCTTTGTTAGATCTGTTTTTTTGTTGTCTCATGAAACCACTATAAG
AGGGTTTCTTTTTTCTTTTCTCTTTTGTAATTTTATAAAACTAAAGATGATGATGAAAGGTAATTTTTTGTCTCAACAACAACAACAAATTGTAGTGATGGATGAAAATT
TGTCCAATTTGACTTCTGCATCTGGTGAAGCTACTGCTAGTGTTTCTTCTGCCAATAAAACTGAATTCCCCAATCAATATTTTGCTCCACAAACCCAACCACCACCTCCT
CCTAAGAAGAAGCGCAACCTCCCCGGAAATCCCGAGTTTATGAATTATAACCCAGATGCGGAAGTGATAGCGTTATCGCCGAAGACGTTGATGGCGACAAATAGATTCGT
ATGCGAGATATGCAACAAAGGGTTTCAAAGAGATCAGAATCTTCAACTACATAGAAGAGGACACAATTTGCCATGGAAATTAAAGCAAAGATCAAACAAAGAAGTAATAA
AGAAGAAGGTATATGTTTGTCCAGAAGTGAGTTGCGTACACCATGATCCATCAAGAGCACTTGGAGACCTAACAGGAATAAAGAAGCACTTCTGCAGAAAGCATGGTGAA
AAGAAATGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAATCTGATTGGAAAGCTCACTCCAAGATCTGTGGCACTAAGGAGTACAGATGTGACTGTGGAACTCT
CTTTTCAAGAAGAGATAGTTTCATTACACATAGAGCTTTCTGTGATGCATTAGCAGATGAAAGTGCAAGATCAGCCATGGCATTAAACCCTCTTCTCTCTTCTTACAACA
ACAATAATAATTCACAAGAATTCCTCAACAATAACAATAATAATTTTGCTCTCAAACGAGATTTCAACAATAATAACAGCAGCAGCAACAACAACAATTTGAGAGCGGAG
ATTCCGCCGTGGCTACAGCCATCGGATCTCCGAGCGGAGATCCTGATGGGAAGTGGGGAGGATGAGAATCCTCATAATAATCATAATCATGAAACCCTAAACCCTAACCC
TAATCCGGGGTGCGGGGCCACAAGTGTTCCTCCTCCGGCGTACCAATCGTGTTCTGTTTCTTCTCCTCATATCTCAGCGACTGCACTGCTGCAGAAGGCAGCTCAGATGG
GTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTATGCCAAGGCCCCACAAGCTTCTTCACGTGTCTACAGGTAATTTTGGGGAAATAGGATTATGGTCACGTGAA
GTTGAAATGGGTAGAGGAGGAGGAGGAGGAGGAGGAGGGGCTGTGAGTTGTAGTAGTAGTAGTTGTACTGATTATGGGAATAAAGCTGCTTCTGCTTCTGCTTCTGCTTC
TGCACCCACCACTTTTCTTCATGATATGATCAATTCCCTCTCTTCTCCTTCTGCTTCTCATCCTTTCTTCCATGATAATTCCTCTTTCAACGACCCCGCCACTTTCGCCG
CTATGCATCATCACACCGCCGCTCCCGCACCCGCCATCAACACCACCACCGCTGCCACTTCCGGCGGTCGTAGCGACGGTTTAACCAGAGATTTCTTGGGACTTCGCCCT
CTTTCTCATGGTGATATTCTAAGCCTTACTGGATTTGGAAGTTGCATTGTTCCTAATTCCTCCAATCTTCATTCTCAAATCCAGAAGCCATGGCAGGGTTAG
Protein sequenceShow/hide protein sequence
MMMKGNFLSQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPEFMNYNPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQ
LHRRGHNLPWKLKQRSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCD
ALADESARSAMALNPLLSSYNNNNNSQEFLNNNNNNFALKRDFNNNNSSSNNNNLRAEIPPWLQPSDLRAEILMGSGEDENPHNNHNHETLNPNPNPGCGATSVPPPAYQ
SCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGGGGAVSCSSSSCTDYGNKAASASASASAPTTFLHDMINSLSS
PSASHPFFHDNSSFNDPATFAAMHHHTAAPAPAINTTTAATSGGRSDGLTRDFLGLRPLSHGDILSLTGFGSCIVPNSSNLHSQIQKPWQG