; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023472 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023472
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein indeterminate-domain 7
Genome locationChr05:34519833..34523233
RNA-Seq ExpressionHG10023472
SyntenyHG10023472
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34113.1 nucleic acid binding protein [Cucumis melo subsp. melo]3.8e-22679Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNNNNS        NN  LKRDF        NNNNN +LR EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE

Query:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI
        +M+GSGGQDE++ +T+NPNP + S  GCGA+          P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGE+
Subjt:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI

Query:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---
        GLWS +VE+GRGGGGGAVSCSSSSCTDYGNK          +ASASASA TTFLHD+I NSLSSPSPS+ PF+Q  N SF D A   FA +HH   P   
Subjt:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---

Query:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
            +PTT A A GGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

KAA0059426.1 protein indeterminate-domain 7 [Cucumis melo var. makuwa]3.4e-21978.69Show/hide
Query:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        MDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
Subjt:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
        WKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
Subjt:  WKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC

Query:  DALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNP-N
        DALADESARSAMALNPLLSSYN NNNNN+        NN  LKRDF        NNNNN +LR EIPPWLQPS     +M+GSGGQDE++ +T+NPNP +
Subjt:  DALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNP-N

Query:  PSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSS
         S  GCGA+          P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGE+GLWS +VE+GRGGGGGAVSCSS
Subjt:  PSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSS

Query:  SSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGL
        SSCTDYGNK          +ASASASA TTFLHD+I NSLSSPSPS+ PF+Q  N SF D A A     HHH     V  T A A GGRNDGLTRDFLGL
Subjt:  SSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGL

Query:  RPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
        RPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  RPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

XP_004141684.2 protein indeterminate-domain 7 [Cucumis sativus]2.9e-22679.25Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
        MMMKGNFLS QQQQQQIVVMDENLSNLTSASGEATASVSSANK+EFPNQYFAPQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN

Query:  KGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
        KGFQRDQNLQLHRRGHNLPWKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
Subjt:  KGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC

Query:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF----NNNNNDNLRAEIPPWLQPSDLRAEMMMGSG
        DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN+NSQ+      NN  LKRDF    N+NNN++LR EIPPWLQPS     +M+GSG
Subjt:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF----NNNNNDNLRAEIPPWLQPSDLRAEMMMGSG

Query:  GQDEHSHQTLNPNP--NPSGHGCGA--------TSLPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWS
        GQ E++ +T+NPNP  N S  GCGA           P      C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGEIGLWS
Subjt:  GQDEHSHQTLNPNP--NPSGHGCGA--------TSLPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWS

Query:  REVEMGR-------GGGGGAVSCSSSSCTDYGNKV-----TNASASASAPTTFLHDMI-NSLSSPSPSNP-FIQD-NPSFNDPAAATFATMHHHT--API
         +VE+GR       GGGGGAVSCSSSSCTDYGNK        ASASASA TTFLHD+I NSLSSPSPS+P F+Q  N SF D A   FA MHHH     +
Subjt:  REVEMGR-------GGGGGAVSCSSSSCTDYGNKV-----TNASASASAPTTFLHDMI-NSLSSPSPSNP-FIQD-NPSFNDPAAATFATMHHHT--API

Query:  PVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
        P+  T A A GGR+DGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  PVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

XP_008462349.1 PREDICTED: protein indeterminate-domain 7 [Cucumis melo]3.2e-22578.83Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNNNNS        NN  LKRDF        NNNNN +LR EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE

Query:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI
        +M+GSGGQDE++ +T+NPNP + S  GCGA+          P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGE+
Subjt:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI

Query:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---
        GLWS +VE+GRGGGGGAVSCSSSSCTDYGNK          +ASASASA TTFLHD+I NSLSSPS S+ PF+Q  N SF D A   FA +HH   P   
Subjt:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---

Query:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
            +PTT A A GGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

XP_038898698.1 protein indeterminate-domain 7-like [Benincasa hispida]3.4e-25989.08Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ--PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        MMMKGNFLS +QQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQ+Q  PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ--PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
        GFQRDQNLQLHRRGHNLPWKLK R+NKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD

Query:  CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHS
        CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN+NNNNNSQEFL  NNNNF LKRDFNNNNN+NLRAEIPPWLQPSDLRAE++MGS G +EH+
Subjt:  CGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHS

Query:  HQTLNPNPNPSGHGCGATS-LPPPAYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCS
        H+TLNPNPNPSGHGCG TS LPPPAYQSC   SPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGN+GE+GLWSREVEMGR  GGGAVSCS
Subjt:  HQTLNPNPNPSGHGCGATS-LPPPAYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCS

Query:  SSSCTDYGNKV--TNASASASAPTTFLHDMINSLSSPSPSNPFIQDNPSFNDP--AAATFATMHHHTAPIPVPTT-AATALGGRNDGLTRDFLGLRPLSH
        SSSCTDYGNK    NASASASA TTFLHDMINSLSSPSPS+PF+Q N SFND   AAA F+ MHHHTAP+P  T  +A   G R+DGLTRDFLGLRPLSH
Subjt:  SSSCTDYGNKV--TNASASASAPTTFLHDMINSLSSPSPSNPFIQDNPSFNDP--AAATFATMHHHTAPIPVPTT-AATALGGRNDGLTRDFLGLRPLSH

Query:  GDILSLTGFGNCIVP-NSSNLHNQIQKPWQG
        GDILSLTGFGNCIVP NSSNL  QIQKPWQG
Subjt:  GDILSLTGFGNCIVP-NSSNLHNQIQKPWQG

TrEMBL top hitse value%identityAlignment
A0A0A0K7W2 C2H2-type domain-containing protein1.4e-22679.25Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
        MMMKGNFLS QQQQQQIVVMDENLSNLTSASGEATASVSSANK+EFPNQYFAPQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN

Query:  KGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
        KGFQRDQNLQLHRRGHNLPWKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
Subjt:  KGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC

Query:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF----NNNNNDNLRAEIPPWLQPSDLRAEMMMGSG
        DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN+NSQ+      NN  LKRDF    N+NNN++LR EIPPWLQPS     +M+GSG
Subjt:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF----NNNNNDNLRAEIPPWLQPSDLRAEMMMGSG

Query:  GQDEHSHQTLNPNP--NPSGHGCGA--------TSLPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWS
        GQ E++ +T+NPNP  N S  GCGA           P      C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGEIGLWS
Subjt:  GQDEHSHQTLNPNP--NPSGHGCGA--------TSLPPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWS

Query:  REVEMGR-------GGGGGAVSCSSSSCTDYGNKV-----TNASASASAPTTFLHDMI-NSLSSPSPSNP-FIQD-NPSFNDPAAATFATMHHHT--API
         +VE+GR       GGGGGAVSCSSSSCTDYGNK        ASASASA TTFLHD+I NSLSSPSPS+P F+Q  N SF D A   FA MHHH     +
Subjt:  REVEMGR-------GGGGGAVSCSSSSCTDYGNKV-----TNASASASAPTTFLHDMI-NSLSSPSPSNP-FIQD-NPSFNDPAAATFATMHHHT--API

Query:  PVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
        P+  T A A GGR+DGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  PVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

A0A1S3CGT8 protein indeterminate-domain 71.6e-22578.83Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNNNNS        NN  LKRDF        NNNNN +LR EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE

Query:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI
        +M+GSGGQDE++ +T+NPNP + S  GCGA+          P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGE+
Subjt:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI

Query:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---
        GLWS +VE+GRGGGGGAVSCSSSSCTDYGNK          +ASASASA TTFLHD+I NSLSSPS S+ PF+Q  N SF D A   FA +HH   P   
Subjt:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---

Query:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
            +PTT A A GGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

A0A5A7UZ06 Protein indeterminate-domain 71.7e-21978.69Show/hide
Query:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        MDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
Subjt:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
        WKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
Subjt:  WKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC

Query:  DALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNP-N
        DALADESARSAMALNPLLSSYN NNNNN+        NN  LKRDF        NNNNN +LR EIPPWLQPS     +M+GSGGQDE++ +T+NPNP +
Subjt:  DALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNP-N

Query:  PSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSS
         S  GCGA+          P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGE+GLWS +VE+GRGGGGGAVSCSS
Subjt:  PSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSS

Query:  SSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGL
        SSCTDYGNK          +ASASASA TTFLHD+I NSLSSPSPS+ PF+Q  N SF D A A     HHH     V  T A A GGRNDGLTRDFLGL
Subjt:  SSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGL

Query:  RPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
        RPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  RPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

A0A6J1GRP3 protein indeterminate-domain 7-like1.4e-21377.37Show/hide
Query:  MMMKGNFLSQQQQQ-QQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-----PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
        MMMKGNFLSQQQQQ Q  V+MDENLSNLTSASGEATASVSSA      N YFAPQ+Q     PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
Subjt:  MMMKGNFLSQQQQQ-QQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-----PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
        ICNKGFQRDQNLQLHRRGHNLPWKLK RSNKEV+KKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE

Query:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQ
        YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPL+SSYNNN          NNNNNF +KRDF+N N  N+RAEIPPWL  +DLR E+ +GS  Q
Subjt:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQ

Query:  DEHSHQ---TLNPN------PNPSGHGCGAT-SLPPP-AYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREV
        DEH HQ   TLNPN       +  GHGCGA+  LPPP +YQ  SVSSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK++HVSTG++G+I       
Subjt:  DEHSHQ---TLNPN------PNPSGHGCGAT-SLPPP-AYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREV

Query:  EMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQDNPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRD
              GGGAVSC SSSCTDYG+K   A+ SAS P TFLHDMINSLSS S S+PF+QD+ SFND  A  F  MHHH  P    TT   A GGR+DGLTRD
Subjt:  EMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQDNPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRD

Query:  FLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
        FLGLRPLSHGDILSLTGFGNCIVPNSSNL +QIQKPWQG
Subjt:  FLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

E5GCB4 Nucleic acid binding protein1.8e-22679Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANK+EF NQYFAPQT   QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQT---QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLK RSNKE+IKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNNNNS        NN  LKRDF        NNNNN +LR EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN-NNNNNNSQEFLINNNNNFGLKRDF--------NNNNNDNLRAEIPPWLQPSDLRAE

Query:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI
        +M+GSGGQDE++ +T+NPNP + S  GCGA+          P     C +  SS HISATALLQKAAQMGATMSSTTTTSGS PRPH LLHVSTGNFGE+
Subjt:  MMMGSGGQDEHSHQTLNPNP-NPSGHGCGATSL-------PPPAYQSCSV--SSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEI

Query:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---
        GLWS +VE+GRGGGGGAVSCSSSSCTDYGNK          +ASASASA TTFLHD+I NSLSSPSPS+ PF+Q  N SF D A   FA +HH   P   
Subjt:  GLWSREVEMGRGGGGGAVSCSSSSCTDYGNKV--------TNASASASAPTTFLHDMI-NSLSSPSPSN-PFIQD-NPSFNDPAAATFATMHHHTAP---

Query:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
            +PTT A A GGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLH QIQKPWQG
Subjt:  --IPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

SwissProt top hitse value%identityAlignment
Q8H1F5 Protein indeterminate-domain 72.6e-10057.82Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMM  + L  QQQQQQ   M+EN+SNLTSASG+  ASVSS N+TE      NQ+   Q   P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLH+RGHNLPWKLK RSNK+V++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNN-------------NNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSD
        +CDCGTLFSRRDSFITHRAFCDALA+ESAR+    NP++   +N+              +++SQ  + N+N +  +K++ + ++  N    IPPWL    
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNN-------------NNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSD

Query:  LRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVS-------SPHISATALLQKAAQMGATMSST
                           ++ NPNP+G+     +L PP   S +         SP +SATALLQKAAQMG+T S+T
Subjt:  LRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVS-------SPHISATALLQKAAQMGATMSST

Q944L3 Zinc finger protein BALDIBIS1.0e-8553.78Show/hide
Query:  QYFAPQTQPPPPP------KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVH
        ++ AP   P P P      K+KRNLPGNPDPDAEVIALSP +LM TNRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLK R+NKE +KKKVY+CPE +CVH
Subjt:  QYFAPQTQPPPPP------KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVH

Query:  HDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNN---
        HDP+RALGDLTGIKKHF RKHGEKKWKCDKCSKKYAV SDWKAHSKICGTKEYRCDCGTLFSR+DSFITHRAFCDALA+ESAR  +++ P  +  NN   
Subjt:  HDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNN---

Query:  ------NNNNNSQEFLINNNNNFGLKRDFNNNNND--NLRAEIPPWLQPSDLRAEMMMGSGG-------QDEHSHQTL-----NPNPNPSGHGCGATSLP
              N N N Q+  +N  ++   +  FN N N+   L   +P  +  S         S         Q + SHQ L     N N N    G       
Subjt:  ------NNNNNSQEFLINNNNNFGLKRDFNNNNND--NLRAEIPPWLQPSDLRAEMMMGSGG-------QDEHSHQTL-----NPNPNPSGHGCGATSLP

Query:  PPAYQSCSVSS--------------------PHISATALLQKAAQMGATMSSTTTTS
               S  S                      +SATALLQKAAQMG+  SS+++++
Subjt:  PPAYQSCSVSS--------------------PHISATALLQKAAQMGATMSSTTTTS

Q9LRW7 Protein indeterminate-domain 113.9e-10947.59Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPDPDAEVIALSPKTLMA
        MM K   L Q QQ QQ    DEN+SNLTSASG+  ASVSS N TE     + P  Q                  KK+RN PGNPDP++EVIALSPKTLMA
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPDPDAEVIALSPKTLMA

Query:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS
        TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLK RSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHS
Subjt:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS

Query:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN-
        K CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL                        SS +N+N  NS  F  NN N 
Subjt:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN-

Query:  -----------NFGLKRDFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQK
                    F +K++  +N++  +   + IPPWL P                  H   + NPNPS  G G  SL        S++SP +SATALLQK
Subjt:  -----------NFGLKRDFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQK

Query:  AAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQD
        AAQMG+T +     + +  R       ST N                                 N  T  +A  ++P+ F+    N+       N    D
Subjt:  AAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQD

Query:  NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
        N    +    TF              +  +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

Q9LVQ7 Zinc finger protein ENHYDROUS1.0e-8547.13Show/hide
Query:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL
        M  +L N ++ SG+  ASVSS       NQ   P++      KKKRNLPG PDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL
Subjt:  MDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL

Query:  KARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDAL
        + RS KEV +KKVYVCP   CVHHDPSRALGDLTGIKKHFCRKHGEKKWKC+KCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDAL
Subjt:  KARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDAL

Query:  ADESARS-----------------------------------------------------------AMALNPLLSSYNNNNNNNSQEFLIN--------N
        A+ESA++                                                           + ++ P+ +S  +  NNN  E +I         N
Subjt:  ADESARS-----------------------------------------------------------AMALNPLLSSYNNNNNNNSQEFLIN--------N

Query:  NNNFGLKRDFNNNNNDN----LRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQKAAQMGAT
         ++  L  D +NNN       + +   P L  S   +  +       E     L+ NP+      G T   PP + +     P +SATALLQKAAQMG+T
Subjt:  NNNFGLKRDFNNNNNDN----LRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQKAAQMGAT

Query:  MSS---------TTTTSGSMP-RPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVT
         S           +TTS SM    H  L ++ G    +GL       G G G   +   +SS   +G K T
Subjt:  MSS---------TTTTSGSMP-RPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVT

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 11.2e-8952.11Show/hide
Query:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
        M  +L N ++ SGEA+ S+SS  N+   PN             KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
Subjt:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK

Query:  LKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA
        L+ +SNKEV KKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDA
Subjt:  LKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA

Query:  LADESARS----AMALNPLLSSYNNNNNN------NSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNP
        LA+E+ARS    +   NP + +  N   N      +++   I +++   +K+  +      +  E P     + + +  +     +   +  ++    + 
Subjt:  LADESARS----AMALNPLLSSYNNNNNN------NSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNP

Query:  SGHGCGATSLPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGLWSREV
        S     ++S   P     S S              P +SATALLQKAAQMGA  S           S+T+TS     PH L L +  G     GL  +E+
Subjt:  SGHGCGATSLPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGLWSREV

Query:  EMG
         MG
Subjt:  EMG

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 71.8e-10157.82Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMM  + L  QQQQQQ   M+EN+SNLTSASG+  ASVSS N+TE      NQ+   Q   P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFP----NQYFAPQT-QPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLH+RGHNLPWKLK RSNK+V++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNN-------------NNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSD
        +CDCGTLFSRRDSFITHRAFCDALA+ESAR+    NP++   +N+              +++SQ  + N+N +  +K++ + ++  N    IPPWL    
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNNNN-------------NNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSD

Query:  LRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVS-------SPHISATALLQKAAQMGATMSST
                           ++ NPNP+G+     +L PP   S +         SP +SATALLQKAAQMG+T S+T
Subjt:  LRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVS-------SPHISATALLQKAAQMGATMSST

AT3G13810.1 indeterminate(ID)-domain 112.8e-11047.59Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPDPDAEVIALSPKTLMA
        MM K   L Q QQ QQ    DEN+SNLTSASG+  ASVSS N TE     + P  Q                  KK+RN PGNPDP++EVIALSPKTLMA
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQ-------------PPPPPKKKRNLPGNPDPDAEVIALSPKTLMA

Query:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS
        TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLK RSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHS
Subjt:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS

Query:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN-
        K CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL                        SS +N+N  NS  F  NN N 
Subjt:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN-

Query:  -----------NFGLKRDFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQK
                    F +K++  +N++  +   + IPPWL P                  H   + NPNPS  G G  SL        S++SP +SATALLQK
Subjt:  -----------NFGLKRDFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQK

Query:  AAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQD
        AAQMG+T +     + +  R       ST N                                 N  T  +A  ++P+ F+    N+       N    D
Subjt:  AAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQD

Query:  NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
        N    +    TF              +  +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  NPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

AT3G13810.2 indeterminate(ID)-domain 111.1e-10145.61Show/hide
Query:  LSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMAT
        L Q QQ QQ    DEN+SNLTSASG+  ASVSS N TE     + P  Q     ++++    +                     P++EVIALSPKTLMAT
Subjt:  LSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMAT

Query:  NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSK
        NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLK RSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK
Subjt:  NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSK

Query:  ICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN--
         CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL                        SS +N+N  NS  F  NN N  
Subjt:  ICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN--

Query:  ----------NFGLKRDFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQKA
                   F +K++  +N++  +   + IPPWL P                  H   + NPNPS  G G  SL        S++SP +SATALLQKA
Subjt:  ----------NFGLKRDFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQKA

Query:  AQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQDN
        AQMG+T +     + +  R       ST N                                 N  T  +A  ++P+ F+    N+       N    DN
Subjt:  AQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQDN

Query:  PSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
            +    TF              +  +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  PSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

AT3G13810.3 indeterminate(ID)-domain 112.9e-9945.31Show/hide
Query:  LSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        +SNLTSASG+  ASVSS N TE     + P  Q     ++++    +                     P++EVIALSPKTLMATNRFVCEICNKGFQRDQ
Subjt:  LSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS
        NLQLHRRGHNLPWKLK RSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CGTKEYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS

Query:  RRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN------------NFGLKR
        RRDSFITHRAFC+ALA+E+AR  +          NPLL                        SS +N+N  NS  F  NN N             F +K+
Subjt:  RRDSFITHRAFCDALADESARSAM--------ALNPLL------------------------SSYNNNNNNNSQEFLINNNN------------NFGLKR

Query:  DFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGS
        +  +N++  +   + IPPWL P                  H   + NPNPS  G G  SL        S++SP +SATALLQKAAQMG+T +     + +
Subjt:  DFNNNNN--DNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSSPHISATALLQKAAQMGATMSSTTTTSGS

Query:  MPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQDNPSFNDPAAATFATMHH
          R       ST N                                 N  T  +A  ++P+ F+    N+       N    DN    +    TF     
Subjt:  MPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQDNPSFNDPAAATFATMHH

Query:  HTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG
                 +  +   G  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  HTAPIPVPTTAATALGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG

AT3G50700.1 indeterminate(ID)-domain 28.5e-9152.11Show/hide
Query:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
        M  +L N ++ SGEA+ S+SS  N+   PN             KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
Subjt:  MDENLSNLTSASGEATASVSS-ANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK

Query:  LKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA
        L+ +SNKEV KKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDA
Subjt:  LKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA

Query:  LADESARS----AMALNPLLSSYNNNNNN------NSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNP
        LA+E+ARS    +   NP + +  N   N      +++   I +++   +K+  +      +  E P     + + +  +     +   +  ++    + 
Subjt:  LADESARS----AMALNPLLSSYNNNNNN------NSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNP

Query:  SGHGCGATSLPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGLWSREV
        S     ++S   P     S S              P +SATALLQKAAQMGA  S           S+T+TS     PH L L +  G     GL  +E+
Subjt:  SGHGCGATSLPPPAYQSCSVS-------------SPHISATALLQKAAQMGATMS-----------STTTTSGSMPRPHKL-LHVSTGNFGEIGLWSREV

Query:  EMG
         MG
Subjt:  EMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGAAAGGTAATTTTTTGTCTCAACAACAACAACAACAACAAATTGTAGTGATGGATGAAAATTTGTCCAATTTGACTTCTGCATCTGGTGAAGCTACTGCCAG
TGTTTCTTCTGCCAATAAAACTGAATTCCCCAATCAATATTTTGCCCCTCAAACCCAACCACCACCTCCTCCAAAGAAGAAGCGCAACCTTCCTGGAAATCCCGACCCAG
ATGCGGAAGTGATAGCATTATCGCCGAAGACGTTGATGGCGACGAATAGATTCGTGTGCGAGATATGTAACAAAGGGTTTCAAAGAGATCAGAATCTTCAATTACATAGA
AGAGGACACAATTTACCATGGAAATTAAAGGCAAGATCAAATAAAGAAGTAATAAAGAAGAAGGTATATGTTTGTCCAGAAGTGAGTTGTGTTCATCATGATCCATCAAG
AGCACTTGGAGATCTAACAGGGATAAAGAAGCACTTCTGTAGAAAGCATGGTGAAAAGAAGTGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAATCTGATTGGA
AAGCTCACTCCAAGATCTGTGGCACTAAGGAGTACAGATGTGACTGTGGAACTCTCTTTTCAAGAAGAGATAGTTTCATTACACATAGAGCCTTCTGTGATGCATTAGCA
GATGAAAGTGCAAGATCAGCCATGGCATTAAACCCTCTTCTCTCTTCTTACAACAATAATAATAATAATAATTCACAAGAATTCCTTATTAATAATAATAATAATTTTGG
TCTTAAACGAGATTTCAACAATAATAACAACGACAATTTGAGAGCGGAGATTCCGCCGTGGCTACAACCATCAGATCTCCGAGCGGAGATGATGATGGGGAGTGGTGGTC
AGGACGAGCACTCTCATCAAACCCTAAACCCTAACCCTAACCCAAGCGGACATGGGTGCGGGGCCACTAGCCTTCCTCCTCCGGCATATCAATCGTGTTCTGTTTCTTCT
CCTCATATATCAGCGACTGCACTGCTGCAGAAGGCAGCTCAGATGGGTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTATGCCAAGGCCCCACAAGCTTCTTCA
CGTGTCTACAGGCAATTTTGGAGAGATAGGATTATGGTCACGTGAAGTTGAGATGGGTAGAGGAGGAGGAGGAGGAGCTGTGAGTTGTAGTAGTAGTAGTTGTACTGATT
ATGGGAATAAAGTAACCAATGCTTCTGCTTCTGCTTCTGCACCAACCACTTTTCTTCATGACATGATCAATTCCCTCTCTTCTCCTTCTCCTTCTAATCCTTTCATCCAA
GATAATCCCTCCTTCAACGACCCCGCCGCCGCCACTTTCGCCACTATGCATCATCACACCGCCCCCATCCCCGTTCCCACCACGGCTGCCACCGCTCTGGGCGGTCGAAA
CGATGGTTTAACGAGAGATTTCTTGGGACTTCGCCCTCTTTCTCATGGAGATATTCTAAGCCTTACTGGTTTTGGAAATTGCATTGTTCCTAATTCCTCCAATCTTCACA
ATCAAATCCAGAAGCCATGGCAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGATGAAAGGTAATTTTTTGTCTCAACAACAACAACAACAACAAATTGTAGTGATGGATGAAAATTTGTCCAATTTGACTTCTGCATCTGGTGAAGCTACTGCCAG
TGTTTCTTCTGCCAATAAAACTGAATTCCCCAATCAATATTTTGCCCCTCAAACCCAACCACCACCTCCTCCAAAGAAGAAGCGCAACCTTCCTGGAAATCCCGACCCAG
ATGCGGAAGTGATAGCATTATCGCCGAAGACGTTGATGGCGACGAATAGATTCGTGTGCGAGATATGTAACAAAGGGTTTCAAAGAGATCAGAATCTTCAATTACATAGA
AGAGGACACAATTTACCATGGAAATTAAAGGCAAGATCAAATAAAGAAGTAATAAAGAAGAAGGTATATGTTTGTCCAGAAGTGAGTTGTGTTCATCATGATCCATCAAG
AGCACTTGGAGATCTAACAGGGATAAAGAAGCACTTCTGTAGAAAGCATGGTGAAAAGAAGTGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAATCTGATTGGA
AAGCTCACTCCAAGATCTGTGGCACTAAGGAGTACAGATGTGACTGTGGAACTCTCTTTTCAAGAAGAGATAGTTTCATTACACATAGAGCCTTCTGTGATGCATTAGCA
GATGAAAGTGCAAGATCAGCCATGGCATTAAACCCTCTTCTCTCTTCTTACAACAATAATAATAATAATAATTCACAAGAATTCCTTATTAATAATAATAATAATTTTGG
TCTTAAACGAGATTTCAACAATAATAACAACGACAATTTGAGAGCGGAGATTCCGCCGTGGCTACAACCATCAGATCTCCGAGCGGAGATGATGATGGGGAGTGGTGGTC
AGGACGAGCACTCTCATCAAACCCTAAACCCTAACCCTAACCCAAGCGGACATGGGTGCGGGGCCACTAGCCTTCCTCCTCCGGCATATCAATCGTGTTCTGTTTCTTCT
CCTCATATATCAGCGACTGCACTGCTGCAGAAGGCAGCTCAGATGGGTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTATGCCAAGGCCCCACAAGCTTCTTCA
CGTGTCTACAGGCAATTTTGGAGAGATAGGATTATGGTCACGTGAAGTTGAGATGGGTAGAGGAGGAGGAGGAGGAGCTGTGAGTTGTAGTAGTAGTAGTTGTACTGATT
ATGGGAATAAAGTAACCAATGCTTCTGCTTCTGCTTCTGCACCAACCACTTTTCTTCATGACATGATCAATTCCCTCTCTTCTCCTTCTCCTTCTAATCCTTTCATCCAA
GATAATCCCTCCTTCAACGACCCCGCCGCCGCCACTTTCGCCACTATGCATCATCACACCGCCCCCATCCCCGTTCCCACCACGGCTGCCACCGCTCTGGGCGGTCGAAA
CGATGGTTTAACGAGAGATTTCTTGGGACTTCGCCCTCTTTCTCATGGAGATATTCTAAGCCTTACTGGTTTTGGAAATTGCATTGTTCCTAATTCCTCCAATCTTCACA
ATCAAATCCAGAAGCCATGGCAAGGTTAG
Protein sequenceShow/hide protein sequence
MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKTEFPNQYFAPQTQPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHR
RGHNLPWKLKARSNKEVIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALA
DESARSAMALNPLLSSYNNNNNNNSQEFLINNNNNFGLKRDFNNNNNDNLRAEIPPWLQPSDLRAEMMMGSGGQDEHSHQTLNPNPNPSGHGCGATSLPPPAYQSCSVSS
PHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLLHVSTGNFGEIGLWSREVEMGRGGGGGAVSCSSSSCTDYGNKVTNASASASAPTTFLHDMINSLSSPSPSNPFIQ
DNPSFNDPAAATFATMHHHTAPIPVPTTAATALGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHNQIQKPWQG