; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G005770 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G005770
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein indeterminate-domain 7-like
Genome locationchr07:6115656..6118899
RNA-Seq ExpressionLsi07G005770
SyntenyLsi07G005770
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465570.1 PREDICTED: protein indeterminate-domain 7-like, partial [Cucumis melo]3.9e-19975.8Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL
        MIKS LFQ Q+QAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQ+FST PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEIC KGFQRDQNL
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL

Query:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR
        QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREY+CDCGTLFSRR
Subjt:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR

Query:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TPISH--LNFQITQQTHFNPP-------LDHFSLKKEHQLI-
        DSFITHRAFCDALAEESARAITSN PILITNNNN         N NQNHLLPPLSS  TP  H  LNFQITQQTHFN P        ++ SLKKEH+ + 
Subjt:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TPISH--LNFQITQQTHFNPP-------LDHFSLKKEHQLI-

Query:  ------NNNIPPWLGCP---NSSSN-NNNHPIINPNHNH--NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN
              NNNIPPWL  P   NS+SN +N+H IINPNHN+  NL PTSLHLI SASPSS HMSATALLQKAAQMG+TMS+N          +NNNNNNNNN
Subjt:  ------NNNIPPWLGCP---NSSSN-NNNHPIINPNHNH--NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN

Query:  NNNNNNNN----NTNCNFGLNLSS------SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDS
        NNN    +    +TNCNFGLNLSS      +SSSSRDIH  QN IL              AAGLSHALPFYRN   NF+G G SFELDQFGGVFKK  D 
Subjt:  NNNNNNNN----NTNCNFGLNLSS------SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDS

Query:  SDQQ---AGLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQST
           Q   AGLSTRDFLGLRAISHTEFL+NIAAAG +++CINN +N  AAQ PQ TQIQNQST
Subjt:  SDQQ---AGLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQST

XP_011652348.1 protein indeterminate-domain 7 [Cucumis sativus]6.1e-20076.84Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL
        MIKS L Q Q+QAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQ+FST PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEIC KGFQRDQNL
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL

Query:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR
        QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREY+CDCGTLFSRR
Subjt:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR

Query:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TP--ISHLNFQITQQTHFNPP-------LDHFSLKKE-HQL-
        DSFITHRAFCDALAEESARAITSN PILI       NNNNNNY  NQNHLLPPLSS  TP   S LNFQITQQTHFN P        ++ SLKKE HQL 
Subjt:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TP--ISHLNFQITQQTHFNPP-------LDHFSLKKE-HQL-

Query:  ------INNNIPPWLGCP---NSSSNN-NNHPIINPNHNH-NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN
               NNNIPPWL  P   NS+SNN N+H IINPNHNH NL PTSLHLI SASPSS HMSATALLQKAAQMG+TMS+N        + +NNNNNNNN 
Subjt:  ------INNNIPPWLGCP---NSSSNN-NNHPIINPNHNH-NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN

Query:  NNNNNNNNNTNCNFGLNLSS--SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDSSDQQ---A
           +    +TNCNFGLNLSS  +SSSSRDIH  QNQIL              AAGLSHALPFYRN   +F+G G SFELDQFGGVFKK  D        A
Subjt:  NNNNNNNNNTNCNFGLNLSS--SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDSSDQQ---A

Query:  GLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQSTWQG
        GLSTRDFLGLRAISHTEFL+NIAAAG +++CINN++N  AAQ PQ TQIQNQSTWQG
Subjt:  GLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQSTWQG

XP_038904337.1 protein indeterminate-domain 7-like isoform X1 [Benincasa hispida]2.4e-23384.39Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF-STPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN
        MIKS LFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF +T PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF-STPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR
        LQLHRRGHNLPWKLKQR NKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREY+CDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP-ISHLNFQITQQTHFN-PPLDHF-------SLKKEHQLINNN
        RDSFITHRAFCDALAEESARAITSNPIL+T          NN N NQNHLLPPLSSTP ISHLNFQITQQTHFN PPLD+        SLKKE    N N
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP-ISHLNFQITQQTHFN-PPLDHF-------SLKKEHQLINNN

Query:  IPPWLGCPNSSS---NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNC
        IPPWLG PNS+S   NNN+H IINPNHN NL PTSLHLI SASPSS HMSATALLQKAAQMGATMS+NTEPPHTIT            +N+NNNNNNTNC
Subjt:  IPPWLGCPNSSS---NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNC

Query:  NFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN--NFQGGASFELDQFGGVFKKTTDSSDQQAGLSTRDFLGLRAISHTEFL
        NFGL+LSS+SSSSRDIHQQQNQ LM+M ST+N  STTEAAGLSHALPFYRN  NF+GGASFEL+QFGGVFKK   + D QAGLSTRDFLGLRAISHTEFL
Subjt:  NFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN--NFQGGASFELDQFGGVFKKTTDSSDQQAGLSTRDFLGLRAISHTEFL

Query:  TNIAAAGYNNCI-NNHNNNVAAQTP-QTQIQNQSTWQG
        +NIAAAGY+NCI NN+NNNVAAQTP QTQIQNQSTWQG
Subjt:  TNIAAAGYNNCI-NNHNNNVAAQTP-QTQIQNQSTWQG

XP_038904341.1 protein indeterminate-domain 7-like isoform X2 [Benincasa hispida]2.4e-23384.39Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF-STPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN
        MIKS LFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF +T PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF-STPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR
        LQLHRRGHNLPWKLKQR NKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREY+CDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP-ISHLNFQITQQTHFN-PPLDHF-------SLKKEHQLINNN
        RDSFITHRAFCDALAEESARAITSNPIL+T          NN N NQNHLLPPLSSTP ISHLNFQITQQTHFN PPLD+        SLKKE    N N
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP-ISHLNFQITQQTHFN-PPLDHF-------SLKKEHQLINNN

Query:  IPPWLGCPNSSS---NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNC
        IPPWLG PNS+S   NNN+H IINPNHN NL PTSLHLI SASPSS HMSATALLQKAAQMGATMS+NTEPPHTIT            +N+NNNNNNTNC
Subjt:  IPPWLGCPNSSS---NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNC

Query:  NFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN--NFQGGASFELDQFGGVFKKTTDSSDQQAGLSTRDFLGLRAISHTEFL
        NFGL+LSS+SSSSRDIHQQQNQ LM+M ST+N  STTEAAGLSHALPFYRN  NF+GGASFEL+QFGGVFKK   + D QAGLSTRDFLGLRAISHTEFL
Subjt:  NFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN--NFQGGASFELDQFGGVFKKTTDSSDQQAGLSTRDFLGLRAISHTEFL

Query:  TNIAAAGYNNCI-NNHNNNVAAQTP-QTQIQNQSTWQG
        +NIAAAGY+NCI NN+NNNVAAQTP QTQIQNQSTWQG
Subjt:  TNIAAAGYNNCI-NNHNNNVAAQTP-QTQIQNQSTWQG

XP_038904345.1 protein indeterminate-domain 7-like isoform X3 [Benincasa hispida]2.4e-23384.39Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF-STPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN
        MIKS LFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF +T PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYF-STPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR
        LQLHRRGHNLPWKLKQR NKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREY+CDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP-ISHLNFQITQQTHFN-PPLDHF-------SLKKEHQLINNN
        RDSFITHRAFCDALAEESARAITSNPIL+T          NN N NQNHLLPPLSSTP ISHLNFQITQQTHFN PPLD+        SLKKE    N N
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP-ISHLNFQITQQTHFN-PPLDHF-------SLKKEHQLINNN

Query:  IPPWLGCPNSSS---NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNC
        IPPWLG PNS+S   NNN+H IINPNHN NL PTSLHLI SASPSS HMSATALLQKAAQMGATMS+NTEPPHTIT            +N+NNNNNNTNC
Subjt:  IPPWLGCPNSSS---NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNC

Query:  NFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN--NFQGGASFELDQFGGVFKKTTDSSDQQAGLSTRDFLGLRAISHTEFL
        NFGL+LSS+SSSSRDIHQQQNQ LM+M ST+N  STTEAAGLSHALPFYRN  NF+GGASFEL+QFGGVFKK   + D QAGLSTRDFLGLRAISHTEFL
Subjt:  NFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN--NFQGGASFELDQFGGVFKKTTDSSDQQAGLSTRDFLGLRAISHTEFL

Query:  TNIAAAGYNNCI-NNHNNNVAAQTP-QTQIQNQSTWQG
        +NIAAAGY+NCI NN+NNNVAAQTP QTQIQNQSTWQG
Subjt:  TNIAAAGYNNCI-NNHNNNVAAQTP-QTQIQNQSTWQG

TrEMBL top hitse value%identityAlignment
A0A0A0LID9 C2H2-type domain-containing protein2.9e-20076.84Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL
        MIKS L Q Q+QAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQ+FST PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEIC KGFQRDQNL
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL

Query:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR
        QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREY+CDCGTLFSRR
Subjt:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR

Query:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TP--ISHLNFQITQQTHFNPP-------LDHFSLKKE-HQL-
        DSFITHRAFCDALAEESARAITSN PILI       NNNNNNY  NQNHLLPPLSS  TP   S LNFQITQQTHFN P        ++ SLKKE HQL 
Subjt:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TP--ISHLNFQITQQTHFNPP-------LDHFSLKKE-HQL-

Query:  ------INNNIPPWLGCP---NSSSNN-NNHPIINPNHNH-NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN
               NNNIPPWL  P   NS+SNN N+H IINPNHNH NL PTSLHLI SASPSS HMSATALLQKAAQMG+TMS+N        + +NNNNNNNN 
Subjt:  ------INNNIPPWLGCP---NSSSNN-NNHPIINPNHNH-NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN

Query:  NNNNNNNNNTNCNFGLNLSS--SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDSSDQQ---A
           +    +TNCNFGLNLSS  +SSSSRDIH  QNQIL              AAGLSHALPFYRN   +F+G G SFELDQFGGVFKK  D        A
Subjt:  NNNNNNNNNTNCNFGLNLSS--SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDSSDQQ---A

Query:  GLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQSTWQG
        GLSTRDFLGLRAISHTEFL+NIAAAG +++CINN++N  AAQ PQ TQIQNQSTWQG
Subjt:  GLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQSTWQG

A0A1S3CQM6 protein indeterminate-domain 7-like1.9e-19975.8Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL
        MIKS LFQ Q+QAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQ+FST PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEIC KGFQRDQNL
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNL

Query:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR
        QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREY+CDCGTLFSRR
Subjt:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRR

Query:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TPISH--LNFQITQQTHFNPP-------LDHFSLKKEHQLI-
        DSFITHRAFCDALAEESARAITSN PILITNNNN         N NQNHLLPPLSS  TP  H  LNFQITQQTHFN P        ++ SLKKEH+ + 
Subjt:  DSFITHRAFCDALAEESARAITSN-PILITNNNNYNNNNNNNYNNNQNHLLPPLSS--TPISH--LNFQITQQTHFNPP-------LDHFSLKKEHQLI-

Query:  ------NNNIPPWLGCP---NSSSN-NNNHPIINPNHNH--NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN
              NNNIPPWL  P   NS+SN +N+H IINPNHN+  NL PTSLHLI SASPSS HMSATALLQKAAQMG+TMS+N          +NNNNNNNNN
Subjt:  ------NNNIPPWLGCP---NSSSN-NNNHPIINPNHNH--NLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNN

Query:  NNNNNNNN----NTNCNFGLNLSS------SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDS
        NNN    +    +TNCNFGLNLSS      +SSSSRDIH  QN IL              AAGLSHALPFYRN   NF+G G SFELDQFGGVFKK  D 
Subjt:  NNNNNNNN----NTNCNFGLNLSS------SSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN---NFQG-GASFELDQFGGVFKKTTDS

Query:  SDQQ---AGLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQST
           Q   AGLSTRDFLGLRAISHTEFL+NIAAAG +++CINN +N  AAQ PQ TQIQNQST
Subjt:  SDQQ---AGLSTRDFLGLRAISHTEFLTNIAAAG-YNNCINNHNNNVAAQTPQ-TQIQNQST

A0A6J1GC33 protein indeterminate-domain 7-like1.6e-16966.61Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYS-GQYFST-PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQ
        MIKS LF  QAQAMEENLSNLTSASGEAS+CSGN SDQIPTNYS G YFS  PPPPPK+KRNLPGNPDPDAEV+ALSPKTLMATNRFVCEIC KGFQRDQ
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYS-GQYFST-PPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFS
        NLQLHRRGHNLPWKLK RANKE IRKKVYVCPETSCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREY+CDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFS

Query:  RRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP---ISHLNFQITQQTHFNP-PLDHFSLKKEH---QLINNNI
        RRDSFITHRAFCDALAEESAR+IT+NP+L+TNNN                  PPL   P   ISHLNFQ   QTHFNP  ++ F+LKKEH   Q   N I
Subjt:  RRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP---ISHLNFQITQQTHFNP-PLDHFSLKKEH---QLINNNI

Query:  PPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMS--NNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNF
        PPWL    + + + N  IINPNH   L  TSL+L  ++S  S HMSATALLQKAAQMGATMS  NN E PHTI      ++ +++ ++++   NN  CNF
Subjt:  PPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMS--NNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNF

Query:  GLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNN--------------------FQGGASFELDQFGGVFKKTTDSSDQQA--G
        GLNLSSSSSS       QNQ+LM          T    GLS ALP YRN                     F+ GASFE+D+FGG+ KK    +D  A  G
Subjt:  GLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNN--------------------FQGGASFELDQFGGVFKKTTDSSDQQA--G

Query:  LSTRDFLGLRAISHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG
        LSTRDFLGLRA+SHTEFL+NIAAAGY NCIN      A QTPQTQI+NQ +WQG
Subjt:  LSTRDFLGLRAISHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG

A0A6J1IU27 protein indeterminate-domain 7-like8.7e-16064.37Show/hide
Query:  AQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLP
        A  ME+ +SNLTSASGE SACSGN SDQ+P NYSGQYFSTPPPPPKKKRNLPGNPDPDAEV+ALSPKTLMATNRFVCEIC KGFQRDQNLQLH+RGHNLP
Subjt:  AQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLP

Query:  WKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFC
        WKLKQRANKEVIRKKVYVCPETSCVHHDP RALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREY+CDCGTLFSRRDSFITHRAFC
Subjt:  WKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFC

Query:  DALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNP-PLDHFSLKKEHQLI----NNNIPPWLGCPNSSS---
        DALAEESARAIT+                +N NNNQN    P+SS+ ISHLNFQ       NP  ++ FSLKKEHQ I    N NIPPW+GCP+S S   
Subjt:  DALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNP-PLDHFSLKKEHQLI----NNNIPPWLGCPNSSS---

Query:  ---------NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNFGLNL
                 NN++  I+NPN+        LHLIPS+SP S HMSATALLQKAAQMGATMS                        N+NNN N         
Subjt:  ---------NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNFGLNL

Query:  SSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN-----NFQGGASFELDQFGGVFKKT---TDSSDQQAGLSTRDFLGLRAISHTEFL
         SSSSSSRD H   +QILM         + +E  GL HALP + N     N   G  FEL++FGG F+K       SD+  GLSTRDFLGLR ISHTEFL
Subjt:  SSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRN-----NFQGGASFELDQFGGVFKKT---TDSSDQQAGLSTRDFLGLRAISHTEFL

Query:  TNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG
         NIAA GY+NCIN      + QTP+TQ  NQ  WQG
Subjt:  TNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG

A0A6J1KE03 protein indeterminate-domain 7-like3.5e-16967.33Show/hide
Query:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYS-GQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN
        MIKS LF  QAQAMEENLSNLTSASGEAS+CSGN SDQIPTNYS G YFS PPP PK+KR+LPGNPDPDAEV+ALSPKTLMATNRFVCEIC KGFQRDQN
Subjt:  MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYS-GQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR
        LQLHRRGHNLPWKLK RANKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREY+CDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP---ISHLNFQITQQTHFNP-PLDHFSLKKEH---QLINNNIP
        RDSFITHRAFCDALAEESAR IT+NPIL+TNNN                  PPL   P   ISHLNFQ   QTHFNP  ++ F+LKKEH   Q   N IP
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTP---ISHLNFQITQQTHFNP-PLDHFSLKKEH---QLINNNIP

Query:  PWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPS--SSHMSATALLQKAAQMGATMS--NNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCN
        PWL    + S + N  IINPNH   L  TSL+L P+AS S  S HMSATALLQKAAQMGATMS  NN E PHTI      ++ +++ ++++   NN  CN
Subjt:  PWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPS--SSHMSATALLQKAAQMGATMS--NNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCN

Query:  FGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNN-------------------FQGGASFELDQFGGVFKKTTDSSDQQA--G
        FGLNLSSSSSS       QNQ+LM          T    GLS ALP YRN                    F+ GASFE+D+FGG+ KK    +D  A  G
Subjt:  FGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNN-------------------FQGGASFELDQFGGVFKKTTDSSDQQA--G

Query:  LSTRDFLGLRAISHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG
        LSTRDFLGLRA+SHTEFL+NIAAAGY NCIN      A QTPQTQIQNQ +WQG
Subjt:  LSTRDFLGLRAISHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG

SwissProt top hitse value%identityAlignment
O22759 Protein indeterminate-domain 121.8e-8565.35Show/hide
Query:  LTSASGEASACSGNHSDQIPTNYSGQY-------FSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKL
        L+S S EASA SGN++      +SG +         T    PKKKR LPGNPDPDAEVIALSPKTL+ATNRFVCEIC KGFQRDQNLQLHRRGHNLPWKL
Subjt:  LTSASGEASACSGNHSDQIPTNYSGQY-------FSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKL

Query:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDAL
        KQ+  KE  +KKVYVCPET+C HH PSRALGDLTGIKKHFCRKHGEKKWKC+KCSK YAVQSDWKAH+K CGTR+Y+CDCGTLFSR+D+FITHRAFCDAL
Subjt:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDAL

Query:  AEESARAITSNPILITNNN--------NYNNNNNNNYNNNQNHLLPPLSSTPIS
        AEESAR  +++   +TN N         +N +++  + ++   + P LS+  +S
Subjt:  AEESARAITSNPILITNNN--------NYNNNNNNNYNNNQNHLLPPLSSTPIS

Q700D2 Zinc finger protein JACKDAW2.1e-8655.1Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR
        KKKRN PG PDPDA+VIALSP TLMATNRFVCEIC KGFQRDQNLQLHRRGHNLPWKLKQR+ +EVI+KKVY+CP  +CVHHD SRALGDLTGIKKH+ R
Subjt:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAIT---SNPILITNNNNYNNNNNNNYNNNQ------
        KHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSR+DSFITHRAFCDAL EE AR  +   +NP++ T N N+ N +N   N N       
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAIT---SNPILITNNNNYNNNNNNNYNNNQ------

Query:  -------------------NHLLPPLSSTPISHL-NFQITQQTHFNP----PLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHPIINPNHNHNLAPTS
                            H L  + +  +S +     T   H  P     L  FS   + Q+   +  P L   +SS++      +    +  L  +S
Subjt:  -------------------NHLLPPLSSTPISHL-NFQITQQTHFNP----PLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHPIINPNHNHNLAPTS

Query:  LHLIPSASPSS------SHMSATALLQKAAQMGATMSNNTEPP
           + S+S  +      S MSATALLQKAAQMG+T SN++  P
Subjt:  LHLIPSASPSS------SHMSATALLQKAAQMGATMSNNTEPP

Q8H1F5 Protein indeterminate-domain 71.2e-9448.7Show/hide
Query:  QQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNYSGQYFS---TPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLH
        QQQ Q MEEN+SNLTSASG +AS  SGN ++   +N +  +      P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+C KGFQRDQNLQLH
Subjt:  QQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNYSGQYFS---TPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLH

Query:  RRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSF
        +RGHNLPWKLKQR+NK+V+RKKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSRRDSF
Subjt:  RRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSF

Query:  ITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEH-QLINNNIPPWLGCPNSSS
        ITHRAFCDALAEESARA+  NPI+I  +N+ +++++    N                + F  + Q   +    H  +K+E  Q    NIPPWL   N + 
Subjt:  ITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEH-QLINNNIPPWLGCPNSSS

Query:  NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRD
        N NN  +  P  +     T     P  SP+   MSATALLQKAAQMG+T S           TT      ++ ++ NN    T      +        +D
Subjt:  NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRD

Query:  IHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFKKTTDSSD--QQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCIN
         +   +Q                                GG     + F G F    + +D     G  TRDFLGLR++ SH E L+   A    NC+N
Subjt:  IHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFKKTTDSSD--QQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCIN

Q9LRW7 Protein indeterminate-domain 112.0e-9246.64Show/hide
Query:  LFQQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNY-----------SGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKG
        L Q Q    +EN+SNLTSASG +AS  SGN ++   +NY             Q         KK+RN PGNPDP++EVIALSPKTLMATNRFVCEIC KG
Subjt:  LFQQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNY-----------SGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKG

Query:  FQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDC
        FQRDQNLQLHRRGHNLPWKLKQR+NKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDC
Subjt:  FQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDC

Query:  GTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNN--------NNNYNNNQNHLLPPLSSTPISHLNFQITQQTHF-------------N
        GTLFSRRDSFITHRAFC+ALAEE+AR      ++I  N N N  N        ++ ++++Q      +SS+  S  N  I    HF             N
Subjt:  GTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNN--------NNNYNNNQNHLLPPLSSTPISHLNFQITQQTHF-------------N

Query:  PPLDHFSLKKEHQ----LINNN---IPPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTT
          L  F +KKE Q    ++N +   IPPWL  P   +  +++P  NP++      +   L   ASP+   MSATALLQKAAQMG+T +    P      +
Subjt:  PPLDHFSLKKEHQ----LINNN---IPPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTT

Query:  TNNNN---------NNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFK
        T+NNN          + +   ++NNNN+      +     ++S  D H ++            +   T AAG                           +
Subjt:  TNNNN---------NNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFK

Query:  KTTDSSDQQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG
        K+T S   + GL TRDFLGLR + SH E L   + AG  +CIN+  ++     P         WQG
Subjt:  KTTDSSDQQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 14.4e-8454.82Show/hide
Query:  MEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKL
        M  +L N ++ SGEAS    +  +Q P          P    KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEIC KGFQRDQNLQLHRRGHNLPWKL
Subjt:  MEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKL

Query:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDAL
        +Q++NKEV +KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSK CGT+EYKCDCGTLFSRRDSFITHRAFCDAL
Subjt:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDAL

Query:  AEESARAITS-----NPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHP
        AEE+AR+  S     NP ++T  N   N      +     +    SST    L  + ++     P +   + K     +  +   + G   SSS + +  
Subjt:  AEESARAITS-----NPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHP

Query:  IINPNHNHNLAPTSLHLIPSASPSSSH---------------MSATALLQKAAQMGATMSNNT
          + +     A +S     S   S+SH               MSATALLQKAAQMGA  S  +
Subjt:  IINPNHNHNLAPTSLHLIPSASPSSSH---------------MSATALLQKAAQMGATMSNNT

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 78.7e-9648.7Show/hide
Query:  QQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNYSGQYFS---TPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLH
        QQQ Q MEEN+SNLTSASG +AS  SGN ++   +N +  +      P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+C KGFQRDQNLQLH
Subjt:  QQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNYSGQYFS---TPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLH

Query:  RRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSF
        +RGHNLPWKLKQR+NK+V+RKKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+KTCGT+EYKCDCGTLFSRRDSF
Subjt:  RRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSF

Query:  ITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEH-QLINNNIPPWLGCPNSSS
        ITHRAFCDALAEESARA+  NPI+I  +N+ +++++    N                + F  + Q   +    H  +K+E  Q    NIPPWL   N + 
Subjt:  ITHRAFCDALAEESARAITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEH-QLINNNIPPWLGCPNSSS

Query:  NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRD
        N NN  +  P  +     T     P  SP+   MSATALLQKAAQMG+T S           TT      ++ ++ NN    T      +        +D
Subjt:  NNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRD

Query:  IHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFKKTTDSSD--QQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCIN
         +   +Q                                GG     + F G F    + +D     G  TRDFLGLR++ SH E L+   A    NC+N
Subjt:  IHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFKKTTDSSD--QQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCIN

AT3G13810.1 indeterminate(ID)-domain 111.4e-9346.64Show/hide
Query:  LFQQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNY-----------SGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKG
        L Q Q    +EN+SNLTSASG +AS  SGN ++   +NY             Q         KK+RN PGNPDP++EVIALSPKTLMATNRFVCEIC KG
Subjt:  LFQQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNY-----------SGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKG

Query:  FQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDC
        FQRDQNLQLHRRGHNLPWKLKQR+NKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+EY+CDC
Subjt:  FQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDC

Query:  GTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNN--------NNNYNNNQNHLLPPLSSTPISHLNFQITQQTHF-------------N
        GTLFSRRDSFITHRAFC+ALAEE+AR      ++I  N N N  N        ++ ++++Q      +SS+  S  N  I    HF             N
Subjt:  GTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNN--------NNNYNNNQNHLLPPLSSTPISHLNFQITQQTHF-------------N

Query:  PPLDHFSLKKEHQ----LINNN---IPPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTT
          L  F +KKE Q    ++N +   IPPWL  P   +  +++P  NP++      +   L   ASP+   MSATALLQKAAQMG+T +    P      +
Subjt:  PPLDHFSLKKEHQ----LINNN---IPPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPPHTITTT

Query:  TNNNN---------NNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFK
        T+NNN          + +   ++NNNN+      +     ++S  D H ++            +   T AAG                           +
Subjt:  TNNNN---------NNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQFGGVFK

Query:  KTTDSSDQQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG
        K+T S   + GL TRDFLGLR + SH E L   + AG  +CIN+  ++     P         WQG
Subjt:  KTTDSSDQQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG

AT3G13810.2 indeterminate(ID)-domain 118.2e-8644.58Show/hide
Query:  LFQQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFVC
        L Q Q    +EN+SNLTSASG +AS  SGN ++   +NY   +        ++ + L  +                   P++EVIALSPKTLMATNRFVC
Subjt:  LFQQQAQAMEENLSNLTSASG-EASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFVC

Query:  EICGKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTR
        EIC KGFQRDQNLQLHRRGHNLPWKLKQR+NKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSKTCGT+
Subjt:  EICGKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTR

Query:  EYKCDCGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNN--------NNNYNNNQNHLLPPLSSTPISHLNFQITQQTHF--------
        EY+CDCGTLFSRRDSFITHRAFC+ALAEE+AR      ++I  N N N  N        ++ ++++Q      +SS+  S  N  I    HF        
Subjt:  EYKCDCGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNYNNNN--------NNNYNNNQNHLLPPLSSTPISHLNFQITQQTHF--------

Query:  -----NPPLDHFSLKKEHQ----LINNN---IPPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPP
             N  L  F +KKE Q    ++N +   IPPWL  P   +  +++P  NP++      +   L   ASP+   MSATALLQKAAQMG+T +    P 
Subjt:  -----NPPLDHFSLKKEHQ----LINNN---IPPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSASPSSSHMSATALLQKAAQMGATMSNNTEPP

Query:  HTITTTTNNNN---------NNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQ
             +T+NNN          + +   ++NNNN+      +     ++S  D H ++            +   T AAG                      
Subjt:  HTITTTTNNNN---------NNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNFQGGASFELDQ

Query:  FGGVFKKTTDSSDQQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG
             +K+T S   + GL TRDFLGLR + SH E L   + AG  +CIN+  ++     P         WQG
Subjt:  FGGVFKKTTDSSDQQAGLSTRDFLGLRAI-SHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG

AT3G50700.1 indeterminate(ID)-domain 23.1e-8554.82Show/hide
Query:  MEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKL
        M  +L N ++ SGEAS    +  +Q P          P    KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEIC KGFQRDQNLQLHRRGHNLPWKL
Subjt:  MEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKL

Query:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDAL
        +Q++NKEV +KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSK CGT+EYKCDCGTLFSRRDSFITHRAFCDAL
Subjt:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDAL

Query:  AEESARAITS-----NPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHP
        AEE+AR+  S     NP ++T  N   N      +     +    SST    L  + ++     P +   + K     +  +   + G   SSS + +  
Subjt:  AEESARAITS-----NPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHP

Query:  IINPNHNHNLAPTSLHLIPSASPSSSH---------------MSATALLQKAAQMGATMSNNT
          + +     A +S     S   S+SH               MSATALLQKAAQMGA  S  +
Subjt:  IINPNHNHNLAPTSLHLIPSASPSSSH---------------MSATALLQKAAQMGATMSNNT

AT5G03150.1 C2H2-like zinc finger protein1.5e-8755.1Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR
        KKKRN PG PDPDA+VIALSP TLMATNRFVCEIC KGFQRDQNLQLHRRGHNLPWKLKQR+ +EVI+KKVY+CP  +CVHHD SRALGDLTGIKKH+ R
Subjt:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAIT---SNPILITNNNNYNNNNNNNYNNNQ------
        KHGEKKWKC+KCSKKYAVQSDWKAH+KTCGTREYKCDCGTLFSR+DSFITHRAFCDAL EE AR  +   +NP++ T N N+ N +N   N N       
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARAIT---SNPILITNNNNYNNNNNNNYNNNQ------

Query:  -------------------NHLLPPLSSTPISHL-NFQITQQTHFNP----PLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHPIINPNHNHNLAPTS
                            H L  + +  +S +     T   H  P     L  FS   + Q+   +  P L   +SS++      +    +  L  +S
Subjt:  -------------------NHLLPPLSSTPISHL-NFQITQQTHFNP----PLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHPIINPNHNHNLAPTS

Query:  LHLIPSASPSS------SHMSATALLQKAAQMGATMSNNTEPP
           + S+S  +      S MSATALLQKAAQMG+T SN++  P
Subjt:  LHLIPSASPSS------SHMSATALLQKAAQMGATMSNNTEPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAAAAGTTCGTTGTTTCAACAACAAGCTCAAGCTATGGAGGAAAATTTGTCGAATTTAACTTCAGCTTCTGGTGAAGCTAGTGCCTGCTCCGGCAACCATTCCGA
TCAGATTCCGACCAACTATTCCGGCCAGTATTTTTCTACTCCACCACCACCACCAAAAAAGAAGAGAAACCTCCCCGGAAATCCAGACCCAGATGCGGAAGTGATAGCTT
TATCGCCGAAAACGCTTATGGCGACGAATAGATTTGTATGCGAGATTTGTGGGAAAGGGTTTCAGAGAGATCAGAATCTTCAACTTCATAGAAGAGGACACAATCTTCCA
TGGAAGTTAAAGCAAAGAGCAAACAAAGAAGTAATAAGGAAGAAAGTTTATGTGTGTCCAGAAACAAGTTGTGTTCATCATGATCCATCAAGGGCATTAGGGGATTTGAC
AGGAATCAAGAAGCACTTTTGTAGAAAACATGGTGAAAAGAAATGGAAATGTGATAAGTGCTCTAAGAAGTATGCAGTTCAATCAGATTGGAAAGCTCATTCTAAGACTT
GTGGCACTAGAGAGTACAAATGTGACTGTGGAACCCTTTTCTCAAGGAGGGATAGTTTCATCACCCACAGAGCATTTTGTGATGCTTTAGCAGAGGAAAGTGCAAGAGCC
ATTACATCAAACCCAATATTAATTACCAATAATAATAATTATAATAATAATAATAATAATAATTATAATAATAATCAAAACCACCTTCTTCCTCCACTTTCTTCCACTCC
CATTTCTCACTTAAACTTCCAAATCACACAACAAACCCATTTCAACCCTCCCTTAGATCATTTTTCTCTAAAAAAAGAACACCAATTAATAAATAATAATATTCCCCCAT
GGTTAGGCTGTCCTAATTCAAGCTCAAATAATAATAATCATCCAATTATAAACCCTAATCACAATCATAATCTTGCTCCCACTTCCCTTCATCTAATTCCAAGTGCTTCC
CCTTCTTCTTCACACATGTCAGCCACAGCACTTCTTCAGAAAGCAGCTCAAATGGGAGCTACCATGAGCAATAATACTGAGCCACCACACACAATCACCACCACCACCAA
TAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATACAAATTGTAATTTTGGCCTTAATTTGTCCTCCTCATCCTCCTCCTCACGTGACATTC
ATCAGCAGCAGAATCAGATTTTGATGATAATGGATAGTACTACTAATAGTTGTAGTACTACTGAAGCTGCAGGGCTTTCTCATGCACTGCCATTCTACAGGAATAATTTT
CAAGGAGGGGCTTCTTTTGAATTAGACCAATTTGGAGGGGTTTTCAAGAAAACAACAGATTCAAGTGATCAACAAGCTGGGCTTAGTACAAGAGATTTCTTGGGGTTAAG
AGCTATTTCTCATACTGAGTTTTTGACTAATATTGCAGCTGCTGGTTATAATAATTGCATCAATAATCATAATAATAATGTTGCTGCTCAAACTCCTCAAACTCAAATTC
AAAATCAATCCACCTGGCAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
TATTAATTATTCTACTTCATTGGTATTATCAAGATTTATAATATTTACTTCACCAAAAACCAAACCCAGGATTTTGATACTCTCAGAATTTTGTTTATTCATCTCTCTCT
CTCTCTCTCTCTTTCTTCTTCTCTTCTCTATTTCATGCATATTTAAAAAAAAAAAATTTAAAAAAGTGTGAAAATTGTCTCTTGGGTGAGTGCCAAATTCAACACAAGAT
TCAAATTCAAGAGCTTTGGTGAGTTTTAAGGATGATAAAAAGTTCGTTGTTTCAACAACAAGCTCAAGCTATGGAGGAAAATTTGTCGAATTTAACTTCAGCTTCTGGTG
AAGCTAGTGCCTGCTCCGGCAACCATTCCGATCAGATTCCGACCAACTATTCCGGCCAGTATTTTTCTACTCCACCACCACCACCAAAAAAGAAGAGAAACCTCCCCGGA
AATCCAGACCCAGATGCGGAAGTGATAGCTTTATCGCCGAAAACGCTTATGGCGACGAATAGATTTGTATGCGAGATTTGTGGGAAAGGGTTTCAGAGAGATCAGAATCT
TCAACTTCATAGAAGAGGACACAATCTTCCATGGAAGTTAAAGCAAAGAGCAAACAAAGAAGTAATAAGGAAGAAAGTTTATGTGTGTCCAGAAACAAGTTGTGTTCATC
ATGATCCATCAAGGGCATTAGGGGATTTGACAGGAATCAAGAAGCACTTTTGTAGAAAACATGGTGAAAAGAAATGGAAATGTGATAAGTGCTCTAAGAAGTATGCAGTT
CAATCAGATTGGAAAGCTCATTCTAAGACTTGTGGCACTAGAGAGTACAAATGTGACTGTGGAACCCTTTTCTCAAGGAGGGATAGTTTCATCACCCACAGAGCATTTTG
TGATGCTTTAGCAGAGGAAAGTGCAAGAGCCATTACATCAAACCCAATATTAATTACCAATAATAATAATTATAATAATAATAATAATAATAATTATAATAATAATCAAA
ACCACCTTCTTCCTCCACTTTCTTCCACTCCCATTTCTCACTTAAACTTCCAAATCACACAACAAACCCATTTCAACCCTCCCTTAGATCATTTTTCTCTAAAAAAAGAA
CACCAATTAATAAATAATAATATTCCCCCATGGTTAGGCTGTCCTAATTCAAGCTCAAATAATAATAATCATCCAATTATAAACCCTAATCACAATCATAATCTTGCTCC
CACTTCCCTTCATCTAATTCCAAGTGCTTCCCCTTCTTCTTCACACATGTCAGCCACAGCACTTCTTCAGAAAGCAGCTCAAATGGGAGCTACCATGAGCAATAATACTG
AGCCACCACACACAATCACCACCACCACCAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATAATACAAATTGTAATTTTGGCCTTAATTTG
TCCTCCTCATCCTCCTCCTCACGTGACATTCATCAGCAGCAGAATCAGATTTTGATGATAATGGATAGTACTACTAATAGTTGTAGTACTACTGAAGCTGCAGGGCTTTC
TCATGCACTGCCATTCTACAGGAATAATTTTCAAGGAGGGGCTTCTTTTGAATTAGACCAATTTGGAGGGGTTTTCAAGAAAACAACAGATTCAAGTGATCAACAAGCTG
GGCTTAGTACAAGAGATTTCTTGGGGTTAAGAGCTATTTCTCATACTGAGTTTTTGACTAATATTGCAGCTGCTGGTTATAATAATTGCATCAATAATCATAATAATAAT
GTTGCTGCTCAAACTCCTCAAACTCAAATTCAAAATCAATCCACCTGGCAAGGTTAGAACAACAAAATTTCAAAAGTTCCCCCTTTTTTTTTGTTTTTTGTTTTTTTTAT
CCCTTTTGGGGTTTAGATTTATATATGAAATATATTTTTGAATTTTGTTGTATTTACAAATGGTGTCTCTTTTTTTCCCTAAGGGGGAAGGACCACCACCTTCTTGAATT
AGGAATTTTTGGATCATAATATTCTATATGCTTTTGTATTTATGAATATATATCCATATCTATATATATATATATATTTCAATGTATAATACTTAATTAGTGAATTTCTT
GGTATGAATGAAATTATAAAGTTAAACCC
Protein sequenceShow/hide protein sequence
MIKSSLFQQQAQAMEENLSNLTSASGEASACSGNHSDQIPTNYSGQYFSTPPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICGKGFQRDQNLQLHRRGHNLP
WKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEESARA
ITSNPILITNNNNYNNNNNNNYNNNQNHLLPPLSSTPISHLNFQITQQTHFNPPLDHFSLKKEHQLINNNIPPWLGCPNSSSNNNNHPIINPNHNHNLAPTSLHLIPSAS
PSSSHMSATALLQKAAQMGATMSNNTEPPHTITTTTNNNNNNNNNNNNNNNNNNTNCNFGLNLSSSSSSSRDIHQQQNQILMIMDSTTNSCSTTEAAGLSHALPFYRNNF
QGGASFELDQFGGVFKKTTDSSDQQAGLSTRDFLGLRAISHTEFLTNIAAAGYNNCINNHNNNVAAQTPQTQIQNQSTWQG