; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023529 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023529
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein indeterminate-domain 7-like
Genome locationscaffold13:1423410..1426985
RNA-Seq ExpressionSpg023529
SyntenySpg023529
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606715.1 Protein indeterminate-domain 11, partial [Cucurbita argyrosperma subsp. sororia]2.2e-19772.56Show/hide
Query:  MEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
        MEENLSNLTSASGEAS+CSGNRSDQIP NYSGG YFSA PPPPK+KRNLPGNPDPDAEV+ALSPKTLMATNRFVCEIC+KGFQRDQNLQLHRRGHNLPWK
Subjt:  MEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK

Query:  LKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDA
        LK RANKEV+RKKVYVCPETSCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDA
Subjt:  LKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDA

Query:  LAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQN---LI
        LAEESAR IT+NPI++TNN      NPPL+  PISSI H NFQ QTHF    NPLDINSF+LKKEH     +P ++ NFIPPW     L NH+Q+   LI
Subjt:  LAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQN---LI

Query:  LNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---SSSRDN-NQILMGS
        +NPN+ LG TSL+    S+SPS PHMSATALLQKAAQMGATMSS++N            ++ HHV DSSA TNN +CNFGLNL   SSSRDN NQ+LMG+
Subjt:  LNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---SSSRDN-NQILMGS

Query:  E-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAITTSEGLSTRDFLG
        E G GLS ALP YRNK+                        L+Q+TF++N+N+ ++TTFSPSAF GASFEID FGG+LKKD   +    SEGLSTRDFLG
Subjt:  E-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAITTSEGLSTRDFLG

Query:  LRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG
        LRA+SH+EFLSN+ AAAGYGNC+N GAGQ+PQ+QI+ QPSWQG
Subjt:  LRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG

XP_022949460.1 protein indeterminate-domain 7-like [Cucurbita moschata]1.3e-20072.89Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA--PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        MIKSLLF  QAQAMEENLSNLTSASGEAS+CSGNRSDQIP NYSGG YFSA  PPPPK+KRNLPGNPDPDAEV+ALSPKTLMATNRFVCEIC+KGFQRDQ
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA--PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS
        NLQLHRRGHNLPWKLK RANKE IRKKVYVCPETSCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS

Query:  RRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQ
        RRDSFITHRAFCDALAEESAR+IT+NP+L+TNN      NPPL+  PISSI HLNFQ QTHF    NPLDINSF+LKKEH     +P ++ NFIPPWL+ 
Subjt:  RRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQ

Query:  DHLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---
            NH+Q+   LI+NPN+ LG TSL+    S+SPS PHMSATALLQKAAQMGATMSS++N            ++ HHV DSSA TNN +CNFGLNL   
Subjt:  DHLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---

Query:  SSSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SA
        SSSRDN NQ+LMG+E G GLS ALP YRNK+                        L+Q+TF+   N+N++TTFSPSAF GASFEID FGG+LKKD   + 
Subjt:  SSSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SA

Query:  ITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG
           SEGLSTRDFLGLRA+SH+EFLSN+ AAAGYGNC+N GAGQ+PQ+QI+ QPSWQG
Subjt:  ITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG

XP_022998314.1 protein indeterminate-domain 7-like [Cucurbita maxima]3.7e-20073.02Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPP-PKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQN
        MIKSLLF  QAQAMEENLSNLTSASGEAS+CSGNRSDQIP NYSGG YFSAPPP PK+KR+LPGNPDPDAEV+ALSPKTLMATNRFVCEICNKGFQRDQN
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPP-PKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR
        LQLHRRGHNLPWKLK RANKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD
        RDSFITHRAFCDALAEESAR IT+NPIL+TNN      NPPL+  PISSI HLNFQ QTHF    NPLDINSF+LKKEH     +P ++ NFIPPWL+  
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD

Query:  HLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---S
           NH Q+   LI+NPN+ LG TSL+    S+S   PHMSATALLQKAAQMGATMSS++N            ++ HHV DSSA TNN +CNFGLNL   S
Subjt:  HLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---S

Query:  SSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAI
        SSRDN NQ+LMG+E G GLS ALP YRNK+                        LLQ+TFI    +N++TTFSPSAF GASFEID FGG+LKKD   +  
Subjt:  SSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAI

Query:  TTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG
          +EGLSTRDFLGLRA+SH+EFLSN+ AAAGYGNC+N GAGQ+PQ+QIQ QPSWQG
Subjt:  TTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG

XP_023525781.1 protein indeterminate-domain 7-like [Cucurbita pepo subsp. pepo]9.7e-20173.24Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQN
        MIKSLLF  QAQAMEENLSNLTSASGEAS+CSGNRSDQIP NYSGG YFSA PPPPK+KRNLPGNPDPDAEV+ALSPKTLMATNRFVCEICNKGFQRDQN
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR
        LQLHRRGHNLPWKLK RANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD
        RDSFITHRAFCDALAEESAR+IT+NPIL+TNN      NPPL+  PISSI H NFQ QTHF    NPLDINSF+LKKEH     +P ++ NFIPPWL+  
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD

Query:  HLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---SSSR
        + + +H  LI+NPN+ LG TSL+    S+SPS PHMSATALLQKAAQMGATMSS++N            ++ HHV DSSA TNN +CNFGLNL   SSSR
Subjt:  HLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---SSSR

Query:  DN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAITTS
        DN NQ+LMG+E G GLS ALP YRNK+                        L+Q+TF+   N+N++TTFSPSAF GASFEID FGG+LKKD   +    S
Subjt:  DN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAITTS

Query:  EGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG
        EGLSTRDFLGLRA+SH+EFLSN+ AAAGYGNC+N GAGQ+P +QI+ QPSWQG
Subjt:  EGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG

XP_038904341.1 protein indeterminate-domain 7-like isoform X2 [Benincasa hispida]1.0e-18168.11Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA--PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        MIKSLLFQQQAQAMEENLSNLTSASGEASACSGN SDQIP NYS GQYF+   PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEIC KGFQRDQ
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA--PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS
        NLQLHRRGHNLPWKLKQR NKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSDWKAHSKTCGTREYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS

Query:  RRDSFITHRAFCDALAEESARAITSNPILITNN-NNNQNQ-NPPLISTPISSIPHLNFQ--TQTHFNNN--NNPLDINSF-SLKKEHHHQQNSPNNNNNF
        RRDSFITHRAFCDALAEESARAITSNPIL+TNN N NQN   PPL STP  +I HLNFQ   QTHFN+   +N   INSF SLKKE           N  
Subjt:  RRDSFITHRAFCDALAEESARAITSNPILITNN-NNNQNQ-NPPLISTPISSIPHLNFQ--TQTHFNNN--NNPLDINSF-SLKKEHHHQQNSPNNNNNF

Query:  IPPWL------SQDHLNNHHQNLILNPN-NSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNL
        IPPWL      S DH NNHHQ  I+NPN N+LGPTSL H +QSASPS PHMSATALLQKAAQMGATMSS++   + I  Q H  D++  NN +CNFGL+L
Subjt:  IPPWL------SQDHLNNHHQNLILNPN-NSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNL

Query:  ----SSSRDNNQ------ILMGS--------EGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFE
            SSSRD +Q      ++MGS        E AGLSHALP YRNK+N +                                            GGASFE
Subjt:  ----SSSRDNNQ------ILMGS--------EGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFE

Query:  IDAFGGVLKKDNSAITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMNG------AGQSP-QSQIQKQPSWQG
        ++ FGGV KK+   I    GLSTRDFLGLRAISH+EFLSN+ AAAGY NC+N       A Q+P Q+QIQ Q +WQG
Subjt:  IDAFGGVLKKDNSAITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMNG------AGQSP-QSQIQKQPSWQG

TrEMBL top hitse value%identityAlignment
A0A0A0LID9 C2H2-type domain-containing protein9.2e-18166.43Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNL
        MIKSLL Q Q+QAMEENLSNLTSASGEASACSGN SDQIP NYS GQ+FS PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEIC+KGFQRDQNL
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNL

Query:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRR
        QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRR
Subjt:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRR

Query:  DSFITHRAFCDALAEESARAITSNP--ILITNNNNNQNQN---PPLISTPISSI-PHLNFQ--TQTHFNNNN--NPLDINSFSLKKEHHH-QQNSPNNNN
        DSFITHRAFCDALAEESARAITSNP  ++  NNNNN NQN   PPL S    +I   LNFQ   QTHFNN    +    N+ SLKKE+H  Q N+ NN+N
Subjt:  DSFITHRAFCDALAEESARAITSNP--ILITNNNNNQNQN---PPLISTPISSI-PHLNFQ--TQTHFNNNN--NPLDINSFSLKKEHHH-QQNSPNNNN

Query:  NFIPPWL-----SQDHLNNHHQNLILNPNN---SLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFG
        N IPPWL     +    NNH+ + I+NPN+   +LGPTSL H +QSASPS PHMSATALLQKAAQMG+TMSS+SN+NNN    +  P  +   + +CNFG
Subjt:  NFIPPWL-----SQDHLNNHHQNLILNPNN---SLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFG

Query:  LNL------SSSRD--NNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGV
        LNL      SSSRD   NQIL     AGLSHALP YRNK  + +                                            G SFE+D FGGV
Subjt:  LNL------SSSRD--NNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGV

Query:  LKKDNS---AITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-----GAGQSPQ-SQIQKQPSWQG
         KK+N        + GLSTRDFLGLRAISH+EFLSN+AAA  + +C+N     GA Q+PQ +QIQ Q +WQG
Subjt:  LKKDNS---AITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-----GAGQSPQ-SQIQKQPSWQG

A0A1S3CQM6 protein indeterminate-domain 7-like2.0e-18066.78Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNL
        MIKSLLFQ Q+QAMEENLSNLTSASGEASACSGN SDQIP NYS GQ+FS PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEIC+KGFQRDQNL
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNL

Query:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRR
        QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRR
Subjt:  QLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRR

Query:  DSFITHRAFCDALAEESARAITSN-PILITNNNNNQNQN---PPLISTPISSI-PHLNFQ--TQTHFNNNN--NPLDINSFSLKKEHHHQQNSPNNNNNF
        DSFITHRAFCDALAEESARAITSN PILITNNNNN NQN   PPL S    +I   LNFQ   QTHFNN    +    N+ SLKKEH   QN+ ++NNN 
Subjt:  DSFITHRAFCDALAEESARAITSN-PILITNNNNNQNQN---PPLISTPISSI-PHLNFQ--TQTHFNNNN--NPLDINSFSLKKEHHHQQNSPNNNNNF

Query:  IPPWL---------SQDHLNNHHQNLILNPNN----SLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHH--VPDSSATNNP
        IPPWL         S DH  NHHQ  I+NPN+    +LGPTSL H +QSASPS PHMSATALLQKAAQMG+TMSS+SNNNNN    ++   P  +   + 
Subjt:  IPPWL---------SQDHLNNHHQNLILNPNN----SLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHH--VPDSSATNNP

Query:  SCNFGLNL----------SSSRDNNQ-ILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASF
        +CNFGLNL          SSSRD +Q  ++    AGLSHALP YRNK  N +                                            G SF
Subjt:  SCNFGLNL----------SSSRDNNQ-ILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASF

Query:  EIDAFGGVLKKDNS---AITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-----GAGQSPQ-SQIQKQ
        E+D FGGV KK+N        + GLSTRDFLGLRAISH+EFLSN+AAA  + +C+N     GA Q+PQ +QIQ Q
Subjt:  EIDAFGGVLKKDNS---AITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-----GAGQSPQ-SQIQKQ

A0A6J1GC33 protein indeterminate-domain 7-like6.1e-20172.89Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA--PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        MIKSLLF  QAQAMEENLSNLTSASGEAS+CSGNRSDQIP NYSGG YFSA  PPPPK+KRNLPGNPDPDAEV+ALSPKTLMATNRFVCEIC+KGFQRDQ
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSA--PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS
        NLQLHRRGHNLPWKLK RANKE IRKKVYVCPETSCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFS

Query:  RRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQ
        RRDSFITHRAFCDALAEESAR+IT+NP+L+TNN      NPPL+  PISSI HLNFQ QTHF    NPLDINSF+LKKEH     +P ++ NFIPPWL+ 
Subjt:  RRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQ

Query:  DHLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---
            NH+Q+   LI+NPN+ LG TSL+    S+SPS PHMSATALLQKAAQMGATMSS++N            ++ HHV DSSA TNN +CNFGLNL   
Subjt:  DHLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---

Query:  SSSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SA
        SSSRDN NQ+LMG+E G GLS ALP YRNK+                        L+Q+TF+   N+N++TTFSPSAF GASFEID FGG+LKKD   + 
Subjt:  SSSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SA

Query:  ITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG
           SEGLSTRDFLGLRA+SH+EFLSN+ AAAGYGNC+N GAGQ+PQ+QI+ QPSWQG
Subjt:  ITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG

A0A6J1IU27 protein indeterminate-domain 7-like1.4e-16062.68Show/hide
Query:  AQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        A  ME+ +SNLTSASGE SACSGNRSDQ+PANYSG  + + PPPPKKKRNLPGNPDPDAEV+ALSPKTLMATNRFVCEIC+KGFQRDQNLQLH+RGHNLP
Subjt:  AQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFC
        WKLKQRANKEVIRKKVYVCPETSCVHHDP RALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFC
Subjt:  WKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFC

Query:  DALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLS---------QDH-
        DALAEESARAIT+       +N N NQN P+     SSI HLNFQ         NPLDINSFSLKKEH   Q  P  NN  IPPW+           DH 
Subjt:  DALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLS---------QDH-

Query:  ----LNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNLSSSRDN-NQILM
            +NN H   I+NPNN L      H + S+SPS PHMSATALLQKAAQMGATMS+S+NN N  +                      SSSRDN +QILM
Subjt:  ----LNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNLSSSRDN-NQILM

Query:  -GSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDNSAITTS-EGLSTRDFL
         GSEG GL HALP + NKSNN ++                                         F G  FE++ FGG  +KD      S EGLSTRDFL
Subjt:  -GSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDNSAITTS-EGLSTRDFL

Query:  GLRAISHSEFLSNMAAAAGYGNCMNGAG-QSPQSQIQKQPSWQG
        GLR ISH+EFL+N+ AA GY NC+NG   Q+P++Q   QP WQG
Subjt:  GLRAISHSEFLSNMAAAAGYGNCMNGAG-QSPQSQIQKQPSWQG

A0A6J1KE03 protein indeterminate-domain 7-like1.8e-20073.02Show/hide
Query:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPP-PKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQN
        MIKSLLF  QAQAMEENLSNLTSASGEAS+CSGNRSDQIP NYSGG YFSAPPP PK+KR+LPGNPDPDAEV+ALSPKTLMATNRFVCEICNKGFQRDQN
Subjt:  MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPP-PKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR
        LQLHRRGHNLPWKLK RANKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD
        RDSFITHRAFCDALAEESAR IT+NPIL+TNN      NPPL+  PISSI HLNFQ QTHF    NPLDINSF+LKKEH     +P ++ NFIPPWL+  
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD

Query:  HLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---S
           NH Q+   LI+NPN+ LG TSL+    S+S   PHMSATALLQKAAQMGATMSS++N            ++ HHV DSSA TNN +CNFGLNL   S
Subjt:  HLNNHHQN---LILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNN-------NNNIAEQHHVPDSSA-TNNPSCNFGLNL---S

Query:  SSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAI
        SSRDN NQ+LMG+E G GLS ALP YRNK+                        LLQ+TFI    +N++TTFSPSAF GASFEID FGG+LKKD   +  
Subjt:  SSRDN-NQILMGSE-GAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDN--SAI

Query:  TTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG
          +EGLSTRDFLGLRA+SH+EFLSN+ AAAGYGNC+N GAGQ+PQ+QIQ QPSWQG
Subjt:  TTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMN-GAGQSPQSQIQKQPSWQG

SwissProt top hitse value%identityAlignment
Q8H1F5 Protein indeterminate-domain 74.8e-10248.04Show/hide
Query:  MIKSLLF-QQQAQAMEENLSNLTSASG-EASACSGNRSDQIPAN---YSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQ
        M + +LF QQQ Q MEEN+SNLTSASG +AS  SGNR++   +N   +   Q F      K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+CNKGFQ
Subjt:  MIKSLLF-QQQAQAMEENLSNLTSASG-EASACSGNRSDQIPAN---YSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQ

Query:  RDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGT
        RDQNLQLH+RGHNLPWKLKQR+NK+V+RKKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSK+YAVQSDWKAH+KTCGT+EY+CDCGT
Subjt:  RDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGT

Query:  LFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPW
        LFSRRDSFITHRAFCDALAEESARA+  NPI+I  +N               S  H + QTQ +   +++  +I S S       Q+ S ++  N IPPW
Subjt:  LFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPW

Query:  LSQDHLNNHHQNLILNP----NNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNLSSSRDNN
        L   + N +  N  L P    + + G +S  H      PS P MSATALLQKAAQMG+T S++                               SSR + 
Subjt:  LSQDHLNNHHQNLILNP----NNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNLSSSRDNN

Query:  QILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDNSAITTSEGLSTRD
          L+ +  A +  + P                G G             Q  ++ N   +          GG +F      G  K D   +    G  TRD
Subjt:  QILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDNSAITTSEGLSTRD

Query:  FLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQ
        FLGLR++ SH+E LS    A   GNC+N +    Q Q
Subjt:  FLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQ

Q944L3 Zinc finger protein BALDIBIS6.9e-9355.46Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR
        K+KRNLPGNPDPDAEVIALSP +LM TNRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQR NKE ++KKVY+CPE +CVHHDP+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR

Query:  KHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARAITSNP----------ILITNNNNNQNQNPPLISTP
        KHGEKKWKCDKCSK+YAV SDWKAHSK CGT+EYRCDCGTLFSR+DSFITHRAFCDALAEESAR ++  P          + + + N NQN     ++T 
Subjt:  KHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARAITSNP----------ILITNNNNNQNQNPPLISTP

Query:  ISSIPHLNFQTQTH---FNNNNNPLDI-----------------NSFSLKKEHHHQ---QNSPNNNNNFIPPWLSQDHLNNHHQNLILNPN--NSLGPTS
         S +    F T  +   F     P ++                 N + L+ +  HQ     + NNNNN +   +S++   +  +N+I N +  +S    +
Subjt:  ISSIPHLNFQTQTH---FNNNNNPLDI-----------------NSFSLKKEHHHQ---QNSPNNNNNFIPPWLSQDHLNNHHQNLILNPN--NSLGPTS

Query:  LHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNN
         +++ Q+    +  MSATALLQKAAQMG+  SSSS++N+
Subjt:  LHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNN

Q9LRW7 Protein indeterminate-domain 114.5e-10848.17Show/hide
Query:  LLFQQQAQAMEENLSNLTSASG-EASACSGNRSDQIPANY---------SGGQYFSAP--PPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        LL Q Q    +EN+SNLTSASG +AS  SGN ++   +NY            Q    P     KK+RN PGNPDP++EVIALSPKTLMATNRFVCEICNK
Subjt:  LLFQQQAQAMEENLSNLTSASG-EASACSGNRSDQIPANY---------SGGQYFSAP--PPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCD
        GFQRDQNLQLHRRGHNLPWKLKQR+NKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSD KAHSKTCGT+EYRCD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCD

Query:  CGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF--------NNNN
        CGTLFSRRDSFITHRAFC+ALAEE+AR +      +   N N NQ  PL+    +S PH + QTQ                     HF        N+NN
Subjt:  CGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF--------NNNN

Query:  NPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQ
        +   +++F +KKE     +  N +++ IPPWL+       H     NPN S G          ASP+   MSATALLQKAAQMG+T +            
Subjt:  NPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQ

Query:  HHVPDSSATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG--
          +P ++A    + N  L  + +      +M S    +S              NNNN+V              L Q    S  +++         FGG  
Subjt:  HHVPDSSATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG--

Query:  ASFEIDAFGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG
         + E+ A  G  K   S     EGL TRDFLGLR + SH+E LS     AG G+C+N    S   Q+  +P WQG
Subjt:  ASFEIDAFGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 16.1e-8947.23Show/hide
Query:  MEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL
        M  +L N ++ SGEAS    +  +Q P   S G         KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL
Subjt:  MEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKL

Query:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDAL
        +Q++NKEV +KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSDWKAHSK CGT+EY+CDCGTLFSRRDSFITHRAFCDAL
Subjt:  KQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDAL

Query:  AEESARAITS-----NPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD-----HLNN
        AEE+AR+  S     NP ++T  N   N  P  + T  + I   +  T     +   P +I   + K       N   +N  F   + S       +  +
Subjt:  AEESARAITS-----NPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQD-----HLNN

Query:  HHQNLILNPNNSLGPTSLHHHLQSASPSL--------PHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSC---NFGLNLSSSRDNNQ
             +   ++S+ P SL       S  L        P MSATALLQKAAQMGA  S  S     +     +  S++T+  +      GL L        
Subjt:  HHQNLILNPNNSLGPTSLHHHLQSASPSL--------PHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSC---NFGLNLSSSRDNNQ

Query:  ILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG
           G   +GL   +    +         + +G+G  VG+ N  S  L +     +  + +TTF    F G
Subjt:  ILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG

Q9ZWA6 Zinc finger protein MAGPIE2.3e-8848.55Show/hide
Query:  TSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEV
        T +S      S + +D +  ++        PP  KKKRNLPGNPDP+AEVIALSPKTLMATNRF+CEIC KGFQRDQNLQLHRRGHNLPWKLKQR +KEV
Subjt:  TSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEV

Query:  IRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARA-
         RK+VYVCPE SCVHH P+RALGDLTGIKKHFCRKHGEKKWKC+KC+KRYAVQSDWKAHSKTCGTREYRCDCGT+FSRRDSFITHRAFCDALAEE+AR  
Subjt:  IRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARA-

Query:  ----ITSNPILITNNNNNQNQNPPLISTP-ISSIPHLNF--QTQTHFNNNNNPLDINSF---------------SLKKEHHHQQ-----------NSP--
            + S      +N N       LI +P +   P   F      H +++  P+  N+F               S    +HHQQ           +SP  
Subjt:  ----ITSNPILITNNNNNQNQNPPLISTP-ISSIPHLNF--QTQTHFNNNNNPLDINSF---------------SLKKEHHHQQ-----------NSP--

Query:  ------NNNNNFIPPWLSQDHLNNHHQ--NLILNPNNSLGPTSLH------------HHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQH
               N NN      + D L  H    N++ +  N+ G TSL                 +AS ++ +MSATALLQKAAQMGAT S+S        +  
Subjt:  ------NNNNNFIPPWLSQDHLNNHHQ--NLILNPNNSLGPTSLH------------HHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQH

Query:  HVPDSSATNNPSCNFGLN---LSSSRDNNQILMGSEGAGLSHALPHYRN
        ++   ++ +N     G +    +S   N+  LM +   GL H + + RN
Subjt:  HVPDSSATNNPSCNFGLN---LSSSRDNNQILMGSEGAGLSHALPHYRN

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 73.4e-10348.04Show/hide
Query:  MIKSLLF-QQQAQAMEENLSNLTSASG-EASACSGNRSDQIPAN---YSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQ
        M + +LF QQQ Q MEEN+SNLTSASG +AS  SGNR++   +N   +   Q F      K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+CNKGFQ
Subjt:  MIKSLLF-QQQAQAMEENLSNLTSASG-EASACSGNRSDQIPAN---YSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQ

Query:  RDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGT
        RDQNLQLH+RGHNLPWKLKQR+NK+V+RKKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSK+YAVQSDWKAH+KTCGT+EY+CDCGT
Subjt:  RDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGT

Query:  LFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPW
        LFSRRDSFITHRAFCDALAEESARA+  NPI+I  +N               S  H + QTQ +   +++  +I S S       Q+ S ++  N IPPW
Subjt:  LFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPW

Query:  LSQDHLNNHHQNLILNP----NNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNLSSSRDNN
        L   + N +  N  L P    + + G +S  H      PS P MSATALLQKAAQMG+T S++                               SSR + 
Subjt:  LSQDHLNNHHQNLILNP----NNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNLSSSRDNN

Query:  QILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDNSAITTSEGLSTRD
          L+ +  A +  + P                G G             Q  ++ N   +          GG +F      G  K D   +    G  TRD
Subjt:  QILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDNSAITTSEGLSTRD

Query:  FLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQ
        FLGLR++ SH+E LS    A   GNC+N +    Q Q
Subjt:  FLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQ

AT3G13810.1 indeterminate(ID)-domain 113.2e-10948.17Show/hide
Query:  LLFQQQAQAMEENLSNLTSASG-EASACSGNRSDQIPANY---------SGGQYFSAP--PPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        LL Q Q    +EN+SNLTSASG +AS  SGN ++   +NY            Q    P     KK+RN PGNPDP++EVIALSPKTLMATNRFVCEICNK
Subjt:  LLFQQQAQAMEENLSNLTSASG-EASACSGNRSDQIPANY---------SGGQYFSAP--PPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCD
        GFQRDQNLQLHRRGHNLPWKLKQR+NKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSD KAHSKTCGT+EYRCD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCD

Query:  CGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF--------NNNN
        CGTLFSRRDSFITHRAFC+ALAEE+AR +      +   N N NQ  PL+    +S PH + QTQ                     HF        N+NN
Subjt:  CGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF--------NNNN

Query:  NPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQ
        +   +++F +KKE     +  N +++ IPPWL+       H     NPN S G          ASP+   MSATALLQKAAQMG+T +            
Subjt:  NPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQ

Query:  HHVPDSSATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG--
          +P ++A    + N  L  + +      +M S    +S              NNNN+V              L Q    S  +++         FGG  
Subjt:  HHVPDSSATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG--

Query:  ASFEIDAFGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG
         + E+ A  G  K   S     EGL TRDFLGLR + SH+E LS     AG G+C+N    S   Q+  +P WQG
Subjt:  ASFEIDAFGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG

AT3G13810.2 indeterminate(ID)-domain 113.2e-10145.96Show/hide
Query:  LLFQQQAQAMEENLSNLTSASG-EASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFV
        LL Q Q    +EN+SNLTSASG +AS  SGN ++   +NY            ++ + L  +                   P++EVIALSPKTLMATNRFV
Subjt:  LLFQQQAQAMEENLSNLTSASG-EASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFV

Query:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGT
        CEICNKGFQRDQNLQLHRRGHNLPWKLKQR+NKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSD KAHSKTCGT
Subjt:  CEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGT

Query:  REYRCDCGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF------
        +EYRCDCGTLFSRRDSFITHRAFC+ALAEE+AR +      +   N N NQ  PL+    +S PH + QTQ                     HF      
Subjt:  REYRCDCGTLFSRRDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF------

Query:  --NNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNN
          N+NN+   +++F +KKE     +  N +++ IPPWL+       H     NPN S G          ASP+   MSATALLQKAAQMG+T +      
Subjt:  --NNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNN

Query:  NNIAEQHHVPDSSATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPS
                +P ++A    + N  L  + +      +M S    +S              NNNN+V              L Q    S  +++        
Subjt:  NNIAEQHHVPDSSATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPS

Query:  AFGG--ASFEIDAFGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG
         FGG   + E+ A  G  K   S     EGL TRDFLGLR + SH+E LS     AG G+C+N    S   Q+  +P WQG
Subjt:  AFGG--ASFEIDAFGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG

AT3G13810.3 indeterminate(ID)-domain 111.5e-9845.95Show/hide
Query:  LSNLTSASG-EASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQN
        +SNLTSASG +AS  SGN ++   +NY            ++ + L  +                   P++EVIALSPKTLMATNRFVCEICNKGFQRDQN
Subjt:  LSNLTSASG-EASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQN

Query:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR
        LQLHRRGHNLPWKLKQR+NKEVIRKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSK+YAVQSD KAHSKTCGT+EYRCDCGTLFSR
Subjt:  LQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSR

Query:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF--------NNNNNPLDINS
        RDSFITHRAFC+ALAEE+AR +      +   N N NQ  PL+    +S PH + QTQ                     HF        N+NN+   +++
Subjt:  RDSFITHRAFCDALAEESARAITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQ--------------------THF--------NNNNNPLDINS

Query:  FSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSS
        F +KKE     +  N +++ IPPWL+       H     NPN S G          ASP+   MSATALLQKAAQMG+T +              +P ++
Subjt:  FSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSS

Query:  ATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG--ASFEIDA
        A    + N  L  + +      +M S    +S              NNNN+V              L Q    S  +++         FGG   + E+ A
Subjt:  ATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQSTFISNSNSNSSTTFSPSAFGG--ASFEIDA

Query:  FGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG
          G  K   S     EGL TRDFLGLR + SH+E LS     AG G+C+N    S   Q+  +P WQG
Subjt:  FGGVLKKDNSAITTSEGLSTRDFLGLRAI-SHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG

AT3G45260.1 C2H2-like zinc finger protein4.9e-9455.46Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR
        K+KRNLPGNPDPDAEVIALSP +LM TNRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQR NKE ++KKVY+CPE +CVHHDP+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCR

Query:  KHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARAITSNP----------ILITNNNNNQNQNPPLISTP
        KHGEKKWKCDKCSK+YAV SDWKAHSK CGT+EYRCDCGTLFSR+DSFITHRAFCDALAEESAR ++  P          + + + N NQN     ++T 
Subjt:  KHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARAITSNP----------ILITNNNNNQNQNPPLISTP

Query:  ISSIPHLNFQTQTH---FNNNNNPLDI-----------------NSFSLKKEHHHQ---QNSPNNNNNFIPPWLSQDHLNNHHQNLILNPN--NSLGPTS
         S +    F T  +   F     P ++                 N + L+ +  HQ     + NNNNN +   +S++   +  +N+I N +  +S    +
Subjt:  ISSIPHLNFQTQTH---FNNNNNPLDI-----------------NSFSLKKEHHHQ---QNSPNNNNNFIPPWLSQDHLNNHHQNLILNPN--NSLGPTS

Query:  LHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNN
         +++ Q+    +  MSATALLQKAAQMG+  SSSS++N+
Subjt:  LHHHLQSASPSLPHMSATALLQKAAQMGATMSSSSNNNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAAAAGTTTGTTGTTTCAACAACAAGCTCAAGCTATGGAGGAAAATTTGTCGAATTTAACTTCAGCTTCTGGTGAAGCTAGTGCCTGCTCCGGCAACCGTTCCGA
TCAGATTCCGGCGAACTATTCCGGCGGCCAGTATTTTTCTGCCCCACCACCACCAAAAAAGAAGAGAAACCTCCCCGGAAATCCAGACCCGGATGCGGAAGTGATAGCTT
TGTCACCGAAGACGCTAATGGCGACGAATAGATTCGTGTGCGAGATCTGCAACAAGGGGTTTCAGAGAGATCAGAATCTTCAGCTTCATAGAAGAGGGCACAATTTGCCA
TGGAAGTTGAAGCAAAGAGCAAACAAAGAGGTGATAAGGAAGAAAGTTTATGTGTGTCCAGAAACAAGCTGTGTTCACCATGATCCATCAAGGGCTCTTGGAGACTTGAC
AGGGATCAAGAAGCACTTTTGCAGAAAGCATGGTGAGAAGAAATGGAAATGTGATAAGTGCTCTAAGAGGTACGCAGTTCAATCAGATTGGAAAGCACATTCTAAGACTT
GTGGCACCAGAGAGTACAGATGTGACTGTGGAACCCTTTTCTCAAGGAGGGATAGTTTCATCACCCACAGAGCATTTTGTGATGCTTTAGCAGAGGAAAGTGCAAGAGCC
ATTACATCAAACCCAATACTAATCACCAATAACAACAATAACCAAAATCAAAACCCTCCACTAATTTCCACTCCCATTTCTTCAATCCCTCACTTAAACTTCCAAACACA
AACCCACTTCAACAACAACAACAACCCCTTAGACATCAACTCATTTTCCTTAAAAAAAGAACACCACCATCAACAAAACTCACCCAATAACAACAACAATTTCATTCCTC
CATGGCTTTCCCAAGACCACCTTAATAATCACCATCAAAACCTCATCTTAAACCCTAATAACAGTCTTGGGCCCACTTCCCTTCATCACCATCTTCAAAGTGCTTCCCCG
TCTCTTCCTCACATGTCAGCCACAGCACTTCTCCAGAAAGCAGCTCAAATGGGAGCTACCATGAGCAGCAGCAGCAACAACAATAATAATATTGCTGAGCAGCATCACGT
GCCTGATTCTTCTGCCACCAACAATCCAAGTTGTAATTTTGGCCTTAACTTGTCCTCCTCACGTGACAATAATCAGATTTTGATGGGCAGTGAAGGTGCAGGGCTCTCTC
ATGCACTGCCACACTACAGGAACAAATCCAACAATGATGATAATAATAATAATAATGTTGGTGTTGGTCTTCTTGTTGGTCATTCTAATTCTTCTTCACTTCTTCTTCAA
AGCACTTTCATCAGCAACAGCAACAGCAACAGCTCTACTACTTTTTCTCCCAGTGCTTTTGGAGGGGCTTCTTTTGAAATAGATGCATTTGGAGGGGTTTTGAAGAAAGA
TAATAGTGCAATTACAACAAGTGAAGGGTTGAGTACAAGAGATTTCTTGGGGCTGAGAGCAATTTCTCACAGTGAGTTTTTGAGTAATATGGCAGCTGCTGCTGGTTATG
GAAATTGCATGAATGGGGCTGGTCAAAGCCCTCAAAGTCAAATTCAAAAGCAACCCAGCTGGCAAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAAAAAGTTTGTTGTTTCAACAACAAGCTCAAGCTATGGAGGAAAATTTGTCGAATTTAACTTCAGCTTCTGGTGAAGCTAGTGCCTGCTCCGGCAACCGTTCCGA
TCAGATTCCGGCGAACTATTCCGGCGGCCAGTATTTTTCTGCCCCACCACCACCAAAAAAGAAGAGAAACCTCCCCGGAAATCCAGACCCGGATGCGGAAGTGATAGCTT
TGTCACCGAAGACGCTAATGGCGACGAATAGATTCGTGTGCGAGATCTGCAACAAGGGGTTTCAGAGAGATCAGAATCTTCAGCTTCATAGAAGAGGGCACAATTTGCCA
TGGAAGTTGAAGCAAAGAGCAAACAAAGAGGTGATAAGGAAGAAAGTTTATGTGTGTCCAGAAACAAGCTGTGTTCACCATGATCCATCAAGGGCTCTTGGAGACTTGAC
AGGGATCAAGAAGCACTTTTGCAGAAAGCATGGTGAGAAGAAATGGAAATGTGATAAGTGCTCTAAGAGGTACGCAGTTCAATCAGATTGGAAAGCACATTCTAAGACTT
GTGGCACCAGAGAGTACAGATGTGACTGTGGAACCCTTTTCTCAAGGAGGGATAGTTTCATCACCCACAGAGCATTTTGTGATGCTTTAGCAGAGGAAAGTGCAAGAGCC
ATTACATCAAACCCAATACTAATCACCAATAACAACAATAACCAAAATCAAAACCCTCCACTAATTTCCACTCCCATTTCTTCAATCCCTCACTTAAACTTCCAAACACA
AACCCACTTCAACAACAACAACAACCCCTTAGACATCAACTCATTTTCCTTAAAAAAAGAACACCACCATCAACAAAACTCACCCAATAACAACAACAATTTCATTCCTC
CATGGCTTTCCCAAGACCACCTTAATAATCACCATCAAAACCTCATCTTAAACCCTAATAACAGTCTTGGGCCCACTTCCCTTCATCACCATCTTCAAAGTGCTTCCCCG
TCTCTTCCTCACATGTCAGCCACAGCACTTCTCCAGAAAGCAGCTCAAATGGGAGCTACCATGAGCAGCAGCAGCAACAACAATAATAATATTGCTGAGCAGCATCACGT
GCCTGATTCTTCTGCCACCAACAATCCAAGTTGTAATTTTGGCCTTAACTTGTCCTCCTCACGTGACAATAATCAGATTTTGATGGGCAGTGAAGGTGCAGGGCTCTCTC
ATGCACTGCCACACTACAGGAACAAATCCAACAATGATGATAATAATAATAATAATGTTGGTGTTGGTCTTCTTGTTGGTCATTCTAATTCTTCTTCACTTCTTCTTCAA
AGCACTTTCATCAGCAACAGCAACAGCAACAGCTCTACTACTTTTTCTCCCAGTGCTTTTGGAGGGGCTTCTTTTGAAATAGATGCATTTGGAGGGGTTTTGAAGAAAGA
TAATAGTGCAATTACAACAAGTGAAGGGTTGAGTACAAGAGATTTCTTGGGGCTGAGAGCAATTTCTCACAGTGAGTTTTTGAGTAATATGGCAGCTGCTGCTGGTTATG
GAAATTGCATGAATGGGGCTGGTCAAAGCCCTCAAAGTCAAATTCAAAAGCAACCCAGCTGGCAAGGTTAG
Protein sequenceShow/hide protein sequence
MIKSLLFQQQAQAMEENLSNLTSASGEASACSGNRSDQIPANYSGGQYFSAPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
WKLKQRANKEVIRKKVYVCPETSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKRYAVQSDWKAHSKTCGTREYRCDCGTLFSRRDSFITHRAFCDALAEESARA
ITSNPILITNNNNNQNQNPPLISTPISSIPHLNFQTQTHFNNNNNPLDINSFSLKKEHHHQQNSPNNNNNFIPPWLSQDHLNNHHQNLILNPNNSLGPTSLHHHLQSASP
SLPHMSATALLQKAAQMGATMSSSSNNNNNIAEQHHVPDSSATNNPSCNFGLNLSSSRDNNQILMGSEGAGLSHALPHYRNKSNNDDNNNNNVGVGLLVGHSNSSSLLLQ
STFISNSNSNSSTTFSPSAFGGASFEIDAFGGVLKKDNSAITTSEGLSTRDFLGLRAISHSEFLSNMAAAAGYGNCMNGAGQSPQSQIQKQPSWQG