; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020939 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020939
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionprotein indeterminate-domain 7
Genome locationchr01:28732086..28736206
RNA-Seq ExpressionPI0020939
SyntenyPI0020939
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34113.1 nucleic acid binding protein [Cucumis melo subsp. melo]2.0e-28392.17Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANKSEF NQYFAPQTTQQQ PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN   NSQDHQFCNNLALKRDFD++SN+NNNN     LRVEIPPWLQPSSDHLM
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM

Query:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL
        VGSGGQDENNDETVNPNP   SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGE+GL
Subjt:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL

Query:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH
        WSGDVEIGR      GGGGGAVSCSSSSCTDYGNKAA     ASA+ASASASASTTFLHDIINNSLSSPSPSH PFLQQHNSSFPDT FAA+HH    HH
Subjt:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH

Query:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        H   VIPTTAP+SGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

KAA0059426.1 protein indeterminate-domain 7 [Cucumis melo var. makuwa]1.2e-27592.58Show/hide
Query:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        MDENLSNLTSASGEAT SVSSANKSEF NQYFAPQTTQQQ PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
Subjt:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
        WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
Subjt:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC

Query:  DALADESARSAMALNPLLSSY--NHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP---S
        DALADESARSAMALNPLLSSY  N+NNNNNSQDHQFCNNLALKRDFD++SN+NNNN     LRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP   S
Subjt:  DALADESARSAMALNPLLSSY--NHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP---S

Query:  SRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAV
        SRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGE+GLWSGDVEIGR      GGGGGAV
Subjt:  SRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAV

Query:  SCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAM---HHHHHTVPVIPTTAPSSGGRNDGLTRD
        SCSSSSCTDYGNKAA     ASA+ASASASASTTFLHDIINNSLSSPSPSH PFLQQHNSSFPDT FAA+   HHHHH   VIPTTAP+SGGRNDGLTRD
Subjt:  SCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAM---HHHHHTVPVIPTTAPSSGGRNDGLTRD

Query:  FLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        FLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  FLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

KAE8646512.1 hypothetical protein Csa_016174 [Cucumis sativus]1.1e-28195.5Show/hide
Query:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT-QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
Subjt:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT-QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
        WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
Subjt:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC

Query:  DALADESARSAMALNPLLSSYNHNNNN-NSQDHQFCNNLALKRDFDD-NSNNNNNNLRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP----SSRGC
        DALADESARSAMALNPLLSSYNHNNNN NSQDHQFCNNLALKRDFDD N++NNNN+LRVEIPPWLQPSSDHLMVGSGGQ ENNDETVNPNP    SSRGC
Subjt:  DALADESARSAMALNPLLSSYNHNNNN-NSQDHQFCNNLALKRDFDD-NSNNNNNNLRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP----SSRGC

Query:  GASRRSVGVGVGTPNNPN-PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGR-GGGGGGGGGGGAVSC
        GA+RRSVGVGVGTPNNPN PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGR GGGGGGGGGGGAVSC
Subjt:  GASRRSVGVGVGTPNNPN-PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGR-GGGGGGGGGGGAVSC

Query:  SSSSCTDYGNKAA--ASATASASASASTTFLHDIINNSLSSPSPSHP-FLQQHNSSFPDTTFAAMHH--HHHTVPVIPTTAPSSGGRNDGLTRDFLGLRP
        SSSSCTDYGNKAA  ASATASASASASTTFLHDIINNSLSSPSPSHP FLQQHNSSFPDT FAAMHH  HHH VP+IPTTAP+SGGR+DGLTRDFLGLRP
Subjt:  SSSSCTDYGNKAA--ASATASASASASTTFLHDIINNSLSSPSPSHP-FLQQHNSSFPDTTFAAMHH--HHHTVPVIPTTAPSSGGRNDGLTRDFLGLRP

Query:  LSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        LSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  LSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

XP_004141684.2 protein indeterminate-domain 7 [Cucumis sativus]1.5e-28995.47Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT-QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
        MMMKGNFLS QQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT-QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN

Query:  KGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
        KGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
Subjt:  KGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC

Query:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN-NSQDHQFCNNLALKRDFDD-NSNNNNNNLRVEIPPWLQPSSDHLMVGSGGQDE
        DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN NSQDHQFCNNLALKRDFDD N++NNNN+LRVEIPPWLQPSSDHLMVGSGGQ E
Subjt:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN-NSQDHQFCNNLALKRDFDD-NSNNNNNNLRVEIPPWLQPSSDHLMVGSGGQDE

Query:  NNDETVNPNP----SSRGCGASRRSVGVGVGTPNNPN-PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVE
        NNDETVNPNP    SSRGCGA+RRSVGVGVGTPNNPN PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVE
Subjt:  NNDETVNPNP----SSRGCGASRRSVGVGVGTPNNPN-PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVE

Query:  IGR-GGGGGGGGGGGAVSCSSSSCTDYGNKAA--ASATASASASASTTFLHDIINNSLSSPSPSHP-FLQQHNSSFPDTTFAAMHH--HHHTVPVIPTTA
        IGR GGGGGGGGGGGAVSCSSSSCTDYGNKAA  ASATASASASASTTFLHDIINNSLSSPSPSHP FLQQHNSSFPDT FAAMHH  HHH VP+IPTTA
Subjt:  IGR-GGGGGGGGGGGAVSCSSSSCTDYGNKAA--ASATASASASASTTFLHDIINNSLSSPSPSHP-FLQQHNSSFPDTTFAAMHH--HHHTVPVIPTTA

Query:  PSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        P+SGGR+DGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  PSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

XP_008462349.1 PREDICTED: protein indeterminate-domain 7 [Cucumis melo]1.7e-28291.99Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANKSEF NQYFAPQTTQQQ PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN   NSQDHQFCNNLALKRDFD++SN+NNNN     LRVEIPPWLQPSSDHLM
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM

Query:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL
        VGSGGQDENNDETVNPNP   SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGE+GL
Subjt:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL

Query:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH
        WSGDVEIGR      GGGGGAVSCSSSSCTDYGNKAA     ASA+ASASASASTTFLHDIINNSLSSPS SH PFLQQHNSSFPDT FAA+HH    HH
Subjt:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH

Query:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        H   VIPTTAP+SGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

TrEMBL top hitse value%identityAlignment
A0A0A0K7W2 C2H2-type domain-containing protein7.1e-29095.47Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT-QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
        MMMKGNFLS QQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTT-QQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICN

Query:  KGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
        KGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC
Subjt:  KGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRC

Query:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN-NSQDHQFCNNLALKRDFDD-NSNNNNNNLRVEIPPWLQPSSDHLMVGSGGQDE
        DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN NSQDHQFCNNLALKRDFDD N++NNNN+LRVEIPPWLQPSSDHLMVGSGGQ E
Subjt:  DCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN-NSQDHQFCNNLALKRDFDD-NSNNNNNNLRVEIPPWLQPSSDHLMVGSGGQDE

Query:  NNDETVNPNP----SSRGCGASRRSVGVGVGTPNNPN-PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVE
        NNDETVNPNP    SSRGCGA+RRSVGVGVGTPNNPN PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVE
Subjt:  NNDETVNPNP----SSRGCGASRRSVGVGVGTPNNPN-PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVE

Query:  IGR-GGGGGGGGGGGAVSCSSSSCTDYGNKAA--ASATASASASASTTFLHDIINNSLSSPSPSHP-FLQQHNSSFPDTTFAAMHH--HHHTVPVIPTTA
        IGR GGGGGGGGGGGAVSCSSSSCTDYGNKAA  ASATASASASASTTFLHDIINNSLSSPSPSHP FLQQHNSSFPDT FAAMHH  HHH VP+IPTTA
Subjt:  IGR-GGGGGGGGGGGAVSCSSSSCTDYGNKAA--ASATASASASASTTFLHDIINNSLSSPSPSHP-FLQQHNSSFPDTTFAAMHH--HHHTVPVIPTTA

Query:  PSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        P+SGGR+DGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  PSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

A0A1S3CGT8 protein indeterminate-domain 78.4e-28391.99Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANKSEF NQYFAPQTTQQQ PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN   NSQDHQFCNNLALKRDFD++SN+NNNN     LRVEIPPWLQPSSDHLM
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM

Query:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL
        VGSGGQDENNDETVNPNP   SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGE+GL
Subjt:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL

Query:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH
        WSGDVEIGR      GGGGGAVSCSSSSCTDYGNKAA     ASA+ASASASASTTFLHDIINNSLSSPS SH PFLQQHNSSFPDT FAA+HH    HH
Subjt:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH

Query:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        H   VIPTTAP+SGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

A0A5A7UZ06 Protein indeterminate-domain 75.8e-27692.58Show/hide
Query:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        MDENLSNLTSASGEAT SVSSANKSEF NQYFAPQTTQQQ PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
Subjt:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
        WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
Subjt:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC

Query:  DALADESARSAMALNPLLSSY--NHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP---S
        DALADESARSAMALNPLLSSY  N+NNNNNSQDHQFCNNLALKRDFD++SN+NNNN     LRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP   S
Subjt:  DALADESARSAMALNPLLSSY--NHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNP---S

Query:  SRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAV
        SRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGE+GLWSGDVEIGR      GGGGGAV
Subjt:  SRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAV

Query:  SCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAM---HHHHHTVPVIPTTAPSSGGRNDGLTRD
        SCSSSSCTDYGNKAA     ASA+ASASASASTTFLHDIINNSLSSPSPSH PFLQQHNSSFPDT FAA+   HHHHH   VIPTTAP+SGGRNDGLTRD
Subjt:  SCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAM---HHHHHTVPVIPTTAPSSGGRNDGLTRD

Query:  FLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        FLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  FLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

A0A6J1GRP3 protein indeterminate-domain 7-like1.3e-19571.79Show/hide
Query:  MMMKGNFLSQQQQQ-QQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQ---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
        MMMKGNFLSQQQQQ Q  V+MDENLSNLTSASGEATASVSSA      N YFAPQ+       PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
Subjt:  MMMKGNFLSQQQQQ-QQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQ---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
        ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE++KKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE

Query:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNNLRVEIPPWLQPSS--DHLMVGSGG
        YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPL+SSYN+NNNN        NN  +KRDF     +N NN+R EIPPWL  +     + +GS  
Subjt:  YRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNNLRVEIPPWLQPSS--DHLMVGSGG

Query:  QDEN---NDETVNPNPSSRGCGASR-RSVGVGVGTPNNPNPCELYQ----SSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL
        QDE+   N ET+NPN    G G       G  +G P    P   YQ    SS HISATALLQKAAQMGATMSSTTTTSGS  RPH ++HVSTG++G+I  
Subjt:  QDEN---NDETVNPNPSSRGCGASR-RSVGVGVGTPNNPNPCELYQ----SSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL

Query:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDIINNSLSSPSPSHPFLQQHNSSFPD-TTFAAMHHHHHTVPVIPTT
                         GGGAVSC SSSCTDYG+KAAA      SAS   TFLHD+I NSLSS S SHPFLQ  +SSF D T F AMHHH        T 
Subjt:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDIINNSLSSPSPSHPFLQQHNSSFPD-TTFAAMHHHHHTVPVIPTT

Query:  APSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
          +SGGR+DGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNL  QIQKPWQG
Subjt:  APSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

E5GCB4 Nucleic acid binding protein9.9e-28492.17Show/hide
Query:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLS  QQQQQQQIVVMDENLSNLTSASGEAT SVSSANKSEF NQYFAPQTTQQQ PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLS--QQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQ-PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM
        RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYN NNNN   NSQDHQFCNNLALKRDFD++SN+NNNN     LRVEIPPWLQPSSDHLM
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNN---NSQDHQFCNNLALKRDFDDNSNNNNNN-----LRVEIPPWLQPSSDHLM

Query:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL
        VGSGGQDENNDETVNPNP   SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGE+GL
Subjt:  VGSGGQDENNDETVNPNP---SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGL

Query:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH
        WSGDVEIGR      GGGGGAVSCSSSSCTDYGNKAA     ASA+ASASASASTTFLHDIINNSLSSPSPSH PFLQQHNSSFPDT FAA+HH    HH
Subjt:  WSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAA-----ASATASASASASTTFLHDIINNSLSSPSPSH-PFLQQHNSSFPDTTFAAMHH----HH

Query:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
        H   VIPTTAP+SGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG
Subjt:  HTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG

SwissProt top hitse value%identityAlignment
Q8H1F5 Protein indeterminate-domain 71.2e-10057.44Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE---------FPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATN
        MMM  + L  QQQQQQ   M+EN+SNLTSASG+  ASVSS N++E            Q F PQ++       K+KRN PGNPDP+AEV+ALSPKTLMATN
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE---------FPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATN

Query:  RFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKI
        RF+CE+CNKGFQRDQNLQLH+RGHNLPWKLKQRSNK++++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K 
Subjt:  RFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKI

Query:  CGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNN--NNNNLR------------VEI
        CGTKEY+CDCGTLFSRRDSFITHRAFCDALA+ESAR+    NP++     +N+ +   HQ   N+     F  +S N  +N+NL               I
Subjt:  CGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNN--NNNNLR------------VEI

Query:  PPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSST
        PPWL  S+ +        + NN     P  SS           V  G  + P+P      S  +SATALLQKAAQMG+T S+T
Subjt:  PPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSST

Q944L3 Zinc finger protein BALDIBIS4.2e-9057.23Show/hide
Query:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCR
        K+KRNLPGNPDPDAEVIALSP +LM TNRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQR+NKE +KKKVY+CPE +CVHHDP+RALGDLTGIKKHF R
Subjt:  KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCR

Query:  KHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESAR-------SAMALNPLLSSYNHNNNNNSQDHQFCNNL
        KHGEKKWKCDKCSKKYAV SDWKAHSKICGTKEYRCDCGTLFSR+DSFITHRAFCDALA+ESAR        A   N L    NH N N +   +  N  
Subjt:  KHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESAR-------SAMALNPLLSSYNHNNNNNSQDHQFCNNL

Query:  ALKRD---FDDNSNN--------NNNNLRVEIPPWLQPSSD------HLMVGSGGQ---DENNDETVNPNPSSRGCGASRR--------SVGVGVGTPNN
        + + D   F+ N NN          N       P  + +SD      HL   S  Q   +ENN+   N N   RG   ++         S G    +   
Subjt:  ALKRD---FDDNSNN--------NNNNLRVEIPPWLQPSSD------HLMVGSGGQ---DENNDETVNPNPSSRGCGASRR--------SVGVGVGTPNN

Query:  PNPCELYQSSSHI---SATALLQKAAQMGATMSSTTTTS
         N     Q+   I   SATALLQKAAQMG+  SS+++++
Subjt:  PNPCELYQSSSHI---SATALLQKAAQMGATMSSTTTTS

Q9LRW7 Protein indeterminate-domain 112.4e-10947.07Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE------FPNQYFAPQTTQQQPPPP-----KKKRNLPGNPDPDAEVIALSPKTLMA
        MM K   L Q QQ QQ    DEN+SNLTSASG+  ASVSS N +E      FP+     +  QQQ   P     KK+RN PGNPDP++EVIALSPKTLMA
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE------FPNQYFAPQTTQQQPPPP-----KKKRNLPGNPDPDAEVIALSPKTLMA

Query:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS
        TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+I+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHS
Subjt:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS

Query:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFD
        K CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H            +++++S +H   N+L         
Subjt:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFD

Query:  DNSNNNNNNLRV-------------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISA
        +NSNN+NN+L                      IPPWL P   H +  S           NPNPS+ G G                    L+  +S  +SA
Subjt:  DNSNNNNNNLRV-------------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISA

Query:  TALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDII
        TALLQKAAQMG+T +     + ++ R       ST N                                    +     AA  T+ +   +S    H + 
Subjt:  TALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDII

Query:  NNSLSSPSPSHPFLQQHNSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG
         +  +S   +H        +F DT    +  +  T       +  SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  ++S+ LHP   KPWQG
Subjt:  NNSLSSPSPSHPFLQQHNSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG

Q9LVQ7 Zinc finger protein ENHYDROUS2.3e-8878.47Show/hide
Query:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPW
        M  +L N ++ SG+  ASVSS       NQ   P++        KKKRNLPG PDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPW
Subjt:  MDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPW

Query:  KLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCD
        KL+QRS KE ++KKVYVCP   CVHHDPSRALGDLTGIKKHFCRKHGEKKWKC+KCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCD
Subjt:  KLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCD

Query:  ALADESARS
        ALA+ESA++
Subjt:  ALADESARS

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 13.5e-9253.83Show/hide
Query:  MDENLSNLTSASGEATASVSS-ANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        M  +L N ++ SGEA+ S+SS  N++  PN               KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLP
Subjt:  MDENLSNLTSASGEATASVSS-ANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
        WKL+Q+SNKE +KKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFC
Subjt:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC

Query:  DALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNNLRV------EIPPWL-----QPSSDHLMVGSGGQDENNDETVNPNP
        DALA+E+ARS  +      S   N    ++ +   N +    D +     +++ L +      + PP +     +P+S + +V S G      E+ + +P
Subjt:  DALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNNLRV------EIPPWL-----QPSSDHLMVGSGGQDENNDETVNPNP

Query:  SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSH---------------ISATALLQKAAQMGATMS-----------STTTTSGSFPRPHNL
        S     +S +S+     + ++  P  L  S+SH               +SATALLQKAAQMGA  S           S+T+TS     PH L
Subjt:  SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSH---------------ISATALLQKAAQMGATMS-----------STTTTSGSFPRPHNL

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 78.4e-10257.44Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE---------FPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATN
        MMM  + L  QQQQQQ   M+EN+SNLTSASG+  ASVSS N++E            Q F PQ++       K+KRN PGNPDP+AEV+ALSPKTLMATN
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE---------FPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATN

Query:  RFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKI
        RF+CE+CNKGFQRDQNLQLH+RGHNLPWKLKQRSNK++++KKVYVCPE  CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K 
Subjt:  RFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKI

Query:  CGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNN--NNNNLR------------VEI
        CGTKEY+CDCGTLFSRRDSFITHRAFCDALA+ESAR+    NP++     +N+ +   HQ   N+     F  +S N  +N+NL               I
Subjt:  CGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNN--NNNNLR------------VEI

Query:  PPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSST
        PPWL  S+ +        + NN     P  SS           V  G  + P+P      S  +SATALLQKAAQMG+T S+T
Subjt:  PPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQSSSHISATALLQKAAQMGATMSST

AT3G13810.1 indeterminate(ID)-domain 111.7e-11047.07Show/hide
Query:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE------FPNQYFAPQTTQQQPPPP-----KKKRNLPGNPDPDAEVIALSPKTLMA
        MM K   L Q QQ QQ    DEN+SNLTSASG+  ASVSS N +E      FP+     +  QQQ   P     KK+RN PGNPDP++EVIALSPKTLMA
Subjt:  MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSE------FPNQYFAPQTTQQQPPPP-----KKKRNLPGNPDPDAEVIALSPKTLMA

Query:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS
        TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+I+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHS
Subjt:  TNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHS

Query:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFD
        K CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H            +++++S +H   N+L         
Subjt:  KICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFD

Query:  DNSNNNNNNLRV-------------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISA
        +NSNN+NN+L                      IPPWL P   H +  S           NPNPS+ G G                    L+  +S  +SA
Subjt:  DNSNNNNNNLRV-------------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISA

Query:  TALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDII
        TALLQKAAQMG+T +     + ++ R       ST N                                    +     AA  T+ +   +S    H + 
Subjt:  TALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDII

Query:  NNSLSSPSPSHPFLQQHNSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG
         +  +S   +H        +F DT    +  +  T       +  SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  ++S+ LHP   KPWQG
Subjt:  NNSLSSPSPSHPFLQQHNSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG

AT3G13810.2 indeterminate(ID)-domain 112.9e-10244.97Show/hide
Query:  LSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMAT
        L Q QQ QQ    DEN+SNLTSASG+  ASVSS N +E     + P   QQQ    ++ + L  +                   P++EVIALSPKTLMAT
Subjt:  LSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMAT

Query:  NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSK
        NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+I+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK
Subjt:  NRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSK

Query:  ICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFDD
         CGTKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H            +++++S +H   N+L         +
Subjt:  ICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFDD

Query:  NSNNNNNNLRV-------------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISAT
        NSNN+NN+L                      IPPWL P   H +  S           NPNPS+ G G                    L+  +S  +SAT
Subjt:  NSNNNNNNLRV-------------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISAT

Query:  ALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDIIN
        ALLQKAAQMG+T +     + ++ R       ST N                                    +     AA  T+ +   +S    H +  
Subjt:  ALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDIIN

Query:  NSLSSPSPSHPFLQQHNSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG
        +  +S   +H        +F DT    +  +  T       +  SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  ++S+ LHP   KPWQG
Subjt:  NSLSSPSPSHPFLQQHNSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG

AT3G13810.3 indeterminate(ID)-domain 117.9e-10044.66Show/hide
Query:  LSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        +SNLTSASG+  ASVSS N +E     + P   QQQ    ++ + L  +                   P++EVIALSPKTLMATNRFVCEICNKGFQRDQ
Subjt:  LSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPD-----------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS
        NLQLHRRGHNLPWKLKQRSNKE+I+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CGTKEYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS

Query:  RRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFDDNSNNNNNNLRV-----
        RRDSFITHRAFC+ALA+E+AR  +          NPLL    +S+ H            +++++S +H   N+L         +NSNN+NN+L       
Subjt:  RRDSFITHRAFCDALADESARSAM--------ALNPLL----SSYNH------------NNNNNSQDHQFCNNLAL--KRDFDDNSNNNNNNLRV-----

Query:  --------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISATALLQKAAQMGATMSST
                       IPPWL P   H +  S           NPNPS+ G G                    L+  +S  +SATALLQKAAQMG+T +  
Subjt:  --------------EIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPNPCELYQ-SSSHISATALLQKAAQMGATMSST

Query:  TTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDIINNSLSSPSPSHPFLQQH
           + ++ R       ST N                                    +     AA  T+ +   +S    H +  +  +S   +H      
Subjt:  TTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLHDIINNSLSSPSPSHPFLQQH

Query:  NSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG
          +F DT    +  +  T       +  SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  ++S+ LHP   KPWQG
Subjt:  NSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPL-SHGDILSLTGFGNCIVPNSSN-LHPQIQKPWQG

AT3G50700.1 indeterminate(ID)-domain 22.5e-9353.83Show/hide
Query:  MDENLSNLTSASGEATASVSS-ANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP
        M  +L N ++ SGEA+ S+SS  N++  PN               KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLP
Subjt:  MDENLSNLTSASGEATASVSS-ANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLP

Query:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC
        WKL+Q+SNKE +KKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFC
Subjt:  WKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFC

Query:  DALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNNLRV------EIPPWL-----QPSSDHLMVGSGGQDENNDETVNPNP
        DALA+E+ARS  +      S   N    ++ +   N +    D +     +++ L +      + PP +     +P+S + +V S G      E+ + +P
Subjt:  DALADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNNLRV------EIPPWL-----QPSSDHLMVGSGGQDENNDETVNPNP

Query:  SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSH---------------ISATALLQKAAQMGATMS-----------STTTTSGSFPRPHNL
        S     +S +S+     + ++  P  L  S+SH               +SATALLQKAAQMGA  S           S+T+TS     PH L
Subjt:  SSRGCGASRRSVGVGVGTPNNPNPCELYQSSSH---------------ISATALLQKAAQMGATMS-----------STTTTSGSFPRPHNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGAAAGGTAATTTTTTGTCTCAACAACAACAACAACAACAAATTGTAGTAATGGATGAAAATTTGTCCAACTTGACTTCTGCATCTGGTGAAGCTACTGCTAG
TGTTTCTTCTGCCAATAAATCTGAATTTCCCAATCAATATTTTGCTCCTCAAACAACCCAACAACAACCTCCTCCTCCTAAGAAGAAGCGCAACCTTCCTGGAAATCCCG
ACCCAGATGCGGAAGTAATAGCGTTATCGCCGAAGACATTGATGGCAACGAATAGATTTGTGTGCGAGATATGCAACAAAGGGTTTCAGAGAGATCAGAATCTACAACTA
CATAGAAGAGGACACAATTTGCCATGGAAATTAAAGCAAAGATCAAACAAAGAGATAATAAAGAAGAAAGTATATGTTTGTCCAGAAGTGAGTTGTGTTCATCATGATCC
ATCAAGAGCACTTGGAGATTTAACAGGAATTAAGAAGCACTTTTGTAGAAAGCATGGTGAAAAGAAATGGAAATGTGATAAATGCTCTAAGAAATATGCTGTTCAATCTG
ATTGGAAAGCTCATTCCAAGATCTGTGGCACTAAAGAGTATAGATGTGACTGTGGTACTCTCTTTTCAAGGAGAGATAGTTTCATTACACATAGAGCTTTTTGTGATGCA
TTAGCAGATGAAAGTGCAAGATCAGCAATGGCATTAAACCCTCTTCTCTCATCTTACAACCACAACAACAATAATAATTCACAAGATCATCAATTTTGTAATAATCTCGC
TCTCAAACGGGATTTCGACGACAACAGCAACAACAACAATAACAATTTGAGAGTGGAAATTCCACCGTGGCTACAACCATCATCAGATCATCTGATGGTGGGGAGTGGCG
GCCAAGACGAGAACAATGATGAAACTGTAAACCCTAACCCTAGTAGTAGGGGGTGCGGGGCCAGCAGGAGAAGTGTTGGTGTTGGTGTTGGTACTCCTAATAATCCTAAT
CCTTGTGAGTTATATCAATCGTCTTCACATATATCAGCGACTGCACTGCTGCAGAAGGCAGCCCAGATGGGTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTTT
CCCAAGGCCCCACAACCTTCTTCACGTGTCTACAGGTAATTTTGGAGAGATAGGATTATGGTCAGGTGATGTTGAAATTGGTAGAGGAGGTGGAGGAGGAGGAGGAGGAG
GAGGAGGAGCTGTGAGTTGTAGCAGTAGTAGTTGTACTGATTACGGGAATAAAGCTGCTGCTTCTGCTACTGCTTCCGCTTCCGCTTCCGCTTCCACCACTTTTCTTCAC
GACATTATTAATAATTCCCTCTCGTCTCCTTCTCCTTCTCATCCTTTCCTCCAACAACACAATTCCTCCTTCCCCGACACCACTTTTGCTGCTATGCATCATCATCATCA
TACCGTACCCGTCATCCCCACCACTGCCCCATCTTCGGGGGGTCGAAACGACGGTTTAACCAGAGATTTCTTGGGACTTCGCCCTCTTTCCCATGGAGATATTCTAAGCC
TTACTGGTTTTGGAAACTGTATTGTTCCTAATTCCTCCAATCTTCACCCACAAATCCAGAAGCCATGGCAGGGTTAG
mRNA sequenceShow/hide mRNA sequence
CACAGGGTTTCTTCAATTCTCATCCATATCTCTCTCTCTCTTCCTTCTTTTTCTCTCTCTAGACTTTCAAGAACTGGGTTTTGTATTTTTTTTTTTTAATTTATTTTCTT
ATATCATATAGTTCAAGATTTTAATTTTAATTTTCTTTTTGTATGTTAATCATATAAAAAATTTCAAAAAAAAAAAGAGTAGAGTGTTTGAATTCTCTCTCTCTCTCTCT
CTATTAAACAGCAGGCTGCTTGTTTCTTTTGAGCTTTCAAAAAAACTAAAAGTTGTCTCTTTGGTTGGAGCCAAATCCAAGATTGAATGCACTAAAGAGCATATATACAT
CTTATAATTTTTCTTTTTTTTTTTTTTTGTCTTGAATTAGATCTTGGAAATTTTTTTTGTTAATTACTCAAAATACAAGAATTGTTTCTTCTTGATTAGATCTGTTTTTT
GTTGTCTCTCATGAAACCATTATAAGAAGGGTTTCTTTTTTTTTTTCTTTTTTTTTTTGAATTTCATAAAATTTTAAGATGATGATGAAAGGTAATTTTTTGTCTCAACA
ACAACAACAACAACAAATTGTAGTAATGGATGAAAATTTGTCCAACTTGACTTCTGCATCTGGTGAAGCTACTGCTAGTGTTTCTTCTGCCAATAAATCTGAATTTCCCA
ATCAATATTTTGCTCCTCAAACAACCCAACAACAACCTCCTCCTCCTAAGAAGAAGCGCAACCTTCCTGGAAATCCCGACCCAGATGCGGAAGTAATAGCGTTATCGCCG
AAGACATTGATGGCAACGAATAGATTTGTGTGCGAGATATGCAACAAAGGGTTTCAGAGAGATCAGAATCTACAACTACATAGAAGAGGACACAATTTGCCATGGAAATT
AAAGCAAAGATCAAACAAAGAGATAATAAAGAAGAAAGTATATGTTTGTCCAGAAGTGAGTTGTGTTCATCATGATCCATCAAGAGCACTTGGAGATTTAACAGGAATTA
AGAAGCACTTTTGTAGAAAGCATGGTGAAAAGAAATGGAAATGTGATAAATGCTCTAAGAAATATGCTGTTCAATCTGATTGGAAAGCTCATTCCAAGATCTGTGGCACT
AAAGAGTATAGATGTGACTGTGGTACTCTCTTTTCAAGGAGAGATAGTTTCATTACACATAGAGCTTTTTGTGATGCATTAGCAGATGAAAGTGCAAGATCAGCAATGGC
ATTAAACCCTCTTCTCTCATCTTACAACCACAACAACAATAATAATTCACAAGATCATCAATTTTGTAATAATCTCGCTCTCAAACGGGATTTCGACGACAACAGCAACA
ACAACAATAACAATTTGAGAGTGGAAATTCCACCGTGGCTACAACCATCATCAGATCATCTGATGGTGGGGAGTGGCGGCCAAGACGAGAACAATGATGAAACTGTAAAC
CCTAACCCTAGTAGTAGGGGGTGCGGGGCCAGCAGGAGAAGTGTTGGTGTTGGTGTTGGTACTCCTAATAATCCTAATCCTTGTGAGTTATATCAATCGTCTTCACATAT
ATCAGCGACTGCACTGCTGCAGAAGGCAGCCCAGATGGGTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTTTCCCAAGGCCCCACAACCTTCTTCACGTGTCTA
CAGGTAATTTTGGAGAGATAGGATTATGGTCAGGTGATGTTGAAATTGGTAGAGGAGGTGGAGGAGGAGGAGGAGGAGGAGGAGGAGCTGTGAGTTGTAGCAGTAGTAGT
TGTACTGATTACGGGAATAAAGCTGCTGCTTCTGCTACTGCTTCCGCTTCCGCTTCCGCTTCCACCACTTTTCTTCACGACATTATTAATAATTCCCTCTCGTCTCCTTC
TCCTTCTCATCCTTTCCTCCAACAACACAATTCCTCCTTCCCCGACACCACTTTTGCTGCTATGCATCATCATCATCATACCGTACCCGTCATCCCCACCACTGCCCCAT
CTTCGGGGGGTCGAAACGACGGTTTAACCAGAGATTTCTTGGGACTTCGCCCTCTTTCCCATGGAGATATTCTAAGCCTTACTGGTTTTGGAAACTGTATTGTTCCTAAT
TCCTCCAATCTTCACCCACAAATCCAGAAGCCATGGCAGGGTTAGATTCGTAAATTACTTCCTACTTTCTTAATCTTTTTTTGATCTTTTTTTTTCCTCAAACAAGGGGA
AAAAATATGTAGTAATTGATCTCATTCTAAGAGATGAAGATTATTTTTATTTTCTTACCTTCTTTTTTTTTACGTGAAATTAAATACTTTATGGAATGTACTTCGACATT
TTATAAGGTTAAATTAATTATCATGAATTTTCTTTTATATGAGAACAATATATTTTAATA
Protein sequenceShow/hide protein sequence
MMMKGNFLSQQQQQQQIVVMDENLSNLTSASGEATASVSSANKSEFPNQYFAPQTTQQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQL
HRRGHNLPWKLKQRSNKEIIKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA
LADESARSAMALNPLLSSYNHNNNNNSQDHQFCNNLALKRDFDDNSNNNNNNLRVEIPPWLQPSSDHLMVGSGGQDENNDETVNPNPSSRGCGASRRSVGVGVGTPNNPN
PCELYQSSSHISATALLQKAAQMGATMSSTTTTSGSFPRPHNLLHVSTGNFGEIGLWSGDVEIGRGGGGGGGGGGGAVSCSSSSCTDYGNKAAASATASASASASTTFLH
DIINNSLSSPSPSHPFLQQHNSSFPDTTFAAMHHHHHTVPVIPTTAPSSGGRNDGLTRDFLGLRPLSHGDILSLTGFGNCIVPNSSNLHPQIQKPWQG