; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014883 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014883
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein indeterminate-domain 7-like
Genome locationscaffold3:49025676..49029311
RNA-Seq ExpressionSpg014883
SyntenySpg014883
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34113.1 nucleic acid binding protein [Cucumis melo subsp. melo]2.2e-21074.91Show/hide
Query:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLSQQQQQ     VVM+ENLSNLTSASGEAT SVSSANK+EF NQYF PQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSA  LNPLLSSYN NN+   SN+ ++   N  ALK+DF       ++NNNN+    EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE

Query:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN
        + +GSG    ++ +T+NPNP+ S            G+G G  N   P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH L+HVSTGN
Subjt:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN

Query:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH
        FG+MGL S + E+G   GGGAVSCSSSSCTDY NKAAAASASASA+ASASASA  TFLHD+I NSLSS S SH PFLQ  +      AFAA+      HH
Subjt:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH

Query:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        H  TV      ASGGR+DGLTRDFLGLRPLSHGDILS+TGFGNCIVPNSSNL  QIQKPWQG
Subjt:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

XP_008462349.1 PREDICTED: protein indeterminate-domain 7 [Cucumis melo]1.3e-21074.91Show/hide
Query:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLSQQQQQ     VVM+ENLSNLTSASGEAT SVSSANK+EF NQYF PQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSA  LNPLLSSYN NN+   SN+ ++   N  ALK+DF       ++NNNN+    EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE

Query:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN
        + +GSG    ++ +T+NPNP+ S            G+G G  N   P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH L+HVSTGN
Subjt:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN

Query:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH
        FG+MGL S + E+G   GGGAVSCSSSSCTDY NKAAAASASASA+ASASASA  TFLHD+I NSLSS S+SH PFLQ  +      AFAA+      HH
Subjt:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH

Query:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        H  TV      ASGGR+DGLTRDFLGLRPLSHGDILS+TGFGNCIVPNSSNL  QIQKPWQG
Subjt:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

XP_022954034.1 protein indeterminate-domain 7-like [Cucurbita moschata]1.5e-21177.63Show/hide
Query:  MMMKGNFLSQQQQ----QVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
        MMMKGNFLSQQQQ    QV+M+ENLSNLTSASGEATASVSSA      N YF PQ+Q      PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
Subjt:  MMMKGNFLSQQQQ----QVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
        ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE

Query:  YRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQ
        YRCDCGTLFSRRDSFITHRAFCDALADESARSA  LNPL+SSYN        NNNNNN F +K+DFS  N NNMR AEIPPWL  +DL+ EIFIGS  D+
Subjt:  YRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQ

Query:  HHH--HQTLNPN------PNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEM
        H H  H+TLNPN       +  G GCG S    PP  YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++GQ+         
Subjt:  HHH--HQTLNPN------PNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEM

Query:  GTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLT
          GGGAVSC SSSCTDY +KAAA           SAS P TFLHDMINSLSSASASHPFLQDSSF ND  AF AMHHH   AT      AASGGRSDGLT
Subjt:  GTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLT

Query:  RDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        RDFLGLRPLSHGDILS+TGFGNCIVPNSSNLQSQIQKPWQG
Subjt:  RDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

XP_023547451.1 protein indeterminate-domain 7-like [Cucurbita pepo subsp. pepo]5.9e-21177.32Show/hide
Query:  MMMKGNFLSQ---QQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ---------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF
        MMMKGNFLSQ   QQ QV+M+ENLSNLTSASGEATASVSSA      N YF PQ+Q          PPPPKKKRNLPGNPDPDAEVIALSP+TLMATNRF
Subjt:  MMMKGNFLSQ---QQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ---------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF

Query:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
        VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
Subjt:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG

Query:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSG
        TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSA  LNPL+SSY          NNNNN F +K+DFS  N NNMR AEIPPWL  +DL+ EIFIGS 
Subjt:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSG

Query:  HDQHHH--HQTLNPNPNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTG
         D+H H  H+TLNPN +  G GCG S    PP  YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++GQ+           G
Subjt:  HDQHHH--HQTLNPNPNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTG

Query:  GGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLTRDF
        GGAVSC SSSCTDY +KAAA           SAS P TFLHDMINSLSSASASHPFLQDSSF ND  AF AMHHH   AT      AASGGRSDGLTRDF
Subjt:  GGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLTRDF

Query:  LGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        LGLRPLSHGDILS+TGFGNCIVPNSSNLQSQIQKPWQG
Subjt:  LGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

XP_038898698.1 protein indeterminate-domain 7-like [Benincasa hispida]2.8e-23783.96Show/hide
Query:  MMMKGNFLSQQQQQ--VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQ--QPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG
        MMMKGNFLS+QQQQ  VVM+ENLSNLTSASGEATASVSSANK EFPNQYF PQ+Q   PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG
Subjt:  MMMKGNFLSQQQQQ--VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQ--QPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG

Query:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
        FQRDQNLQLHRRGHNLPWKLKQR+NKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
Subjt:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC

Query:  GTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSY---NNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHH
        GTLFSRRDSFITHRAFCDALADESARSA  LNPLLSSY   NNNNSQEF NNNN   FALK+DF++NNNNN+R AEIPPWLQPSDL+AEI +GSGH++ H
Subjt:  GTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSY---NNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHH

Query:  HHQTLNPNPNPSGIGCGP-SNLAAPPYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSS
        +H+TLNPNPNPSG GCGP S L  P YQS    SPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKL+HVSTGN+G+MGL SRE EMG GGGAVSCSS
Subjt:  HHQTLNPNPNPSGIGCGP-SNLAAPPYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSS

Query:  SSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDND----AAAFAAMHHHTATVPAA---NVAASGGRSDGLTRDFLGL
        SSCTDY NKA      A+A ASASASA  TFLHDMINSLSS S SHPFLQ +S  ND    AAAF+AMHHHTA VPA    +   SG RSDGLTRDFLGL
Subjt:  SSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDND----AAAFAAMHHHTATVPAA---NVAASGGRSDGLTRDFLGL

Query:  RPLSHGDILSITGFGNCIVP-NSSNLQSQIQKPWQG
        RPLSHGDILS+TGFGNCIVP NSSNLQ+QIQKPWQG
Subjt:  RPLSHGDILSITGFGNCIVP-NSSNLQSQIQKPWQG

TrEMBL top hitse value%identityAlignment
A0A0A0K7W2 C2H2-type domain-containing protein5.3e-21075.62Show/hide
Query:  MMMKGNFLSQQQQQ--VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        MMMKGNFLSQQQQQ  VVM+ENLSNLTSASGEATASVSSANK+EFPNQYF PQT   QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
Subjt:  MMMKGNFLSQQQQQ--VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
        GFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD

Query:  CGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNN--NNIFALKQDF----SSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGH
        CGTLFSRRDSFITHRAFCDALADESARSA  LNPLLSSYN+NN+   S ++   NN+ ALK+DF    +SNNNN++R  EIPPWLQPS     + +GSG 
Subjt:  CGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNN--NNIFALKQDF----SSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGH

Query:  DQHHHHQTLNPNP--NPSGIGCG------------PSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMG
           ++ +T+NPNP  N S  GCG            P+N   P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH L+HVSTGNFG++G
Subjt:  DQHHHHQTLNPNP--NPSGIGCG------------PSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMG

Query:  LCSREGEMGT---------GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASHP-FLQDSSFDNDAAAFAAM----HH
        L S + E+G          GGGAVSCSSSSCTDY NKAA   ASASATASASASA  TFLHD+I NSLSS S SHP FLQ  +      AFAAM    HH
Subjt:  LCSREGEMGT---------GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASHP-FLQDSSFDNDAAAFAAM----HH

Query:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        H   +      ASGGRSDGLTRDFLGLRPLSHGDILS+TGFGNCIVPNSSNL  QIQKPWQG
Subjt:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

A0A1S3CGT8 protein indeterminate-domain 76.3e-21174.91Show/hide
Query:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLSQQQQQ     VVM+ENLSNLTSASGEAT SVSSANK+EF NQYF PQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSA  LNPLLSSYN NN+   SN+ ++   N  ALK+DF       ++NNNN+    EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE

Query:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN
        + +GSG    ++ +T+NPNP+ S            G+G G  N   P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH L+HVSTGN
Subjt:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN

Query:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH
        FG+MGL S + E+G   GGGAVSCSSSSCTDY NKAAAASASASA+ASASASA  TFLHD+I NSLSS S+SH PFLQ  +      AFAA+      HH
Subjt:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH

Query:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        H  TV      ASGGR+DGLTRDFLGLRPLSHGDILS+TGFGNCIVPNSSNL  QIQKPWQG
Subjt:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

A0A6J1GRP3 protein indeterminate-domain 7-like7.5e-21277.63Show/hide
Query:  MMMKGNFLSQQQQ----QVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
        MMMKGNFLSQQQQ    QV+M+ENLSNLTSASGEATASVSSA      N YF PQ+Q      PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE
Subjt:  MMMKGNFLSQQQQ----QVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-----PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCE

Query:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
        ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE
Subjt:  ICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKE

Query:  YRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQ
        YRCDCGTLFSRRDSFITHRAFCDALADESARSA  LNPL+SSYN        NNNNNN F +K+DFS  N NNMR AEIPPWL  +DL+ EIFIGS  D+
Subjt:  YRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQ

Query:  HHH--HQTLNPN------PNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEM
        H H  H+TLNPN       +  G GCG S    PP  YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++GQ+         
Subjt:  HHH--HQTLNPN------PNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEM

Query:  GTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLT
          GGGAVSC SSSCTDY +KAAA           SAS P TFLHDMINSLSSASASHPFLQDSSF ND  AF AMHHH   AT      AASGGRSDGLT
Subjt:  GTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLT

Query:  RDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        RDFLGLRPLSHGDILS+TGFGNCIVPNSSNLQSQIQKPWQG
Subjt:  RDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

A0A6J1JP83 protein indeterminate-domain 7-like1.6e-20977.44Show/hide
Query:  MMMKGNFLSQQQQ----QVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG
        MMMKGNFLSQQQQ    QV+M+ENLSNLTSASGEATASVSSAN    P          PPPPKKKRNLPGNPDPDAEVIALSP+TLMATNRFVCEICNKG
Subjt:  MMMKGNFLSQQQQ----QVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKG

Query:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
        FQRDQNLQLHRRGHNLPWKLKQRSNKEV+KKKVYVCPE SCVHHDPSRALGDLTGIKKHF RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC
Subjt:  FQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDC

Query:  GTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHH--
        GTLFSRRDSFITHRAFCDALADESARSA  LNP++SSY         NNNNNN F +K+DFS  N+NNMR AEIPPWL  +DL+ EIFIGS  D+H H  
Subjt:  GTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHH--

Query:  HQTLNPN--PNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSC
        H+TLNPN   +  G GCG S    PP  YQ SS+SSPHISATALLQKAAQMGATMSSTTTTSGSM RPHK+VHVSTG++G +           GGGAVSC
Subjt:  HQTLNPN--PNPSGIGCGPSNLAAPP--YQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSC

Query:  SSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLTRDFLGLRPL
         SSSCTDY +KAAA           SAS P TFLHDMINSLSSASASHPFLQDSSF ND  AF AMHHH   AT      AASGGRSDGLTRDFLGLRPL
Subjt:  SSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFAAMHHH--TATVPAANVAASGGRSDGLTRDFLGLRPL

Query:  SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        SHGDILS+TGFGNCIVPNSSNLQSQIQKPWQG
Subjt:  SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

E5GCB4 Nucleic acid binding protein1.1e-21074.91Show/hide
Query:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
        MMMKGNFLSQQQQQ     VVM+ENLSNLTSASGEAT SVSSANK+EF NQYF PQT   Q PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI
Subjt:  MMMKGNFLSQQQQQ-----VVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQT---QQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEI

Query:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
        CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKE+IKKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY
Subjt:  CNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY

Query:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE
        RCDCGTLFSRRDSFITHRAFCDALADESARSA  LNPLLSSYN NN+   SN+ ++   N  ALK+DF       ++NNNN+    EIPPWLQPS     
Subjt:  RCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNN---NIFALKQDF-------SSNNNNNMRSAEIPPWLQPSDLQAE

Query:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN
        + +GSG    ++ +T+NPNP+ S            G+G G  N   P   YQSSS    HISATALLQKAAQMGATMSSTTTTSGS PRPH L+HVSTGN
Subjt:  IFIGSGHDQHHHHQTLNPNPNPS------------GIGCGPSNLAAP--PYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGN

Query:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH
        FG+MGL S + E+G   GGGAVSCSSSSCTDY NKAAAASASASA+ASASASA  TFLHD+I NSLSS S SH PFLQ  +      AFAA+      HH
Subjt:  FGQMGLCSREGEMGT--GGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMI-NSLSSASASH-PFLQDSSFDNDAAAFAAM------HH

Query:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        H  TV      ASGGR+DGLTRDFLGLRPLSHGDILS+TGFGNCIVPNSSNL  QIQKPWQG
Subjt:  HTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

SwissProt top hitse value%identityAlignment
O22759 Protein indeterminate-domain 123.0e-8564.03Show/hide
Query:  PKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFC
        PKKKR LPGNPDPDAEVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQ++ KE  KKKVYVCPE +C HH PSRALGDLTGIKKHFC
Subjt:  PKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFC

Query:  RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNN-NSQEFSNNNNNNIFALKQD
        RKHGEKKWKC+KCSK YAVQSDWKAH+KICGT++YRCDCGTLFSR+D+FITHRAFCDALA+ESAR  +T +  L++ N N     F  N ++++      
Subjt:  RKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNN-NSQEFSNNNNNNIFALKQD

Query:  FSSNNNNNMRSAEIPPWL---------QPSDLQAEIFIGSGHDQ---HHHHQT
             + +  +   PP           + + L +  F G G  +   HH H T
Subjt:  FSSNNNNNMRSAEIPPWL---------QPSDLQAEIFIGSGHDQ---HHHHQT

Q8H1F5 Protein indeterminate-domain 71.0e-10159.45Show/hide
Query:  MMMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFP----NQYFTPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        MMM  + L  QQQQ  MEEN+SNLTSASG+  ASVSS N+ E      NQ+   Q   P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+CNK
Subjt:  MMMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFP----NQYFTPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
        GFQRDQNLQLH+RGHNLPWKLKQRSNK+V++KKVYVCPEP CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGTKEY+CD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD

Query:  CGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHH
        CGTLFSRRDSFITHRAFCDALA+ESAR+    NP++   +N+          N        FSS++ N + ++ +   ++  + Q            HH+
Subjt:  CGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHH

Query:  QTLNP-----NPNPSGIGCGPSNLAAPPYQSSSLS-------SPHISATALLQKAAQMGATMSST
        Q + P     NPNP+G      NL  P   S +         SP +SATALLQKAAQMG+T S+T
Subjt:  QTLNP-----NPNPSGIGCGPSNLAAPPYQSSSLS-------SPHISATALLQKAAQMGATMSST

Q944L3 Zinc finger protein BALDIBIS1.2e-8653.52Show/hide
Query:  PNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPS
        PN    P        K+KRNLPGNPDPDAEVIALSP +LM TNRF+CE+CNKGF+RDQNLQLHRRGHNLPWKLKQR+NKE +KKKVY+CPE +CVHHDP+
Subjt:  PNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPS

Query:  RALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSATT------LNPLL------
        RALGDLTGIKKHF RKHGEKKWKCDKCSKKYAV SDWKAHSKICGTKEYRCDCGTLFSR+DSFITHRAFCDALA+ESAR  +       LN  L      
Subjt:  RALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADESARSATT------LNPLL------

Query:  SSYNNNNSQE------------FSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPY
         + N N+ Q               N N NNI  L Q   +   N   S+  P     SD    ++   G   H      N N N + +  G S       
Subjt:  SSYNNNNSQE------------FSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPY

Query:  QSSSLSS----------------------PHISATALLQKAAQMGATMSSTTTTS
          + +S+                        +SATALLQKAAQMG+  SS+++++
Subjt:  QSSSLSS----------------------PHISATALLQKAAQMGATMSSTTTTS

Q9LRW7 Protein indeterminate-domain 112.1e-11047.93Show/hide
Query:  MMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF
        MM  + L  Q QQ   +EN+SNLTSASG+  ASVSS N  E     + P  QQ                 KK+RN PGNPDP++EVIALSPKTLMATNRF
Subjt:  MMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF

Query:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
        VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CG
Subjt:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG

Query:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SN
        TKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR             NPLL                         SS+N+N  NS  F         SN
Subjt:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SN

Query:  NNNNNI--FALKQDFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQM
        N+NN++  F +K++  SN++  N   + IPPWL P                  H   + NPNPS  G G  +L        SL+SP +SATALLQKAAQM
Subjt:  NNNNNI--FALKQDFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQM

Query:  GATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFL
        G+T +     + +  R                                      + + N      A+   + S   S+ N   + ++    +AS      
Subjt:  GATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFL

Query:  QDSSFDNDAAAFAAMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        ++ +FD+    F   +  TA   +     SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  QDSSFDNDAAAFAAMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

Q9SCQ6 Zinc finger protein GAI-ASSOCIATED FACTOR 11.2e-8955.52Show/hide
Query:  MEENLSNLTSASGEATASVSS-ANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
        M  +L N ++ SGEA+ S+SS  N+N  PN             KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
Subjt:  MEENLSNLTSASGEATASVSS-ANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK

Query:  LKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA
        L+Q+SNKEV KKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDA
Subjt:  LKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA

Query:  LADESARS------------ATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIG-----SGHDQHHHHQTL
        LA+E+ARS             T  NP+ +        E +   +++   +KQ  S      +      P          +F G     S     +   + 
Subjt:  LADESARS------------ATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIG-----SGHDQHHHHQTL

Query:  NPNPNPSGIGCGPSNLAAPPYQSSSL-------SSPHISATALLQKAAQMGATMSSTTTTSG
        + +   S     P +L       SS        + P +SATALLQKAAQMGA  S  +   G
Subjt:  NPNPNPSGIGCGPSNLAAPPYQSSSL-------SSPHISATALLQKAAQMGATMSSTTTTSG

Arabidopsis top hitse value%identityAlignment
AT1G55110.1 indeterminate(ID)-domain 77.4e-10359.45Show/hide
Query:  MMMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFP----NQYFTPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK
        MMM  + L  QQQQ  MEEN+SNLTSASG+  ASVSS N+ E      NQ+   Q   P    K+KRN PGNPDP+AEV+ALSPKTLMATNRF+CE+CNK
Subjt:  MMMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFP----NQYFTPQTQQPPPP-KKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNK

Query:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD
        GFQRDQNLQLH+RGHNLPWKLKQRSNK+V++KKVYVCPEP CVHH PSRALGDLTGIKKHF RKHGEKKWKC+KCSKKYAVQSDWKAH+K CGTKEY+CD
Subjt:  GFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCD

Query:  CGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHH
        CGTLFSRRDSFITHRAFCDALA+ESAR+    NP++   +N+          N        FSS++ N + ++ +   ++  + Q            HH+
Subjt:  CGTLFSRRDSFITHRAFCDALADESARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHH

Query:  QTLNP-----NPNPSGIGCGPSNLAAPPYQSSSLS-------SPHISATALLQKAAQMGATMSST
        Q + P     NPNP+G      NL  P   S +         SP +SATALLQKAAQMG+T S+T
Subjt:  QTLNP-----NPNPSGIGCGPSNLAAPPYQSSSLS-------SPHISATALLQKAAQMGATMSST

AT3G13810.1 indeterminate(ID)-domain 111.5e-11147.93Show/hide
Query:  MMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF
        MM  + L  Q QQ   +EN+SNLTSASG+  ASVSS N  E     + P  QQ                 KK+RN PGNPDP++EVIALSPKTLMATNRF
Subjt:  MMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQ-------------PPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRF

Query:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
        VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CG
Subjt:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG

Query:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SN
        TKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR             NPLL                         SS+N+N  NS  F         SN
Subjt:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SN

Query:  NNNNNI--FALKQDFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQM
        N+NN++  F +K++  SN++  N   + IPPWL P                  H   + NPNPS  G G  +L        SL+SP +SATALLQKAAQM
Subjt:  NNNNNI--FALKQDFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQM

Query:  GATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFL
        G+T +     + +  R                                      + + N      A+   + S   S+ N   + ++    +AS      
Subjt:  GATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFL

Query:  QDSSFDNDAAAFAAMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        ++ +FD+    F   +  TA   +     SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  QDSSFDNDAAAFAAMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

AT3G13810.2 indeterminate(ID)-domain 115.6e-10346.03Show/hide
Query:  LSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRF
        L  Q QQ   +EN+SNLTSASG+  ASVSS N  E     + P  QQ    ++++    +                     P++EVIALSPKTLMATNRF
Subjt:  LSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRF

Query:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG
        VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CG
Subjt:  VCEICNKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICG

Query:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SN
        TKEYRCDCGTLFSRRDSFITHRAFC+ALA+E+AR             NPLL                         SS+N+N  NS  F         SN
Subjt:  TKEYRCDCGTLFSRRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SN

Query:  NNNNNI--FALKQDFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQM
        N+NN++  F +K++  SN++  N   + IPPWL P                  H   + NPNPS  G G  +L        SL+SP +SATALLQKAAQM
Subjt:  NNNNNI--FALKQDFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQM

Query:  GATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFL
        G+T +     + +  R                                      + + N      A+   + S   S+ N   + ++    +AS      
Subjt:  GATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFL

Query:  QDSSFDNDAAAFAAMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
        ++ +FD+    F   +  TA   +     SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  QDSSFDNDAAAFAAMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

AT3G13810.3 indeterminate(ID)-domain 111.2e-10046.03Show/hide
Query:  LSNLTSASGEATASVSSANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ
        +SNLTSASG+  ASVSS N  E     + P  QQ    ++++    +                     P++EVIALSPKTLMATNRFVCEICNKGFQRDQ
Subjt:  LSNLTSASGEATASVSSANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPD-------------------PDAEVIALSPKTLMATNRFVCEICNKGFQRDQ

Query:  NLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS
        NLQLHRRGHNLPWKLKQRSNKEVI+KKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSD KAHSK CGTKEYRCDCGTLFS
Subjt:  NLQLHRRGHNLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFS

Query:  RRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SNNNNNNI--FALKQ
        RRDSFITHRAFC+ALA+E+AR             NPLL                         SS+N+N  NS  F         SNN+NN++  F +K+
Subjt:  RRDSFITHRAFCDALADESARSA--------TTLNPLL-------------------------SSYNNN--NSQEF---------SNNNNNNI--FALKQ

Query:  DFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGS
        +  SN++  N   + IPPWL P                  H   + NPNPS  G G  +L        SL+SP +SATALLQKAAQMG+T +     + +
Subjt:  DFSSNNN-NNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHISATALLQKAAQMGATMSSTTTTSGS

Query:  MPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFA
          R                                      + + N      A+   + S   S+ N   + ++    +AS      ++ +FD+    F 
Subjt:  MPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPFLQDSSFDNDAAAFA

Query:  AMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG
          +  TA   +     SGG  +GLTRDFLGLRPL SH +ILS  G G+CI  NSS       KPWQG
Subjt:  AMHHHTATVPAANVAASGGRSDGLTRDFLGLRPL-SHGDILSITGFGNCIVPNSSNLQSQIQKPWQG

AT3G50700.1 indeterminate(ID)-domain 28.4e-9155.52Show/hide
Query:  MEENLSNLTSASGEATASVSS-ANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
        M  +L N ++ SGEA+ S+SS  N+N  PN             KKKRNLPG PDP++EVIALSPKTL+ATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK
Subjt:  MEENLSNLTSASGEATASVSS-ANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGHNLPWK

Query:  LKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA
        L+Q+SNKEV KKKVYVCPE SCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEY+CDCGTLFSRRDSFITHRAFCDA
Subjt:  LKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDA

Query:  LADESARS------------ATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIG-----SGHDQHHHHQTL
        LA+E+ARS             T  NP+ +        E +   +++   +KQ  S      +      P          +F G     S     +   + 
Subjt:  LADESARS------------ATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIG-----SGHDQHHHHQTL

Query:  NPNPNPSGIGCGPSNLAAPPYQSSSL-------SSPHISATALLQKAAQMGATMSSTTTTSG
        + +   S     P +L       SS        + P +SATALLQKAAQMGA  S  +   G
Subjt:  NPNPNPSGIGCGPSNLAAPPYQSSSL-------SSPHISATALLQKAAQMGATMSSTTTTSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGATGAAAGGTAATTTTTTGTCTCAACAACAACAACAAGTAGTCATGGAGGAAAATTTGTCCAATTTGACTTCTGCTTCTGGTGAAGCCACTGCTAGTGTTTCTTC
TGCCAATAAAAATGAGTTCCCCAATCAATATTTTACTCCTCAAACCCAACAACCGCCCCCTCCAAAGAAGAAGCGGAACCTCCCAGGAAATCCCGACCCAGATGCGGAAG
TGATAGCGTTATCGCCGAAGACGCTGATGGCGACGAATCGGTTCGTGTGCGAGATCTGCAACAAGGGGTTTCAGAGAGATCAAAATCTTCAGCTGCATAGAAGAGGGCAC
AATCTTCCATGGAAGCTGAAGCAGAGATCAAACAAAGAGGTGATAAAGAAGAAGGTATATGTTTGTCCAGAGCCAAGCTGTGTCCACCATGATCCATCAAGAGCTCTTGG
AGACCTCACAGGAATCAAGAAGCACTTCTGCAGAAAGCACGGTGAAAAGAAGTGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAATCTGATTGGAAAGCTCACT
CCAAGATCTGTGGCACAAAGGAGTACAGATGTGACTGTGGAACTCTCTTTTCAAGGAGAGATAGTTTCATTACACATAGAGCTTTCTGTGATGCATTAGCAGATGAAAGT
GCAAGATCAGCCACGACATTAAACCCTCTTCTCTCTTCTTACAACAATAACAATTCACAAGAATTCTCGAATAACAACAACAATAATATTTTCGCTCTGAAACAAGATTT
CAGCAGCAACAACAATAATAACATGAGATCGGCGGAAATCCCGCCGTGGCTGCAGCCTTCTGATCTCCAAGCGGAAATCTTCATTGGGAGCGGCCACGACCAGCATCATC
ATCATCAAACCCTAAACCCTAACCCTAACCCTAGCGGAATCGGGTGTGGGCCCAGTAATCTTGCTGCTCCTCCATATCAATCCTCTTCTCTTTCATCTCCTCATATTTCA
GCCACTGCACTGCTGCAGAAGGCAGCCCAAATGGGTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTATGCCAAGGCCCCACAAGCTGGTTCACGTGTCTACAGG
TAATTTTGGGCAGATGGGTTTATGCTCACGTGAAGGTGAAATGGGAACAGGAGGAGGGGCCGTCAGTTGTAGTAGTAGTAGTTGTACTGATTATAGGAATAAAGCTGCAG
CTGCTTCTGCTTCTGCTTCTGCTACTGCTTCTGCTTCTGCTTCTGCACCCAACACATTTCTTCATGACATGATAAATTCCCTCTCTTCTGCTTCTGCCTCTCATCCCTTC
CTCCAAGATTCCTCCTTCGACAACGACGCCGCCGCTTTCGCCGCTATGCATCATCACACAGCCACCGTCCCCGCCGCCAACGTCGCTGCTTCGGGCGGTCGAAGCGACGG
TTTGACGAGAGATTTCTTGGGGCTTCGCCCTCTTTCCCATGGAGATATTCTAAGCATTACTGGTTTTGGAAACTGCATTGTTCCCAATTCGTCCAATCTTCAGAGCCAAA
TTCAGAAGCCATGGCAGGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGATGAAAGGTAATTTTTTGTCTCAACAACAACAACAAGTAGTCATGGAGGAAAATTTGTCCAATTTGACTTCTGCTTCTGGTGAAGCCACTGCTAGTGTTTCTTC
TGCCAATAAAAATGAGTTCCCCAATCAATATTTTACTCCTCAAACCCAACAACCGCCCCCTCCAAAGAAGAAGCGGAACCTCCCAGGAAATCCCGACCCAGATGCGGAAG
TGATAGCGTTATCGCCGAAGACGCTGATGGCGACGAATCGGTTCGTGTGCGAGATCTGCAACAAGGGGTTTCAGAGAGATCAAAATCTTCAGCTGCATAGAAGAGGGCAC
AATCTTCCATGGAAGCTGAAGCAGAGATCAAACAAAGAGGTGATAAAGAAGAAGGTATATGTTTGTCCAGAGCCAAGCTGTGTCCACCATGATCCATCAAGAGCTCTTGG
AGACCTCACAGGAATCAAGAAGCACTTCTGCAGAAAGCACGGTGAAAAGAAGTGGAAATGTGATAAGTGCTCTAAGAAATATGCAGTTCAATCTGATTGGAAAGCTCACT
CCAAGATCTGTGGCACAAAGGAGTACAGATGTGACTGTGGAACTCTCTTTTCAAGGAGAGATAGTTTCATTACACATAGAGCTTTCTGTGATGCATTAGCAGATGAAAGT
GCAAGATCAGCCACGACATTAAACCCTCTTCTCTCTTCTTACAACAATAACAATTCACAAGAATTCTCGAATAACAACAACAATAATATTTTCGCTCTGAAACAAGATTT
CAGCAGCAACAACAATAATAACATGAGATCGGCGGAAATCCCGCCGTGGCTGCAGCCTTCTGATCTCCAAGCGGAAATCTTCATTGGGAGCGGCCACGACCAGCATCATC
ATCATCAAACCCTAAACCCTAACCCTAACCCTAGCGGAATCGGGTGTGGGCCCAGTAATCTTGCTGCTCCTCCATATCAATCCTCTTCTCTTTCATCTCCTCATATTTCA
GCCACTGCACTGCTGCAGAAGGCAGCCCAAATGGGTGCGACCATGAGTAGTACCACTACCACGAGTGGCTCTATGCCAAGGCCCCACAAGCTGGTTCACGTGTCTACAGG
TAATTTTGGGCAGATGGGTTTATGCTCACGTGAAGGTGAAATGGGAACAGGAGGAGGGGCCGTCAGTTGTAGTAGTAGTAGTTGTACTGATTATAGGAATAAAGCTGCAG
CTGCTTCTGCTTCTGCTTCTGCTACTGCTTCTGCTTCTGCTTCTGCACCCAACACATTTCTTCATGACATGATAAATTCCCTCTCTTCTGCTTCTGCCTCTCATCCCTTC
CTCCAAGATTCCTCCTTCGACAACGACGCCGCCGCTTTCGCCGCTATGCATCATCACACAGCCACCGTCCCCGCCGCCAACGTCGCTGCTTCGGGCGGTCGAAGCGACGG
TTTGACGAGAGATTTCTTGGGGCTTCGCCCTCTTTCCCATGGAGATATTCTAAGCATTACTGGTTTTGGAAACTGCATTGTTCCCAATTCGTCCAATCTTCAGAGCCAAA
TTCAGAAGCCATGGCAGGGTTAG
Protein sequenceShow/hide protein sequence
MMMKGNFLSQQQQQVVMEENLSNLTSASGEATASVSSANKNEFPNQYFTPQTQQPPPPKKKRNLPGNPDPDAEVIALSPKTLMATNRFVCEICNKGFQRDQNLQLHRRGH
NLPWKLKQRSNKEVIKKKVYVCPEPSCVHHDPSRALGDLTGIKKHFCRKHGEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYRCDCGTLFSRRDSFITHRAFCDALADES
ARSATTLNPLLSSYNNNNSQEFSNNNNNNIFALKQDFSSNNNNNMRSAEIPPWLQPSDLQAEIFIGSGHDQHHHHQTLNPNPNPSGIGCGPSNLAAPPYQSSSLSSPHIS
ATALLQKAAQMGATMSSTTTTSGSMPRPHKLVHVSTGNFGQMGLCSREGEMGTGGGAVSCSSSSCTDYRNKAAAASASASATASASASAPNTFLHDMINSLSSASASHPF
LQDSSFDNDAAAFAAMHHHTATVPAANVAASGGRSDGLTRDFLGLRPLSHGDILSITGFGNCIVPNSSNLQSQIQKPWQG