; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0028146 (gene) of Chayote v1 genome

Gene IDSed0028146
OrganismSechium edule (Chayote v1)
DescriptionHydroxyproline-rich glycoprotein
Genome locationLG01:15552574..15556688
RNA-Seq ExpressionSed0028146
SyntenySed0028146
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442261.1 PREDICTED: uncharacterized protein At4g04980 isoform X1 [Cucumis melo]1.3e-28666.94Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SKT++KKPKLSK  RSKKSSRCKDN+VQMMELRKKILILRDIIDLP++E S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIGDSWMLNHEWRDKSKYN SS QENSSF EIVE+VL +IDCIVSMANERFD ++E+VNSK + +SR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VLANFR+++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ +RV I+EESTMDVDD+ TS+  DA+D N EM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQL
         LS++ SQKAD ++ IE+ D KEEKL+LS+  S K DRNEE+EV +I+E   +SK +SQ+ D  E T   D +   +      S     + + + +  ++
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQL

Query:  SQPPPPPPLP------------------------------QIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLP
        + PPPPPP P                              Q  V     +   P PPPPP  +I+     + +   PPPPPP+PQ+  QP AA S AP P
Subjt:  SQPPPPPPLP------------------------------QIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLP

Query:  PPPPFKVVASVVH---------------SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK
        PP   KV+ +V+                +MV AGVPPPPPM   KG    APPP M QGNG   PPPPPGGA +SLR KK +TKLKRS+QLG LYRTLK 
Subjt:  PPPPFKVVASVVH---------------SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK

Query:  KVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFE
        KVEG NQNLK A+GRKGG GNS GGKQGMADALAEMTKRSAYFQQIEED +KH+ SITALKSSIS+F+S++M++LL FHK VES+LE LTDESQVLARFE
Subjt:  KVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFE

Query:  GFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALK
        GFPIKKLETLR A ALYLKLD IV QLQNWK VSPM  LLDRVE+YF+KIKGE+DALERTKDEESKRF+ HGI+FDFSVLIRIKESMVDVSS CMELALK
Subjt:  GFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALK

Query:  EKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL
        EK ELKAA E+++   +SEN NKA SKMLW+ FQFAYRVYTFAGGHDERADRLTRELA+EIESESH L
Subjt:  EKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL

XP_031736251.1 uncharacterized protein At4g04980 isoform X1 [Cucumis sativus]1.3e-28468.17Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SKT++KK KLSK  RSKKSS CKDN+VQMMELRKKILILRDIIDLP++E S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIGDSWMLNHEWRDKSKYN SS QENSSF EIVE+VL +IDCIVSMANERFD ++E+VNSK + YSR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VLANFRN++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ +RV I+EES MD+DD+  S+  DA+D N EM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEI-KEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQ
         LS++ SQKADR++ IE+ D KEEKL+LS+  S K DRNEE+EV +I +E L +++ +SQK D  E T+  D +          S     + + + +   
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEI-KEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQ

Query:  LSQPPPPPPLPQIGVQPIATTSMAPL---PPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVH-------------
        ++ PPPPPP P   ++P   T   P    PPPPP  +++           PPPPPP+PQ+  QP AA    P PPP   KV+ + +              
Subjt:  LSQPPPPPPLPQIGVQPIATTSMAPL---PPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVH-------------

Query:  -SMVNAGVPPPPPMSSLKGGA----PPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK
         +MV AGVPPPPPM+  KG A    PPP M QGNG   PPPPPGGA +SLR+KKA+TKLKRS+ LG LYRTLK KVEGSNQNLK A+GRKGG GNS GGK
Subjt:  -SMVNAGVPPPPPMSSLKGGA----PPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK

Query:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ
        QGMADALAEMTKRSAYFQQIEED +KH+ SITALK SIS+F+S++M +LL FHK VES+LE LTDESQVLARFEGFPIKKLETLR A ALYLKLD IV Q
Subjt:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ

Query:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACS
        LQNWK VSPM  LLDRVE+YF KIKGE+DALERTKDEESKRF+ HGI+FDFSVLIRIKESMVDVSS CMELALKEK ELKAA E+S+   +S+N NKA S
Subjt:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACS

Query:  KMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL
        KMLW+ FQFAYRVYTFAGGHD+RADRLTRELA+EIESESH L
Subjt:  KMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL

XP_031736252.1 uncharacterized protein At4g04980 isoform X2 [Cucumis sativus]1.3e-28468.17Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SKT++KK KLSK  RSKKSS CKDN+VQMMELRKKILILRDIIDLP++E S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIGDSWMLNHEWRDKSKYN SS QENSSF EIVE+VL +IDCIVSMANERFD ++E+VNSK + YSR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VLANFRN++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ +RV I+EES MD+DD+  S+  DA+D N EM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEI-KEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQ
         LS++ SQKADR++ IE+ D KEEKL+LS+  S K DRNEE+EV +I +E L +++ +SQK D  E T+  D +          S     + + + +   
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEI-KEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQ

Query:  LSQPPPPPPLPQIGVQPIATTSMAPL---PPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVH-------------
        ++ PPPPPP P   ++P   T   P    PPPPP  +++           PPPPPP+PQ+  QP AA    P PPP   KV+ + +              
Subjt:  LSQPPPPPPLPQIGVQPIATTSMAPL---PPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVH-------------

Query:  -SMVNAGVPPPPPMSSLKGGA----PPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK
         +MV AGVPPPPPM+  KG A    PPP M QGNG   PPPPPGGA +SLR+KKA+TKLKRS+ LG LYRTLK KVEGSNQNLK A+GRKGG GNS GGK
Subjt:  -SMVNAGVPPPPPMSSLKGGA----PPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK

Query:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ
        QGMADALAEMTKRSAYFQQIEED +KH+ SITALK SIS+F+S++M +LL FHK VES+LE LTDESQVLARFEGFPIKKLETLR A ALYLKLD IV Q
Subjt:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ

Query:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACS
        LQNWK VSPM  LLDRVE+YF KIKGE+DALERTKDEESKRF+ HGI+FDFSVLIRIKESMVDVSS CMELALKEK ELKAA E+S+   +S+N NKA S
Subjt:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACS

Query:  KMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL
        KMLW+ FQFAYRVYTFAGGHD+RADRLTRELA+EIESESH L
Subjt:  KMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL

XP_038895361.1 uncharacterized protein At4g04980-like isoform X1 [Benincasa hispida]5.3e-26760.76Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SK++TKKPK SK  RSKKSSRCKDN+VQMMELRKKI+ILRDIIDLP +E S SINELVVGTMEDL KLYPEIIS +QYSEMK TCIEQ+L +FCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIG+SWMLNHEWRDKSKYN SS QEN+SF EIVE+VLA+IDCIVSMANERFD ++E+VNSK + YSR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VL NFRN++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ + V I+EES MDVDD+ TS   DA+D NEEM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQK--KDRN-----------EEVEVSEIKEGLD-----------------------------------
         LS + SQ+AD ++ +E+SDIKEEKL+LS+ +SQK   +R            +EV  SE+   +                                    
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQK--KDRN-----------EEVEVSEIKEGLD-----------------------------------

Query:  -----------MSKEDSQKADRI------------EETEVSDIKE------------GELDMSREDSQKADRIEEIEVTQQLSQPPPPPPLPQIGVQPIA
                   M ++ +  A ++            ++ EV  + +             E+    +       +++ EV   LSQPPPPPP P +    + 
Subjt:  -----------MSKEDSQKADRI------------EETEVSDIKE------------GELDMSREDSQKADRIEEIEVTQQLSQPPPPPPLPQIGVQPIA

Query:  TTSMAPLPPPPPFKVIETVVHSMV---------------NAGV------PPPPPPLPQIGVQPTAAAS-MAPLPPPPPFKVVASVVH-------------
          S  P PPPPP      V+   +               NA +      PPPPPPLPQ+  QP AAA+ +AP PPP   KV+ +V+              
Subjt:  TTSMAPLPPPPPFKVIETVVHSMV---------------NAGV------PPPPPPLPQIGVQPTAAAS-MAPLPPPPPFKVVASVVH-------------

Query:  --SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK
          + V AG+PPPPPM+   GG   APPPPM QGNG   PPPPPGGA +SLR KK +TKLKRS+QLG LYRTLK KVEGSNQNLK A+GRKG  G+S GGK
Subjt:  --SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK

Query:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ
        QGMADALAEMTKRSAYFQQIE+D +KH+ SIT LKSSIS+F+S++M+++L FHK VES+LE LTDESQVLARFEGFP KKLET+R A ALYLKLDAIV Q
Subjt:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ

Query:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELK-AAIERSQNEVKSENLNKAC
        LQNWK VSPM QLLDRVESYF+KIKGE+DALERTKDEESKRF+SHGI+FDFSVLIRIKESMVDVSS+CMELALKEK ELK AA E++QN  +SEN NKA 
Subjt:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELK-AAIERSQNEVKSENLNKAC

Query:  SKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL
        SKMLW+ FQFAYRVYTFAGGHD+RADRLTRELALEIESES  L
Subjt:  SKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL

XP_038895362.1 uncharacterized protein At4g04980-like isoform X2 [Benincasa hispida]5.3e-26760.76Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SK++TKKPK SK  RSKKSSRCKDN+VQMMELRKKI+ILRDIIDLP +E S SINELVVGTMEDL KLYPEIIS +QYSEMK TCIEQ+L +FCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIG+SWMLNHEWRDKSKYN SS QEN+SF EIVE+VLA+IDCIVSMANERFD ++E+VNSK + YSR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VL NFRN++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ + V I+EES MDVDD+ TS   DA+D NEEM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQK--KDRN-----------EEVEVSEIKEGLD-----------------------------------
         LS + SQ+AD ++ +E+SDIKEEKL+LS+ +SQK   +R            +EV  SE+   +                                    
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQK--KDRN-----------EEVEVSEIKEGLD-----------------------------------

Query:  -----------MSKEDSQKADRI------------EETEVSDIKE------------GELDMSREDSQKADRIEEIEVTQQLSQPPPPPPLPQIGVQPIA
                   M ++ +  A ++            ++ EV  + +             E+    +       +++ EV   LSQPPPPPP P +    + 
Subjt:  -----------MSKEDSQKADRI------------EETEVSDIKE------------GELDMSREDSQKADRIEEIEVTQQLSQPPPPPPLPQIGVQPIA

Query:  TTSMAPLPPPPPFKVIETVVHSMV---------------NAGV------PPPPPPLPQIGVQPTAAAS-MAPLPPPPPFKVVASVVH-------------
          S  P PPPPP      V+   +               NA +      PPPPPPLPQ+  QP AAA+ +AP PPP   KV+ +V+              
Subjt:  TTSMAPLPPPPPFKVIETVVHSMV---------------NAGV------PPPPPPLPQIGVQPTAAAS-MAPLPPPPPFKVVASVVH-------------

Query:  --SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK
          + V AG+PPPPPM+   GG   APPPPM QGNG   PPPPPGGA +SLR KK +TKLKRS+QLG LYRTLK KVEGSNQNLK A+GRKG  G+S GGK
Subjt:  --SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK

Query:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ
        QGMADALAEMTKRSAYFQQIE+D +KH+ SIT LKSSIS+F+S++M+++L FHK VES+LE LTDESQVLARFEGFP KKLET+R A ALYLKLDAIV Q
Subjt:  QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQ

Query:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELK-AAIERSQNEVKSENLNKAC
        LQNWK VSPM QLLDRVESYF+KIKGE+DALERTKDEESKRF+SHGI+FDFSVLIRIKESMVDVSS+CMELALKEK ELK AA E++QN  +SEN NKA 
Subjt:  LQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELK-AAIERSQNEVKSENLNKAC

Query:  SKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL
        SKMLW+ FQFAYRVYTFAGGHD+RADRLTRELALEIESES  L
Subjt:  SKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL

TrEMBL top hitse value%identityAlignment
A0A0A0LSA3 Uncharacterized protein6.1e-28569.3Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SKT++KK KLSK  RSKKSS CKDN+VQMMELRKKILILRDIIDLP++E S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIGDSWMLNHEWRDKSKYN SS QENSSF EIVE+VL +IDCIVSMANERFD ++E+VNSK + YSR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VLANFRN++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ +RV I+EES MD+DD+  S+  DA+D N EM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEI-KEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEEIEVTQQL
         LS++ SQKADR++ IE+ D KEEKL+LS+  S K DRNEE+EV +I +E L +++ +SQK          DI E   D    DSQ    ++E+  +   
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEI-KEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEEIEVTQQL

Query:  SQPPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSLK
        + PPPPPP+PQ+  QP A     P PPP   KVIET     +   VPPPPPP                          ++   +MV AGVPPPPPM+  K
Subjt:  SQPPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSLK

Query:  GGA----PPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQ
        G A    PPP M QGNG   PPPPPGGA +SLR+KKA+TKLKRS+ LG LYRTLK KVEGSNQNLK A+GRKGG GNS GGKQGMADALAEMTKRSAYFQ
Subjt:  GGA----PPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQ

Query:  QIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVE
        QIEED +KH+ SITALK SIS+F+S++M +LL FHK VES+LE LTDESQVLARFEGFPIKKLETLR A ALYLKLD IV QLQNWK VSPM  LLDRVE
Subjt:  QIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVE

Query:  SYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAG
        +YF KIKGE+DALERTKDEESKRF+ HGI+FDFSVLIRIKESMVDVSS CMELALKEK ELKAA E+S+   +S+N NKA SKMLW+ FQFAYRVYTFAG
Subjt:  SYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAG

Query:  GHDERADRLTRELALEIESESHQL
        GHD+RADRLTRELA+EIESESH L
Subjt:  GHDERADRLTRELALEIESESHQL

A0A1S3B5Y5 uncharacterized protein At4g04980 isoform X16.5e-28766.94Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SKT++KKPKLSK  RSKKSSRCKDN+VQMMELRKKILILRDIIDLP++E S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIGDSWMLNHEWRDKSKYN SS QENSSF EIVE+VL +IDCIVSMANERFD ++E+VNSK + +SR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VLANFR+++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ +RV I+EESTMDVDD+ TS+  DA+D N EM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQL
         LS++ SQKAD ++ IE+ D KEEKL+LS+  S K DRNEE+EV +I+E   +SK +SQ+ D  E T   D +   +      S     + + + +  ++
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQL

Query:  SQPPPPPPLP------------------------------QIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLP
        + PPPPPP P                              Q  V     +   P PPPPP  +I+     + +   PPPPPP+PQ+  QP AA S AP P
Subjt:  SQPPPPPPLP------------------------------QIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLP

Query:  PPPPFKVVASVVH---------------SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK
        PP   KV+ +V+                +MV AGVPPPPPM   KG    APPP M QGNG   PPPPPGGA +SLR KK +TKLKRS+QLG LYRTLK 
Subjt:  PPPPFKVVASVVH---------------SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK

Query:  KVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFE
        KVEG NQNLK A+GRKGG GNS GGKQGMADALAEMTKRSAYFQQIEED +KH+ SITALKSSIS+F+S++M++LL FHK VES+LE LTDESQVLARFE
Subjt:  KVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFE

Query:  GFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALK
        GFPIKKLETLR A ALYLKLD IV QLQNWK VSPM  LLDRVE+YF+KIKGE+DALERTKDEESKRF+ HGI+FDFSVLIRIKESMVDVSS CMELALK
Subjt:  GFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALK

Query:  EKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL
        EK ELKAA E+++   +SEN NKA SKMLW+ FQFAYRVYTFAGGHDERADRLTRELA+EIESESH L
Subjt:  EKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL

A0A5A7UTX7 Hydroxyproline-rich glycoprotein6.5e-28766.94Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSYV SKT++KKPKLSK  RSKKSSRCKDN+VQMMELRKKILILRDIIDLP++E S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+SIGDSWMLNHEWRDKSKYN SS QENSSF EIVE+VL +IDCIVSMANERFD ++E+VNSK + +SR SSF KS+SST+S SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI
        VLANFR+++RKS+EKE  SCSSPLLWSLRV+AVEKLNPID+KHLLLPRLSH  VN CP+ +RV I+EESTMDVDD+ TS+  DA+D N EM+V   KEE 
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEI

Query:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQL
         LS++ SQKAD ++ IE+ D KEEKL+LS+  S K DRNEE+EV +I+E   +SK +SQ+ D  E T   D +   +      S     + + + +  ++
Subjt:  GLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEE-IEVTQQL

Query:  SQPPPPPPLP------------------------------QIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLP
        + PPPPPP P                              Q  V     +   P PPPPP  +I+     + +   PPPPPP+PQ+  QP AA S AP P
Subjt:  SQPPPPPPLP------------------------------QIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLP

Query:  PPPPFKVVASVVH---------------SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK
        PP   KV+ +V+                +MV AGVPPPPPM   KG    APPP M QGNG   PPPPPGGA +SLR KK +TKLKRS+QLG LYRTLK 
Subjt:  PPPPFKVVASVVH---------------SMVNAGVPPPPPMSSLKGG---APPPPMPQGNGLT-PPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK

Query:  KVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFE
        KVEG NQNLK A+GRKGG GNS GGKQGMADALAEMTKRSAYFQQIEED +KH+ SITALKSSIS+F+S++M++LL FHK VES+LE LTDESQVLARFE
Subjt:  KVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFE

Query:  GFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALK
        GFPIKKLETLR A ALYLKLD IV QLQNWK VSPM  LLDRVE+YF+KIKGE+DALERTKDEESKRF+ HGI+FDFSVLIRIKESMVDVSS CMELALK
Subjt:  GFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALK

Query:  EKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL
        EK ELKAA E+++   +SEN NKA SKMLW+ FQFAYRVYTFAGGHDERADRLTRELA+EIESESH L
Subjt:  EKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL

A0A6J1JVH3 uncharacterized protein At4g04980-like isoform X11.3e-26357.58Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSY LSK ++KK KLSKVVR KKSSRCKDN+VQ+MELRKKIL+LRDIIDLP++E+S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+ IGDSWM+NHEW+DKSKYN  S QENSSFNEIVE+VLA+IDC+VSMANERFD +++YVN+K + YSR SSF KSTSS ES SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQ-PTSKKIDA-------------SD
        VL N RN+ RKS EKE  SCSSPLLWSLRV+AVEKLNPIDIKHLLLP+L H  VN CP+ +RV I+EES   VDD+  +SKK DA             +D
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQ-PTSKKIDA-------------SD

Query:  VNEEMKVSVTKEEIGLSRDDSQKADRSDGIEM-------------------------------------------------------------------S
         NEEM+VSV KEE   SR  SQ+AD ++ +E+                                                                    
Subjt:  VNEEMKVSVTKEEIGLSRDDSQKADRSDGIEM-------------------------------------------------------------------S

Query:  DIK----EEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSRE-------------DSQKADRIEEIEVTQQLSQP
        DI+    EE+ D S   SQK  RNE++EVS   E  D+S+  SQKADR EE EVSDIKE +L +SR              DSQ A  +EE+   +  S P
Subjt:  DIK----EEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSRE-------------DSQKADRIEEIEVTQQLSQP

Query:  PP---PPPL----PQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVV---------HSMVN
         P   PPP+    P   V  + T S  P PPPPP  + +  V S     +  PPPPLPQI V P AAA M P PPPPP K V +++          S  N
Subjt:  PP---PPPL----PQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVV---------HSMVN

Query:  AG----------------------------------VPPPPPMSSLKGG---------------------------------------------------
        AG                                  VPPPPPM+SL+GG                                                   
Subjt:  AG----------------------------------VPPPPPMSSLKGG---------------------------------------------------

Query:  APPPPMPQGNGL-TPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDF
        APPPPMPQG G+  PPPPPGGA +SLRSKKAAT+LKRS+QLG LYRTLK KVEGSNQNL+G+ G+KG GG+SAGGKQGMADALAEMTKRSAYFQQIEED 
Subjt:  APPPPMPQGNGL-TPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDF

Query:  RKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKI
        + H+ +I  LK  ISTF+S++M+EL+ FH+SVES+LE LTDESQVLARFEGFP KKLE LRTA ALYLKLDAIV QLQNWKIVSP  QLLDR+E+YF+KI
Subjt:  RKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKI

Query:  KGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAI-ERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDER
        KG+LDALERTKD+E+KRFKSHGI+FDF+VLIRIKESMVD+SS+CMELALKE  ELKAA  E+++N  K +N N A SKMLWK FQFAYRVYTFAGGHDER
Subjt:  KGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAI-ERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDER

Query:  ADRLTRELALEIESES
        ADRLTRELALEIESES
Subjt:  ADRLTRELALEIESES

A0A6J1K4W2 uncharacterized protein At4g04980-like isoform X21.3e-26357.58Show/hide
Query:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT
        SSY LSK ++KK KLSKVVR KKSSRCKDN+VQ+MELRKKIL+LRDIIDLP++E+S SINELVVGTMEDL KLYPEIISD+QYSEMK TCIEQSL YFCT
Subjt:  SSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCT

Query:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS
        AL+ IGDSWM+NHEW+DKSKYN  S QENSSFNEIVE+VLA+IDC+VSMANERFD +++YVN+K + YSR SSF KSTSS ES SETNSSCCSSPETPTS
Subjt:  ALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSK-NYYSRASSFRKSTSSTESSSETNSSCCSSPETPTS

Query:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQ-PTSKKIDA-------------SD
        VL N RN+ RKS EKE  SCSSPLLWSLRV+AVEKLNPIDIKHLLLP+L H  VN CP+ +RV I+EES   VDD+  +SKK DA             +D
Subjt:  VLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQ-PTSKKIDA-------------SD

Query:  VNEEMKVSVTKEEIGLSRDDSQKADRSDGIEM-------------------------------------------------------------------S
         NEEM+VSV KEE   SR  SQ+AD ++ +E+                                                                    
Subjt:  VNEEMKVSVTKEEIGLSRDDSQKADRSDGIEM-------------------------------------------------------------------S

Query:  DIK----EEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSRE-------------DSQKADRIEEIEVTQQLSQP
        DI+    EE+ D S   SQK  RNE++EVS   E  D+S+  SQKADR EE EVSDIKE +L +SR              DSQ A  +EE+   +  S P
Subjt:  DIK----EEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSRE-------------DSQKADRIEEIEVTQQLSQP

Query:  PP---PPPL----PQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVV---------HSMVN
         P   PPP+    P   V  + T S  P PPPPP  + +  V S     +  PPPPLPQI V P AAA M P PPPPP K V +++          S  N
Subjt:  PP---PPPL----PQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVV---------HSMVN

Query:  AG----------------------------------VPPPPPMSSLKGG---------------------------------------------------
        AG                                  VPPPPPM+SL+GG                                                   
Subjt:  AG----------------------------------VPPPPPMSSLKGG---------------------------------------------------

Query:  APPPPMPQGNGL-TPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDF
        APPPPMPQG G+  PPPPPGGA +SLRSKKAAT+LKRS+QLG LYRTLK KVEGSNQNL+G+ G+KG GG+SAGGKQGMADALAEMTKRSAYFQQIEED 
Subjt:  APPPPMPQGNGL-TPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDF

Query:  RKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKI
        + H+ +I  LK  ISTF+S++M+EL+ FH+SVES+LE LTDESQVLARFEGFP KKLE LRTA ALYLKLDAIV QLQNWKIVSP  QLLDR+E+YF+KI
Subjt:  RKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKI

Query:  KGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAI-ERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDER
        KG+LDALERTKD+E+KRFKSHGI+FDF+VLIRIKESMVD+SS+CMELALKE  ELKAA  E+++N  K +N N A SKMLWK FQFAYRVYTFAGGHDER
Subjt:  KGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAI-ERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDER

Query:  ADRLTRELALEIESES
        ADRLTRELALEIESES
Subjt:  ADRLTRELALEIESES

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049802.3e-10339.92Show/hide
Query:  MEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCTALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSKNY
        M+DL KL PEI++  Q  EM+   +++ L +F   L++IGDSW+++ +W  +SKY  S   +N S + +VE VLA +D ++    ERF  ++     +  
Subjt:  MEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCTALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSKNY

Query:  YSR----ASSFRKSTSSTESSSETNSSCCSSPETPTSVLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRV
        ++     +S  R+S + + S SE+N+S   SP TP SVL          T   +++ +SP LW+LR +A+++L+P+D+K   +  LS  R +   S++++
Subjt:  YSR----ASSFRKSTSSTESSSETNSSCCSSPETPTSVLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRV

Query:  TIIEESTMD-------VDDQPTSKKIDASDVNEEMKVSVTKEEIGLSRDDSQKADRSDGIEMSDIKEEKLDLSKE-DSQKKDRNEEVEVSEIKEGLDMSK
         I EE+           ++      ++     +E+K    +E  G   +   K D S+G E    + E  +   E D+     + E E   I EGL+ ++
Subjt:  TIIEESTMD-------VDDQPTSKKIDASDVNEEMKVSVTKEEIGLSRDDSQKADRSDGIEMSDIKEEKLDLSKE-DSQKKDRNEEVEVSEIKEGLDMSK

Query:  EDSQKADRIE--ETEVSDIKEGELDMSREDSQKAD-RIEEIEVTQQLSQP--PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPP
         +    D IE  ETE   I+  E +   E    ++    E + T+   +   PPPPPL         T+   P P    F    ++         PPPPP
Subjt:  EDSQKADRIE--ETEVSDIKEGELDMSREDSQKAD-RIEEIEVTQQLSQP--PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPP

Query:  PLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSLKGGAPPPPMPQGNGLTPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK
        P P+           AP PPPPP    AS       +G       +    G   P MP      PP PPG   +SL  KKA +KL+RS Q+  LY  LK 
Subjt:  PLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSLKGGAPPPPMPQGNGLTPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKK

Query:  KVEGSNQNLKGAHGRKGGGGNSAGGK-------QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDES
        K+EG  + ++G   +   G NS   K        GMADALAEMTKRS+YFQQIEED +K++ SI  LKSSI +F++ +M ELL FH  VESILEKLTDE+
Subjt:  KVEGSNQNLKGAHGRKGGGGNSAGGK-------QGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDES

Query:  QVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSN
        QVLARFEGFP KKLE +RTA ALY KLD I+V+L+NWKI  P+  LLD++E YFNK KGE++ +ERTKDE++K FK + I  DF VL+++KE+MVDVSSN
Subjt:  QVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSN

Query:  CMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQ
        CMELALKE+ E   A E ++N  +S+ + +  +K LW+ FQFA++VYTFAGGHDERAD LTR+LA EI+++  Q
Subjt:  CMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQ

Q84ZL0 Formin-like protein 51.4e-0742.5Show/hide
Query:  PPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSLKGG
        PPPPPP P+ GV      +  P PPPPP +       S V A  PPPPPP P +  +P++ A   P PPPPP     S   S   +  PPPPP   L   
Subjt:  PPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSLKGG

Query:  APPPPMPQ--GNGLTPPPPP
         PPPP P    +   PPPPP
Subjt:  APPPPMPQ--GNGLTPPPPP

Q9FLQ7 Formin-like protein 206.2e-0842.11Show/hide
Query:  SQPPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASV----VHSMVNAGVPPPPPM
        S PPPPPP P  G  P       P PPPPP              G PPPPPP       P  +    P PPPPPF  V+S+        ++ G PPPPP 
Subjt:  SQPPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASV----VHSMVNAGVPPPPPM

Query:  SSLKGGAPPPPMP---QGNGLTPPPPP---GGA
          + GGAPPPP P    G    PPPPP   GGA
Subjt:  SSLKGGAPPPPMP---QGNGLTPPPPP---GGA

Q9JL04 Formin-25.6e-0938.84Show/hide
Query:  PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVA-SVVHSMVNAGVPPPPPMSSLKGG
        PPPPPLP +G+ P+       +PPPPP   +       +     PPPPPLP +G+ P        +PPPPP   V       +   G+PPPPP+  +  G
Subjt:  PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVA-SVVHSMVNAGVPPPPPMSSLKGG

Query:  APPPPMPQGNGLTPPPPPGGA
         PPPP   G G+ PPPP  G+
Subjt:  APPPPMPQGNGLTPPPPPGGA

Q9NZ56 Formin-21.7e-1040.77Show/hide
Query:  PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVP----------PPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPP
        PPPPPLP +G+ P      A +PPPPP          +  AG+P          PPPPPLP++G+ P      A +PPPPP          +  AG+PPP
Subjt:  PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVP----------PPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPP

Query:  PPMSSLKGGAPPPPMPQGNGLTPPPPPGGA
        PP+  +  G PPPP   G G+ PPPP  GA
Subjt:  PPMSSLKGGAPPPPMPQGNGLTPPPPPGGA

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown4.7e-1126.27Show/hide
Query:  PPPMP----QGNGLTPPPPPGGAFQSLRSKKAATK--LKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIE
        P P P    Q     PPPPP      L SK+   K  ++R+ ++   YR L K+ E    N    +G      N         + + E+  RS Y   I+
Subjt:  PPPMP----QGNGLTPPPPPGGAFQSLRSKKAATK--LKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIE

Query:  EDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWK--IVSPMTQLLDRVES
         D  +H + I  L S +     T++SE+ +F K ++  L  L DE  VL  F  +P +K+++LR A   Y +   +  ++ ++K      +TQ L R++S
Subjt:  EDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWK--IVSPMTQLLDRVES

Query:  YFNKIKGELDALERTKDEESKRFKSHGIEF----DFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYT
          ++++  ++  E+ +D   KR+K   I +    D  ++ ++K S + ++   M              +R   E++S    K  + ML  V +FAY ++ 
Subjt:  YFNKIKGELDALERTKDEESKRFKSHGIEF----DFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYT

Query:  FAGGHDERADRLTREL
        FAGG D     +  EL
Subjt:  FAGGHDERADRLTREL

AT1G11070.1 BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein family protein (TAIR:AT1G61080.1)3.7e-10938.05Show/hide
Query:  LSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCTALQS
        +   H +  K+S    S  SS+   N++ M+ELR+KI+  R IIDLP +   +SI+ +V+ TM+DLHKL PEII   Q  EM+   +++ L  F  AL+S
Subjt:  LSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCTALQS

Query:  IGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEE-YVNSKNYYSRASSFRKSTSSTESSSETNSSCCSSPETPTSVLAN
        IGDSW+ +HEW  KSKY  S+ ++N S + +VE VLA +D ++   NER +T E   V  K         + S  ST++S++ + S    P  P +VL  
Subjt:  IGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEE-YVNSKNYYSRASSFRKSTSSTESSSETNSSCCSSPETPTSVLAN

Query:  FRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEIGLSR
           +       + +   S L  ++R++A+ KL+PID+K L +   S                                              KE +    
Subjt:  FRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEIGLSR

Query:  DDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEEIEVTQQLSQPPP
             ++  DG       +E L + K ++++K           KE + + K+DS K+ +I++ + S +                 ++   V++  + PPP
Subjt:  DDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEEIEVTQQLSQPPP

Query:  PPP-LPQIGVQPIAT-----TSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSL
        PPP    I V+P  T      S APLPPPP                  PPPPPLP + V+   AA   PLPPP               A +PPPPP+   
Subjt:  PPP-LPQIGVQPIAT-----TSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSL

Query:  KGGAPPPPMPQGNGL-TPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLK---GAHGRKGGGGNS-AGGKQGMADALAEMTKRSAYF
               PM  G G+  PPPPP GA   L +KK  +KLKRS  LG L+R LK K+EG N  ++      G KG  G++ A GKQGMADALAE+TK+S YF
Subjt:  KGGAPPPPMPQGNGL-TPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLK---GAHGRKGGGGNS-AGGKQGMADALAEMTKRSAYF

Query:  QQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRV
        Q+IEED R +  SI  LK+ I+ FK+ +++EL  FH  +ES+LEKL DE+QVLAR EGFP KKLE +R A ALY KL+ ++ +L+NWKI SP  QL D+ 
Subjt:  QQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRV

Query:  ESYFNK------------------IKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVK----SENL
        E YF K                  I+ E++ L++ K EE K+FKS+ I FDF++L++IKE MVD+SS CMELALKEK E K A + +   ++    ++N 
Subjt:  ESYFNK------------------IKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVK----SENL

Query:  NKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIE
            +K LW+ F FAYRVYTFAGGHD+RAD+LTRELA EIE
Subjt:  NKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIE

AT1G61080.1 Hydroxyproline-rich glycoprotein family protein3.0e-11138.12Show/hide
Query:  MRSSYVLSKTHTKKP---------KLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINE---------LVVGTMEDLHKLYPEIISD
        +  SY+ S      P         K ++  ++  S +   N++ M+ELR+KI   RDIIDL  ++ S SI +         +V+ TM+DL K+ PEII  
Subjt:  MRSSYVLSKTHTKKP---------KLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINE---------LVVGTMEDLHKLYPEIISD

Query:  VQYSEMKPTCIEQSLHYFCTALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIE-----------EYVNSKNYYSR
            E++   +++ L +F  AL+SIGDSW+ N EW  KSKY  SS  +N S + +VE VLA +D ++ M+ ERFD +E                K   SR
Subjt:  VQYSEMKPTCIEQSLHYFCTALQSIGDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIE-----------EYVNSKNYYSR

Query:  ASSFRKSTSSTESSSETNSSCCSSPETPTSVLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEEST
          S  +S S ++SS  + +S C SP TP SVL             + A+ +S LLW++RV+A+EKL+PID+K L +  LS       P++S     +E  
Subjt:  ASSFRKSTSSTESSSETNSSCCSSPETPTSVLANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEEST

Query:  MDVDDQPTSKKIDASDVNEEMKVSVTKEEIGLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVS
        + V ++   KK +   ++ +M+   T+E + L  +        D I    +K   L+ + E   K + +E+ E S        S+             +S
Subjt:  MDVDDQPTSKKIDASDVNEEMKVSVTKEEIGLSRDDSQKADRSDGIEMSDIKEEKLDLSKEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVS

Query:  DIKEGELDMSREDSQKADRIEEIEVTQQLSQPPPPPPLPQIGVQPIATTSMAPLPP--------------PPPFKVIETVVHS-------MVNAGVPPPP
         IK   L +          I +I ++     PPPPPP   + ++  A     PLPP              PP FK ++                  PPPP
Subjt:  DIKEGELDMSREDSQKADRIEEIEVTQQLSQPPPPPPLPQIGVQPIATTSMAPLPP--------------PPPFKVIETVVHS-------MVNAGVPPPP

Query:  PPLPQIGVQP--------TAAA----------SMAPLPPPPP---FKVVASVVHSMVNAG----VPPPPPMSSLKGGAPPPPMP----------------
        PP P+  V P        TAAA            AP PPPPP    +  +     M N+G     PPPPPM    G  PPPP P                
Subjt:  PPLPQIGVQP--------TAAA----------SMAPLPPPPP---FKVVASVVHSMVNAG----VPPPPPMSSLKGGAPPPPMP----------------

Query:  ---QGNGLTPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLK--GAHGRKGGGGNS-AGGKQGMADALAEMTKRSAYFQQIEEDFRK
             NG   PPPP GA +SLR KKAATKLKRS QLG LYR LK KVEG + N K     GRK G G++ AGGKQGMADALAE+TK+SAYF QI+ D  K
Subjt:  ---QGNGLTPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLK--GAHGRKGGGGNS-AGGKQGMADALAEMTKRSAYFQQIEEDFRK

Query:  HSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKG
        +  SI  LK  I+ F++ +M+ELLSFH+ VES+LE LTDESQVLAR EGFP KKLE +R AVALY KL  ++ +LQN KI  P+ QLLD+VE YF K   
Subjt:  HSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKG

Query:  ELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADR
                                      IKE+MVD+SSNCMELALKEK + K     ++  +K        +KMLW+ FQFA++VYTFAGGHD+RAD 
Subjt:  ELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKVFQFAYRVYTFAGGHDERADR

Query:  LTRELALEIESES
        LTRELA EI+++S
Subjt:  LTRELALEIESES

AT4G04980.1 unknown protein6.3e-11739.88Show/hide
Query:  SKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCTALQSI
        SKT    P+ + +  S  S +C  N++ M+ELRK I   RD+IDLP+++ S+S+ E++  TM+DL KL PEI++  Q  EM+   +++ L +F   L++I
Subjt:  SKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCTALQSI

Query:  GDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSKNYYSR----ASSFRKSTSSTESSSETNSSCCSSPETPTSVL
        GDSW+++ +W  +SKY  S   +N S + +VE VLA +D ++    ERF  ++     +  ++     +S  R+S + + S SE+N+S   SP TP SVL
Subjt:  GDSWMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSKNYYSR----ASSFRKSTSSTESSSETNSSCCSSPETPTSVL

Query:  ANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMD-------VDDQPTSKKIDASDVNEEMKVSV
                  T   +++ +SP LW+LR +A+++L+P+D+K   +  LS  R +   S++++ I EE+           ++      ++     +E+K   
Subjt:  ANFRNNQRKSTEKENASCSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMD-------VDDQPTSKKIDASDVNEEMKVSV

Query:  TKEEIGLSRDDSQKADRSDGIEMSDIKEEKLDLSKE-DSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIE--ETEVSDIKEGELDMSREDSQKAD-RIE
         +E  G   +   K D S+G E    + E  +   E D+     + E E   I EGL+ ++ +    D IE  ETE   I+  E +   E    ++    
Subjt:  TKEEIGLSRDDSQKADRSDGIEMSDIKEEKLDLSKE-DSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIE--ETEVSDIKEGELDMSREDSQKAD-RIE

Query:  EIEVTQQLSQP--PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGV
        E + T+   +   PPPPPL         T+   P P    F    ++         PPPPPP P+           AP PPPPP    AS       +G 
Subjt:  EIEVTQQLSQP--PPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGV

Query:  PPPPPMSSLKGGAPPPPMPQGNGLTPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK-------QGMADA
              +    G   P MP      PP PPG   +SL  KKA +KL+RS Q+  LY  LK K+EG  + ++G   +   G NS   K        GMADA
Subjt:  PPPPPMSSLKGGAPPPPMPQGNGLTPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLKKKVEGSNQNLKGAHGRKGGGGNSAGGK-------QGMADA

Query:  LAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKI
        LAEMTKRS+YFQQIEED +K++ SI  LKSSI +F++ +M ELL FH  VESILEKLTDE+QVLARFEGFP KKLE +RTA ALY KLD I+V+L+NWKI
Subjt:  LAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLETLRTAVALYLKLDAIVVQLQNWKI

Query:  VSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKV
          P+  LLD++E YFNK KGE++ +ERTKDE++K FK + I  DF VL+++KE+MVDVSSNCMELALKE+ E   A E ++N  +S+ + +  +K LW+ 
Subjt:  VSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSENLNKACSKMLWKV

Query:  FQFAYRVYTFAGGHDERADRLTRELALEIESESHQ
        FQFA++VYTFAGGHDERAD LTR+LA EI+++  Q
Subjt:  FQFAYRVYTFAGGHDERADRLTRELALEIESESHQ

AT5G07740.1 actin binding4.4e-0942.11Show/hide
Query:  SQPPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASV----VHSMVNAGVPPPPPM
        S PPPPPP P  G  P       P PPPPP              G PPPPPP       P  +    P PPPPPF  V+S+        ++ G PPPPP 
Subjt:  SQPPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHSMVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASV----VHSMVNAGVPPPPPM

Query:  SSLKGGAPPPPMP---QGNGLTPPPPP---GGA
          + GGAPPPP P    G    PPPPP   GGA
Subjt:  SSLKGGAPPPPMP---QGNGLTPPPPP---GGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGTTCTTATGTGCTCTCCAAGACACATACAAAGAAACCAAAGCTTTCCAAAGTTGTTAGAAGTAAGAAATCTTCGAGATGCAAAGATAATTATGTTCAAATGAT
GGAGTTGCGAAAGAAAATCTTGATTCTCAGAGATATCATTGACTTGCCTGCTATAGAAAGCTCTATATCTATAAATGAGCTAGTGGTGGGAACGATGGAAGATCTTCATA
AGTTGTATCCAGAAATCATTTCGGATGTCCAATATTCTGAAATGAAGCCGACATGTATCGAACAGAGTCTGCACTACTTCTGTACTGCATTGCAATCGATTGGAGATTCG
TGGATGTTAAATCATGAATGGAGGGACAAGTCGAAATATAATTTTTCATCATGTCAGGAAAACTCGAGTTTCAATGAAATTGTTGAAACGGTGTTGGCTGTCATAGATTG
CATCGTTAGTATGGCGAATGAAAGGTTTGATACGATAGAGGAATATGTTAATTCTAAGAACTATTATTCACGAGCAAGTTCCTTTCGTAAAAGCACGAGCTCAACGGAAT
CGAGCTCCGAAACCAATAGCTCTTGTTGCTCTTCTCCAGAAACTCCAACTTCCGTCCTTGCAAACTTCCGGAACAACCAAAGAAAATCAACTGAAAAGGAGAATGCCTCT
TGTAGTTCTCCCCTTTTGTGGTCTCTTAGAGTTAAAGCTGTAGAAAAGTTGAACCCCATTGATATCAAGCATCTTTTGCTACCTAGGTTGTCTCATATTCGAGTAAATGC
GTGCCCTTCCAAAAGCAGGGTGACGATCATCGAGGAATCAACGATGGATGTTGATGACCAGCCCACCTCTAAAAAGATCGATGCTTCAGATGTAAATGAAGAAATGAAAG
TGAGTGTTACCAAAGAAGAGATAGGTTTAAGCAGGGATGATAGCCAAAAAGCAGATAGAAGTGATGGAATTGAAATGAGTGATATCAAAGAAGAGAAGTTGGATTTGAGC
AAAGAAGATAGCCAAAAGAAAGATAGAAATGAAGAAGTTGAAGTGAGTGAAATCAAAGAAGGGTTGGATATGAGCAAGGAAGATAGTCAAAAGGCAGATAGAATTGAAGA
AACTGAAGTAAGTGATATCAAAGAAGGGGAGTTGGATATGAGCAGGGAAGATAGTCAAAAGGCCGATAGAATTGAAGAAATTGAAGTAACTCAACAACTTTCACAACCTC
CACCTCCACCACCATTGCCTCAAATTGGGGTACAACCTATTGCAACAACTTCTATGGCGCCTCTTCCTCCTCCACCACCATTCAAGGTAATAGAAACTGTGGTGCATTCT
ATGGTTAATGCAGGTGTACCCCCACCTCCACCACCATTGCCTCAAATTGGGGTACAACCTACTGCCGCAGCTTCTATGGCGCCTCTTCCTCCTCCACCACCATTCAAGGT
AGTAGCAAGTGTAGTGCATTCTATGGTTAATGCGGGTGTACCACCACCTCCACCGATGTCGTCCTTAAAAGGGGGTGCACCCCCTCCTCCAATGCCCCAAGGAAATGGCT
TAACTCCACCACCCCCTCCTGGTGGTGCATTTCAATCCTTGCGCTCCAAGAAAGCCGCTACTAAACTTAAGAGATCTAATCAATTGGGAACCCTTTACCGGACGCTAAAA
AAGAAGGTAGAGGGATCCAATCAAAATCTTAAGGGTGCTCATGGAAGAAAAGGCGGCGGCGGTAACAGTGCCGGAGGAAAACAAGGAATGGCTGATGCATTGGCGGAGAT
GACAAAAAGATCAGCATACTTTCAGCAAATTGAAGAAGACTTTAGAAAACACTCAAACTCAATCACTGCGCTTAAATCTTCCATTTCGACTTTCAAGTCGACAAACATGA
GTGAGTTGCTCTCTTTCCACAAGAGTGTGGAGTCTATACTGGAAAAACTAACTGATGAATCACAGGTACTAGCAAGGTTTGAAGGATTTCCCATAAAAAAATTGGAAACT
TTAAGAACTGCAGTAGCGTTATATTTAAAGTTAGATGCAATCGTCGTTCAACTACAAAATTGGAAGATTGTTTCTCCCATGACACAGCTCCTTGATCGAGTCGAAAGCTA
CTTCAATAAGATTAAAGGAGAACTTGATGCACTCGAACGAACCAAGGATGAAGAATCAAAGAGATTCAAGAGTCACGGTATTGAATTCGATTTCAGTGTCTTAATACGAA
TCAAGGAATCGATGGTGGACGTCTCTTCCAACTGCATGGAGTTAGCTCTGAAGGAAAAAAATGAGTTGAAGGCAGCAATCGAAAGGTCACAAAATGAAGTTAAGTCCGAA
AATTTGAACAAGGCATGTTCTAAGATGCTATGGAAAGTGTTTCAATTTGCATACCGAGTTTACACCTTTGCCGGTGGGCACGACGAGCGCGCCGACAGACTGACACGAGA
ATTGGCTCTAGAAATAGAGAGTGAATCTCACCAACTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGTTCTTATGTGCTCTCCAAGACACATACAAAGAAACCAAAGCTTTCCAAAGTTGTTAGAAGTAAGAAATCTTCGAGATGCAAAGATAATTATGTTCAAATGAT
GGAGTTGCGAAAGAAAATCTTGATTCTCAGAGATATCATTGACTTGCCTGCTATAGAAAGCTCTATATCTATAAATGAGCTAGTGGTGGGAACGATGGAAGATCTTCATA
AGTTGTATCCAGAAATCATTTCGGATGTCCAATATTCTGAAATGAAGCCGACATGTATCGAACAGAGTCTGCACTACTTCTGTACTGCATTGCAATCGATTGGAGATTCG
TGGATGTTAAATCATGAATGGAGGGACAAGTCGAAATATAATTTTTCATCATGTCAGGAAAACTCGAGTTTCAATGAAATTGTTGAAACGGTGTTGGCTGTCATAGATTG
CATCGTTAGTATGGCGAATGAAAGGTTTGATACGATAGAGGAATATGTTAATTCTAAGAACTATTATTCACGAGCAAGTTCCTTTCGTAAAAGCACGAGCTCAACGGAAT
CGAGCTCCGAAACCAATAGCTCTTGTTGCTCTTCTCCAGAAACTCCAACTTCCGTCCTTGCAAACTTCCGGAACAACCAAAGAAAATCAACTGAAAAGGAGAATGCCTCT
TGTAGTTCTCCCCTTTTGTGGTCTCTTAGAGTTAAAGCTGTAGAAAAGTTGAACCCCATTGATATCAAGCATCTTTTGCTACCTAGGTTGTCTCATATTCGAGTAAATGC
GTGCCCTTCCAAAAGCAGGGTGACGATCATCGAGGAATCAACGATGGATGTTGATGACCAGCCCACCTCTAAAAAGATCGATGCTTCAGATGTAAATGAAGAAATGAAAG
TGAGTGTTACCAAAGAAGAGATAGGTTTAAGCAGGGATGATAGCCAAAAAGCAGATAGAAGTGATGGAATTGAAATGAGTGATATCAAAGAAGAGAAGTTGGATTTGAGC
AAAGAAGATAGCCAAAAGAAAGATAGAAATGAAGAAGTTGAAGTGAGTGAAATCAAAGAAGGGTTGGATATGAGCAAGGAAGATAGTCAAAAGGCAGATAGAATTGAAGA
AACTGAAGTAAGTGATATCAAAGAAGGGGAGTTGGATATGAGCAGGGAAGATAGTCAAAAGGCCGATAGAATTGAAGAAATTGAAGTAACTCAACAACTTTCACAACCTC
CACCTCCACCACCATTGCCTCAAATTGGGGTACAACCTATTGCAACAACTTCTATGGCGCCTCTTCCTCCTCCACCACCATTCAAGGTAATAGAAACTGTGGTGCATTCT
ATGGTTAATGCAGGTGTACCCCCACCTCCACCACCATTGCCTCAAATTGGGGTACAACCTACTGCCGCAGCTTCTATGGCGCCTCTTCCTCCTCCACCACCATTCAAGGT
AGTAGCAAGTGTAGTGCATTCTATGGTTAATGCGGGTGTACCACCACCTCCACCGATGTCGTCCTTAAAAGGGGGTGCACCCCCTCCTCCAATGCCCCAAGGAAATGGCT
TAACTCCACCACCCCCTCCTGGTGGTGCATTTCAATCCTTGCGCTCCAAGAAAGCCGCTACTAAACTTAAGAGATCTAATCAATTGGGAACCCTTTACCGGACGCTAAAA
AAGAAGGTAGAGGGATCCAATCAAAATCTTAAGGGTGCTCATGGAAGAAAAGGCGGCGGCGGTAACAGTGCCGGAGGAAAACAAGGAATGGCTGATGCATTGGCGGAGAT
GACAAAAAGATCAGCATACTTTCAGCAAATTGAAGAAGACTTTAGAAAACACTCAAACTCAATCACTGCGCTTAAATCTTCCATTTCGACTTTCAAGTCGACAAACATGA
GTGAGTTGCTCTCTTTCCACAAGAGTGTGGAGTCTATACTGGAAAAACTAACTGATGAATCACAGGTACTAGCAAGGTTTGAAGGATTTCCCATAAAAAAATTGGAAACT
TTAAGAACTGCAGTAGCGTTATATTTAAAGTTAGATGCAATCGTCGTTCAACTACAAAATTGGAAGATTGTTTCTCCCATGACACAGCTCCTTGATCGAGTCGAAAGCTA
CTTCAATAAGATTAAAGGAGAACTTGATGCACTCGAACGAACCAAGGATGAAGAATCAAAGAGATTCAAGAGTCACGGTATTGAATTCGATTTCAGTGTCTTAATACGAA
TCAAGGAATCGATGGTGGACGTCTCTTCCAACTGCATGGAGTTAGCTCTGAAGGAAAAAAATGAGTTGAAGGCAGCAATCGAAAGGTCACAAAATGAAGTTAAGTCCGAA
AATTTGAACAAGGCATGTTCTAAGATGCTATGGAAAGTGTTTCAATTTGCATACCGAGTTTACACCTTTGCCGGTGGGCACGACGAGCGCGCCGACAGACTGACACGAGA
ATTGGCTCTAGAAATAGAGAGTGAATCTCACCAACTATGA
Protein sequenceShow/hide protein sequence
MRSSYVLSKTHTKKPKLSKVVRSKKSSRCKDNYVQMMELRKKILILRDIIDLPAIESSISINELVVGTMEDLHKLYPEIISDVQYSEMKPTCIEQSLHYFCTALQSIGDS
WMLNHEWRDKSKYNFSSCQENSSFNEIVETVLAVIDCIVSMANERFDTIEEYVNSKNYYSRASSFRKSTSSTESSSETNSSCCSSPETPTSVLANFRNNQRKSTEKENAS
CSSPLLWSLRVKAVEKLNPIDIKHLLLPRLSHIRVNACPSKSRVTIIEESTMDVDDQPTSKKIDASDVNEEMKVSVTKEEIGLSRDDSQKADRSDGIEMSDIKEEKLDLS
KEDSQKKDRNEEVEVSEIKEGLDMSKEDSQKADRIEETEVSDIKEGELDMSREDSQKADRIEEIEVTQQLSQPPPPPPLPQIGVQPIATTSMAPLPPPPPFKVIETVVHS
MVNAGVPPPPPPLPQIGVQPTAAASMAPLPPPPPFKVVASVVHSMVNAGVPPPPPMSSLKGGAPPPPMPQGNGLTPPPPPGGAFQSLRSKKAATKLKRSNQLGTLYRTLK
KKVEGSNQNLKGAHGRKGGGGNSAGGKQGMADALAEMTKRSAYFQQIEEDFRKHSNSITALKSSISTFKSTNMSELLSFHKSVESILEKLTDESQVLARFEGFPIKKLET
LRTAVALYLKLDAIVVQLQNWKIVSPMTQLLDRVESYFNKIKGELDALERTKDEESKRFKSHGIEFDFSVLIRIKESMVDVSSNCMELALKEKNELKAAIERSQNEVKSE
NLNKACSKMLWKVFQFAYRVYTFAGGHDERADRLTRELALEIESESHQL