; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0814 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0814
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHydroxyproline-rich glycoprotein
Genome locationMC03:14829301..14833793
RNA-Seq ExpressionMC03g0814
SyntenyMC03g0814
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442261.1 PREDICTED: uncharacterized protein At4g04980 isoform X1 [Cucumis melo]0.074.08Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL
        MATGGWCGLGPLLFR+KAYGLETMK+SS Y FSK+ +KKPKLSK ARSKKSSR KDNFVQMMELRKKILILRDIIDLP LE +ASINELVVGTMEDLQKL
Subjt:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL

Query:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS
        YPEIIS+IQYSEMK TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSS QENSSF EIVESVLGIIDCIVSMANERFD MDE+V+SKDSS +SRTS
Subjt:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS

Query:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE
        SFGKSSSS DSCSETNSSCCSSPETPTSVLANFR+S+RK     SSEKE VSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCG NVCP   +   VE
Subjt:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE

Query:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS
        ES MD+DDKL S+  DAA  N E+EV DIKEEK  +KE         +K D NE+IEV D KEEKL          +  EEIEV D I+EEK  LS A S
Subjt:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS

Query:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFP----QPFVVVVALQLPTPPPPPPP-MMQQNPVLAPPLSRPPPPPPLP
        Q++IAE ++D +SQAA   +ELP S LP V S+P PLL+ + PPPPPPPP  P    QP +V + L  PTPPPPPPP MMQQN VLA  LS+PPPPPP  
Subjt:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFP----QPFVVVVALQLPTPPPPPPP-MMQQNPVLAPPLSRPPPPPPLP

Query:  QITVLSTAAVPMAPPPPALPQ-ITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTP
                     PPPP + Q  T++   + P    PP +PQ+   P AA   APPPPPQLL VIETV+KV  PPPPP    ++GT V+   PPPPP  P
Subjt:  QITVLSTAAVPMAPPPPALPQ-ITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTP

Query:  SKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAY
        SKG +A  APPP M  GNG APPPPPPGGA+RS+R KK+STKLKRSHQLGNLYRTLKGKVEG NQNLKSA GRKGGVG+S GGKQGMADALAEMTKRSAY
Subjt:  SKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAY

Query:  FQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDR
        FQQIEEDVK HAK+IT LK SIS+FQSSDMN+LL FHK VES+LE+LTDESQVLARFEGFP KKLE +R AAALYLKLD IVY LQNWK VSPMG LLDR
Subjt:  FQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDR

Query:  VENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTF
        VENYF+KIKGE+DALERTKD+ESK+FR HGIQFDF++LIRIKESMVDVSS CMELALKEKRELKAAA KT+   +SE +NK  +KMLWRAFQFAYRVYTF
Subjt:  VENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTF

Query:  AGGHDERADRLTRELALEIESEPHNL
        AGGHDERADRLTRELA+EIESE H+L
Subjt:  AGGHDERADRLTRELALEIESEPHNL

XP_022139793.1 uncharacterized protein At4g04980-like [Momordica charantia]0.0100Show/hide
Query:  MMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFN
        MMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFN
Subjt:  MMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFN

Query:  EIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWS
        EIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWS
Subjt:  EIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWS

Query:  LRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGD
        LRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGD
Subjt:  LRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGD

Query:  IKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPP
        IKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPP
Subjt:  IKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPP

Query:  PPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPP
        PPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPP
Subjt:  PPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPP

Query:  PPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSS
        PPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSS
Subjt:  PPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSS

Query:  TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDA
        TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDA
Subjt:  TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDA

Query:  IVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTN
        IVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTN
Subjt:  IVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTN

Query:  KGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL
        KGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL
Subjt:  KGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL

XP_031736251.1 uncharacterized protein At4g04980 isoform X1 [Cucumis sativus]0.071.84Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETMKS-------------------SSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLE
        MATGGWCGLGPLLFRKKAYGLET  S                   SSSY FSK+ +KK KLSK ARSKKSS  KDNFVQMMELRKKILILRDIIDLP LE
Subjt:  MATGGWCGLGPLLFRKKAYGLETMKS-------------------SSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLE

Query:  GAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERF
         +ASINELVVGTMEDLQKLYPEIIS+IQYSEMK TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSS QENSSF EIVESVLGIIDCIVSMANERF
Subjt:  GAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERF

Query:  DMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPR
        D MDE+V+SKDSS YSRTSSFGKSSSS DSCSETNSSCCSSPETPTSVLANFRNS+RK     SSEKE VSCSSPLLWSLRVQAVEKLNPIDVKHLLLPR
Subjt:  DMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPR

Query:  LSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEE
        LSHCG NVCP   +   VEES MD+DDKL S+  DAA  N E+EV DIKEEK  +KE         +K D NEEIEV D KEEKL          +  EE
Subjt:  LSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEE

Query:  IEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTP---PPPPPPMMQQN
        IEV D I+EEKL L+   SQK+IAE +DD +SQA  TV+ELP S LP V S+P PLL  + PPPPPPPP        +V LQLPTP   PPPPPPMMQQN
Subjt:  IEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTP---PPPPPPMMQQN

Query:  PVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM--VSG
         VLA  LS+PPPPPP+PQ+                                           P AA P  PPPPPQLL VIET +KV+VPPPPP    +G
Subjt:  PVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM--VSG

Query:  TTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQ
        T V+   PPPPP  PSKG A  + PPP M  GNG APPPPPPGGA+RS+R+KK+STKLKRSH LGNLYRTLKGKVEGSNQNLKSA GRKGGVG+S GGKQ
Subjt:  TTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQ

Query:  GMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNL
        GMADALAEMTKRSAYFQQIEEDVK HAK+IT LKPSIS+FQSSDM +LL FHK VES+LE+LTDESQVLARFEGFP KKLE +R AAALYLKLD IV+ L
Subjt:  GMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNL

Query:  QNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAK
        QNWK VSPMG LLDRVENYFTKIKGE+DALERTKD+ESK+FR HGIQFDF++LIRIKESMVDVSS CMELALKEKRELKAAA K++   +S+ +NK  +K
Subjt:  QNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAK

Query:  MLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL
        MLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESE H+L
Subjt:  MLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL

XP_031736252.1 uncharacterized protein At4g04980 isoform X2 [Cucumis sativus]0.073.43Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL
        MATGGWCGLGPLLFRKKAYGLETMKSSS Y FSK+ +KK KLSK ARSKKSS  KDNFVQMMELRKKILILRDIIDLP LE +ASINELVVGTMEDLQKL
Subjt:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL

Query:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS
        YPEIIS+IQYSEMK TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSS QENSSF EIVESVLGIIDCIVSMANERFD MDE+V+SKDSS YSRTS
Subjt:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS

Query:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE
        SFGKSSSS DSCSETNSSCCSSPETPTSVLANFRNS+RK     SSEKE VSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCG NVCP   +   VE
Subjt:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE

Query:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS
        ES MD+DDKL S+  DAA  N E+EV DIKEEK  +KE         +K D NEEIEV D KEEKL          +  EEIEV D I+EEKL L+   S
Subjt:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS

Query:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTP---PPPPPPMMQQNPVLAPPLSRPPPPPPLPQI
        QK+IAE +DD +SQA  TV+ELP S LP V S+P PLL  + PPPPPPPP        +V LQLPTP   PPPPPPMMQQN VLA  LS+PPPPPP+PQ+
Subjt:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTP---PPPPPPMMQQNPVLAPPLSRPPPPPPLPQI

Query:  TVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM--VSGTTVKLVGPPPPPPTPSKGG
                                                   P AA P  PPPPPQLL VIET +KV+VPPPPP    +GT V+   PPPPP  PSKG 
Subjt:  TVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM--VSGTTVKLVGPPPPPPTPSKGG

Query:  AAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAYFQQI
        A  + PPP M  GNG APPPPPPGGA+RS+R+KK+STKLKRSH LGNLYRTLKGKVEGSNQNLKSA GRKGGVG+S GGKQGMADALAEMTKRSAYFQQI
Subjt:  AAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAYFQQI

Query:  EEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENY
        EEDVK HAK+IT LKPSIS+FQSSDM +LL FHK VES+LE+LTDESQVLARFEGFP KKLE +R AAALYLKLD IV+ LQNWK VSPMG LLDRVENY
Subjt:  EEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENY

Query:  FTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGH
        FTKIKGE+DALERTKD+ESK+FR HGIQFDF++LIRIKESMVDVSS CMELALKEKRELKAAA K++   +S+ +NK  +KMLWRAFQFAYRVYTFAGGH
Subjt:  FTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGH

Query:  DERADRLTRELALEIESEPHNL
        D+RADRLTRELA+EIESE H+L
Subjt:  DERADRLTRELALEIESEPHNL

XP_038895362.1 uncharacterized protein At4g04980-like isoform X2 [Benincasa hispida]0.068.79Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL
        MATGGWCGLGPLLFRKKAYGLETMKSSS Y FSKS TKKPK SK ARSKKSSR KDNFVQMMELRKKI+ILRDIIDLPPLE +ASINELVVGTMEDLQKL
Subjt:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL

Query:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS
        YPEIIS IQYSEMK TCIEQ+LA+FCTALKSIG+SWMLNHEWRDKSKYNLSS QEN+SF EIVESVL IIDCIVSMANERFD+MDE+V+SKDSS YSRTS
Subjt:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS

Query:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE
        SFGKSSSS DSCSETNSSCCSSPETPTSVL NFRNS+RK     SSEKE VSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCG NVCP       VE
Subjt:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE

Query:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDD
        ES MD+DDKL S   DAA  NEE+EV D+KEEK       ++S    ++ D NEE+EV DIKE                 EKL LS A SQK+IAE +DD
Subjt:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDD

Query:  NNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPP------------------------------------------
          SQ A TV+E+PAS+LP+V S+P P L  V PPP  PP   PQP VV+  LQLPTPP                                          
Subjt:  NNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPP------------------------------------------

Query:  -------PPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLP--------------------
               PPPPPMMQ+N VLA  LS+PPPPPP+ Q   +   + P  PPPP + Q  VL+ +  P    PP + Q  VL                     
Subjt:  -------PPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLP--------------------

Query:  -----------------------TAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPP
                                AAAP APPPPPQLL VIETV+KV VPPPPP    V+GTTV+   PPPPP  PS GGAA  APPPPM  GNG APPP
Subjt:  -----------------------TAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPP

Query:  PPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSIST
        PPPGGA+RS+R KK+STKLKRSHQLGNLYRTLKGKVEGSNQNLKSA GRKG VGSSTGGKQGMADALAEMTKRSAYFQQIE+DVK HAK+ITELK SIS+
Subjt:  PPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSIST

Query:  FQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESK
        FQSSDMN++L FHK VES+LE+LTDESQVLARFEGFPTKKLE +R A ALYLKLDAIV+ LQNWK VSPMGQLLDRVE+YF+KIKGE+DALERTKD+ESK
Subjt:  FQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESK

Query:  KFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAG-KTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEP
        +FRSHGIQFDF++LIRIKESMVDVSSSCMELALKEKRELKAAA  KTQN  +SE +NK  +KMLWRAFQFAYRVYTFAGGHD+RADRLTRELALEIESE 
Subjt:  KFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAG-KTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEP

Query:  HNL
         +L
Subjt:  HNL

TrEMBL top hitse value%identityAlignment
A0A0A0LSA3 Uncharacterized protein0.068.98Show/hide
Query:  SSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFC
        SSSY FSK+ +KK KLSK ARSKKSS  KDNFVQMMELRKKILILRDIIDLP LE +ASINELVVGTMEDLQKLYPEIIS+IQYSEMK TCIEQSLAYFC
Subjt:  SSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFC

Query:  TALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETP
        TALKSIGDSWMLNHEWRDKSKYNLSS QENSSF EIVESVLGIIDCIVSMANERFD MDE+V+SKDSS YSRTSSFGKSSSS DSCSETNSSCCSSPETP
Subjt:  TALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETP

Query:  TSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEV
        TSVLANFRNS+RK     SSEKE VSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCG NVCP   +   VEES MD+DDKL S+  DAA  N E+EV
Subjt:  TSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEV

Query:  SDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASK
         DIKEEK  +KE         +K D NEEIEV D KEEKL          +  EEIEV D I+EEKL L+   SQK+IAE +DD +SQA  TV+ELP S 
Subjt:  SDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASK

Query:  LPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAP
        LP V            PPPPPP                                                                              
Subjt:  LPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAP

Query:  TASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM--VSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRS
               +PQ+   P AA P  PPPPPQLL VIET +KV+VPPPPP    +GT V+   PPPPP  PSKG A  + PPP M  GNG APPPPPPGGA+RS
Subjt:  TASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM--VSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRS

Query:  VRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNEL
        +R+KK+STKLKRSH LGNLYRTLKGKVEGSNQNLKSA GRKGGVG+S GGKQGMADALAEMTKRSAYFQQIEEDVK HAK+IT LKPSIS+FQSSDM +L
Subjt:  VRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNEL

Query:  LKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQF
        L FHK VES+LE+LTDESQVLARFEGFP KKLE +R AAALYLKLD IV+ LQNWK VSPMG LLDRVENYFTKIKGE+DALERTKD+ESK+FR HGIQF
Subjt:  LKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQF

Query:  DFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL
        DF++LIRIKESMVDVSS CMELALKEKRELKAAA K++   +S+ +NK  +KMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESE H+L
Subjt:  DFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL

A0A1S3B5Y5 uncharacterized protein At4g04980 isoform X10.074.08Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL
        MATGGWCGLGPLLFR+KAYGLETMK+SS Y FSK+ +KKPKLSK ARSKKSSR KDNFVQMMELRKKILILRDIIDLP LE +ASINELVVGTMEDLQKL
Subjt:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL

Query:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS
        YPEIIS+IQYSEMK TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSS QENSSF EIVESVLGIIDCIVSMANERFD MDE+V+SKDSS +SRTS
Subjt:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS

Query:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE
        SFGKSSSS DSCSETNSSCCSSPETPTSVLANFR+S+RK     SSEKE VSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCG NVCP   +   VE
Subjt:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE

Query:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS
        ES MD+DDKL S+  DAA  N E+EV DIKEEK  +KE         +K D NE+IEV D KEEKL          +  EEIEV D I+EEK  LS A S
Subjt:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS

Query:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFP----QPFVVVVALQLPTPPPPPPP-MMQQNPVLAPPLSRPPPPPPLP
        Q++IAE ++D +SQAA   +ELP S LP V S+P PLL+ + PPPPPPPP  P    QP +V + L  PTPPPPPPP MMQQN VLA  LS+PPPPPP  
Subjt:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFP----QPFVVVVALQLPTPPPPPPP-MMQQNPVLAPPLSRPPPPPPLP

Query:  QITVLSTAAVPMAPPPPALPQ-ITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTP
                     PPPP + Q  T++   + P    PP +PQ+   P AA   APPPPPQLL VIETV+KV  PPPPP    ++GT V+   PPPPP  P
Subjt:  QITVLSTAAVPMAPPPPALPQ-ITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTP

Query:  SKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAY
        SKG +A  APPP M  GNG APPPPPPGGA+RS+R KK+STKLKRSHQLGNLYRTLKGKVEG NQNLKSA GRKGGVG+S GGKQGMADALAEMTKRSAY
Subjt:  SKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAY

Query:  FQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDR
        FQQIEEDVK HAK+IT LK SIS+FQSSDMN+LL FHK VES+LE+LTDESQVLARFEGFP KKLE +R AAALYLKLD IVY LQNWK VSPMG LLDR
Subjt:  FQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDR

Query:  VENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTF
        VENYF+KIKGE+DALERTKD+ESK+FR HGIQFDF++LIRIKESMVDVSS CMELALKEKRELKAAA KT+   +SE +NK  +KMLWRAFQFAYRVYTF
Subjt:  VENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTF

Query:  AGGHDERADRLTRELALEIESEPHNL
        AGGHDERADRLTRELA+EIESE H+L
Subjt:  AGGHDERADRLTRELALEIESEPHNL

A0A5A7UTX7 Hydroxyproline-rich glycoprotein0.074.08Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL
        MATGGWCGLGPLLFR+KAYGLETMK+SS Y FSK+ +KKPKLSK ARSKKSSR KDNFVQMMELRKKILILRDIIDLP LE +ASINELVVGTMEDLQKL
Subjt:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL

Query:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS
        YPEIIS+IQYSEMK TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSS QENSSF EIVESVLGIIDCIVSMANERFD MDE+V+SKDSS +SRTS
Subjt:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS

Query:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE
        SFGKSSSS DSCSETNSSCCSSPETPTSVLANFR+S+RK     SSEKE VSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCG NVCP   +   VE
Subjt:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE

Query:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS
        ES MD+DDKL S+  DAA  N E+EV DIKEEK  +KE         +K D NE+IEV D KEEKL          +  EEIEV D I+EEK  LS A S
Subjt:  ESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKL----------ETKEEIEVSDDIKEEKLELSTAAS

Query:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFP----QPFVVVVALQLPTPPPPPPP-MMQQNPVLAPPLSRPPPPPPLP
        Q++IAE ++D +SQAA   +ELP S LP V S+P PLL+ + PPPPPPPP  P    QP +V + L  PTPPPPPPP MMQQN VLA  LS+PPPPPP  
Subjt:  QKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFP----QPFVVVVALQLPTPPPPPPP-MMQQNPVLAPPLSRPPPPPPLP

Query:  QITVLSTAAVPMAPPPPALPQ-ITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTP
                     PPPP + Q  T++   + P    PP +PQ+   P AA   APPPPPQLL VIETV+KV  PPPPP    ++GT V+   PPPPP  P
Subjt:  QITVLSTAAVPMAPPPPALPQ-ITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPM---VSGTTVKLVGPPPPPPTP

Query:  SKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAY
        SKG +A  APPP M  GNG APPPPPPGGA+RS+R KK+STKLKRSHQLGNLYRTLKGKVEG NQNLKSA GRKGGVG+S GGKQGMADALAEMTKRSAY
Subjt:  SKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSA-GRKGGVGSSTGGKQGMADALAEMTKRSAY

Query:  FQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDR
        FQQIEEDVK HAK+IT LK SIS+FQSSDMN+LL FHK VES+LE+LTDESQVLARFEGFP KKLE +R AAALYLKLD IVY LQNWK VSPMG LLDR
Subjt:  FQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDR

Query:  VENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTF
        VENYF+KIKGE+DALERTKD+ESK+FR HGIQFDF++LIRIKESMVDVSS CMELALKEKRELKAAA KT+   +SE +NK  +KMLWRAFQFAYRVYTF
Subjt:  VENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTF

Query:  AGGHDERADRLTRELALEIESEPHNL
        AGGHDERADRLTRELA+EIESE H+L
Subjt:  AGGHDERADRLTRELALEIESEPHNL

A0A6J1CGJ6 uncharacterized protein At4g04980-like0.0100Show/hide
Query:  MMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFN
        MMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFN
Subjt:  MMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFN

Query:  EIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWS
        EIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWS
Subjt:  EIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWS

Query:  LRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGD
        LRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGD
Subjt:  LRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGD

Query:  IKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPP
        IKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPP
Subjt:  IKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPP

Query:  PPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPP
        PPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPP
Subjt:  PPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPP

Query:  PPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSS
        PPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSS
Subjt:  PPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSS

Query:  TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDA
        TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDA
Subjt:  TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDA

Query:  IVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTN
        IVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTN
Subjt:  IVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTN

Query:  KGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL
        KGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL
Subjt:  KGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHNL

A0A6J1K4W2 uncharacterized protein At4g04980-like isoform X20.061.7Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL
        MATGGWCGLGPLLFRKKAYGLETM+SSS YA SK  +KK KLSKV R KKSSR KDNFVQ+MELRKKIL+LRDIIDLP LE +ASINELVVGTMEDLQKL
Subjt:  MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKL

Query:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS
        YPEIIS+IQYSEMK TCIEQSL+YFCTALK IGDSWM+NHEW+DKSKYNL S QENSSFNEIVESVL IIDC+VSMANERFD+MD+YV++K+SS YSRTS
Subjt:  YPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTS

Query:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE
        SFGKS+SS++SCSETNSSCCSSPETPTSVL N RNS RK     S EKE +SCSSPLLWSLRVQAVEKLNPID+KHLLLP+L H G NVCP  N+   VE
Subjt:  SFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVE

Query:  ESRMDMDDKLLS-KEIDA-------------AARNEEIEVSDIKEE---------KLDTKEEIEVSDIKEEKL---------------------------
        ES   +DDKL+S K+ DA             A RNEE+EVS  KEE         + D+ E++EV   +EE +                           
Subjt:  ESRMDMDDKLLS-KEIDA-------------AARNEEIEVSDIKEE---------KLDTKEEIEVSDIKEEKL---------------------------

Query:  -----------------------------DTNEEIEVGDIKEE----------------------------------KLETKEEIEVSDDIKEEKLELST
                                     D NE+I+V   +EE                                  K +  EE+EVSD IKEEKL LS 
Subjt:  -----------------------------DTNEEIEVGDIKEE----------------------------------KLETKEEIEVSDDIKEEKLELST

Query:  AASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTP-----PPPPPPMMQQNPVLAPPLSRPPPPP
        AASQK + E ++D +SQAA TVEE+PA +LP      SP L  VLPPP     H P+P  VV  LQLPTP     PPPPPPMMQQN VL+  L    PPP
Subjt:  AASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTP-----PPPPPPMMQQNPVLAPPLSRPPPPP

Query:  PLPQITVLSTAAVPMAPPPPALPQ--ITVLSTAAAPT----------ASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVK
        PLPQI V+  AA  M PPPP  P   +  L T   P            + PP  P   +   AAA   PPPPP   N      + AVPPPPPM S     
Subjt:  PLPQITVLSTAAVPMAPPPPALPQ--ITVLSTAAAPT----------ASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVK

Query:  LVGPPPPPPTPSKGGAA-------------------------------ASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKG
         + PPPPP  PSKGGAA                               A APPPPMP G G   PPPPPGGA RS+RSKK++T+LKRSHQLGNLYRTLKG
Subjt:  LVGPPPPPPTPSKGGAA-------------------------------ASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKG

Query:  KVEGSNQNLK-SAGRKGGVGSSTGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFE
        KVEGSNQNL+ S+G+KG  GSS GGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAI +LKP ISTFQSSDMNEL+ FH+ VES+LE+LTDESQVLARFE
Subjt:  KVEGSNQNLK-SAGRKGGVGSSTGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFE

Query:  GFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALK
        GFP KKLE +RTAAALYLKLDAIV  LQNWKIVSP GQLLDR+ENYF+KIKG++DALERTKDDE+K+F+SHGIQFDFN+LIRIKESMVD+SSSCMELALK
Subjt:  GFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALK

Query:  EKRELKAAAG-KTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHN
        E RELKAAA  K +N AK +  N   +KMLW+AFQFAYRVYTFAGGHDERADRLTRELALEIESE  N
Subjt:  EKRELKAAAG-KTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESEPHN

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049809.6e-9537.24Show/hide
Query:  MEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDS
        M+DLQKL PEI++  Q  EM+   +++ L +F   L++IGDSW+++ +W  +SKY  S   +N S + +VE VL  +D ++    ERF MMD     + S
Subjt:  MEDLQKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDS

Query:  SSYSR--TSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCP
         +  +  +S   +S +   S SE+N+S   SP TP SVL               +   + + +SP LW+LR QA+++L+P+D+K   +  LS        
Subjt:  SSYSR--TSSFGKSSSSLDSCSETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCP

Query:  VSNKATTVEESRMDMDDKLLSKEIDAAARNEE------IEVSDIKEE-KLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLE
            + +V E+++ ++++    EI A  + EE      +E    ++E K +   E   ++ + E  D +E  E    + E  ET+ EI+  D I++ + E
Subjt:  VSNKATTVEESRMDMDDKLLSKEIDAAARNEE------IEVSDIKEE-KLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLE

Query:  LSTAASQKNIAETSDDNNSQAAETVEELPAS-KLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPP
                        ++ +  ET  E+ A+  + D  +E                 H  + F      +  T         + +   + P    PPPPP
Subjt:  LSTAASQKNIAETSDDNNSQAAETVEELPAS-KLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPP

Query:  LPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSK
        L             +P  P+ P ++  +T ++  +  PP                PPP P+         K   PPPPP +S  +             + 
Subjt:  LPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSK

Query:  GGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSSTG------GKQGMADALAEMTKR
        G  A S P PP P G+G            RS+  KK+++KL+RS Q+ NLY  LKGK+EG     K+     G  S          + GMADALAEMTKR
Subjt:  GGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSSTG------GKQGMADALAEMTKR

Query:  SAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQL
        S+YFQQIEEDV+ +AK+I ELK SI +FQ+ DM ELL+FH  VESILE LTDE+QVLARFEGFP KKLE IRTA ALY KLD I+  L+NWKI  P+  L
Subjt:  SAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQL

Query:  LDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKREL-KAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYR
        LD++E YF K KGEI+ +ERTKD+++K F+ + I  DF +L+++KE+MVDVSS+CMELALKE+RE  + A    +++ K ER     AK LWRAFQFA++
Subjt:  LDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKREL-KAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYR

Query:  VYTFAGGHDERADRLTRELALEIESEP
        VYTFAGGHDERAD LTR+LA EI+++P
Subjt:  VYTFAGGHDERADRLTRELALEIESEP

Q84ZL0 Formin-like protein 52.1e-0438.97Show/hide
Query:  LPASKLPDVASEP--SPLLASVLPPPPPPPPHFPQPFVVVV-ALQLPTPPPPPPPMMQQNPVLAPP----LSRPPPPPPLPQITVLSTAAVPMAPPPPAL
        +PA   P     P   P   +  PPPPPPPP  P P      A     PPPPPPP+++  P   PP     S  PPPPPLP       AA   APPPP  
Subjt:  LPASKLPDVASEP--SPLLASVLPPPPPPPPHFPQPFVVVV-ALQLPTPPPPPPPMMQQNPVLAPP----LSRPPPPPPLPQITVLSTAAVPMAPPPPAL

Query:  PQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPP------PPTPSKGGAAASAPPPPMPLG
        P  T  +   AP    PP + +    P+   P +PPPPP          +   PPPPP           PPPP      PP P  GG A++ PPPP P  
Subjt:  PQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPP------PPTPSKGGAAASAPPPPMPLG

Query:  NGCAPPPPPPGGA
           APPPPPP GA
Subjt:  NGCAPPPPPPGGA

Arabidopsis top hitse value%identityAlignment
AT1G11070.1 BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein family protein (TAIR:AT1G61080.1)1.0e-10737.43Show/hide
Query:  TMKSSSSYAFSKSITKKP--KLSKVARSK-KSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIE
        + K+S   A S    K    + +K++ S   SS+   NF+ M+ELR+KI+  R IIDLPPL    SI+ +V+ TM+DL KL PEII   Q  EM+   ++
Subjt:  TMKSSSSYAFSKSITKKP--KLSKVARSK-KSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMKATCIE

Query:  QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDE-YVHSKDSSSYSRTSSFGKSSSSLDSCSETNSS
        + L  F  ALKSIGDSW+ +HEW  KSKY  S+ ++N S + +VE VL  +D ++   NER +  +   V  K   S   T +    S SL         
Subjt:  QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDE-YVHSKDSSSYSRTSSFGKSSSSLDSCSETNSS

Query:  CCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAA
            P  P +VL+         S   S+   NV          R+QA+ KL+PIDVK L +   S                                   
Subjt:  CCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAA

Query:  ARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLP
                                   ++E L  N + +   +K +K ET+E+                  +++ I E  D   SQ    +++   SK+ 
Subjt:  ARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLP

Query:  DVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTA
           S  S +++    PPPPPPP                                                    AA+P+ PP         L+ +A   +
Subjt:  DVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTA

Query:  SAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPL--GNGCAPPPPPPGGAMRSVR
         AP                 PPPP           + A+PPPPP+       +  PP PPP     G AA  PPPP+P+  G G A PPPPP GA   + 
Subjt:  SAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPL--GNGCAPPPPPPGGAMRSVR

Query:  SKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSSTG-----GKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMN
        +KK ++KLKRS  LG L+R LKGK+EG N  ++S G  GG   +TG     GKQGMADALAE+TK+S YFQ+IEEDV+ +  +I ELK  I+ F++ D+ 
Subjt:  SKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSSTG-----GKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMN

Query:  ELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTK------------------IKGEID
        EL KFH  +ES+LE L DE+QVLAR EGFP KKLEAIR AAALY KL+ ++  L+NWKI SP  QL D+ E YF K                  I+ EI+
Subjt:  ELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTK------------------IKGEID

Query:  ALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQN--RAKSERTNK--GCAKMLWRAFQFAYRVYTFAGGHDERAD
         L++ K +E KKF+S+ I FDFNIL++IKE MVD+SS CMELALKEKRE K A+  T     AK    NK  G AK LWRAF FAYRVYTFAGGHD+RAD
Subjt:  ALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQN--RAKSERTNK--GCAKMLWRAFQFAYRVYTFAGGHDERAD

Query:  RLTRELALEIE
        +LTRELA EIE
Subjt:  RLTRELALEIE

AT1G61080.1 Hydroxyproline-rich glycoprotein family protein1.7e-12641.79Show/hide
Query:  PLLFRKKAYGLETMKSSSSYAFSKSITKKPKLS-KVARSKKSS---RRKDNFVQMMELRKKILILRDIIDLPPLEGAASINE---------LVVGTMEDL
        P+  R     L     +SS   S  I     +S K ARS K+S   +   NF+ M+ELR+KI   RDIIDL  L+G+ SI +         +V+ TM+DL
Subjt:  PLLFRKKAYGLETMKSSSSYAFSKSITKKPKLS-KVARSKKSS---RRKDNFVQMMELRKKILILRDIIDLPPLEGAASINE---------LVVGTMEDL

Query:  QKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMM--DEYVHSKDS--
        QK+ PEII      E++   +++ L +F  ALKSIGDSW+ N EW  KSKY  SS  +N S + +VE VL  +D ++ M+ ERFDMM  DE    K+S  
Subjt:  QKLYPEIISEIQYSEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMM--DEYVHSKDS--

Query:  -----SSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVL-ANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGG
             +S SR  S  +S S   S   + +S C SP TP SVL  +   S  ++ +  +S       +S LLW++RVQA+EKL+PIDVK L +  LS    
Subjt:  -----SSYSRTSSFGKSSSSLDSCSETNSSCCSSPETPTSVL-ANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGG

Query:  NVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELST
                                 KE      + + +V  + EE    K+EIE  D+   K++T E +   ++ EE +    E    D I +     ST
Subjt:  NVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIKEEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELST

Query:  AASQKNIAETSDDNNSQ---------AAETVEELPASKLPDVASEPSPLLASV---LPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPP-
        + S+ N +E   +N+SQ             +  +  + LP  +  P+P +A +   +PPPPPPPP  P P V+ +    P PPPP PP +      APP 
Subjt:  AASQKNIAETSDDNNSQ---------AAETVEELPASKLPDVASEPSPLLASV---LPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPP-

Query:  ------------LSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAP---TASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPP
                       PPPPPPLP     +T A P  PPPP  P+  V      P   TA+APP  P      T AAP  PPPPP         M+   P 
Subjt:  ------------LSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAP---TASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPP

Query:  PPPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPP---------------PPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGS
        PPPM  G +     PPPPPP P   GA    PPPPM + NG A P               PPPP GA RS+R KK++TKLKRS QLGNLYR LKGKVEG 
Subjt:  PPPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPP---------------PPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGS

Query:  NQNLKS---AGRKGGVGSS-TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGF
        + N K+   +GRK G GS+  GGKQGMADALAE+TK+SAYF QI+ D+  +  +I ELK  I+ FQ+ DM ELL FH+ VES+LE+LTDESQVLAR EGF
Subjt:  NQNLKS---AGRKGGVGSS-TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGF

Query:  PTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEK
        P KKLEA+R A ALY KL  ++  LQN KI  P+ QLLD+VE YFTK                                 IKE+MVD+SS+CMELALKEK
Subjt:  PTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEK

Query:  RELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESE
        R+ K     + +   S +   G AKMLWRAFQFA++VYTFAGGHD+RAD LTRELA EI+++
Subjt:  RELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELALEIESE

AT4G04980.1 unknown protein3.0e-10736.87Show/hide
Query:  SSSSYAFSKSITKKPKLSKVARSKKSSRRKD-----------NFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMK
        +S+ +  + + T  P   K +++K ++ RK+           NF+ M+ELRK I   RD+IDLP L+G+ S+ E++  TM+DLQKL PEI++  Q  EM+
Subjt:  SSSSYAFSKSITKKPKLSKVARSKKSSRRKD-----------NFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQYSEMK

Query:  ATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSR--TSSFGKSSSSLDSC
           +++ L +F   L++IGDSW+++ +W  +SKY  S   +N S + +VE VL  +D ++    ERF MMD     + S +  +  +S   +S +   S 
Subjt:  ATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSR--TSSFGKSSSSLDSC

Query:  SETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLS
        SE+N+S   SP TP SVL               +   + + +SP LW+LR QA+++L+P+D+K   +  LS            + +V E+++ ++++   
Subjt:  SETNSSCCSSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLS

Query:  KEIDAAARNEE------IEVSDIKEE-KLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQA
         EI A  + EE      +E    ++E K +   E   ++ + E  D +E  E    + E  ET+ EI+  D I++ + E                ++ + 
Subjt:  KEIDAAARNEE------IEVSDIKEE-KLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQA

Query:  AETVEELPAS-KLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPAL
         ET  E+ A+  + D  +E                 H  + F      +  T         + +   + P    PPPPPL             +P  P+ 
Subjt:  AETVEELPAS-KLPDVASEPSPLLASVLPPPPPPPPHFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPAL

Query:  PQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPP
        P ++  +T ++  +  PP                PPP P+         K   PPPPP +S  +             + G  A S P PP P G+G    
Subjt:  PQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIETVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPP

Query:  PPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSSTG------GKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITEL
                RS+  KK+++KL+RS Q+ NLY  LKGK+EG     K+     G  S          + GMADALAEMTKRS+YFQQIEEDV+ +AK+I EL
Subjt:  PPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSSTG------GKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITEL

Query:  KPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERT
        K SI +FQ+ DM ELL+FH  VESILE LTDE+QVLARFEGFP KKLE IRTA ALY KLD I+  L+NWKI  P+  LLD++E YF K KGEI+ +ERT
Subjt:  KPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKIVSPMGQLLDRVENYFTKIKGEIDALERT

Query:  KDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKREL-KAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELAL
        KD+++K F+ + I  DF +L+++KE+MVDVSS+CMELALKE+RE  + A    +++ K ER     AK LWRAFQFA++VYTFAGGHDERAD LTR+LA 
Subjt:  KDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKREL-KAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTFAGGHDERADRLTRELAL

Query:  EIESEP
        EI+++P
Subjt:  EIESEP

AT5G07740.1 actin binding7.2e-0536.84Show/hide
Query:  PASKLPDVASEPSPLLASVLPPPPPPPPHF---PQPFVVVVALQLPTPPPPPPPMMQQNPVLAPP---LSRPPPPPPLPQITVLSTAAVPMAPPPPAL--
        P S    V S P P   S  PPPPPPPP +   P P     +   P PPPPPPP     P   PP      PPPPPP P   V   +++P  PPPP +  
Subjt:  PASKLPDVASEPSPLLASVLPPPPPPPPHF---PQPFVVVVALQLPTPPPPPPPMMQQNPVLAPP---LSRPPPPPPLPQITVLSTAAVPMAPPPPAL--

Query:  -----PQITVLSTAAAPTASAPP---ALPQITVLPTAAAPTAPPPPPQLLNVIE-----TVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSKGGA-----
             P    +   A P    PP     P     P       PPPPP +    +      +   A PPPPP + G      G PPPPP P +GGA     
Subjt:  -----PQITVLSTAAAPTASAPP---ALPQITVLPTAAAPTAPPPPPQLLNVIE-----TVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSKGGA-----

Query:  ----AASAPPPPMPLGNGCAPPPPPPGG
              + PPPP P+  G  PPPPPPGG
Subjt:  ----AASAPPPPMPLGNGCAPPPPPPGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCGGTGGCTGGTGTGGTTTAGGTCCTTTGCTGTTTCGTAAGAAGGCTTATGGGCTCGAGACAATGAAGAGCTCGTCTTCTTACGCTTTCTCCAAATCCATTAC
AAAGAAACCAAAGTTGTCAAAAGTCGCCAGAAGTAAGAAGTCTTCGAGACGTAAAGATAACTTTGTTCAAATGATGGAGCTGCGGAAGAAAATCTTGATTCTCAGAGACA
TCATTGATTTGCCTCCTTTAGAAGGGGCTGCATCTATTAATGAGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAACTCTATCCTGAAATCATATCAGAAATCCAATAT
TCCGAAATGAAAGCAACATGCATAGAACAGAGTCTGGCCTACTTCTGCACTGCATTGAAGTCGATTGGAGATTCGTGGATGCTGAATCATGAATGGAGGGACAAATCGAA
ATATAATCTGTCATCATGTCAGGAAAACTCCAGCTTCAATGAAATTGTTGAATCTGTGTTGGGGATCATTGATTGCATCGTTAGTATGGCGAATGAAAGGTTCGATATGA
TGGACGAATACGTTCATTCAAAGGACTCCTCCTCTTATTCGCGAACTAGCTCCTTCGGGAAGAGCTCGAGCTCGTTGGATTCGTGCTCCGAAACCAATAGCTCTTGCTGC
TCTTCTCCTGAAACTCCCACCTCCGTCCTAGCAAACTTTCGCAACAGCCAAAGAAAATTATCAGAAAAGAAATCTTCAGAAAAGGAGAATGTCTCTTGTAGTTCCCCCCT
TTTGTGGTCTCTTAGAGTTCAAGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCTCTTGCTCCCCAGGTTGTCTCATTGTGGAGGCAATGTCTGCCCTGTTTCGA
ACAAGGCGACGACCGTTGAGGAATCGAGGATGGATATGGATGACAAGCTCCTCTCTAAAGAGATTGATGCTGCTGCTAGAAATGAAGAAATTGAAGTGAGTGATATCAAA
GAGGAGAAGTTGGATACAAAAGAAGAAATTGAAGTGAGTGATATCAAAGAGGAGAAGTTGGATACAAATGAAGAAATTGAAGTGGGTGATATCAAAGAGGAGAAGTTGGA
GACAAAGGAAGAAATTGAAGTGAGTGATGATATCAAAGAAGAGAAGTTGGAATTGAGCACGGCAGCAAGCCAAAAGAACATTGCTGAGACAAGTGATGATAATAATTCCC
AAGCTGCTGAAACTGTTGAAGAGTTGCCTGCATCAAAATTACCTGATGTTGCATCAGAGCCATCACCACTGCTCGCCTCAGTGTTGCCTCCTCCACCGCCCCCTCCTCCA
CATTTTCCGCAACCCTTCGTAGTCGTAGTCGCCCTGCAACTACCAACACCACCTCCGCCACCACCACCAATGATGCAACAAAATCCAGTACTGGCTCCACCCCTTTCACG
GCCGCCTCCTCCGCCACCATTGCCTCAGATCACAGTGCTGTCAACTGCAGCAGTTCCTATGGCACCTCCTCCACCAGCATTGCCTCAGATCACAGTATTGTCAACTGCAG
CAGCGCCTACAGCGTCTGCTCCACCAGCATTGCCTCAGATCACAGTACTGCCAACTGCAGCAGCTCCTACAGCACCTCCTCCGCCTCCACAACTGTTGAATGTAATAGAA
ACGGTGATGAAAGTGGCTGTACCTCCACCACCACCGATGGTTTCAGGGACGACGGTAAAGCTGGTTGGACCACCACCACCTCCGCCCACGCCCTCAAAAGGAGGAGCTGC
AGCTTCAGCTCCACCTCCTCCAATGCCCCTAGGAAATGGCTGTGCTCCACCACCACCTCCTCCGGGCGGTGCAATGAGATCCGTGCGCTCCAAGAAATCCTCTACTAAAT
TGAAGAGATCTCATCAATTGGGAAATCTGTACCGAACACTCAAAGGGAAGGTGGAGGGATCCAATCAAAATCTTAAATCGGCTGGGCGAAAAGGCGGTGTAGGGAGCAGT
ACAGGCGGAAAACAAGGAATGGCAGATGCATTGGCGGAGATGACAAAAAGATCAGCATACTTTCAACAAATTGAAGAAGACGTTAAAAACCATGCAAAAGCAATCACAGA
GCTTAAACCTTCCATTTCAACCTTCCAGTCATCGGACATGAATGAGCTGCTGAAGTTCCACAAGCATGTGGAGTCCATACTCGAGCATCTAACCGATGAATCGCAGGTAC
TAGCAAGATTTGAAGGATTTCCCACAAAGAAGTTGGAAGCTATAAGGACTGCGGCTGCGTTATATTTAAAGTTAGATGCAATTGTGTATAACCTACAGAATTGGAAGATT
GTTTCTCCCATGGGACAGCTTCTTGACAGAGTTGAAAACTACTTCACTAAGATCAAAGGAGAAATCGACGCACTCGAGCGAACGAAGGATGACGAATCAAAGAAATTCCG
AAGTCACGGTATTCAATTCGATTTCAATATCTTAATACGCATCAAGGAATCAATGGTGGACGTCTCCTCCAGCTGCATGGAGTTGGCTCTAAAGGAAAAGAGAGAGTTGA
AAGCAGCAGCAGGGAAGACACAAAACAGAGCCAAATCCGAAAGGACGAACAAGGGATGTGCCAAGATGCTATGGAGGGCGTTTCAGTTCGCGTACCGAGTCTACACATTC
GCCGGTGGACACGACGAGCGTGCCGATAGGCTGACCAGAGAATTGGCTCTAGAAATAGAGAGTGAGCCTCACAACCTA
mRNA sequenceShow/hide mRNA sequence
AAAAAGAAAAGAAAAAAAAAAACAACAACATTAGGATATTTGTTTCTCTTTCGTTTTTCCTTTCTGTACATAAACTTCTGTGGGTGGTTTTCCTTTTCCCTTTGAGAAAA
CTTAAGCACCGCTGAGCAGAGGAATTCAAAGAAAAAACAAAAAATATATACATATACATATATATTCATTCTTTTATGTAAAAAGTTGAAATCCCCTCCACCCCAATATT
TCATCTACGTTTTTTTTCATTGTTTTTGTGGCATTCAAGTCTTTTGAAGGCTTGTTTTGGCCACTTTTTATGGCTACCGGTGGCTGGTGTGGTTTAGGTCCTTTGCTGTT
TCGTAAGAAGGCTTATGGGCTCGAGACAATGAAGAGCTCGTCTTCTTACGCTTTCTCCAAATCCATTACAAAGAAACCAAAGTTGTCAAAAGTCGCCAGAAGTAAGAAGT
CTTCGAGACGTAAAGATAACTTTGTTCAAATGATGGAGCTGCGGAAGAAAATCTTGATTCTCAGAGACATCATTGATTTGCCTCCTTTAGAAGGGGCTGCATCTATTAAT
GAGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAACTCTATCCTGAAATCATATCAGAAATCCAATATTCCGAAATGAAAGCAACATGCATAGAACAGAGTCTGGCCTA
CTTCTGCACTGCATTGAAGTCGATTGGAGATTCGTGGATGCTGAATCATGAATGGAGGGACAAATCGAAATATAATCTGTCATCATGTCAGGAAAACTCCAGCTTCAATG
AAATTGTTGAATCTGTGTTGGGGATCATTGATTGCATCGTTAGTATGGCGAATGAAAGGTTCGATATGATGGACGAATACGTTCATTCAAAGGACTCCTCCTCTTATTCG
CGAACTAGCTCCTTCGGGAAGAGCTCGAGCTCGTTGGATTCGTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCTCCTGAAACTCCCACCTCCGTCCTAGCAAACTTTCG
CAACAGCCAAAGAAAATTATCAGAAAAGAAATCTTCAGAAAAGGAGAATGTCTCTTGTAGTTCCCCCCTTTTGTGGTCTCTTAGAGTTCAAGCAGTGGAAAAGTTGAACC
CCATTGATGTCAAGCACCTCTTGCTCCCCAGGTTGTCTCATTGTGGAGGCAATGTCTGCCCTGTTTCGAACAAGGCGACGACCGTTGAGGAATCGAGGATGGATATGGAT
GACAAGCTCCTCTCTAAAGAGATTGATGCTGCTGCTAGAAATGAAGAAATTGAAGTGAGTGATATCAAAGAGGAGAAGTTGGATACAAAAGAAGAAATTGAAGTGAGTGA
TATCAAAGAGGAGAAGTTGGATACAAATGAAGAAATTGAAGTGGGTGATATCAAAGAGGAGAAGTTGGAGACAAAGGAAGAAATTGAAGTGAGTGATGATATCAAAGAAG
AGAAGTTGGAATTGAGCACGGCAGCAAGCCAAAAGAACATTGCTGAGACAAGTGATGATAATAATTCCCAAGCTGCTGAAACTGTTGAAGAGTTGCCTGCATCAAAATTA
CCTGATGTTGCATCAGAGCCATCACCACTGCTCGCCTCAGTGTTGCCTCCTCCACCGCCCCCTCCTCCACATTTTCCGCAACCCTTCGTAGTCGTAGTCGCCCTGCAACT
ACCAACACCACCTCCGCCACCACCACCAATGATGCAACAAAATCCAGTACTGGCTCCACCCCTTTCACGGCCGCCTCCTCCGCCACCATTGCCTCAGATCACAGTGCTGT
CAACTGCAGCAGTTCCTATGGCACCTCCTCCACCAGCATTGCCTCAGATCACAGTATTGTCAACTGCAGCAGCGCCTACAGCGTCTGCTCCACCAGCATTGCCTCAGATC
ACAGTACTGCCAACTGCAGCAGCTCCTACAGCACCTCCTCCGCCTCCACAACTGTTGAATGTAATAGAAACGGTGATGAAAGTGGCTGTACCTCCACCACCACCGATGGT
TTCAGGGACGACGGTAAAGCTGGTTGGACCACCACCACCTCCGCCCACGCCCTCAAAAGGAGGAGCTGCAGCTTCAGCTCCACCTCCTCCAATGCCCCTAGGAAATGGCT
GTGCTCCACCACCACCTCCTCCGGGCGGTGCAATGAGATCCGTGCGCTCCAAGAAATCCTCTACTAAATTGAAGAGATCTCATCAATTGGGAAATCTGTACCGAACACTC
AAAGGGAAGGTGGAGGGATCCAATCAAAATCTTAAATCGGCTGGGCGAAAAGGCGGTGTAGGGAGCAGTACAGGCGGAAAACAAGGAATGGCAGATGCATTGGCGGAGAT
GACAAAAAGATCAGCATACTTTCAACAAATTGAAGAAGACGTTAAAAACCATGCAAAAGCAATCACAGAGCTTAAACCTTCCATTTCAACCTTCCAGTCATCGGACATGA
ATGAGCTGCTGAAGTTCCACAAGCATGTGGAGTCCATACTCGAGCATCTAACCGATGAATCGCAGGTACTAGCAAGATTTGAAGGATTTCCCACAAAGAAGTTGGAAGCT
ATAAGGACTGCGGCTGCGTTATATTTAAAGTTAGATGCAATTGTGTATAACCTACAGAATTGGAAGATTGTTTCTCCCATGGGACAGCTTCTTGACAGAGTTGAAAACTA
CTTCACTAAGATCAAAGGAGAAATCGACGCACTCGAGCGAACGAAGGATGACGAATCAAAGAAATTCCGAAGTCACGGTATTCAATTCGATTTCAATATCTTAATACGCA
TCAAGGAATCAATGGTGGACGTCTCCTCCAGCTGCATGGAGTTGGCTCTAAAGGAAAAGAGAGAGTTGAAAGCAGCAGCAGGGAAGACACAAAACAGAGCCAAATCCGAA
AGGACGAACAAGGGATGTGCCAAGATGCTATGGAGGGCGTTTCAGTTCGCGTACCGAGTCTACACATTCGCCGGTGGACACGACGAGCGTGCCGATAGGCTGACCAGAGA
ATTGGCTCTAGAAATAGAGAGTGAGCCTCACAACCTA
Protein sequenceShow/hide protein sequence
MATGGWCGLGPLLFRKKAYGLETMKSSSSYAFSKSITKKPKLSKVARSKKSSRRKDNFVQMMELRKKILILRDIIDLPPLEGAASINELVVGTMEDLQKLYPEIISEIQY
SEMKATCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSCQENSSFNEIVESVLGIIDCIVSMANERFDMMDEYVHSKDSSSYSRTSSFGKSSSSLDSCSETNSSCC
SSPETPTSVLANFRNSQRKLSEKKSSEKENVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGGNVCPVSNKATTVEESRMDMDDKLLSKEIDAAARNEEIEVSDIK
EEKLDTKEEIEVSDIKEEKLDTNEEIEVGDIKEEKLETKEEIEVSDDIKEEKLELSTAASQKNIAETSDDNNSQAAETVEELPASKLPDVASEPSPLLASVLPPPPPPPP
HFPQPFVVVVALQLPTPPPPPPPMMQQNPVLAPPLSRPPPPPPLPQITVLSTAAVPMAPPPPALPQITVLSTAAAPTASAPPALPQITVLPTAAAPTAPPPPPQLLNVIE
TVMKVAVPPPPPMVSGTTVKLVGPPPPPPTPSKGGAAASAPPPPMPLGNGCAPPPPPPGGAMRSVRSKKSSTKLKRSHQLGNLYRTLKGKVEGSNQNLKSAGRKGGVGSS
TGGKQGMADALAEMTKRSAYFQQIEEDVKNHAKAITELKPSISTFQSSDMNELLKFHKHVESILEHLTDESQVLARFEGFPTKKLEAIRTAAALYLKLDAIVYNLQNWKI
VSPMGQLLDRVENYFTKIKGEIDALERTKDDESKKFRSHGIQFDFNILIRIKESMVDVSSSCMELALKEKRELKAAAGKTQNRAKSERTNKGCAKMLWRAFQFAYRVYTF
AGGHDERADRLTRELALEIESEPHNL