; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0002426 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0002426
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionHydroxyproline-rich glycoprotein
Genome locationchr12:21441539..21445897
RNA-Seq ExpressionIVF0002426
SyntenyIVF0002426
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442261.1 PREDICTED: uncharacterized protein At4g04980 isoform X1 [Cucumis melo]0.098.77Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
        DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV           RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA

Query:  AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS
        AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS
Subjt:  AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS

Query:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
        QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
Subjt:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS

Query:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
        LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
Subjt:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL

Query:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
        LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
Subjt:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF

Query:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
Subjt:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

XP_008442269.1 PREDICTED: uncharacterized protein At4g04980 isoform X2 [Cucumis melo]0.097.55Show/hide
Query:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
        PS   +     LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
Subjt:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA

Query:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
        NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
Subjt:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS

Query:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEE
        HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV           RTASLKADRNEEIEVIDIEEE
Subjt:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEE

Query:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ
        KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ
Subjt:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ

Query:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
        PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
Subjt:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA

Query:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
        PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
Subjt:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK

Query:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
        KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
Subjt:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK

Query:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
        GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
Subjt:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD

Query:  RLTRELAIEIESESHHL
        RLTRELAIEIESESHHL
Subjt:  RLTRELAIEIESESHHL

XP_031736251.1 uncharacterized protein At4g04980 isoform X1 [Cucumis sativus]0.087.87Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLET--------------------MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLET                    MK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRRKAYGLET--------------------MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE

Query:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
        RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
Subjt:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CL
        NVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NE+IEV           RTASLKADRNEEIEVIDIEEEK CL
Subjt:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CL

Query:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPP
        ++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTVVSKPLPLL  MAPPPPPPPPP  AR L+PS+V LQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPP
Subjt:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPP

Query:  PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPP
        PPP                         +PQMKAQPAAA  N PPPPPQLLKVIET IKV+ PPP PPPSN TGTMVRAGVPPPPPM PSKG AGPAPPP
Subjt:  PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPP

Query:  R-MAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKH
          MAQGNGFAPPPPPPGGALRSLR KK STKLKRSH LGNLYRTLKGKVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKH
Subjt:  R-MAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKH

Query:  AKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGE
        AKSITALK SISSFQSSDM DLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIV+QLQNWKFVSPMG LLDRVENYF+KIKGE
Subjt:  AKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGE

Query:  VDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRL
        VDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRL
Subjt:  VDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRL

Query:  TRELAIEIESESHHL
        TRELA+EIESESHHL
Subjt:  TRELAIEIESESHHL

XP_031736252.1 uncharacterized protein At4g04980 isoform X2 [Cucumis sativus]0.089.83Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFR+KAYGLETMK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSS+SRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEES MD+D
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQA
        DKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NE+IEV           RTASLKADRNEEIEVIDIEEEK CL++ NSQ+DIAERT+DFDSQA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQA

Query:  AAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHL
         A  QELPTSDLPTVVSKPLPLL  MAPPPPPPPPP  AR L+PS+V LQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPP                 
Subjt:  AAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHL

Query:  SQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPR-MAQGNGFAPPPPPPGGAL
                +PQMKAQPAAA  N PPPPPQLLKVIET IKV+ PPP PPPSN TGTMVRAGVPPPPPM PSKG AGPAPPP  MAQGNGFAPPPPPPGGAL
Subjt:  SQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPR-MAQGNGFAPPPPPPGGAL

Query:  RSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMN
        RSLR KK STKLKRSH LGNLYRTLKGKVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALK SISSFQSSDM 
Subjt:  RSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMN

Query:  DLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGI
        DLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIV+QLQNWKFVSPMG LLDRVENYF+KIKGEVDALERTKDEESKRFRGHGI
Subjt:  DLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGI

Query:  QFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        QFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESESHHL
Subjt:  QFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

XP_031736253.1 uncharacterized protein At4g04980 isoform X3 [Cucumis sativus]0.084.48Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLET--------------------MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLET                    MK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRRKAYGLET--------------------MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE

Query:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
        RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
Subjt:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CL
        NVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NE+IEV           RTASLKADRNEEIEVIDIEEEK CL
Subjt:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CL

Query:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPP
        ++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTVVSKPLPLL  MAPPPPPPPPP  AR L+PS+V LQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPP
Subjt:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPP

Query:  PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPP
        PPP                         +PQMKAQPAAA  N PPPPPQLLKVIET IKV+ PPP PPPSN TGTMVRAGVPPPPPM PSKG AGPAPPP
Subjt:  PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPP

Query:  R-MAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKH
          MAQGNGFAPPPPPPGGALRSLR KK STKLKRSH LGNLYRTLKGKVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKH
Subjt:  R-MAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKH

Query:  AKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGE
        AKSITALK SISSFQSSDM DLLLFHKQVESVLENLTDESQ                                LQNWKFVSPMG LLDRVENYF+KIKGE
Subjt:  AKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGE

Query:  VDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRL
        VDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRL
Subjt:  VDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRL

Query:  TRELAIEIESESHHL
        TRELA+EIESESHHL
Subjt:  TRELAIEIESESHHL

TrEMBL top hitse value%identityAlignment
A0A0A0LSA3 Uncharacterized protein0.0e+0083.49Show/hide
Query:  MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY
        MK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY
Subjt:  MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY

Query:  FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPET
        FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPET
Subjt:  FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPET

Query:  PTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIK
        PTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIK
Subjt:  PTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIK

Query:  EEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLL
        EEKDLSKEASQKAD NE+IEV           RTASLKADRNEEIEVIDIEEEK CL++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTV        
Subjt:  EEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLL

Query:  SKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNA
                                                                                       PPPPPP+PQMKAQPAAA  N 
Subjt:  SKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNA

Query:  PPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYR
        PPPPPQLLKVIET IKV+  PPPPPPSN TGTMVRAGVPPPPPM PSKG AGPA PPP MAQGNGFAPPPPPPGGALRSLR KK STKLKRSH LGNLYR
Subjt:  PPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYR

Query:  TLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVL
        TLKGKVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALK SISSFQSSDM DLLLFHKQVESVLENLTDESQVL
Subjt:  TLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVL

Query:  ARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCME
        ARFEGFPIKKLETLRIAAALYLKLDTIV+QLQNWKFVSPMG LLDRVENYF+KIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCME
Subjt:  ARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCME

Query:  LALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        LALKEKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESESHHL
Subjt:  LALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

A0A1S3B5Y5 uncharacterized protein At4g04980 isoform X10.0e+0098.77Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
        DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV           RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA

Query:  AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS
        AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS
Subjt:  AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS

Query:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
        QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
Subjt:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS

Query:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
        LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
Subjt:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL

Query:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
        LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
Subjt:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF

Query:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
Subjt:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

A0A1S3B627 uncharacterized protein At4g04980 isoform X20.0e+0097.55Show/hide
Query:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
        PS   +     LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
Subjt:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA

Query:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
        NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
Subjt:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS

Query:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEE
        HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV           RTASLKADRNEEIEVIDIEEE
Subjt:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEE

Query:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ
        KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ
Subjt:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ

Query:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
        PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
Subjt:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA

Query:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
        PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
Subjt:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK

Query:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
        KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
Subjt:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK

Query:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
        GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
Subjt:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD

Query:  RLTRELAIEIESESHHL
        RLTRELAIEIESESHHL
Subjt:  RLTRELAIEIESESHHL

A0A5A7UTX7 Hydroxyproline-rich glycoprotein0.0e+0098.77Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
        DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV           RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEV-----------RTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA

Query:  AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS
        AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS
Subjt:  AIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLS

Query:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
        QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
Subjt:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS

Query:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
        LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
Subjt:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL

Query:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
        LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
Subjt:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF

Query:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
Subjt:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

A0A6J1CGJ6 uncharacterized protein At4g04980-like2.3e-30673.33Show/hide
Query:  MMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQ
        MMELRKKILILRDIIDLP LE +ASINELVVGTMEDLQKLYPEIIS+IQYSEMK TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSS QENSSF 
Subjt:  MMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQ

Query:  EIVESVLGIIDCIVSMANERFDAMDEFVNSKD-SSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRS-----SERKSSEKEKVSCSSPLLWS
        EIVESVLGIIDCIVSMANERFD MDE+V+SKD SS+SRTSSFGKSSSS DSCSETNSSCCSSPETPTSVLANFR+     SE+KSSEKE VSCSSPLLWS
Subjt:  EIVESVLGIIDCIVSMANERFDAMDEFVNSKD-SSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRS-----SERKSSEKEKVSCSSPLLWS

Query:  LRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEK-------DLSKEASQKADGNEKIEV-R
        LRVQAVEKLNPIDVKHLLLPRLSHCG NVCP   +   VEES MD+DDKL S+  DAA  N E+EV DIKEEK       ++S    +K D NE+IEV  
Subjt:  LRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEK-------DLSKEASQKADGNEKIEV-R

Query:  TASLKADRNEEIEVI-DIEEEKC-LSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPT
            K +  EEIEV  DI+EEK  LS A SQ++IAE ++D +SQAA   +ELP S LP V S+P PLL+ + PPPPPPPP  P    QP +V + L  PT
Subjt:  TASLKADRNEEIEVI-DIEEEKC-LSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPT

Query:  PPPPPPPPMMQQNAVLAQQLSQPPPPPP---------------PPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKV
         PPPPPPPMMQQN VLA  LS+PPPPPP               PPPP + Q              PP +PQ+   P AA   APPPPPQLL VIETV+KV
Subjt:  PPPPPPPPMMQQNAVLAQQLSQPPPPPP---------------PPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKV

Query:  NGPPPPPPPSNITGTMVRAGVPPPPPMVPSK-GSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSAN
          PPPPP    ++GT V+   PPPPP  PSK G+A  APPP M  GNG APPPPPPGGA+RS+R KK+STKLKRSHQLGNLYRTLKGKVEG NQNLKSA 
Subjt:  NGPPPPPPPSNITGTMVRAGVPPPPPMVPSK-GSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSAN

Query:  GRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIA
        GRKGGVG+S GGKQGMADALAEMTKRSAYFQQIEEDVK HAK+IT LK SIS+FQSSDMN+LL FHK VES+LE+LTDESQVLARFEGFP KKLE +R A
Subjt:  GRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIA

Query:  AALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTR
        AALYLKLD IVY LQNWK VSPMG LLDRVENYF+KIKGE+DALERTKD+ESK+FR HGIQFDF++LIRIKESMVDVSS CMELALKEKRELKAAA KT+
Subjt:  AALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTR

Query:  KGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
           +SE +NK  +KMLWRAFQFAYRVYTFAGGHDERADRLTRELA+EIESE H+L
Subjt:  KGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049807.1e-10338.31Show/hide
Query:  MEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------E
        M+DLQKL PEI++  Q  EM+   +++ L +F   L++IGDSW+++ +W  +SKY  S   +N S   +VE VL  +D ++    ERF  MD        
Subjt:  MEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------E

Query:  FVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPA
        F   K  S     SF +S+    S SE+N+S   SP TP SVL         S        +SP LW+LR QA+++L+P+D+K   +  LS    +   +
Subjt:  FVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPA

Query:  PTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDIKEE-------KDLSKEASQKADGNE----KIEVRTASLKADRNEEIEVIDIEEEKCLSKAN
         T++ I EE+     + ++   E+ D +    E    +IK E        +   EA   ++G E    +IE      + D ++ IE  + E         
Subjt:  PTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDIKEE-------KDLSKEASQKADGNE----KIEVRTASLKADRNEEIEVIDIEEEKCLSKAN

Query:  SQE-DIAERTNDFDSQAAAI-----AQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQP
          E D  +   DF+++   I       E  T D     +                           S  +    +P    PPPPP+              
Subjt:  SQE-DIAERTNDFDSQAAAI-----AQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQP

Query:  PPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAP
            P  P P +    T     SQPPPPPP P+ KA        APPPPP + K  E                 +G   +           + G   P+ 
Subjt:  PPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAP

Query:  PPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQGMADALAEMTKRSAYFQQIE
        P         APP PP  G  RSL  KK ++KL+RS Q+ NLY  LKGK+EG     K+    KG   V   +     + GMADALAEMTKRS+YFQQIE
Subjt:  PPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQGMADALAEMTKRSAYFQQIE

Query:  EDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYF
        EDV+K+AKSI  LKSSI SFQ+ DM +LL FH +VES+LE LTDE+QVLARFEGFP KKLE +R A ALY KLD I+ +L+NWK   P+  LLD++E YF
Subjt:  EDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYF

Query:  SKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHD
        +K KGE++ +ERTKDE++K F+ + I  DF VL+++KE+MVDVSS CMELALKE+RE    A +  K G      + R+K LWRAFQFA++VYTFAGGHD
Subjt:  SKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHD

Query:  ERADRLTRELAIEIESE
        ERAD LTR+LA EI+++
Subjt:  ERADRLTRELAIEIESE

Q6K8Z4 Formin-like protein 77.8e-0937.8Show/hide
Query:  SDLPTVVSKPLPLLSKMAPPPPPPPPPPPARAL----------QP--SIVNLQL-PTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATL
        S   +VVS PL      APPPPPPPPPPP   +          QP  ++  LQ  P P PPPPPPPPM+           + PP PPPPPP    Q A  
Subjt:  SDLPTVVSKPLPLLSKMAPPPPPPPPPPPARAL----------QP--SIVNLQL-PTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATL

Query:  VQHLSQPPPPPPMPQMKAQPAAAESNAPPPPP----QLLKVIETVIKVNGPPPPPP----------PSNITGTMVRAGVPPPPP----------MVPSKG
        V     PPPPP +P M+ +P       PPPPP     L+    +  K   PPPPPP          PS+  G  V A  PPPPP            PS  
Subjt:  VQHLSQPPPPPPMPQMKAQPAAAESNAPPPPP----QLLKVIETVIKVNGPPPPPP----------PSNITGTMVRAGVPPPPP----------MVPSKG

Query:  SAGPAPPPRMAQGNGFA-------PPPPPPGGALRSLRPKKTSTKL
         A P PPP++   +  +       PPPPPPG       PK +S  L
Subjt:  SAGPAPPPRMAQGNGFA-------PPPPPPGGALRSLRPKKTSTKL

Q7G6K7 Formin-like protein 34.0e-0527.79Show/hide
Query:  LPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPA---RALQPSI---VNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHL
        LP   +P     P PL +    PPPPPPPPPP+   R + P     +  + P P PPPPPP    +     A     PPPPPPPP PP  + N   V   
Subjt:  LPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPA---RALQPSI---VNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHL

Query:  SQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPP-------PRMAQGNGFAPPPP
          PPPPPP       P+A    APP PP L            PP PPPP  +TG    A  PPPP      G+  P PP       P      G   P P
Subjt:  SQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPP-------PRMAQGNGFAPPPP

Query:  PP-----------------GGALRSLRPKKTS------TKLKRSHQLGNLYRTLK-------------GKVEGCNQNLKSAN-GRKGGVGNSNGGKQGMA
        PP                 G A +S  PKK S       K+ R+ Q G+L+   +              ++E       + N   KGG    +   +   
Subjt:  PP-----------------GGALRSLRPKKTS------TKLKRSHQLGNLYRTLK-------------GKVEGCNQNLKSAN-GRKGGVGNSNGGKQGMA

Query:  DALAEMTKRS---AYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLF--HKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVY
          L +M + +       +I+  +     +I AL +S+    +  + +L+ F   K+   +L+N     ++L + E F ++ ++  R+ + L +    I +
Subjt:  DALAEMTKRS---AYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLF--HKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVY

Query:  QLQ
          Q
Subjt:  QLQ

Q84ZL0 Formin-like protein 55.4e-1038.4Show/hide
Query:  FDSQAAAIAQELPTSDLPTVVSKPLPLLS---------KMAPPPPPPPPPPPARALQPSI----VNLQLPTPTPPPP--PPPPMMQQNAVLAQQLSQPPP
        +DS  AA+A     ++    V K  PL+S         + APPPPPPPPPP A +   S+       Q P P PPPP  PPPP    ++ L+   S PPP
Subjt:  FDSQAAAIAQELPTSDLPTVVSKPLPLLS---------KMAPPPPPPPPPPPARALQPSI----VNLQLPTPTPPPP--PPPPMMQQNAVLAQQLSQPPP

Query:  PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPP
        PPPPP      Q  T V     PPPPPP P  ++        APPPPP     + + +    PPPPPPP  +  +   A  PPPPP  P      P PPP
Subjt:  PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPP

Query:  RMAQGNGF--APPPPPPGGALRSLRPKKTSTKLKRSH
               F  APPPPPP   LRS+ P      +  S+
Subjt:  RMAQGNGF--APPPPPPGGALRSLRPKKTSTKLKRSH

Q9C6S1 Formin-like protein 142.2e-1141.92Show/hide
Query:  AQELPTSDLPTVVSK-PLPLL----SKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQ
        +Q  P   LP+  ++ PL  L    +K  PPPPPPPPP P+R++ P +   Q P P PPPPPPPP   ++       S   PPPPPPPPP     +T  +
Subjt:  AQELPTSDLPTVVSK-PLPLL----SKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQ

Query:  HLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPP-PPPMVPSKGSAG-PAPPPRMAQGNGFAPPPPP
          +QPPPPPP P     PAA  +  PPPPP         +     PPPPPP      +  A  PP PPP+ PS    G P PPP        APPPPP
Subjt:  HLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPP-PPPMVPSKGSAG-PAPPPRMAQGNGFAPPPPP

Arabidopsis top hitse value%identityAlignment
AT1G11070.1 BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein family protein (TAIR:AT1G61080.1)4.6e-10537.12Show/hide
Query:  RRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKT
        RR A   E +KN+       + +  K+S  + S  SS+   NF+ M+ELR+KI+  R IIDLP L    SI+ +V+ TM+DL KL PEII   Q  EM+ 
Subjt:  RRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKT

Query:  TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETN
          +++ L  F  ALKSIGDSW+ +HEW  KSKY  S+ ++N S   +VE VL  +D ++   NER +  +   N       +      S  ST + ++ +
Subjt:  TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETN

Query:  SSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADAN
         S    P  P +VL    S   K      +S S+ L  ++R+QA+ KL+PIDVK L +   S                                      
Subjt:  SSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADAN

Query:  NEMEVCDIKEEKDLSKEA-SQKADGNEKIEVRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSK
                       KEA S   DG+++      SLK  ++E       EE+   +    Q+ +  + +D D    ++  E               ++SK
Subjt:  NEMEVCDIKEEKDLSKEA-SQKADGNEKIEVRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSK

Query:  MAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPP
           PPPPPPP   A  ++P +                       + A++ S  P PPPP    +              PPPPP+P    +  AA      
Subjt:  MAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPP

Query:  PPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLK
                          PP PPP    GT   A +PPPPP+              MA G G A PPPPP GA   L  KK ++KLKRS  LG L+R LK
Subjt:  PPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLK

Query:  GKVEGCNQNLKS---ANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQV
        GK+EG N  ++S     G KG  G++   GKQGMADALAE+TK+S YFQ+IEEDV+ +  SI  LK+ I+ F++ D+ +L  FH ++ESVLE L DE+QV
Subjt:  GKVEGCNQNLKS---ANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQV

Query:  LARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSK------------------IKGEVDALERTKDEESKRFRGHGIQFDF
        LAR EGFP KKLE +R+AAALY KL+ ++ +L+NWK  SP   L D+ E YF+K                  I+ E++ L++ K EE K+F+ + I FDF
Subjt:  LARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSK------------------IKGEVDALERTKDEESKRFRGHGIQFDF

Query:  SVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKAR----SKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIE
        ++L++IKE MVD+SSGCMELALKEKRE K A++ T +   ++ S K +    +K LWRAF FAYRVYTFAGGHD+RAD+LTRELA EIE
Subjt:  SVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKAR----SKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIE

AT1G31810.1 Formin Homology 141.6e-1241.92Show/hide
Query:  AQELPTSDLPTVVSK-PLPLL----SKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQ
        +Q  P   LP+  ++ PL  L    +K  PPPPPPPPP P+R++ P +   Q P P PPPPPPPP   ++       S   PPPPPPPPP     +T  +
Subjt:  AQELPTSDLPTVVSK-PLPLL----SKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQ

Query:  HLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPP-PPPMVPSKGSAG-PAPPPRMAQGNGFAPPPPP
          +QPPPPPP P     PAA  +  PPPPP         +     PPPPPP      +  A  PP PPP+ PS    G P PPP        APPPPP
Subjt:  HLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPP-PPPMVPSKGSAG-PAPPPRMAQGNGFAPPPPP

AT1G61080.1 Hydroxyproline-rich glycoprotein family protein2.4e-13041.97Show/hide
Query:  KGARSKKSS---RCKDNFVQMMELRKKILILRDIIDLPSLERSASINE---------LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKS
        K ARS K+S   +   NF+ M+ELR+KI   RDIIDL +L+ S SI +         +V+ TM+DLQK+ PEII      E++   +++ L +F  ALKS
Subjt:  KGARSKKSS---RCKDNFVQMMELRKKILILRDIIDLPSLERSASINE---------LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKS

Query:  IGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAM--DEFVNSKDS--------SFSRTSSFGKSSSSTDSCSETNSSCCSS
        IGDSW+ N EW  KSKY  SS  +N S   +VE VL  +D ++ M+ ERFD M  DE    K+S        S SR  S  +S S + S   + +S C S
Subjt:  IGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAM--DEFVNSKDS--------SFSRTSSFGKSSSSTDSCSETNSSCCSS

Query:  PETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVC
        P TP SVL             +  + +S LLW++RVQA+EKL+PIDVK L +  LS                             +     + +N+ +V 
Subjt:  PETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVC

Query:  DIKEEKDLSKEASQKADGNEKIEVRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELP---------------TSDLPTVVS
         + E      E  QK D  E I+V+  + ++   +E  V+   E+  + K +S E  +E   +   +    +Q  P               T+ LP    
Subjt:  DIKEEKDLSKEASQKADGNEKIEVRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELP---------------TSDLPTVVS

Query:  KPLPLLSKMA---PPPPPPPPPPPA------------RALQPSIVNLQ---LPTPTPP---------PPPPPPMMQQNAVLAQQLSQPPP-----PPPPP
         P P ++ +A   PPPPPPPPPPPA              L P+++ L+    P PTPP         PPPPPP      + A     PPP     PPPPP
Subjt:  KPLPLLSKMA---PPPPPPPPPPPA------------RALQPSIVNLQ---LPTPTPP---------PPPPPPMMQQNAVLAQQLSQPPP-----PPPPP

Query:  PPPMIQQNATLVQHLSQPPPPPPMPQMKA-----QPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAG-PAPP
        PPP            + PPPPPP P  +A      P   ++ AP PPP  +    +     GPPPPPPP  +         PPPPPM  + G+AG P PP
Subjt:  PPPMIQQNATLVQHLSQPPPPPPMPQMKA-----QPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAG-PAPP

Query:  PRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLK--SANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQIEEDV
        PRM   NG A PPPPPG A RSLRPKK +TKLKRS QLGNLYR LKGKVEG + N K  S +GRK G G++  GGKQGMADALAE+TK+SAYF QI+ D+
Subjt:  PRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLK--SANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQIEEDV

Query:  KKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKI
         K+  SI  LK  I+ FQ+ DM +LL FH++VESVLENLTDESQVLAR EGFP KKLE +R+A ALY KL  ++ +LQN K   P+  LLD+VE YF+K 
Subjt:  KKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKI

Query:  KGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERA
                                        IKE+MVD+SS CMELALKEKR+ K  +   +    S       +KMLWRAFQFA++VYTFAGGHD+RA
Subjt:  KGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERA

Query:  DRLTRELAIEIESES
        D LTRELA EI+++S
Subjt:  DRLTRELAIEIESES

AT4G04980.1 unknown protein5.2e-11738.72Show/hide
Query:  SKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSI
        SKT +  P+  K      S +C  NF+ M+ELRK I   RD+IDLPSL+ S S+ E++  TM+DLQKL PEI++  Q  EM+   +++ L +F   L++I
Subjt:  SKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSI

Query:  GDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETP
        GDSW+++ +W  +SKY  S   +N S   +VE VL  +D ++    ERF  MD        F   K  S     SF +S+    S SE+N+S   SP TP
Subjt:  GDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETP

Query:  TSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDI
         SVL         S        +SP LW+LR QA+++L+P+D+K   +  LS    +   + T++ I EE+     + ++   E+ D +    E    +I
Subjt:  TSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDI

Query:  KEE-------KDLSKEASQKADGNE----KIEVRTASLKADRNEEIEVIDIEEEKCLSKANSQE-DIAERTNDFDSQAAAI-----AQELPTSDLPTVVS
        K E        +   EA   ++G E    +IE      + D ++ IE  + E           E D  +   DF+++   I       E  T D     +
Subjt:  KEE-------KDLSKEASQKADGNE----KIEVRTASLKADRNEEIEVIDIEEEKCLSKANSQE-DIAERTNDFDSQAAAI-----AQELPTSDLPTVVS

Query:  KPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPA
                                   S  +    +P    PPPPP+                  P  P P +    T     SQPPPPPP P+ KA   
Subjt:  KPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPA

Query:  AAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQL
             APPPPP + K  E                 +G   +           + G   P+ P         APP PP  G  RSL  KK ++KL+RS Q+
Subjt:  AAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQL

Query:  GNLYRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVL
         NLY  LKGK+EG     K+    KG   V   +     + GMADALAEMTKRS+YFQQIEEDV+K+AKSI  LKSSI SFQ+ DM +LL FH +VES+L
Subjt:  GNLYRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVL

Query:  ENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKES
        E LTDE+QVLARFEGFP KKLE +R A ALY KLD I+ +L+NWK   P+  LLD++E YF+K KGE++ +ERTKDE++K F+ + I  DF VL+++KE+
Subjt:  ENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKES

Query:  MVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESE
        MVDVSS CMELALKE+RE    A +  K G      + R+K LWRAFQFA++VYTFAGGHDERAD LTR+LA EI+++
Subjt:  MVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESE

AT5G07740.1 actin binding6.8e-0833.33Show/hide
Query:  SQAAAIAQELPTSDLPTVVSKPLP--------------LLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPP----
        + A AI     TS  PT    P P              L +   P PPPPPPPPP  +++ +   L LP P PPPPPP   +++N+        PP    
Subjt:  SQAAAIAQELPTSDLPTVVSKPLP--------------LLSKMAPPPPPPPPPPPARALQPSIVNLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPP----

Query:  ------------------PPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAA-----ESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTM
                          PPPPPPPPP    N T       PPPP P   +   P+         ++ P PP         +K   PPPPPPP +   ++
Subjt:  ------------------PPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAA-----ESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTM

Query:  VR------AGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPG
        +          PPPPP  PS GS  P PPP  + G+   PPPPPPG
Subjt:  VR------AGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCGGAGGCTGGTGTGGTTTAGGCCCTTTGTTGTTTCGTAGAAAGGCCTATGGACTCGAGACAATGAAGAACTCTTCTTACGTGTTCTCGAAGACATATTCTAA
GAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTCTTCAAGATGCAAAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCA
TTGATTTGCCTTCTTTAGAACGCTCTGCTTCTATAAATGAGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAGCTTTATCCTGAAATCATTTCGGATATCCAATATTCC
GAAATGAAGACGACATGTATTGAACAGAGTCTTGCCTATTTCTGCACTGCACTGAAATCGATTGGCGATTCGTGGATGCTGAACCATGAGTGGAGGGACAAATCTAAATA
TAATTTGTCATCATTTCAGGAAAACTCCAGCTTCCAAGAAATTGTTGAATCTGTGTTGGGTATTATTGATTGCATCGTTAGTATGGCGAACGAAAGGTTTGATGCGATGG
ACGAATTTGTTAACTCAAAGGATTCTTCTTTTTCAAGAACTAGTTCCTTTGGTAAGAGCTCGAGTTCGACAGATTCCTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCT
CCAGAAACTCCGACGTCCGTCCTTGCAAACTTTCGGAGCAGTGAAAGAAAATCTTCTGAAAAGGAGAAAGTCTCTTGTAGCTCTCCTCTTTTATGGTCTCTTAGAGTTCA
AGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCTTTTGCTTCCTAGGTTGTCTCACTGCGGAGTAAATGTCTGCCCCGCCCCAACCAGAGTGGCGATCGTTGAGG
AATCAACGATGGATGTGGATGACAAGCTCACCTCTGAAAACACTGATGCTGCTGATGCAAATAATGAAATGGAAGTGTGTGATATTAAAGAAGAGAAGGATTTGAGCAAG
GAAGCTAGTCAAAAGGCAGATGGAAATGAGAAAATTGAAGTCAGGACAGCTAGTCTAAAGGCAGATAGAAATGAAGAAATTGAAGTGATTGATATCGAAGAAGAGAAGTG
TTTGAGCAAGGCAAATAGCCAAGAAGACATTGCTGAGAGAACCAATGATTTTGATTCCCAAGCTGCTGCAATTGCTCAAGAATTGCCTACATCAGATTTACCAACTGTTG
TATCAAAGCCATTACCGCTGCTATCCAAGATGGCTCCCCCTCCCCCTCCCCCTCCCCCCCCTCCACCTGCCCGAGCTCTGCAACCCTCCATAGTCAACCTGCAACTGCCA
ACGCCAACGCCACCACCTCCACCACCACCACCAATGATGCAACAAAATGCAGTATTAGCTCAACAACTTTCACAGCCACCTCCTCCACCACCACCACCACCACCACCAAT
GATACAACAAAATGCAACATTGGTCCAACATCTTTCACAGCCACCTCCTCCCCCACCAATGCCTCAGATGAAAGCACAGCCTGCTGCAGCAGAGTCAAATGCGCCTCCTC
CACCTCCACAATTGTTAAAGGTAATCGAAACGGTGATCAAAGTCAATGGACCACCACCACCACCACCACCATCAAACATTACTGGAACGATGGTGAGAGCAGGTGTACCG
CCACCTCCCCCTATGGTGCCCTCAAAAGGGAGTGCAGGTCCAGCACCCCCTCCTCGGATGGCCCAAGGCAATGGGTTTGCTCCACCACCACCTCCACCAGGTGGTGCATT
ACGGTCCTTGCGCCCCAAGAAAACCTCTACCAAACTAAAAAGATCTCATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAGGTGGAGGGATGCAATCAAAATCTTA
AGTCGGCTAACGGAAGGAAAGGTGGCGTCGGAAACAGTAACGGAGGAAAACAAGGAATGGCTGATGCATTGGCAGAGATGACAAAAAGATCAGCATACTTCCAGCAAATT
GAAGAAGATGTTAAAAAACACGCCAAATCGATCACCGCGCTTAAATCTTCCATTTCATCTTTCCAATCATCAGACATGAATGACCTGCTCCTTTTCCACAAGCAAGTGGA
ATCTGTACTAGAGAATTTAACTGATGAATCACAGGTACTAGCAAGGTTTGAAGGATTTCCCATCAAAAAGTTGGAAACTTTGAGAATTGCAGCAGCATTATATCTAAAGT
TAGATACAATTGTCTATCAACTACAGAACTGGAAGTTTGTTTCTCCCATGGGACTGCTTCTCGACCGAGTCGAAAACTACTTCTCTAAGATCAAAGGAGAAGTCGATGCA
CTTGAACGAACCAAGGATGAAGAATCAAAGAGATTCCGAGGTCACGGTATTCAATTTGATTTCAGTGTGTTAATACGGATCAAGGAATCAATGGTGGATGTTTCTTCTGG
CTGCATGGAGTTGGCTCTGAAGGAAAAAAGAGAGTTGAAGGCAGCAGCAGAAAAGACACGAAAAGGAGGCCGATCTGAAAATTCGAACAAGGCACGTTCAAAGATGCTAT
GGAGGGCATTCCAATTCGCATACCGAGTTTACACCTTCGCCGGTGGACACGACGAGCGTGCTGATAGACTGACCAGAGAGTTGGCTATAGAAATAGAGAGTGAATCCCAT
CACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACCGGAGGCTGGTGTGGTTTAGGCCCTTTGTTGTTTCGTAGAAAGGCCTATGGACTCGAGACAATGAAGAACTCTTCTTACGTGTTCTCGAAGACATATTCTAA
GAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTCTTCAAGATGCAAAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCA
TTGATTTGCCTTCTTTAGAACGCTCTGCTTCTATAAATGAGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAGCTTTATCCTGAAATCATTTCGGATATCCAATATTCC
GAAATGAAGACGACATGTATTGAACAGAGTCTTGCCTATTTCTGCACTGCACTGAAATCGATTGGCGATTCGTGGATGCTGAACCATGAGTGGAGGGACAAATCTAAATA
TAATTTGTCATCATTTCAGGAAAACTCCAGCTTCCAAGAAATTGTTGAATCTGTGTTGGGTATTATTGATTGCATCGTTAGTATGGCGAACGAAAGGTTTGATGCGATGG
ACGAATTTGTTAACTCAAAGGATTCTTCTTTTTCAAGAACTAGTTCCTTTGGTAAGAGCTCGAGTTCGACAGATTCCTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCT
CCAGAAACTCCGACGTCCGTCCTTGCAAACTTTCGGAGCAGTGAAAGAAAATCTTCTGAAAAGGAGAAAGTCTCTTGTAGCTCTCCTCTTTTATGGTCTCTTAGAGTTCA
AGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCTTTTGCTTCCTAGGTTGTCTCACTGCGGAGTAAATGTCTGCCCCGCCCCAACCAGAGTGGCGATCGTTGAGG
AATCAACGATGGATGTGGATGACAAGCTCACCTCTGAAAACACTGATGCTGCTGATGCAAATAATGAAATGGAAGTGTGTGATATTAAAGAAGAGAAGGATTTGAGCAAG
GAAGCTAGTCAAAAGGCAGATGGAAATGAGAAAATTGAAGTCAGGACAGCTAGTCTAAAGGCAGATAGAAATGAAGAAATTGAAGTGATTGATATCGAAGAAGAGAAGTG
TTTGAGCAAGGCAAATAGCCAAGAAGACATTGCTGAGAGAACCAATGATTTTGATTCCCAAGCTGCTGCAATTGCTCAAGAATTGCCTACATCAGATTTACCAACTGTTG
TATCAAAGCCATTACCGCTGCTATCCAAGATGGCTCCCCCTCCCCCTCCCCCTCCCCCCCCTCCACCTGCCCGAGCTCTGCAACCCTCCATAGTCAACCTGCAACTGCCA
ACGCCAACGCCACCACCTCCACCACCACCACCAATGATGCAACAAAATGCAGTATTAGCTCAACAACTTTCACAGCCACCTCCTCCACCACCACCACCACCACCACCAAT
GATACAACAAAATGCAACATTGGTCCAACATCTTTCACAGCCACCTCCTCCCCCACCAATGCCTCAGATGAAAGCACAGCCTGCTGCAGCAGAGTCAAATGCGCCTCCTC
CACCTCCACAATTGTTAAAGGTAATCGAAACGGTGATCAAAGTCAATGGACCACCACCACCACCACCACCATCAAACATTACTGGAACGATGGTGAGAGCAGGTGTACCG
CCACCTCCCCCTATGGTGCCCTCAAAAGGGAGTGCAGGTCCAGCACCCCCTCCTCGGATGGCCCAAGGCAATGGGTTTGCTCCACCACCACCTCCACCAGGTGGTGCATT
ACGGTCCTTGCGCCCCAAGAAAACCTCTACCAAACTAAAAAGATCTCATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAGGTGGAGGGATGCAATCAAAATCTTA
AGTCGGCTAACGGAAGGAAAGGTGGCGTCGGAAACAGTAACGGAGGAAAACAAGGAATGGCTGATGCATTGGCAGAGATGACAAAAAGATCAGCATACTTCCAGCAAATT
GAAGAAGATGTTAAAAAACACGCCAAATCGATCACCGCGCTTAAATCTTCCATTTCATCTTTCCAATCATCAGACATGAATGACCTGCTCCTTTTCCACAAGCAAGTGGA
ATCTGTACTAGAGAATTTAACTGATGAATCACAGGTACTAGCAAGGTTTGAAGGATTTCCCATCAAAAAGTTGGAAACTTTGAGAATTGCAGCAGCATTATATCTAAAGT
TAGATACAATTGTCTATCAACTACAGAACTGGAAGTTTGTTTCTCCCATGGGACTGCTTCTCGACCGAGTCGAAAACTACTTCTCTAAGATCAAAGGAGAAGTCGATGCA
CTTGAACGAACCAAGGATGAAGAATCAAAGAGATTCCGAGGTCACGGTATTCAATTTGATTTCAGTGTGTTAATACGGATCAAGGAATCAATGGTGGATGTTTCTTCTGG
CTGCATGGAGTTGGCTCTGAAGGAAAAAAGAGAGTTGAAGGCAGCAGCAGAAAAGACACGAAAAGGAGGCCGATCTGAAAATTCGAACAAGGCACGTTCAAAGATGCTAT
GGAGGGCATTCCAATTCGCATACCGAGTTTACACCTTCGCCGGTGGACACGACGAGCGTGCTGATAGACTGACCAGAGAGTTGGCTATAGAAATAGAGAGTGAATCCCAT
CACCTATGATTCTCTCCTCTCTCTTTTAAAAAGTTGTTTTCCTCTCCTTTCTTCTTCTTTTTGGCTGCA
Protein sequenceShow/hide protein sequence
MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYS
EMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSS
PETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSK
EASQKADGNEKIEVRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPPPPPPARALQPSIVNLQLP
TPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVP
PPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQI
EEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDA
LERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESH
HL