; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000166 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000166
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionHydroxyproline-rich glycoprotein
Genome locationchr12:23051367..23055662
RNA-Seq ExpressionPay0000166
SyntenyPay0000166
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442261.1 PREDICTED: uncharacterized protein At4g04980 isoform X1 [Cucumis melo]0.0e+0092.95Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
        DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNE+IEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA

Query:  AIAQELPTSDLPTV--------------------------------------------------------------PPPPPPPPPPPMIQQNATLVQHLS
        AIAQELPTSDLPTV                                                              PPPPPPPPPPPMIQQNATLVQHLS
Subjt:  AIAQELPTSDLPTV--------------------------------------------------------------PPPPPPPPPPPMIQQNATLVQHLS

Query:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
        QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
Subjt:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS

Query:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
        LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
Subjt:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL

Query:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
        LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
Subjt:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF

Query:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
Subjt:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

XP_008442269.1 PREDICTED: uncharacterized protein At4g04980 isoform X2 [Cucumis melo]0.0e+0091.19Show/hide
Query:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
        PS   +     LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
Subjt:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA

Query:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
        NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
Subjt:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS

Query:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEE
        HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNE+IEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEE
Subjt:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEE

Query:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV--------------------------------------------------------------
        KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV                                                              
Subjt:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV--------------------------------------------------------------

Query:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
        PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
Subjt:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA

Query:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
        PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
Subjt:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK

Query:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
        KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
Subjt:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK

Query:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
        GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
Subjt:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD

Query:  RLTRELAIEIESESHHL
        RLTRELAIEIESESHHL
Subjt:  RLTRELAIEIESESHHL

XP_031736251.1 uncharacterized protein At4g04980 isoform X1 [Cucumis sativus]0.0e+0087.95Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLE                    TMK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE

Query:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
        RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
Subjt:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CL
        NVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NEEIEV D KEEKLNLSRTASLKADRNEEIEVIDIEEEK CL
Subjt:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CL

Query:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV------------------PPP-----------------PPPPPPPPMIQQNATLVQHLSQPPPPP
        ++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTV                  PPP                 PPPPPPPPM+QQNA L Q LSQPPPPP
Subjt:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV------------------PPP-----------------PPPPPPPPMIQQNATLVQHLSQPPPPP

Query:  PMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKK
        P+PQMKAQPAAA  N PPPPPQLLKVIET IKV+  PPPPPPSN TGTMVRAGVPPPPPM PSKG AGPA PPP MAQGNGFAPPPPPPGGALRSLR KK
Subjt:  PMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKK

Query:  TSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHK
         STKLKRSH LGNLYRTLKGKVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALK SISSFQSSDM DLLLFHK
Subjt:  TSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHK

Query:  QVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVL
        QVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIV+QLQNWKFVSPMG LLDRVENYF+KIKGEVDALERTKDEESKRFRGHGIQFDFSVL
Subjt:  QVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVL

Query:  IRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        IRIKESMVDVSSGCMELALKEKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESESHHL
Subjt:  IRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

XP_031736252.1 uncharacterized protein At4g04980 isoform X2 [Cucumis sativus]0.0e+0089.98Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFR+KAYGLETMK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSS+SRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEES MD+D
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQA
        DKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NEEIEV D KEEKLNLSRTASLKADRNEEIEVIDIEEEK CL++ NSQ+DIAERT+DFDSQA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQA

Query:  AAIAQELPTSDLPTV------------------PPP-----------------PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPP
         A  QELPTSDLPTV                  PPP                 PPPPPPPPM+QQNA L Q LSQPPPPPP+PQMKAQPAAA  N PPPP
Subjt:  AAIAQELPTSDLPTV------------------PPP-----------------PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPP

Query:  PQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKG
        PQLLKVIET IKV+  PPPPPPSN TGTMVRAGVPPPPPM PSKG AGPA PPP MAQGNGFAPPPPPPGGALRSLR KK STKLKRSH LGNLYRTLKG
Subjt:  PQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKG

Query:  KVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFE
        KVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALK SISSFQSSDM DLLLFHKQVESVLENLTDESQVLARFE
Subjt:  KVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFE

Query:  GFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALK
        GFPIKKLETLRIAAALYLKLDTIV+QLQNWKFVSPMG LLDRVENYF+KIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALK
Subjt:  GFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALK

Query:  EKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        EKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESESHHL
Subjt:  EKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

XP_031736253.1 uncharacterized protein At4g04980 isoform X3 [Cucumis sativus]0.0e+0084.46Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLE                    TMK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE

Query:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
        RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF
Subjt:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CL
        NVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NEEIEV D KEEKLNLSRTASLKADRNEEIEVIDIEEEK CL
Subjt:  NVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CL

Query:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV------------------PPP-----------------PPPPPPPPMIQQNATLVQHLSQPPPPP
        ++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTV                  PPP                 PPPPPPPPM+QQNA L Q LSQPPPPP
Subjt:  SKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV------------------PPP-----------------PPPPPPPPMIQQNATLVQHLSQPPPPP

Query:  PMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKK
        P+PQMKAQPAAA  N PPPPPQLLKVIET IKV+  PPPPPPSN TGTMVRAGVPPPPPM PSKG AGPA PPP MAQGNGFAPPPPPPGGALRSLR KK
Subjt:  PMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQGNGFAPPPPPPGGALRSLRPKK

Query:  TSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHK
         STKLKRSH LGNLYRTLKGKVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALK SISSFQSSDM DLLLFHK
Subjt:  TSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHK

Query:  QVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVL
        QVESVLENLTDES                                QLQNWKFVSPMG LLDRVENYF+KIKGEVDALERTKDEESKRFRGHGIQFDFSVL
Subjt:  QVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVL

Query:  IRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        IRIKESMVDVSSGCMELALKEKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESESHHL
Subjt:  IRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

TrEMBL top hitse value%identityAlignment
A0A0A0LSA3 Uncharacterized protein0.0e+0091.11Show/hide
Query:  MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY
        MK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY
Subjt:  MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY

Query:  FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPET
        FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPET
Subjt:  FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPET

Query:  PTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIK
        PTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIK
Subjt:  PTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIK

Query:  EEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVPPPPPPPP
        EEKDLSKEASQKAD NEEIEV D KEEKLNLSRTASLKADRNEEIEVIDIEEEK CL++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTV        
Subjt:  EEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVPPPPPPPP

Query:  PPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQ
                         PPPPPP+PQMKAQPAAA  N PPPPPQLLKVIET IKV+  PPPPPPSN TGTMVRAGVPPPPPM PSKG AGPA PPP MAQ
Subjt:  PPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA-PPPRMAQ

Query:  GNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSIT
        GNGFAPPPPPPGGALRSLR KK STKLKRSH LGNLYRTLKGKVEG NQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSIT
Subjt:  GNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSIT

Query:  ALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALE
        ALK SISSFQSSDM DLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIV+QLQNWKFVSPMG LLDRVENYF+KIKGEVDALE
Subjt:  ALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALE

Query:  RTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELA
        RTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEK+RKGGRS+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA
Subjt:  RTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELA

Query:  IEIESESHHL
        +EIESESHHL
Subjt:  IEIESESHHL

A0A1S3B5Y5 uncharacterized protein At4g04980 isoform X10.0e+0092.95Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
        DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNE+IEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA

Query:  AIAQELPTSDLPTV--------------------------------------------------------------PPPPPPPPPPPMIQQNATLVQHLS
        AIAQELPTSDLPTV                                                              PPPPPPPPPPPMIQQNATLVQHLS
Subjt:  AIAQELPTSDLPTV--------------------------------------------------------------PPPPPPPPPPPMIQQNATLVQHLS

Query:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
        QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
Subjt:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS

Query:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
        LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
Subjt:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL

Query:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
        LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
Subjt:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF

Query:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
Subjt:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

A0A1S3B627 uncharacterized protein At4g04980 isoform X20.0e+0091.19Show/hide
Query:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
        PS   +     LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA
Subjt:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMA

Query:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
        NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
Subjt:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS

Query:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEE
        HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNE+IEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEE
Subjt:  HCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEE

Query:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV--------------------------------------------------------------
        KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV                                                              
Subjt:  KCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTV--------------------------------------------------------------

Query:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
        PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
Subjt:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA

Query:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
        PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK
Subjt:  PPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVK

Query:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
        KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK
Subjt:  KHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIK

Query:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
        GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
Subjt:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD

Query:  RLTRELAIEIESESHHL
        RLTRELAIEIESESHHL
Subjt:  RLTRELAIEIESESHHL

A0A5A7UTX7 Hydroxyproline-rich glycoprotein0.0e+0092.95Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
        DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNE+IEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA
Subjt:  DKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAA

Query:  AIAQELPTSDLPTV--------------------------------------------------------------PPPPPPPPPPPMIQQNATLVQHLS
        AIAQELPTSDLPTV                                                              PPPPPPPPPPPMIQQNATLVQHLS
Subjt:  AIAQELPTSDLPTV--------------------------------------------------------------PPPPPPPPPPPMIQQNATLVQHLS

Query:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
        QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS
Subjt:  QPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRS

Query:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
        LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL
Subjt:  LRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDL

Query:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
        LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF
Subjt:  LLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQF

Query:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
        DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL
Subjt:  DFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL

A0A6J1K4W2 uncharacterized protein At4g04980-like isoform X22.5e-30260.31Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFR+KAYGLETM++SSY  SK YSKK KLSK  R KKSSRCKDNFVQ+MELRKKIL+LRDIIDLPSLE SASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF
        PEIISDIQYSEMK TCIEQSL+YFCTALK IGDSWM+NHEW+DKSKYNL S QENSSF EIVESVL IIDC+VSMANERFD MD++VN+K+SS+SRTSSF
Subjt:  PEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSF

Query:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD
        GKS+SS +SCSETNSSCCSSPETPTSVL N R+S RKS EKEK+SCSSPLLWSLRVQAVEKLNPID+KHLLLP+L H GVNVCPAP RVAIVEES   VD
Subjt:  GKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVD

Query:  DKL-TSENTDA-------------ADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKL----------------------------------
        DKL +S+  DA             AD N EMEV   KEE + S+ ASQ+AD NE++EV   +EE +                                  
Subjt:  DKL-TSENTDA-------------ADANNEMEVCDIKEEKDLSKEASQKADGNEEIEVLDTKEEKL----------------------------------

Query:  --------------------------------------------------------------NLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIA
                                                                      +LSR AS KADR EE+EV DI+EEK  LS+A SQ+++ 
Subjt:  --------------------------------------------------------------NLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIA

Query:  ERTNDFDSQAAAIAQELPTSDLPT------VPPP-------------------PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPP
        ER ND DSQAAA  +E+P  +LP+      +PPP                   PPPPPPPPM+QQNA L Q L    PPPP+PQ+K  P AA    PPPP
Subjt:  ERTNDFDSQAAAIAQELPTSDLPT------VPPP-------------------PPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPP

Query:  PQLLKVIETVIKVNGPPPPPP---------------------------------PSNITGTMV-------------------------------------
        P   K + T+I ++ PPPPPP                                 PSN  G +                                      
Subjt:  PQLLKVIETVIKVNGPPPPPP---------------------------------PSNITGTMV-------------------------------------

Query:  ------RAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGG
               A  PPPPPM PSKG A  APPP M QG G   PPPPPGGA RSLR KK +T+LKRSHQLGNLYRTLKGKVEG NQNL+ ++G+KG  G+S GG
Subjt:  ------RAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGG

Query:  KQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVY
        KQGMADALAEMTKRSAYFQQIEEDVK HAK+I  LK  IS+FQSSDMN+L+ FH+ VESVLENLTDESQVLARFEGFP KKLE LR AAALYLKLD IV 
Subjt:  KQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVY

Query:  QLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAA-EKTRKGGRSENSNKA
        QLQNWK VSP G LLDR+ENYFSKIKG++DALERTKD+E+KRF+ HGIQFDF+VLIRIKESMVD+SS CMELALKE RELKAAA EK + G + +N N A
Subjt:  QLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAA-EKTRKGGRSENSNKA

Query:  RSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHH
         SKMLW+AFQFAYRVYTFAGGHDERADRLTRELA+EIESES +
Subjt:  RSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHH

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049809.7e-11039.87Show/hide
Query:  MEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------E
        M+DLQKL PEI++  Q  EM+   +++ L +F   L++IGDSW+++ +W  +SKY  S   +N S   +VE VL  +D ++    ERF  MD        
Subjt:  MEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------E

Query:  FVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPA
        F   K  S     SF +S+    S SE+N+S   SP TP SVL         S        +SP LW+LR QA+++L+P+D+K   +  LS    +   +
Subjt:  FVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPA

Query:  PTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDIK--------------EEKDLSK------------EASQKADGNEEIEVLDTKEEKLNLSRT
         T++ I EE+     + ++   E+ D +    E    +IK              E KD S+            E   + D ++ IE  +T+   +    T
Subjt:  PTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDIK--------------EEKDLSK------------EASQKADGNEEIEVLDTKEEKLNLSRT

Query:  ASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAE
           + D N+ IE  + E           E+  E  ++  +      +  P  D+P  PP   P  P P +    T     SQPPPPPP P+ KA      
Subjt:  ASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAE

Query:  SNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNL
          APPPPP + K  E                 +G   +           + G   P+ P         APP PP  G  RSL  KK ++KL+RS Q+ NL
Subjt:  SNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNL

Query:  YRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENL
        Y  LKGK+EG     K+    KG   V   +     + GMADALAEMTKRS+YFQQIEEDV+K+AKSI  LKSSI SFQ+ DM +LL FH +VES+LE L
Subjt:  YRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENL

Query:  TDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVD
        TDE+QVLARFEGFP KKLE +R A ALY KLD I+ +L+NWK   P+  LLD++E YF+K KGE++ +ERTKDE++K F+ + I  DF VL+++KE+MVD
Subjt:  TDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVD

Query:  VSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESE
        VSS CMELALKE+RE    A +  K G      + R+K LWRAFQFA++VYTFAGGHDERAD LTR+LA EI+++
Subjt:  VSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESE

Q27J81 Inverted formin-23.8e-0538.97Show/hide
Query:  SQAAAIAQELPTSDLPTVPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRA
        SQ  A+ Q+  T         PPPPPPPP++  ++      ++PPPPPP P + +  A A   APPPPP         +    PP PP P  + G+    
Subjt:  SQAAAIAQELPTSDLPTVPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRA

Query:  GVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPP
         +PPPPP +P  G   P PPP +  G G+ PPPPPP
Subjt:  GVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPP

Q7G6K7 Formin-like protein 39.0e-0744.85Show/hide
Query:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA
        PPPPPPPPPPP+ Q N       SQPPPPPP P +        S  PPPPP  +    +V     PPPPPPP  +    V    PPPPP  PS  +    
Subjt:  PPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPA

Query:  PPPRMAQGNGF-APPPPPPGGALRSLRPKKTSTKLK
        PPP    GN F APPPPPP     S  P   +T  K
Subjt:  PPPRMAQGNGF-APPPPPPGGALRSLRPKKTSTKLK

Q84ZL0 Formin-like protein 51.1e-0440Show/hide
Query:  VPPPPPPPPPP-----------PMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPP
        VPPPPPPPPPP           P      + V  +S PPPPPP P   +  A      PPPPP       +    +  PPPPPP  +        VPPPP
Subjt:  VPPPPPPPPPP-----------PMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPP

Query:  PMVPSKGSAGPAPPPRMAQGNGFAPPPPPP
        P  P   S  P PPP  A      PPPPPP
Subjt:  PMVPSKGSAGPAPPPRMAQGNGFAPPPPPP

Q9AKP3 Arp2/3 complex-activating protein rickA7.6e-0638.67Show/hide
Query:  LPTSDLPTVPPPP----------PPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVR
        LP +++P+ PPPP          PPPPPPP+   N       S PPPPPP+P           + PPPPP L     +   +  PPPPPPP  ++G  + 
Subjt:  LPTSDLPTVPPPP----------PPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVR

Query:  AGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTS
        +  PPPPP+  +   + P PPP ++Q N   PPPPPP  A  S + +K S
Subjt:  AGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTS

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown2.8e-1125.36Show/hide
Query:  SNITGTMVRAGVPPPPPMVPSKGSAGPAPPPR---MAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVG
        SN  G+ ++A   P      +KG     P P+     Q     PPPPPP  + R+L  +     ++R+ ++   YR L  K E    N  + NG      
Subjt:  SNITGTMVRAGVPPPPPMVPSKGSAGPAPPPR---MAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVG

Query:  NSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKL
        N N         + E+  RS Y   I+ D  +H   I  L S + +   +D++++  F K ++  L +L DE  VL  F  +P +K+++LR AA  Y + 
Subjt:  NSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKL

Query:  DTIVYQLQNWKFVSPMGL--LLDRVENYFSKIKGEVDALERTKDEESKRFRGHGI----QFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRK
          +  ++ ++K      L   L R+++   +++  V+  E+ +D   KR++   I      D  ++ ++K S + ++   M+   KE             
Subjt:  DTIVYQLQNWKFVSPMGL--LLDRVENYFSKIKGEVDALERTKDEESKRFRGHGI----QFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRK

Query:  GGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTREL
           S  S K  + ML +  +FAY ++ FAGG D     +  EL
Subjt:  GGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTREL

AT1G11070.1 BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein family protein (TAIR:AT1G61080.1)2.4e-11139.14Show/hide
Query:  RRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKT
        RR A   E +KN+       + +  K+S  + S  SS+   NF+ M+ELR+KI+  R IIDLP L    SI+ +V+ TM+DL KL PEII   Q  EM+ 
Subjt:  RRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKT

Query:  TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETN
          +++ L  F  ALKSIGDSW+ +HEW  KSKY  S+ ++N S   +VE VL  +D ++   NER +  +   N       +      S  ST + ++ +
Subjt:  TCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETN

Query:  SSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADAN
         S    P  P +VL    S   K      +S S+ L  ++R+QA+ KL+PIDVK L +   S                                      
Subjt:  SSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADAN

Query:  NEMEVCDIKEEKDLSKEA-SQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPT
                       KEA S   DG++E                 SLK  ++E       EE+   +    Q+ +  + +D D    ++  E+ +     
Subjt:  NEMEVCDIKEEKDLSKEA-SQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPT

Query:  VPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGP
         PPPPPPP        NA +       P  PP+       A  ES AP PPP     +        PPPPP P       VR GV  PP   P   +  P
Subjt:  VPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGP

Query:  APPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKS---ANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQI
         PP  MA G G A PPPPP GA   L  KK ++KLKRS  LG L+R LKGK+EG N  ++S     G KG  G++   GKQGMADALAE+TK+S YFQ+I
Subjt:  APPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKS---ANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQI

Query:  EEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENY
        EEDV+ +  SI  LK+ I+ F++ D+ +L  FH ++ESVLE L DE+QVLAR EGFP KKLE +R+AAALY KL+ ++ +L+NWK  SP   L D+ E Y
Subjt:  EEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENY

Query:  FSK------------------IKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKAR---
        F+K                  I+ E++ L++ K EE K+F+ + I FDF++L++IKE MVD+SSGCMELALKEKRE K A++ T +   ++ S K +   
Subjt:  FSK------------------IKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKAR---

Query:  -SKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIE
         +K LWRAF FAYRVYTFAGGHD+RAD+LTRELA EIE
Subjt:  -SKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIE

AT1G61080.1 Hydroxyproline-rich glycoprotein family protein1.3e-12040.04Show/hide
Query:  KGARSKKSS---RCKDNFVQMMELRKKILILRDIIDLPSLERSASINE---------LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKS
        K ARS K+S   +   NF+ M+ELR+KI   RDIIDL +L+ S SI +         +V+ TM+DLQK+ PEII      E++   +++ L +F  ALKS
Subjt:  KGARSKKSS---RCKDNFVQMMELRKKILILRDIIDLPSLERSASINE---------LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKS

Query:  IGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAM--DEFVNSKDS--------SFSRTSSFGKSSSSTDSCSETNSSCCSS
        IGDSW+ N EW  KSKY  SS  +N S   +VE VL  +D ++ M+ ERFD M  DE    K+S        S SR  S  +S S + S   + +S C S
Subjt:  IGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAM--DEFVNSKDS--------SFSRTSSFGKSSSSTDSCSETNSSCCSS

Query:  PETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVC
        P TP SVL             +  + +S LLW++RVQA+EKL+PIDVK L +  LS                             +     + +N+ +V 
Subjt:  PETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVC

Query:  DIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDS-------------QAAAIAQELP
         + E      E  QK D  E I+V    EE +NL   + +     + I  I   E    SK N  E     +  F               + A++    P
Subjt:  DIKEEKDLSKEASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDS-------------QAAAIAQELP

Query:  TSDLP------TVPPPPPPPPPPPMIQQNATLVQHLSQPPPP-------------PPMPQMKA-QPAAAESNAPPPPPQLLKVIETV--------IKVNG
            P      ++PPPPPPPPPPP +      ++H + PPPP             PP P   A +P    +  PPPPP L   I             V  
Subjt:  TSDLP------TVPPPPPPPPPPPMIQQNATLVQHLSQPPPP-------------PPMPQMKA-QPAAAESNAPPPPPQLLKVIETV--------IKVNG

Query:  PPPPPPPSNIT-----------------------------------GTMVRAGVPPPPPMVP----------------SKGSAG-PAPPPRMAQGNGFAP
        PPPPPPP                                       G     G PPPPP +P                + G+AG P PPPRM   NG A 
Subjt:  PPPPPPPSNIT-----------------------------------GTMVRAGVPPPPPMVP----------------SKGSAG-PAPPPRMAQGNGFAP

Query:  PPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLK--SANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALK
        PPPPPG A RSLRPKK +TKLKRS QLGNLYR LKGKVEG + N K  S +GRK G G++  GGKQGMADALAE+TK+SAYF QI+ D+ K+  SI  LK
Subjt:  PPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLK--SANGRKGGVGNS-NGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALK

Query:  SSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTK
          I+ FQ+ DM +LL FH++VESVLENLTDESQVLAR EGFP KKLE +R+A ALY KL  ++ +LQN K   P+  LLD+VE YF+K            
Subjt:  SSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTK

Query:  DEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEI
                             IKE+MVD+SS CMELALKEKR+ K  +   +    S       +KMLWRAFQFA++VYTFAGGHD+RAD LTRELA EI
Subjt:  DEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEI

Query:  ESES
        +++S
Subjt:  ESES

AT4G04980.1 unknown protein7.1e-12440.19Show/hide
Query:  SKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSI
        SKT +  P+  K      S +C  NF+ M+ELRK I   RD+IDLPSL+ S S+ E++  TM+DLQKL PEI++  Q  EM+   +++ L +F   L++I
Subjt:  SKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSI

Query:  GDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETP
        GDSW+++ +W  +SKY  S   +N S   +VE VL  +D ++    ERF  MD        F   K  S     SF +S+    S SE+N+S   SP TP
Subjt:  GDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETP

Query:  TSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDI
         SVL         S        +SP LW+LR QA+++L+P+D+K   +  LS    +   + T++ I EE+     + ++   E+ D +    E    +I
Subjt:  TSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDI

Query:  K--------------EEKDLSK------------EASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFD
        K              E KD S+            E   + D ++ IE  +T+   +    T   + D N+ IE  + E           E+  E  ++  
Subjt:  K--------------EEKDLSK------------EASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFD

Query:  SQAAAIAQELPTSDLPTVPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRA
        +      +  P  D+P  PP   P  P P +    T     SQPPPPPP P+ KA        APPPPP + K  E                 +G   + 
Subjt:  SQAAAIAQELPTSDLPTVPPPPPPPPPPPMIQQNATLVQHLSQPPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRA

Query:  GVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQG
                  + G   P+ P         APP PP  G  RSL  KK ++KL+RS Q+ NLY  LKGK+EG     K+    KG   V   +     + G
Subjt:  GVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKG--GVGNSNG---GKQG

Query:  MADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQ
        MADALAEMTKRS+YFQQIEEDV+K+AKSI  LKSSI SFQ+ DM +LL FH +VES+LE LTDE+QVLARFEGFP KKLE +R A ALY KLD I+ +L+
Subjt:  MADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQ

Query:  NWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKM
        NWK   P+  LLD++E YF+K KGE++ +ERTKDE++K F+ + I  DF VL+++KE+MVDVSS CMELALKE+RE    A +  K G      + R+K 
Subjt:  NWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKM

Query:  LWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESE
        LWRAFQFA++VYTFAGGHDERAD LTR+LA EI+++
Subjt:  LWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESE

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.6e-1426.67Show/hide
Query:  PPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSL
        P  P P P+       +  N   PPPQ         K   PPPPPPP  +         PPPPP      S   APPP         PPPPPP  +L   
Subjt:  PPPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSL

Query:  RPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGM------ADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSS
             S K++R  ++   Y +L  +         S N R+   G  N   + +       D + E+  RS Y   I+ DV+     I  L   + +   S
Subjt:  RPKKTSTKLKRSHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGM------ADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSS

Query:  DMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKF--VSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRF
        D+ D++ F K ++  L  L DE  VL  FE +P +K + LR AA  Y  L  ++ +   ++          L +++  F K++  V +L R ++  + +F
Subjt:  DMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKF--VSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRF

Query:  RGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELA-------IEI
        +   I  D+ +   I  S + ++S  ++LA+K  + + A  E    GG  E       +++ +  +FA+RV+ FAGG D    +   EL        ++ 
Subjt:  RGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKAAAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELA-------IEI

Query:  ESESH
        +S++H
Subjt:  ESESH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCGGAGGCTGGTGTGGTTTAGGCCCTTTGTTGTTTCGTAGAAAGGCCTATGGACTCGAGACAATGAAGAACTCTTCTTACGTGTTCTCGAAGACATATTCTAA
GAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTCTTCAAGATGCAAAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCA
TTGATTTGCCTTCTTTAGAACGCTCTGCTTCTATAAATGAGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAGCTTTATCCTGAAATCATTTCGGATATCCAATATTCC
GAAATGAAGACGACATGTATTGAACAGAGTCTTGCCTATTTCTGCACTGCACTGAAATCGATTGGCGATTCGTGGATGCTGAACCATGAGTGGAGGGACAAATCTAAATA
TAATTTGTCATCATTTCAGGAAAACTCCAGCTTCCAAGAAATTGTTGAATCTGTGTTGGGTATTATTGATTGCATCGTTAGTATGGCGAACGAAAGGTTTGATGCGATGG
ACGAATTTGTTAACTCAAAGGATTCTTCTTTTTCAAGAACTAGTTCCTTTGGTAAGAGCTCGAGTTCGACAGATTCCTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCT
CCAGAAACTCCGACGTCCGTCCTTGCAAACTTTCGGAGCAGTGAAAGAAAATCTTCTGAAAAGGAGAAAGTCTCTTGTAGCTCTCCTCTTTTATGGTCTCTTAGAGTTCA
AGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCTTTTGCTTCCTAGGTTGTCTCACTGCGGAGTAAATGTCTGCCCCGCCCCAACCAGAGTGGCGATCGTTGAGG
AATCAACGATGGATGTGGATGACAAGCTCACCTCTGAAAACACTGATGCTGCTGATGCAAATAATGAAATGGAAGTGTGTGATATTAAAGAAGAGAAGGATTTGAGCAAG
GAAGCTAGTCAAAAGGCAGATGGAAATGAGGAAATTGAAGTGTTAGATACCAAAGAAGAAAAATTGAATTTGAGCAGGACAGCTAGTCTAAAGGCAGATAGAAATGAAGA
AATTGAAGTGATTGATATCGAAGAAGAGAAGTGTTTGAGCAAGGCAAATAGCCAAGAAGACATTGCTGAGAGAACCAATGATTTTGATTCCCAAGCTGCTGCAATTGCTC
AAGAATTGCCTACATCAGATTTACCAACTGTTCCACCTCCTCCACCACCACCACCACCACCACCAATGATACAACAAAATGCAACATTGGTCCAACATCTTTCACAGCCA
CCTCCTCCCCCACCAATGCCTCAGATGAAAGCACAGCCTGCTGCAGCAGAGTCAAATGCGCCTCCTCCACCTCCACAATTGTTAAAGGTAATCGAAACGGTGATCAAAGT
CAATGGACCACCACCACCACCACCACCATCAAACATTACTGGAACGATGGTGAGAGCAGGTGTACCGCCACCTCCCCCTATGGTGCCCTCAAAAGGGAGTGCAGGTCCAG
CACCCCCTCCTCGGATGGCCCAAGGCAATGGGTTTGCTCCACCACCACCTCCACCAGGTGGTGCATTACGGTCCTTGCGCCCCAAGAAAACCTCTACCAAACTAAAAAGA
TCTCATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAGGTGGAGGGATGCAATCAAAATCTTAAGTCGGCTAACGGAAGGAAAGGTGGCGTCGGAAACAGTAACGG
AGGAAAACAAGGAATGGCTGATGCATTGGCAGAGATGACAAAAAGATCAGCATACTTCCAGCAAATTGAAGAAGATGTTAAAAAACACGCCAAATCGATCACCGCGCTTA
AATCTTCCATTTCATCTTTCCAATCATCAGACATGAATGACCTGCTCCTTTTCCACAAGCAAGTGGAATCTGTACTAGAGAATTTAACTGATGAATCACAGGTACTAGCA
AGGTTTGAAGGATTTCCCATCAAAAAGTTGGAAACTTTGAGAATTGCAGCAGCATTATATCTAAAGTTAGATACAATTGTCTATCAACTACAGAACTGGAAGTTTGTTTC
TCCCATGGGACTGCTTCTCGACCGAGTCGAAAACTACTTCTCTAAGATCAAAGGAGAAGTCGATGCACTTGAACGAACCAAGGATGAAGAATCAAAGAGATTCCGAGGTC
ACGGTATTCAATTTGATTTCAGTGTGTTAATACGGATCAAGGAATCAATGGTGGATGTTTCTTCTGGCTGCATGGAGTTGGCTTTGAAGGAAAAAAGAGAGTTGAAGGCA
GCAGCAGAAAAGACACGAAAAGGAGGCCGATCTGAAAATTCGAACAAGGCACGTTCAAAGATGCTATGGAGGGCATTCCAATTCGCATACCGAGTTTACACCTTCGCCGG
TGGACACGACGAGCGTGCTGATAGACTGACCAGAGAGTTGGCTATAGAAATAGAGAGTGAATCCCATCACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACCGGAGGCTGGTGTGGTTTAGGCCCTTTGTTGTTTCGTAGAAAGGCCTATGGACTCGAGACAATGAAGAACTCTTCTTACGTGTTCTCGAAGACATATTCTAA
GAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTCTTCAAGATGCAAAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCA
TTGATTTGCCTTCTTTAGAACGCTCTGCTTCTATAAATGAGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAGCTTTATCCTGAAATCATTTCGGATATCCAATATTCC
GAAATGAAGACGACATGTATTGAACAGAGTCTTGCCTATTTCTGCACTGCACTGAAATCGATTGGCGATTCGTGGATGCTGAACCATGAGTGGAGGGACAAATCTAAATA
TAATTTGTCATCATTTCAGGAAAACTCCAGCTTCCAAGAAATTGTTGAATCTGTGTTGGGTATTATTGATTGCATCGTTAGTATGGCGAACGAAAGGTTTGATGCGATGG
ACGAATTTGTTAACTCAAAGGATTCTTCTTTTTCAAGAACTAGTTCCTTTGGTAAGAGCTCGAGTTCGACAGATTCCTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCT
CCAGAAACTCCGACGTCCGTCCTTGCAAACTTTCGGAGCAGTGAAAGAAAATCTTCTGAAAAGGAGAAAGTCTCTTGTAGCTCTCCTCTTTTATGGTCTCTTAGAGTTCA
AGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCTTTTGCTTCCTAGGTTGTCTCACTGCGGAGTAAATGTCTGCCCCGCCCCAACCAGAGTGGCGATCGTTGAGG
AATCAACGATGGATGTGGATGACAAGCTCACCTCTGAAAACACTGATGCTGCTGATGCAAATAATGAAATGGAAGTGTGTGATATTAAAGAAGAGAAGGATTTGAGCAAG
GAAGCTAGTCAAAAGGCAGATGGAAATGAGGAAATTGAAGTGTTAGATACCAAAGAAGAAAAATTGAATTTGAGCAGGACAGCTAGTCTAAAGGCAGATAGAAATGAAGA
AATTGAAGTGATTGATATCGAAGAAGAGAAGTGTTTGAGCAAGGCAAATAGCCAAGAAGACATTGCTGAGAGAACCAATGATTTTGATTCCCAAGCTGCTGCAATTGCTC
AAGAATTGCCTACATCAGATTTACCAACTGTTCCACCTCCTCCACCACCACCACCACCACCACCAATGATACAACAAAATGCAACATTGGTCCAACATCTTTCACAGCCA
CCTCCTCCCCCACCAATGCCTCAGATGAAAGCACAGCCTGCTGCAGCAGAGTCAAATGCGCCTCCTCCACCTCCACAATTGTTAAAGGTAATCGAAACGGTGATCAAAGT
CAATGGACCACCACCACCACCACCACCATCAAACATTACTGGAACGATGGTGAGAGCAGGTGTACCGCCACCTCCCCCTATGGTGCCCTCAAAAGGGAGTGCAGGTCCAG
CACCCCCTCCTCGGATGGCCCAAGGCAATGGGTTTGCTCCACCACCACCTCCACCAGGTGGTGCATTACGGTCCTTGCGCCCCAAGAAAACCTCTACCAAACTAAAAAGA
TCTCATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAGGTGGAGGGATGCAATCAAAATCTTAAGTCGGCTAACGGAAGGAAAGGTGGCGTCGGAAACAGTAACGG
AGGAAAACAAGGAATGGCTGATGCATTGGCAGAGATGACAAAAAGATCAGCATACTTCCAGCAAATTGAAGAAGATGTTAAAAAACACGCCAAATCGATCACCGCGCTTA
AATCTTCCATTTCATCTTTCCAATCATCAGACATGAATGACCTGCTCCTTTTCCACAAGCAAGTGGAATCTGTACTAGAGAATTTAACTGATGAATCACAGGTACTAGCA
AGGTTTGAAGGATTTCCCATCAAAAAGTTGGAAACTTTGAGAATTGCAGCAGCATTATATCTAAAGTTAGATACAATTGTCTATCAACTACAGAACTGGAAGTTTGTTTC
TCCCATGGGACTGCTTCTCGACCGAGTCGAAAACTACTTCTCTAAGATCAAAGGAGAAGTCGATGCACTTGAACGAACCAAGGATGAAGAATCAAAGAGATTCCGAGGTC
ACGGTATTCAATTTGATTTCAGTGTGTTAATACGGATCAAGGAATCAATGGTGGATGTTTCTTCTGGCTGCATGGAGTTGGCTTTGAAGGAAAAAAGAGAGTTGAAGGCA
GCAGCAGAAAAGACACGAAAAGGAGGCCGATCTGAAAATTCGAACAAGGCACGTTCAAAGATGCTATGGAGGGCATTCCAATTCGCATACCGAGTTTACACCTTCGCCGG
TGGACACGACGAGCGTGCTGATAGACTGACCAGAGAGTTGGCTATAGAAATAGAGAGTGAATCCCATCACCTATGA
Protein sequenceShow/hide protein sequence
MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYS
EMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSS
PETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSK
EASQKADGNEEIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVPPPPPPPPPPPMIQQNATLVQHLSQP
PPPPPMPQMKAQPAAAESNAPPPPPQLLKVIETVIKVNGPPPPPPPSNITGTMVRAGVPPPPPMVPSKGSAGPAPPPRMAQGNGFAPPPPPPGGALRSLRPKKTSTKLKR
SHQLGNLYRTLKGKVEGCNQNLKSANGRKGGVGNSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKSSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLA
RFEGFPIKKLETLRIAAALYLKLDTIVYQLQNWKFVSPMGLLLDRVENYFSKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSGCMELALKEKRELKA
AAEKTRKGGRSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHHL