; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0024198 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0024198
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionHydroxyproline-rich glycoprotein
Genome locationchr12:4818360..4823352
RNA-Seq ExpressionPI0024198
SyntenyPI0024198
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008442261.1 PREDICTED: uncharacterized protein At4g04980 isoform X1 [Cucumis melo]0.0e+0091.14Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLE                    TMK+SSYVFSKTYSKKPKLSKGARSKKSSR KDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL
        NVCPAPTRVAIVEES MDVDDKLTS NTDAADA +EMEVCDIKEE+DLSKEAS KAD NE+IEV D KEEKLNLSRTASLKADRNEE EVIDI+EEK CL
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL

Query:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ---
        S+AN+Q+DIAERT++FDSQA A AQELPTSDLP VVSKPLPLL    P  PPPPPPPPARALQPS+V LQLPTPTPPPPPPPPMMQQNAVLAQQLSQ   
Subjt:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ---

Query:  PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPP
        PPPPPPPPM+QQNAT VQHLSQPPPPPP+PQMKAQPAAAE NAPPPPPQLLKVIETVIKV+  PPPPPPSNITGTMVRAGVPPPPPMVPS+GSAGPAPPP
Subjt:  PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPP

Query:  PMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA
         MAQGNGFAPPPPPPGGA+RSLRPKK STKLKRSHQLGNLYRTLKGKVEG NQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA
Subjt:  PMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA

Query:  KSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEV
        KSITALK SISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLD IVYQLQNWKFVSP+G LLDRVENYF+KIKGEV
Subjt:  KSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEV

Query:  DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT
        DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKRE KAAAEKTRKGG+SENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT
Subjt:  DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT

Query:  RELAIEIESESHQL
        RELAIEIESESH L
Subjt:  RELAIEIESESHQL

XP_031736251.1 uncharacterized protein At4g04980 isoform X1 [Cucumis sativus]0.0e+0091.42Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFRKKAYGLETHSSLSFSICP+GKWVCLKKTMKSSSYVFSKTYSKK KLSKGARSKKSS  KDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL
        NVCPAPTRVAIVEESMMD+DDKL S NTDAADA +EMEVCDIKEE+DLSKEAS KADRNEEIEVFD KEEKLNLSRTASLKADRNEE EVIDI+EEKLCL
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL

Query:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVA-PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPP
        +R N+QKDIAERTD+FDSQA AT QELPTSDLP VVSKPLPLLPP+A PPPPPPPPAR L+PS+VTLQLPTPTPPPPPPPPMMQQNAVLAQQ        
Subjt:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVA-PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPP

Query:  PPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQG
                      LSQPPPPPPVPQMKAQPAAA PN PPPPPQLLKVIET IKVSVPPPPPPSN TGTMVRAGVPPPPPM PS+G AGPAPPPP MAQG
Subjt:  PPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQG

Query:  NGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA
        NGFAPPPPPPGGA+RSLR KKASTKLKRSH LGNLYRTLKGKVEGSNQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA
Subjt:  NGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA

Query:  LKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALER
        LKPSISSFQSSDM DLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLD IV+QLQNWKFVSP+G LLDRVENYFTKIKGEVDALER
Subjt:  LKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALER

Query:  TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAI
        TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKRE KAAAEK+RKGG+S+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+
Subjt:  TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAI

Query:  EIESESHQL
        EIESESH L
Subjt:  EIESESHQL

XP_031736252.1 uncharacterized protein At4g04980 isoform X2 [Cucumis sativus]0.0e+0089.33Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFRKKAYGLE                    TMKSSSYVFSKTYSKK KLSKGARSKKSS  KDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL
        NVCPAPTRVAIVEESMMD+DDKL S NTDAADA +EMEVCDIKEE+DLSKEAS KADRNEEIEVFD KEEKLNLSRTASLKADRNEE EVIDI+EEKLCL
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL

Query:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVA-PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPP
        +R N+QKDIAERTD+FDSQA AT QELPTSDLP VVSKPLPLLPP+A PPPPPPPPAR L+PS+VTLQLPTPTPPPPPPPPMMQQNAVLAQQ        
Subjt:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVA-PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPP

Query:  PPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQG
                      LSQPPPPPPVPQMKAQPAAA PN PPPPPQLLKVIET IKVSVPPPPPPSN TGTMVRAGVPPPPPM PS+G AGPAPPPP MAQG
Subjt:  PPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQG

Query:  NGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA
        NGFAPPPPPPGGA+RSLR KKASTKLKRSH LGNLYRTLKGKVEGSNQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA
Subjt:  NGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA

Query:  LKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALER
        LKPSISSFQSSDM DLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLD IV+QLQNWKFVSP+G LLDRVENYFTKIKGEVDALER
Subjt:  LKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALER

Query:  TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAI
        TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKRE KAAAEK+RKGG+S+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+
Subjt:  TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAI

Query:  EIESESHQL
        EIESESH L
Subjt:  EIESESHQL

XP_031736253.1 uncharacterized protein At4g04980 isoform X3 [Cucumis sativus]0.0e+0088.12Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFRKKAYGLETHSSLSFSICP+GKWVCLKKTMKSSSYVFSKTYSKK KLSKGARSKKSS  KDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL
        NVCPAPTRVAIVEESMMD+DDKL S NTDAADA +EMEVCDIKEE+DLSKEAS KADRNEEIEVFD KEEKLNLSRTASLKADRNEE EVIDI+EEKLCL
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL

Query:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVA-PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPP
        +R N+QKDIAERTD+FDSQA AT QELPTSDLP VVSKPLPLLPP+A PPPPPPPPAR L+PS+VTLQLPTPTPPPPPPPPMMQQNAVLAQQ        
Subjt:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVA-PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPP

Query:  PPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQG
                      LSQPPPPPPVPQMKAQPAAA PN PPPPPQLLKVIET IKVSVPPPPPPSN TGTMVRAGVPPPPPM PS+G AGPAPPPP MAQG
Subjt:  PPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQG

Query:  NGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA
        NGFAPPPPPPGGA+RSLR KKASTKLKRSH LGNLYRTLKGKVEGSNQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA
Subjt:  NGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITA

Query:  LKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALER
        LKPSISSFQSSDM DLLLFHKQVESVLENLTDES                                QLQNWKFVSP+G LLDRVENYFTKIKGEVDALER
Subjt:  LKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALER

Query:  TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAI
        TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKRE KAAAEK+RKGG+S+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+
Subjt:  TKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAI

Query:  EIESESHQL
        EIESESH L
Subjt:  EIESESHQL

XP_038895361.1 uncharacterized protein At4g04980-like isoform X1 [Benincasa hispida]0.0e+0077.12Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFRKKAYGLETHSSL FSICPLGKWVCLKKTMKSSSYVFSK+Y+KKPK SK ARSKKSSR KDNFVQMMELRKKI+ILRDIIDLP LE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIIS IQYSEMK TCIEQ+LA+FCTALKSIG+SWMLNHEWRDKSKYNLSS+QEN+SF+EIVESVL IIDCI+SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        D MDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVL NFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL
        NVCPAPT VAIVEESMMDVDDKLTSVNTDAADA +EMEVCD+KEE+DLS EAS +AD NEE+EV D+KEEKLN                          L
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL

Query:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPT----PTPPPPPPPPMMQQNAVLAQQLSQPP
        SRAN+QKDIAERTD+F SQ  AT QE+P S+LP VVSKPLP LP VA PPP  PPARA QPSVV LQLPT    P PPPPPPPPMMQQ+AV AQQLSQPP
Subjt:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPT----PTPPPPPPPPMMQQNAVLAQQLSQPP

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  PPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQP-AAAEPNAPPPPPQLLKVIETVIKVSV-PPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP
        PP PPPMMQQNA   Q LSQPPPPPP+PQMK QP AAA P APPPPPQLLKVIETVIKV V PPPPPPSN+TGT VRAG+PPPPPM PS G A PAPPPP
Subjt:  PPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQP-AAAEPNAPPPPPQLLKVIETVIKVSV-PPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP

Query:  MAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAK
        MAQGNGFAPPPPPPGGA+RSLRPKK STKLKRSHQLGNLYRTLKGKVEGSNQNL+SANGRKG VGSS GGKQGMADALAEMTKRSAYFQQIE+DVKKHAK
Subjt:  MAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAK

Query:  SITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVD
        SIT LK SISSFQSSDMND+L+FHKQVESVLENLTDESQVLARFEGFP KKLET+RIA ALYLKLDAIV+QLQNWKFVSP+G LLDRVE+YF+KIKGEVD
Subjt:  SITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVD

Query:  ALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESK-AAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT
        ALERTKDEESKRFR HGIQFDFSVLIRIKESMVDVSSSCMELALKEKRE K AAAEKT+ GG+SENSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLT
Subjt:  ALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESK-AAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT

Query:  RELAIEIESESHQL
        RELA+EIESES  L
Subjt:  RELAIEIESESHQL

TrEMBL top hitse value%identityAlignment
A0A0A0LSA3 Uncharacterized protein0.0e+0084.86Show/hide
Query:  MKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLEGSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY
        MKSSSYVFSKTYSKK KLSKGARSKKSS  KDNFVQMMELRKKILILRDIIDLPSLE SASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY
Subjt:  MKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLEGSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAY

Query:  FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPET
        FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMANERFDAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPET
Subjt:  FCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPET

Query:  PTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIK
        PTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESMMD+DDKL S NTDAADA +EMEVCDIK
Subjt:  PTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIK

Query:  EERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL
        EE+DLSKEAS KADRNEEIEVFD KEEKLNLSRTASLKADRNEE EVIDI+EEKLCL+R N+QKDIAERTD+FDSQA AT QELPTSDLP V        
Subjt:  EERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL

Query:  PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQ
                                                                                 PPPPPPVPQMKAQPAAA PN PPPPPQ
Subjt:  PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQ

Query:  LLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVE
        LLKVIET IKVSVPPPPPPSN TGTMVRAGVPPPPPM PS+G AGPAPPPP MAQGNGFAPPPPPPGGA+RSLR KKASTKLKRSH LGNLYRTLKGKVE
Subjt:  LLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPP-MAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVE

Query:  GSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFP
        GSNQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDM DLLLFHKQVESVLENLTDESQVLARFEGFP
Subjt:  GSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFP

Query:  IKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKR
        IKKLETLRIAAALYLKLD IV+QLQNWKFVSP+G LLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKR
Subjt:  IKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKR

Query:  ESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHQL
        E KAAAEK+RKGG+S+NSNKA SKMLWRAFQFAYRVYTFAGGHD+RADRLTRELA+EIESESH L
Subjt:  ESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESESHQL

A0A1S3B5Y5 uncharacterized protein At4g04980 isoform X10.0e+0091.14Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLE                    TMK+SSYVFSKTYSKKPKLSKGARSKKSSR KDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL
        NVCPAPTRVAIVEES MDVDDKLTS NTDAADA +EMEVCDIKEE+DLSKEAS KAD NE+IEV D KEEKLNLSRTASLKADRNEE EVIDI+EEK CL
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL

Query:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ---
        S+AN+Q+DIAERT++FDSQA A AQELPTSDLP VVSKPLPLL    P  PPPPPPPPARALQPS+V LQLPTPTPPPPPPPPMMQQNAVLAQQLSQ   
Subjt:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ---

Query:  PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPP
        PPPPPPPPM+QQNAT VQHLSQPPPPPP+PQMKAQPAAAE NAPPPPPQLLKVIETVIKV+  PPPPPPSNITGTMVRAGVPPPPPMVPS+GSAGPAPPP
Subjt:  PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPP

Query:  PMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA
         MAQGNGFAPPPPPPGGA+RSLRPKK STKLKRSHQLGNLYRTLKGKVEG NQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA
Subjt:  PMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA

Query:  KSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEV
        KSITALK SISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLD IVYQLQNWKFVSP+G LLDRVENYF+KIKGEV
Subjt:  KSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEV

Query:  DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT
        DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKRE KAAAEKTRKGG+SENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT
Subjt:  DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT

Query:  RELAIEIESESHQL
        RELAIEIESESH L
Subjt:  RELAIEIESESHQL

A0A1S3B627 uncharacterized protein At4g04980 isoform X20.0e+0091.93Show/hide
Query:  PSLEGSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMA
        PS   +     LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMA
Subjt:  PSLEGSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMA

Query:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
        NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS
Subjt:  NERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLS

Query:  HCGVNVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEE
        HCGVNVCPAPTRVAIVEES MDVDDKLTS NTDAADA +EMEVCDIKEE+DLSKEAS KAD NE+IEV D KEEKLNLSRTASLKADRNEE EVIDI+EE
Subjt:  HCGVNVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEE

Query:  KLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLS
        K CLS+AN+Q+DIAERT++FDSQA A AQELPTSDLP VVSKPLPLL    P  PPPPPPPPARALQPS+V LQLPTPTPPPPPPPPMMQQNAVLAQQLS
Subjt:  KLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLS

Query:  Q---PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGP
        Q   PPPPPPPPM+QQNAT VQHLSQPPPPPP+PQMKAQPAAAE NAPPPPPQLLKVIETVIKV+  PPPPPPSNITGTMVRAGVPPPPPMVPS+GSAGP
Subjt:  Q---PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGP

Query:  APPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDV
        APPP MAQGNGFAPPPPPPGGA+RSLRPKK STKLKRSHQLGNLYRTLKGKVEG NQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDV
Subjt:  APPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDV

Query:  KKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKI
        KKHAKSITALK SISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLD IVYQLQNWKFVSP+G LLDRVENYF+KI
Subjt:  KKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKI

Query:  KGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERA
        KGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKRE KAAAEKTRKGG+SENSNKARSKMLWRAFQFAYRVYTFAGGHDERA
Subjt:  KGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERA

Query:  DRLTRELAIEIESESHQL
        DRLTRELAIEIESESH L
Subjt:  DRLTRELAIEIESESHQL

A0A5A7UTX7 Hydroxyproline-rich glycoprotein0.0e+0091.14Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLE                    TMK+SSYVFSKTYSKKPKLSKGARSKKSSR KDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCI+SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL
        NVCPAPTRVAIVEES MDVDDKLTS NTDAADA +EMEVCDIKEE+DLSKEAS KAD NE+IEV D KEEKLNLSRTASLKADRNEE EVIDI+EEK CL
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCL

Query:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ---
        S+AN+Q+DIAERT++FDSQA A AQELPTSDLP VVSKPLPLL    P  PPPPPPPPARALQPS+V LQLPTPTPPPPPPPPMMQQNAVLAQQLSQ   
Subjt:  SRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLL---PPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQ---

Query:  PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPP
        PPPPPPPPM+QQNAT VQHLSQPPPPPP+PQMKAQPAAAE NAPPPPPQLLKVIETVIKV+  PPPPPPSNITGTMVRAGVPPPPPMVPS+GSAGPAPPP
Subjt:  PPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS-VPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPP

Query:  PMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA
         MAQGNGFAPPPPPPGGA+RSLRPKK STKLKRSHQLGNLYRTLKGKVEG NQNL+SANGRKGGVG+SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA
Subjt:  PMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHA

Query:  KSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEV
        KSITALK SISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLD IVYQLQNWKFVSP+G LLDRVENYF+KIKGEV
Subjt:  KSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEV

Query:  DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT
        DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSS CMELALKEKRE KAAAEKTRKGG+SENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT
Subjt:  DALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLT

Query:  RELAIEIESESHQL
        RELAIEIESESH L
Subjt:  RELAIEIESESHQL

A0A6J1JVH3 uncharacterized protein At4g04980-like isoform X10.0e+0062.31Show/hide
Query:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFRKKAYGLE+HSSLS SICP G WVCL KTM+SSSY  SK YSKK KLSK  R KKSSR KDNFVQ+MELRKKIL+LRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLE

Query:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF
         SASINELVVGTMEDLQKLYPEIISDIQYSEMK TCIEQSL+YFCTALK IGDSWM+NHEW+DKSKYNL S QENSSF EIVESVL IIDC++SMANERF
Subjt:  GSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERF

Query:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV
        D MD++VN+K+SS+SRTSSFGKS+SS +SCSETNSSCCSSPETPTSVL N RNS RKS EKEK+SCSSPLLWSLRVQAVEKLNPID+KHLLLP+L H GV
Subjt:  DAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGV

Query:  NVCPAPTRVAIVEESMMDVDDKLTS--------------VNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKL--------------
        NVCPAP RVAIVEESM  VDDKL S              V +  AD  +EMEV   KEE + S+ AS +AD NE++EV   +EE +              
Subjt:  NVCPAPTRVAIVEESMMDVDDKLTS--------------VNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKL--------------

Query:  ----------------------------------------------------------------------------------NLSRTASLKADRNEETEV
                                                                                          +LSR AS KADR EE EV
Subjt:  ----------------------------------------------------------------------------------NLSRTASLKADRNEETEV

Query:  IDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPT--PPPPPPPPMMQQNAVL
         DIKEEKL LSRA +QK++ ER ++ DSQA AT +E+P  +LP   S PLPL+         PPP  A +PS V LQLPTP+  PPPPPPPPMMQQNAVL
Subjt:  IDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPT--PPPPPPPPMMQQNAVL

Query:  AQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPP------------VPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMV------
        +QQL    PPPP P ++       H+  PPPPPP             P     P+ A    PPPPP           V  PPPP PSN  G +       
Subjt:  AQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPP------------VPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMV------

Query:  -------------------------------------RAGVPPPPPMVPSRGSAGPAPPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNL
                                              A  PPPPPM PS+G A  APPPPM QG G   PPPPPGGA RSLR KKA+T+LKRSHQLGNL
Subjt:  -------------------------------------RAGVPPPPPMVPSRGSAGPAPPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNL

Query:  YRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQ
        YRTLKGKVEGSNQNLR ++G+KG  GSS GGKQGMADALAEMTKRSAYFQQIEEDVK HAK+I  LKP IS+FQSSDMN+L+ FH+ VESVLENLTDESQ
Subjt:  YRTLKGKVEGSNQNLRSANGRKGGVGSSNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQ

Query:  VLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSC
        VLARFEGFP KKLE LR AAALYLKLDAIV QLQNWK VSP G LLDR+ENYF+KIKG++DALERTKD+E+KRF+ HGIQFDF+VLIRIKESMVD+SSSC
Subjt:  VLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSC

Query:  MELALKEKRESKAAA-EKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESES
        MELALKE RE KAAA EK + G + +N N A SKMLW+AFQFAYRVYTFAGGHDERADRLTRELA+EIESES
Subjt:  MELALKEKRESKAAA-EKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESES

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049801.1e-10336.81Show/hide
Query:  MEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAMD-------E
        M+DLQKL PEI++  Q  EM+   +++ L +F   L++IGDSW+++ +W  +SKY  S   +N S   +VE VL  +D +I    ERF  MD        
Subjt:  MEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAMD-------E

Query:  FVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPA
        F   K  S     SF +S+    S SE+N+S   SP TP SVL         S        +SP LW+LR QA+++L+P+D+K   +  LS    +   +
Subjt:  FVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPA

Query:  PTRVAIVEE------------------SMMDVDDKLTSVNTD----------AADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRT
         T++ I EE                  S+++ +     + T+            +A D  E  + +  R    E   + D ++ IE F+ +   +    T
Subjt:  PTRVAIVEE------------------SMMDVDDKLTSVNTD----------AADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRT

Query:  ASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPP
           + D N+  E  + +   +       + +  + ++   S+ D+T                                              +P    PP
Subjt:  ASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPP

Query:  PPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPP
        PPP+               P  P P +    T     SQPPPPPP P+ KA        APPPPP + K  E                +G   +      
Subjt:  PPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPP

Query:  PPMVPSRGSAGPAPPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKG--GVGSSNG---GKQGMADAL
             + G   P+ P         APP PP  G+ RSL  KKA++KL+RS Q+ NLY  LKGK+EG     ++    KG   V   +     + GMADAL
Subjt:  PPMVPSRGSAGPAPPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKG--GVGSSNG---GKQGMADAL

Query:  AEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFV
        AEMTKRS+YFQQIEEDV+K+AKSI  LK SI SFQ+ DM +LL FH +VES+LE LTDE+QVLARFEGFP KKLE +R A ALY KLD I+ +L+NWK  
Subjt:  AEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFV

Query:  SPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAF
         P+  LLD++E YF K KGE++ +ERTKDE++K F+ + I  DF VL+++KE+MVDVSS+CMELALKE+RE    A +  K G+     + R+K LWRAF
Subjt:  SPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAF

Query:  QFAYRVYTFAGGHDERADRLTRELAIEIESESHQ
        QFA++VYTFAGGHDERAD LTR+LA EI+++  Q
Subjt:  QFAYRVYTFAGGHDERADRLTRELAIEIESESHQ

Q6K8Z4 Formin-like protein 72.9e-0638.02Show/hide
Query:  SDLPAVVSKPL-----PLLPPVAPPPPPPP-PARALQP---------SVVTLQL-PTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQH
        S   +VVS PL     P  PP  PPPPPPP P  +  P         +V  LQ  P P PPPPPPPPM     +   +    PPPPPP    Q A  V  
Subjt:  SDLPAVVSKPL-----PLLPPVAPPPPPPP-PARALQP---------SVVTLQL-PTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQH

Query:  LSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS--VPPPPPPSNITGTMV------RAG-------VPPPPPM------VPSRGSAGPAPP
           PPPPP VP M+ +P       PPPP Q   ++ +    S  +PPPPPP + T ++V      R G        PPPPP+      V S  SA PAPP
Subjt:  LSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVS--VPPPPPPSNITGTMV------RAG-------VPPPPPM------VPSRGSAGPAPP

Query:  PPMAQGNGFA----------PPPPPPGGAVRSLRPKKASTKL
         P  +  G +          PPPPPPG       PK +S  L
Subjt:  PPMAQGNGFA----------PPPPPPGGAVRSLRPKKASTKL

Q7G6K7 Formin-like protein 35.8e-0737.13Show/hide
Query:  DSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQ
        ++Q     Q    SD       P    PP  PPPPPPPP    +P+      P P PPPPPPPP+ Q N       SQPPPPPPPP +          S 
Subjt:  DSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQ

Query:  PPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPS------------NITGTMVRAGVPPPPPMVPSRGSAG---------PAPPPPM
        PPPPPP P +   P  + P  PPPPP L     +V+    PPPPPPS             I         PPPPP   SR   G         P PPPP+
Subjt:  PPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPS------------NITGTMVRAGVPPPPPMVPSRGSAG---------PAPPPPM

Query:  AQGN--------GFAPPPPPPGGAVRSLRPKKASTKL
           N           PPPPPP  A RS  P   +  L
Subjt:  AQGN--------GFAPPPPPPGGAVRSLRPKKASTKL

Q84ZL0 Formin-like protein 53.0e-1139.91Show/hide
Query:  ATAQELPTSDLPAVVSKPLPLLPPVA------PPPPPPPPARALQPSVVTLQLPTPTPPPPP-----------PPPMMQQNAVLAQQLSQPPPPPPPPMM
        AT Q+ P    P  +  P P  PP +      PPPPPPPP  +      T   P P PPPPP           PPP   ++ V A  +S PPPPPPPP+ 
Subjt:  ATAQELPTSDLPAVVSKPLPLLPPVA------PPPPPPPPARALQPSVVTLQLPTPTPPPPP-----------PPPMMQQNAVLAQQLSQPPPPPPPPMM

Query:  QQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPPMAQGNGFAPP
          +         PPPPPP P   +  A +    PPPPP LL+        SVPPPPPP  I+     +  PPPPP+  +R +A P PPPP    +  APP
Subjt:  QQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPPMAQGNGFAPP

Query:  PPPPGGAVRSLRP
        PPPP    RS  P
Subjt:  PPPPGGAVRSLRP

Q9C6S1 Formin-like protein 143.9e-1144.51Show/hide
Query:  KPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAA-AEPN
        +P+   PP  PPPPPP P+R++ P +   Q P P PPPPPPPP   ++  +    + PPPPPPPP     +T  +  +QPPPPPP P     PAA   P 
Subjt:  KPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAA-AEPN

Query:  APPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPP-PPPMVPSRGSAG-PAPPPPMAQGNGFAPPPPP
         PPPPP        V   S PPPPPP      +  A  PP PPP+ PS    G P PPPP       APPPPP
Subjt:  APPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPP-PPPMVPSRGSAG-PAPPPPMAQGNGFAPPPPP

Arabidopsis top hitse value%identityAlignment
AT1G11070.1 BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein family protein (TAIR:AT1G61080.1)3.1e-10438.18Show/hide
Query:  SSRGKDNFVQMMELRKKILILRDIIDLPSLEGSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNL
        SS+   NF+ M+ELR+KI+  R IIDLP L    SI+ +V+ TM+DL KL PEII   Q  EM+   +++ L  F  ALKSIGDSW+ +HEW  KSKY  
Subjt:  SSRGKDNFVQMMELRKKILILRDIIDLPSLEGSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNL

Query:  SSFQENSSFQEIVESVLGIIDCIISMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSP
        S+ ++N S   +VE VL  +D +I   NER +  +   N       +      S  ST + ++ + S    P  P +VL    +   K      +S S+ 
Subjt:  SSFQENSSFQEIVESVLGIIDCIISMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSP

Query:  LLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEA-SPKADRNEEIEVFDVK
        L  ++R+QA+ KL+PIDVK L +   S                                                     KEA S   D ++E       
Subjt:  LLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEA-SPKADRNEEIEVFDVK

Query:  EEKLNLSRTASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQL
                  SLK  ++E  E                ++ I E+ D+  SQ D         D   V  K   +    APPPPPPP   A+         
Subjt:  EEKLNLSRTASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQL

Query:  PTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGT
               P  PP+     + A++ S  P PPPP             +  PPPPP+P    +  AA                       PP PPP    GT
Subjt:  PTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGT

Query:  MVRAGVPPPPPMVPSRGSAGPAPPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRS---ANGRKGGVGSS-NGG
           A +PPPPP+             PMA G G A PPPPP GA   L  KK ++KLKRS  LG L+R LKGK+EG N  +RS     G KG  GS+   G
Subjt:  MVRAGVPPPPPMVPSRGSAGPAPPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRS---ANGRKGGVGSS-NGG

Query:  KQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVY
        KQGMADALAE+TK+S YFQ+IEEDV+ +  SI  LK  I+ F++ D+ +L  FH ++ESVLE L DE+QVLAR EGFP KKLE +R+AAALY KL+ ++ 
Subjt:  KQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVY

Query:  QLQNWKFVSPVGPLLDRVENYFTK------------------IKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKA
        +L+NWK  SP   L D+ E YF K                  I+ E++ L++ K EE K+F+ + I FDF++L++IKE MVD+SS CMELALKEKRE+K 
Subjt:  QLQNWKFVSPVGPLLDRVENYFTK------------------IKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKA

Query:  AAEKTRKGGQSENSNKAR----SKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIE
        A++ T +  +++ S K +    +K LWRAF FAYRVYTFAGGHD+RAD+LTRELA EIE
Subjt:  AAEKTRKGGQSENSNKAR----SKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIE

AT1G31810.1 Formin Homology 142.7e-1244.51Show/hide
Query:  KPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAA-AEPN
        +P+   PP  PPPPPP P+R++ P +   Q P P PPPPPPPP   ++  +    + PPPPPPPP     +T  +  +QPPPPPP P     PAA   P 
Subjt:  KPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAA-AEPN

Query:  APPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPP-PPPMVPSRGSAG-PAPPPPMAQGNGFAPPPPP
         PPPPP        V   S PPPPPP      +  A  PP PPP+ PS    G P PPPP       APPPPP
Subjt:  APPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPP-PPPMVPSRGSAG-PAPPPPMAQGNGFAPPPPP

AT1G61080.1 Hydroxyproline-rich glycoprotein family protein9.8e-12741.08Show/hide
Query:  KTMKSSSYVFSKTYSKKP---------KLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLEGSASINE---------LVVGTMEDLQKLYPEI
        K++ S SY+ S      P         K ++  ++  S +   NF+ M+ELR+KI   RDIIDL +L+GS SI +         +V+ TM+DLQK+ PEI
Subjt:  KTMKSSSYVFSKTYSKKP---------KLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLEGSASINE---------LVVGTMEDLQKLYPEI

Query:  ISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAM--DEFVNSKDS--------S
        I      E++   +++ L +F  ALKSIGDSW+ N EW  KSKY  SS  +N S   +VE VL  +D +I M+ ERFD M  DE    K+S        S
Subjt:  ISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAM--DEFVNSKDS--------S

Query:  FSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVE
         SR  S  +S S + S   + +S C SP TP SVL             +  + +S LLW++RVQA+EKL+PIDVK L +  LS                +
Subjt:  FSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVE

Query:  ESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERT
        E     D+ + SV  +     DE+E  D+K E              EE    D +   LN  +   +K    E T      E KL         + +E+ 
Subjt:  ESMMDVDDKLTSVNTDAADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERT

Query:  DNFDSQADATAQELPTSDLPAVVSKPLPL-----LPPVA------PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQN----------AVLAQQL
        +N            P   L  + +  LPL      PP+A      PPPPPPPP     P+V+ L+   P PPPP PP +M             A    + 
Subjt:  DNFDSQADATAQELPTSDLPAVVSKPLPL-----LPPVA------PPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQN----------AVLAQQL

Query:  SQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQP--------AAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNIT--------GTMVRAGVPP
        S PPPPPPPP+          ++ PPPPPP P+    P         AA P  PPPPP       T      PPPPP  N          G     G PP
Subjt:  SQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQP--------AAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNIT--------GTMVRAGVPP

Query:  PPPMVP-SRGSAGPAPPPPMAQGNGFAPP---------------PPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLR--SANGRKGGV
        PPP +P + G+  P PPPPMA  NG A P               PPPP GA RSLRPKKA+TKLKRS QLGNLYR LKGKVEG + N +  S +GRK G 
Subjt:  PPPMVP-SRGSAGPAPPPPMAQGNGFAPP---------------PPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLR--SANGRKGGV

Query:  GSS-NGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYL
        GS+  GGKQGMADALAE+TK+SAYF QI+ D+ K+  SI  LK  I+ FQ+ DM +LL FH++VESVLENLTDESQVLAR EGFP KKLE +R+A ALY 
Subjt:  GSS-NGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYL

Query:  KLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQS
        KL  ++ +LQN K   P+  LLD+VE YFTK                                 IKE+MVD+SS+CMELALKEKR+ K  +   +    S
Subjt:  KLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQS

Query:  ENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESES
               +KMLWRAFQFA++VYTFAGGHD+RAD LTRELA EI+++S
Subjt:  ENSNKARSKMLWRAFQFAYRVYTFAGGHDERADRLTRELAIEIESES

AT4G04980.1 unknown protein8.3e-11836.68Show/hide
Query:  KTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKD-----------NFVQMMELRKKILILRDIIDLPSLEGSASINELVVGTMEDLQKLYPEIISDIQYS
        K+  S+ +V + T +  P   K +++K ++  K+           NF+ M+ELRK I   RD+IDLPSL+GS S+ E++  TM+DLQKL PEI++  Q  
Subjt:  KTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKD-----------NFVQMMELRKKILILRDIIDLPSLEGSASINELVVGTMEDLQKLYPEIISDIQYS

Query:  EMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAMD-------EFVNSKDSSFSRTSSFGKS
        EM+   +++ L +F   L++IGDSW+++ +W  +SKY  S   +N S   +VE VL  +D +I    ERF  MD        F   K  S     SF +S
Subjt:  EMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAMD-------EFVNSKDSSFSRTSSFGKS

Query:  SSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEE---------
        +    S SE+N+S   SP TP SVL         S        +SP LW+LR QA+++L+P+D+K   +  LS    +   + T++ I EE         
Subjt:  SSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEE---------

Query:  ---------SMMDVDDKLTSVNTD----------AADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKE
                 S+++ +     + T+            +A D  E  + +  R    E   + D ++ IE F+ +   +    T   + D N+  E  + + 
Subjt:  ---------SMMDVDDKLTSVNTD----------AADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKE

Query:  EKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQP
          +       + +  + ++   S+ D+T                                              +P    PPPPP+              
Subjt:  EKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQP

Query:  PPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPPM
         P  P P +    T     SQPPPPPP P+ KA        APPPPP + K  E                +G   +           + G   P+ P   
Subjt:  PPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPPM

Query:  AQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKG--GVGSSNG---GKQGMADALAEMTKRSAYFQQIEEDVK
              APP PP  G+ RSL  KKA++KL+RS Q+ NLY  LKGK+EG     ++    KG   V   +     + GMADALAEMTKRS+YFQQIEEDV+
Subjt:  AQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKG--GVGSSNG---GKQGMADALAEMTKRSAYFQQIEEDVK

Query:  KHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIK
        K+AKSI  LK SI SFQ+ DM +LL FH +VES+LE LTDE+QVLARFEGFP KKLE +R A ALY KLD I+ +L+NWK   P+  LLD++E YF K K
Subjt:  KHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWKFVSPVGPLLDRVENYFTKIK

Query:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD
        GE++ +ERTKDE++K F+ + I  DF VL+++KE+MVDVSS+CMELALKE+RE    A +  K G+     + R+K LWRAFQFA++VYTFAGGHDERAD
Subjt:  GEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYTFAGGHDERAD

Query:  RLTRELAIEIESESHQ
         LTR+LA EI+++  Q
Subjt:  RLTRELAIEIESESHQ

AT5G07740.1 actin binding2.1e-0432.14Show/hide
Query:  SRTASLK-ADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPV---------------APPPPPPPPARA
        S T SLK   R    + +  K+ K    RA+             S  D++ +E P+S  PA   +  P LP +               +PPPPPPPP   
Subjt:  SRTASLK-ADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPLPLLPPV---------------APPPPPPPPARA

Query:  LQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKV----
              T QLP   PPPPPPPP   +       L  PPPPPPPP           +  PPPPPP+P    +P +     PPP P    V  + + +    
Subjt:  LQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIETVIKV----

Query:  ---------SVPPPPPP--------SNITGTMVRAGVPPPPPMVP----SRGSAGPAPPPPMAQGNGFAPPPPPPGGAVR
                   PPPPPP        S+   T      PPPPP  P     R S    PPPP        PPPPPP  +VR
Subjt:  ---------SVPPPPPP--------SNITGTMVRAGVPPPPPMVP----SRGSAGPAPPPPMAQGNGFAPPPPPPGGAVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCGGTGGCTGGTGTGGTTTAGGCCCTTTGTTGTTTCGTAAAAAGGCTTATGGACTCGAGACTCATTCTTCTCTTTCATTTTCCATCTGTCCTTTGGGGAAGTG
GGTTTGCTTAAAAAAGACAATGAAGAGCTCTTCTTACGTGTTCTCCAAGACATATTCTAAGAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTCTTCAAGAGGCA
AAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCATTGATTTGCCTTCTTTAGAAGGCTCTGCTTCTATAAATGAGCTGGTGGTA
GGAACGATGGAAGATCTTCAAAAGCTTTATCCCGAAATCATTTCGGATATCCAATATTCCGAAATGAAGACGACATGTATTGAACAGAGTCTTGCCTATTTCTGCACTGC
ACTGAAATCGATTGGCGATTCGTGGATGCTGAACCATGAATGGAGGGACAAATCTAAATATAATTTGTCATCATTTCAGGAAAACTCCAGCTTCCAAGAAATTGTTGAAT
CTGTGTTGGGTATTATTGATTGCATCATTAGTATGGCGAACGAAAGGTTTGATGCGATGGACGAATTTGTTAACTCAAAGGACTCTTCTTTTTCAAGAACTAGTTCCTTT
GGTAAGAGCTCGAGTTCGACAGATTCCTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCTCCAGAAACTCCGACGTCCGTCCTTGCAAACTTTCGGAACAGCGAAAGAAA
ATCTTCTGAAAAGGAGAAAGTCTCTTGTAGCTCTCCCCTTTTGTGGTCTCTTAGAGTTCAAGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCTTTTGCTTCCTA
GGTTGTCTCATTGCGGAGTAAATGTCTGCCCCGCCCCAACCAGAGTGGCGATCGTTGAGGAATCAATGATGGATGTGGATGACAAGCTCACCTCTGTAAACACTGATGCT
GCTGATGCAATTGACGAAATGGAAGTGTGTGATATTAAAGAAGAGAGGGATTTGAGCAAGGAAGCTAGTCCAAAGGCAGATAGAAATGAGGAAATTGAAGTGTTTGATGT
CAAAGAAGAAAAATTGAATTTGAGCAGGACAGCTAGCCTAAAGGCAGATAGAAATGAAGAAACTGAAGTGATTGATATCAAAGAAGAGAAGTTGTGTTTGAGCAGGGCAA
ATACCCAAAAAGATATTGCTGAGAGAACAGATAATTTTGATTCCCAAGCTGATGCAACTGCTCAAGAATTACCTACATCAGATTTACCAGCTGTTGTATCAAAGCCATTA
CCGCTGCTACCCCCGGTGGCTCCTCCCCCTCCCCCCCCACCGCCTGCCCGAGCTCTGCAACCCTCCGTAGTCACCCTGCAACTGCCAACGCCAACACCACCACCTCCACC
ACCACCACCAATGATGCAACAAAATGCAGTATTAGCTCAACAACTTTCACAGCCACCTCCTCCACCACCACCACCAATGATGCAACAAAATGCAACATTTGTCCAACATC
TTTCACAGCCACCTCCTCCACCACCAGTGCCTCAGATGAAAGCACAGCCTGCTGCAGCAGAGCCTAATGCGCCTCCTCCACCTCCACAATTGTTAAAGGTAATCGAAACA
GTGATCAAAGTCAGCGTACCACCACCACCACCACCATCAAACATTACTGGAACGATGGTGAGAGCAGGTGTACCGCCACCTCCACCGATGGTGCCCTCGAGAGGGAGTGC
AGGTCCAGCACCCCCTCCTCCAATGGCCCAAGGCAATGGGTTTGCTCCACCACCACCTCCACCAGGTGGTGCCGTACGGTCCTTGCGCCCCAAGAAAGCCTCTACCAAAC
TAAAAAGATCTCATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAGGTGGAGGGATCCAATCAAAATCTTAGATCAGCTAACGGTAGGAAAGGTGGTGTCGGGAGC
AGTAACGGAGGAAAACAAGGAATGGCTGATGCTTTGGCAGAGATGACAAAAAGATCAGCATACTTTCAGCAAATTGAAGAAGATGTTAAAAAACACGCAAAATCGATCAC
GGCGCTTAAACCTTCCATTTCATCTTTCCAATCATCAGACATGAATGACCTGCTCCTTTTCCACAAGCAAGTGGAATCTGTACTAGAGAATTTAACTGATGAATCACAGG
TACTAGCAAGGTTTGAAGGATTTCCCATAAAAAAGTTGGAAACTTTGAGAATTGCAGCAGCATTATATCTAAAGTTAGATGCAATTGTCTATCAACTACAGAATTGGAAG
TTTGTTTCTCCTGTGGGACCGCTTCTCGACCGAGTTGAAAATTACTTCACTAAGATCAAAGGAGAAGTGGATGCACTTGAACGAACCAAGGATGAAGAATCAAAGAGATT
CCGGGGTCACGGTATTCAATTTGATTTCAGTGTGTTAATACGGATCAAGGAATCAATGGTGGATGTTTCTTCCAGCTGTATGGAGTTGGCTTTGAAGGAAAAAAGAGAGT
CGAAGGCAGCAGCAGAGAAGACACGAAAAGGAGGGCAATCTGAAAATTCGAACAAGGCACGTTCAAAGATGCTATGGAGGGCATTCCAATTCGCATACCGAGTTTACACC
TTCGCCGGTGGACACGACGAGCGTGCTGATAGACTGACCAGAGAATTGGCTATAGAAATAGAGAGTGAATCTCATCAACTATGA
mRNA sequenceShow/hide mRNA sequence
GTTCCAATGCAGTAAATTCCATTTGGTTCTTTTTCTTTTTTTCTTTTTTCCCTTTTTTCCTTCTTCTTTAGGATCTTCCATCAATCCTAACCCAAAAAGCTCCATCCTTT
CAGTTCCATTTTGAATCTTCCCCTTTTGTGTTGTGCTTCAAAAAAAAAAAAAAAAAAGAAAAAACAAAAAGCCTTAGAATATATATCTTTTTGTTGTTTGTTGGATCTGT
ATATATAAAATTTTATGTAAATAAGTTTTGTGAATGGTTGGTTTTCATTTTCCCCTTGAGAAACCTTCAACACAGCTGACCAGAGGAATAAAAAAAACAAACTTTTTCTG
TAAAAAGATTGAAAAATTTCCCTTCACCCCACTATTTCATCTGCGTTAGTTTAAAATATATATATTTTTTTTCTTTCTGTGCCATTCAAGTCTTTTTAAAGGCTTGTTTT
GGTCACATTTCATGGCTACCGGTGGCTGGTGTGGTTTAGGCCCTTTGTTGTTTCGTAAAAAGGCTTATGGACTCGAGACTCATTCTTCTCTTTCATTTTCCATCTGTCCT
TTGGGGAAGTGGGTTTGCTTAAAAAAGACAATGAAGAGCTCTTCTTACGTGTTCTCCAAGACATATTCTAAGAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTC
TTCAAGAGGCAAAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCATTGATTTGCCTTCTTTAGAAGGCTCTGCTTCTATAAATG
AGCTGGTGGTAGGAACGATGGAAGATCTTCAAAAGCTTTATCCCGAAATCATTTCGGATATCCAATATTCCGAAATGAAGACGACATGTATTGAACAGAGTCTTGCCTAT
TTCTGCACTGCACTGAAATCGATTGGCGATTCGTGGATGCTGAACCATGAATGGAGGGACAAATCTAAATATAATTTGTCATCATTTCAGGAAAACTCCAGCTTCCAAGA
AATTGTTGAATCTGTGTTGGGTATTATTGATTGCATCATTAGTATGGCGAACGAAAGGTTTGATGCGATGGACGAATTTGTTAACTCAAAGGACTCTTCTTTTTCAAGAA
CTAGTTCCTTTGGTAAGAGCTCGAGTTCGACAGATTCCTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCTCCAGAAACTCCGACGTCCGTCCTTGCAAACTTTCGGAAC
AGCGAAAGAAAATCTTCTGAAAAGGAGAAAGTCTCTTGTAGCTCTCCCCTTTTGTGGTCTCTTAGAGTTCAAGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCT
TTTGCTTCCTAGGTTGTCTCATTGCGGAGTAAATGTCTGCCCCGCCCCAACCAGAGTGGCGATCGTTGAGGAATCAATGATGGATGTGGATGACAAGCTCACCTCTGTAA
ACACTGATGCTGCTGATGCAATTGACGAAATGGAAGTGTGTGATATTAAAGAAGAGAGGGATTTGAGCAAGGAAGCTAGTCCAAAGGCAGATAGAAATGAGGAAATTGAA
GTGTTTGATGTCAAAGAAGAAAAATTGAATTTGAGCAGGACAGCTAGCCTAAAGGCAGATAGAAATGAAGAAACTGAAGTGATTGATATCAAAGAAGAGAAGTTGTGTTT
GAGCAGGGCAAATACCCAAAAAGATATTGCTGAGAGAACAGATAATTTTGATTCCCAAGCTGATGCAACTGCTCAAGAATTACCTACATCAGATTTACCAGCTGTTGTAT
CAAAGCCATTACCGCTGCTACCCCCGGTGGCTCCTCCCCCTCCCCCCCCACCGCCTGCCCGAGCTCTGCAACCCTCCGTAGTCACCCTGCAACTGCCAACGCCAACACCA
CCACCTCCACCACCACCACCAATGATGCAACAAAATGCAGTATTAGCTCAACAACTTTCACAGCCACCTCCTCCACCACCACCACCAATGATGCAACAAAATGCAACATT
TGTCCAACATCTTTCACAGCCACCTCCTCCACCACCAGTGCCTCAGATGAAAGCACAGCCTGCTGCAGCAGAGCCTAATGCGCCTCCTCCACCTCCACAATTGTTAAAGG
TAATCGAAACAGTGATCAAAGTCAGCGTACCACCACCACCACCACCATCAAACATTACTGGAACGATGGTGAGAGCAGGTGTACCGCCACCTCCACCGATGGTGCCCTCG
AGAGGGAGTGCAGGTCCAGCACCCCCTCCTCCAATGGCCCAAGGCAATGGGTTTGCTCCACCACCACCTCCACCAGGTGGTGCCGTACGGTCCTTGCGCCCCAAGAAAGC
CTCTACCAAACTAAAAAGATCTCATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAGGTGGAGGGATCCAATCAAAATCTTAGATCAGCTAACGGTAGGAAAGGTG
GTGTCGGGAGCAGTAACGGAGGAAAACAAGGAATGGCTGATGCTTTGGCAGAGATGACAAAAAGATCAGCATACTTTCAGCAAATTGAAGAAGATGTTAAAAAACACGCA
AAATCGATCACGGCGCTTAAACCTTCCATTTCATCTTTCCAATCATCAGACATGAATGACCTGCTCCTTTTCCACAAGCAAGTGGAATCTGTACTAGAGAATTTAACTGA
TGAATCACAGGTACTAGCAAGGTTTGAAGGATTTCCCATAAAAAAGTTGGAAACTTTGAGAATTGCAGCAGCATTATATCTAAAGTTAGATGCAATTGTCTATCAACTAC
AGAATTGGAAGTTTGTTTCTCCTGTGGGACCGCTTCTCGACCGAGTTGAAAATTACTTCACTAAGATCAAAGGAGAAGTGGATGCACTTGAACGAACCAAGGATGAAGAA
TCAAAGAGATTCCGGGGTCACGGTATTCAATTTGATTTCAGTGTGTTAATACGGATCAAGGAATCAATGGTGGATGTTTCTTCCAGCTGTATGGAGTTGGCTTTGAAGGA
AAAAAGAGAGTCGAAGGCAGCAGCAGAGAAGACACGAAAAGGAGGGCAATCTGAAAATTCGAACAAGGCACGTTCAAAGATGCTATGGAGGGCATTCCAATTCGCATACC
GAGTTTACACCTTCGCCGGTGGACACGACGAGCGTGCTGATAGACTGACCAGAGAATTGGCTATAGAAATAGAGAGTGAATCTCATCAACTATGA
Protein sequenceShow/hide protein sequence
MATGGWCGLGPLLFRKKAYGLETHSSLSFSICPLGKWVCLKKTMKSSSYVFSKTYSKKPKLSKGARSKKSSRGKDNFVQMMELRKKILILRDIIDLPSLEGSASINELVV
GTMEDLQKLYPEIISDIQYSEMKTTCIEQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIISMANERFDAMDEFVNSKDSSFSRTSSF
GKSSSSTDSCSETNSSCCSSPETPTSVLANFRNSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESMMDVDDKLTSVNTDA
ADAIDEMEVCDIKEERDLSKEASPKADRNEEIEVFDVKEEKLNLSRTASLKADRNEETEVIDIKEEKLCLSRANTQKDIAERTDNFDSQADATAQELPTSDLPAVVSKPL
PLLPPVAPPPPPPPPARALQPSVVTLQLPTPTPPPPPPPPMMQQNAVLAQQLSQPPPPPPPPMMQQNATFVQHLSQPPPPPPVPQMKAQPAAAEPNAPPPPPQLLKVIET
VIKVSVPPPPPPSNITGTMVRAGVPPPPPMVPSRGSAGPAPPPPMAQGNGFAPPPPPPGGAVRSLRPKKASTKLKRSHQLGNLYRTLKGKVEGSNQNLRSANGRKGGVGS
SNGGKQGMADALAEMTKRSAYFQQIEEDVKKHAKSITALKPSISSFQSSDMNDLLLFHKQVESVLENLTDESQVLARFEGFPIKKLETLRIAAALYLKLDAIVYQLQNWK
FVSPVGPLLDRVENYFTKIKGEVDALERTKDEESKRFRGHGIQFDFSVLIRIKESMVDVSSSCMELALKEKRESKAAAEKTRKGGQSENSNKARSKMLWRAFQFAYRVYT
FAGGHDERADRLTRELAIEIESESHQL