; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005077 (gene) of Chayote v1 genome

Gene IDSed0005077
OrganismSechium edule (Chayote v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationLG03:15447579..15453307
RNA-Seq ExpressionSed0005077
SyntenySed0005077
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595439.1 hypothetical protein SDJN03_11992, partial [Cucurbita argyrosperma subsp. sororia]4.8e-22683.14Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       H+SPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIV  PSPSPSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

XP_022925200.1 uncharacterized protein LOC111432513 isoform X1 [Cucurbita moschata]1.4e-22883.73Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       HVSPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIVPSPSPSPSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

XP_022925201.1 uncharacterized protein LOC111432513 isoform X2 [Cucurbita moschata]2.8e-22683.53Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRG DIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       HVSPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIVPSPSPSPSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

XP_022925202.1 uncharacterized protein LOC111432513 isoform X3 [Cucurbita moschata]5.4e-22582.94Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       HVSPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIV    PSPSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]3.1e-22583.95Show/hide
Query:  MGKTDGEQPPPSA---AASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNI
        MGK DGEQP PSA     SG+V  GRCC GC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATF+VERPVSLLEDNI
Subjt:  MGKTDGEQPPPSA---AASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNI

Query:  EQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAF
        EQLR DIF EF IPSIKVDILSLESL GSNRTKVVF +DPD D+SEISS +LSLIRST+  +V NQ  LRITKSMFG+AF FEVLKFPGGITIIPPQSAF
Subjt:  EQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILY+KLWNAEGSTVTAPTIVQSSVLLEVGN PSMRRLKQLAQTISGSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPE-SAASPKKSYTAKPPGCQY-KR
        GKVKQVRLSSILKHSLNGS+GNGP  RSPSPAP PQPHN    PP HHHHHHT LTPAISPAPA E GAPEYGSPAPE S ASPK+SYTAKPPGCQY KR
Subjt:  GKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPE-SAASPKKSYTAKPPGCQY-KR

Query:  KSGRKEGKQSHLTPVASPNISPIHSPASPP--PRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPR
        KSGRKEGKQSHLTP+ASPN+SP HS ASP   P+HKV PPAA + P PALTPLPNVIYAHVQPPSKS+S+HP KSTTN      PS +PSPSPSGA R  
Subjt:  KSGRKEGKQSHLTPVASPNISPIHSPASPP--PRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPR

Query:  TITQWGFIPFLIIACIM
         ITQWGF  FLI+AC M
Subjt:  TITQWGFIPFLIIACIM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein1.7e-22483.14Show/hide
Query:  MGKTDGEQPPPSA---AASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNI
        MGK DGEQP PSA     SG V  GRCC GC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATF+VER VSLLEDN 
Subjt:  MGKTDGEQPPPSA---AASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNI

Query:  EQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAF
        +QLR DIF EF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEISS +LSLIRS +  +V NQ  L ITKS FG+A+ FEVLKFPGGITIIPPQSAF
Subjt:  EQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGSTVT PTIVQ+SVLLEVGN PSMRRLKQLAQTISGSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPE-SAASPKKSYTAKPPGCQ--YK
        GKVKQVRLSSILKHSLNGSDGNGPV RSPSPAPTPQPHN HHPP HHHHHHHTPLTPAISPAPA E GAPEYGSPAPE +AASPK+SYTAKPPGCQ  YK
Subjt:  GKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPE-SAASPKKSYTAKPPGCQ--YK

Query:  RKSGRKEGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRT
        RKSGRKEGKQSHLTP+ASPNISP HS ASP P+H++ PPAA VSP PALTPLPNVIYAHVQPPSKSDS+HP     NPSI        +PSPSGA R   
Subjt:  RKSGRKEGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRT

Query:  ITQWGFIPFLIIACIM
        ITQWGF  FLI+AC M
Subjt:  ITQWGFIPFLIIACIM

A0A6J1EB56 uncharacterized protein LOC111432513 isoform X32.6e-22582.94Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       HVSPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIV    PSPSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

A0A6J1EBJ8 uncharacterized protein LOC111432513 isoform X21.4e-22683.53Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRG DIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       HVSPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIVPSPSPSPSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X44.9e-22482.55Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       HVSPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIV      PSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

A0A6J1EH92 uncharacterized protein LOC111432513 isoform X16.6e-22983.73Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL
        MGK DGE PPPSA  S     GRCCSGC SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VERPVSLL+DNIE+L
Subjt:  MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQL

Query:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
        R DIF EF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EI S +LSLIRST A +V NQS LRITKSMFG+AF FEVLKFPGGITIIPPQSAFLLQ
Subjt:  RADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
        KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGN PSM+RLKQLAQTIS SNSSNLGLNNTEFGKV
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK
        KQVRLSSILKHSLNG DG GP+ RSPSPAPTPQPHNFHHPP HHHHHHH PLTP ISPAPA E GAPEYG  AP+SAASPK+SY AKPPGCQYKRKSGRK
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRK

Query:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF
        EGKQ HL+P+ASP+ISP+HS ASP  +H       HVSPT A TPLP+VIYAHVQPPSKSDS+HP KSTT+PSIVPSPSPSPSPSPS A+    IT+WGF
Subjt:  EGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGF

Query:  IPFLIIACIM
           LI+A  M
Subjt:  IPFLIIACIM

SwissProt top hitse value%identityAlignment
P13983 Extensin2.3e-0532.26Show/hide
Query:  PVRRSPSPAP-----TPQPHNFHHPPPHHHH----HHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRKEGKQSHLTPVA
        P    P+P+P      PQP    H PP H H    H  +PL   + P+P  +   P Y  P P  A SP+ S T  PP   Y           S   P  
Subjt:  PVRRSPSPAP-----TPQPHNFHHPPPHHHH----HHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRKEGKQSHLTPVA

Query:  SPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQ
        SP+  P  +P   PP     PP  +  P P   PLP+       PP  S    P  S   P+ +P P PS  P PS +  P T  Q
Subjt:  SPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQ

Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)4.8e-3836.14Show/hide
Query:  DGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLG---LNPSYRGHDIVATFDVERPVSLLEDNIEQLR
        D E P  S  +S     GR CS   S  RL+G RC+ +L+LS A+ +SA+FWL P    ++ K  G   LN S     + A+F +++PVS +  +  ++ 
Subjt:  DGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLG---LNPSYRGHDIVATFDVERPVSLLEDNIEQLR

Query:  ADIFGEF-LIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ
         DI     L  + KV +LSL     SN T V F V P   D EIS   LSL+RS+   + A +S L++T S FG    F+VLKFPGGIT+ P + A +  
Subjt:  ADIFGEF-LIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQ

Query:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
           +LF+ T+  SI  +Q     L    +  L L PYE ++ +L N +GST++ P   Q  V   +      +RL    Q I  S + NLGL+   FG+V
Subjt:  KVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

Query:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTP
        K +  S+ L   +  SD         +PAPTP
Subjt:  KQVRLSSILKHSLNGSDGNGPVRRSPSPAPTP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein9.5e-9545.7Show/hide
Query:  MGKTDGEQPPPSAAASGEVPGGRC-----CSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLED
        MGKT  E       A GE  G        C  C  I   +GF+C+F+LLLSVALF+SA+F L PF    D++D  L+P +RGH IVA+F + R  S L +
Subjt:  MGKTDGEQPPPSAAASGEVPGGRC-----CSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLED

Query:  NIEQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQS
        N  QL+ DIF E    SIKV IL++E     N TKVVFG+DPD    EI    LS I+     ++ NQS+L++TKS+FG+ FLFEVLKFPGGIT+IPPQS
Subjt:  NIEQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQS

Query:  AFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNT
        AF LQK +I+FNFTLN+SIHQIQ++F+ L SQL  GL LAPYE LY+ L N+EGSTV+ PT V SSVLL VG   S  RLKQL  TI+GS S NLGLNNT
Subjt:  AFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNT

Query:  EFGKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQP----------HNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYT
         FGKVKQVRLSS L +S + S       +SPSP+P+P            H+ HH   +HHHHHH  L+P ++P            SPAP    S K++ +
Subjt:  EFGKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQP----------HNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYT

Query:  AKPPGCQYKRKSGRKEGKQSHLTPVASPNI-SPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSP
        A PP     R   +++  Q   TP  +P+  +P H   SP P   +    +H+ P  A  PLP+V++AH   P  ++   P     + + V  P P    
Subjt:  AKPPGCQYKRKSGRKEGKQSHLTPVASPNI-SPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSP

Query:  SPSGAYRPRTITQWGFIPFLIIA
        S S A        W  +  LI+A
Subjt:  SPSGAYRPRTITQWGFIPFLIIA

AT3G56590.1 hydroxyproline-rich glycoprotein family protein3.6e-10247.28Show/hide
Query:  MGKTDGEQ---PPPSAAASGEVPGG---RCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLE
        MGK   E+   P    AAS    GG     C  C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DL L+P ++ H IVA+FDV +P+S +E
Subjt:  MGKTDGEQ---PPPSAAASGEVPGG---RCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLE

Query:  DNIEQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQ
        DN+ QL  DI  E   P  KV +L+LE L   NRT V+F +DP+ ++S+I +   SLI++    +V  Q S R+T+S+FG+ F FEVLKFPGGIT+IPPQ
Subjt:  DNIEQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQ

Query:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNN
          F LQK Q+LFNFTLNFSI+QIQ +F EL SQL  G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+
Subjt:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNN

Query:  TEFGKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYK
        T FGKVKQVRLSSIL HS   S        +PSP+P P+ H + H  PHHHHHHH      ++P P+L   +P     AP SA +       + P C Y+
Subjt:  TEFGKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYK

Query:  RKSGRKEGKQSHLT--PVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGA
        ++  +     +H T  P  +P+ S  H PA  P      PP  H    P  +PLP+V++AH+ PPSKS         + P+   SPSP+P+P  S +
Subjt:  RKSGRKEGKQSHLT--PVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGA

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.6e-10247.67Show/hide
Query:  MGKTDGEQ---PPPSAAASGEVPGG---RCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLE
        MGK   E+   P    AAS    GG     C  C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DL L+P ++ H IVA+FDV +P+S +E
Subjt:  MGKTDGEQ---PPPSAAASGEVPGG---RCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLE

Query:  DNIEQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQ
        DN+ QL  DI  E   P  KV +L+LE L   NRT V+F +DP+ ++S+I +   SLI++    +V  Q S R+T+S+FG+ F FEVLKFPGGIT+IPPQ
Subjt:  DNIEQLRADIFGEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQ

Query:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNN
          F LQK Q+LFNFTLNFSI+QIQ +F EL SQL  G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+
Subjt:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNN

Query:  TEFGKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYK
        T FGKVKQVRLSSIL HS   S        +PSP+P P+ H + H  PHHHHHHH      ++P P+L   +P     AP SA +       + P C Y+
Subjt:  TEFGKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAPTPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYK

Query:  RKSGRKEGKQSHLT--PVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPS
        ++  +     +H T  P  +P+ S  H PA  P      PP  H    P  +PLP+V++AH+ PPSKS         + P+   SPSP+P+PS
Subjt:  RKSGRKEGKQSHLT--PVASPNISPIHSPASPPPRHKVYPPAAHVSPTPALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAACCGACGGAGAACAGCCGCCGCCGTCCGCCGCCGCCTCCGGCGAGGTTCCCGGTGGCCGATGCTGTTCTGGGTGTGGTTCGATTCGGAGGCTCATTGGGTT
CAGATGCATCTTCATTCTCTTATTGTCCGTTGCCCTGTTCGTTTCTGCTGTTTTTTGGTTGCCCCCTTTTCTCCATTATGCAGATCAGAAGGATCTGGGTCTAAATCCAT
CGTATCGAGGTCATGATATAGTAGCAACGTTTGATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCGGGCTGACATTTTTGGAGAGTTCCTTATT
CCTTCTATCAAAGTGGATATTCTATCTCTAGAATCATTATCAGGATCAAACCGGACAAAAGTTGTGTTTGGTGTCGATCCAGATGCTGATGATTCCGAGATTTCGTCACC
TTTTCTAAGTTTAATCAGGTCGACCGTTGCAGATATAGTAGCGAATCAGTCGTCCCTCCGCATTACTAAATCCATGTTTGGGGATGCCTTTTTGTTTGAAGTGCTGAAAT
TCCCTGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAAGTACAT
TTCAGTGAACTGACAAGCCAACTGGACGCCGGTTTACGACTAGCTCCATATGAGATTTTGTACATTAAACTGTGGAATGCGGAAGGTTCGACTGTGACTGCCCCAACAAT
TGTCCAGTCATCTGTTCTTCTGGAAGTTGGAAATCCTCCATCGATGCGACGACTGAAGCAGCTAGCTCAGACAATCTCTGGCTCTAATTCTAGCAACCTCGGCCTGAATA
ATACCGAGTTTGGAAAAGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAGCACTCCCTCAATGGCAGTGACGGCAACGGTCCCGTAAGGAGGTCTCCTTCTCCTGCTCCT
ACACCCCAGCCCCATAACTTCCATCACCCCCCGCCTCACCACCACCACCACCATCACACTCCTCTAACACCTGCAATTTCACCTGCCCCTGCATTAGAGATGGGTGCACC
AGAATATGGTTCGCCTGCCCCCGAAAGTGCTGCATCGCCTAAGAAAAGTTACACAGCTAAGCCACCTGGTTGTCAATATAAGAGGAAGTCTGGTAGGAAAGAGGGAAAAC
AATCACATTTAACCCCGGTTGCTTCACCCAACATTTCTCCTATTCATTCTCCCGCATCGCCACCACCACGACATAAAGTATACCCACCTGCAGCACACGTCTCTCCAACT
CCGGCATTAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCGCCATCGAAAAGCGACTCCAGCCACCCTGGAAAATCCACAACAAATCCATCCATTGTGCCATC
TCCATCACCATCACCATCACCATCACCATCTGGTGCCTATCGTCCCCGTACAATTACTCAATGGGGATTCATACCGTTTCTAATTATCGCATGCATCATGTAA
mRNA sequenceShow/hide mRNA sequence
AGAAATTTCAATCGCCAAGCAATTCTGTGACGGCAATAATGGTCGCACCCAACTCATCCCCAGCGCAGCAGAGCCGTAATGGCCCTTGACCCACTTCGCCGGCATTGTTT
CCGATGGGGAAAACCGACGGAGAACAGCCGCCGCCGTCCGCCGCCGCCTCCGGCGAGGTTCCCGGTGGCCGATGCTGTTCTGGGTGTGGTTCGATTCGGAGGCTCATTGG
GTTCAGATGCATCTTCATTCTCTTATTGTCCGTTGCCCTGTTCGTTTCTGCTGTTTTTTGGTTGCCCCCTTTTCTCCATTATGCAGATCAGAAGGATCTGGGTCTAAATC
CATCGTATCGAGGTCATGATATAGTAGCAACGTTTGATGTTGAGAGACCGGTTTCTTTGCTGGAAGACAATATCGAGCAACTCCGGGCTGACATTTTTGGAGAGTTCCTT
ATTCCTTCTATCAAAGTGGATATTCTATCTCTAGAATCATTATCAGGATCAAACCGGACAAAAGTTGTGTTTGGTGTCGATCCAGATGCTGATGATTCCGAGATTTCGTC
ACCTTTTCTAAGTTTAATCAGGTCGACCGTTGCAGATATAGTAGCGAATCAGTCGTCCCTCCGCATTACTAAATCCATGTTTGGGGATGCCTTTTTGTTTGAAGTGCTGA
AATTCCCTGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAAGTA
CATTTCAGTGAACTGACAAGCCAACTGGACGCCGGTTTACGACTAGCTCCATATGAGATTTTGTACATTAAACTGTGGAATGCGGAAGGTTCGACTGTGACTGCCCCAAC
AATTGTCCAGTCATCTGTTCTTCTGGAAGTTGGAAATCCTCCATCGATGCGACGACTGAAGCAGCTAGCTCAGACAATCTCTGGCTCTAATTCTAGCAACCTCGGCCTGA
ATAATACCGAGTTTGGAAAAGTGAAGCAAGTTCGCCTTTCGTCGATTCTTAAGCACTCCCTCAATGGCAGTGACGGCAACGGTCCCGTAAGGAGGTCTCCTTCTCCTGCT
CCTACACCCCAGCCCCATAACTTCCATCACCCCCCGCCTCACCACCACCACCACCATCACACTCCTCTAACACCTGCAATTTCACCTGCCCCTGCATTAGAGATGGGTGC
ACCAGAATATGGTTCGCCTGCCCCCGAAAGTGCTGCATCGCCTAAGAAAAGTTACACAGCTAAGCCACCTGGTTGTCAATATAAGAGGAAGTCTGGTAGGAAAGAGGGAA
AACAATCACATTTAACCCCGGTTGCTTCACCCAACATTTCTCCTATTCATTCTCCCGCATCGCCACCACCACGACATAAAGTATACCCACCTGCAGCACACGTCTCTCCA
ACTCCGGCATTAACTCCATTGCCAAACGTCATTTACGCTCATGTTCAACCGCCATCGAAAAGCGACTCCAGCCACCCTGGAAAATCCACAACAAATCCATCCATTGTGCC
ATCTCCATCACCATCACCATCACCATCACCATCTGGTGCCTATCGTCCCCGTACAATTACTCAATGGGGATTCATACCGTTTCTAATTATCGCATGCATCATGTAACATT
CAGAAAGAAGACTACTGGTTTTCTGATGAACGTATGTCGATGATGTCGAGAGATGCCGGAGTTCTTATAAAATGACAAGTGCAGGAGTTTTTTAGAAGTGTGAGCAGAGG
AAAGCAAAGCAAAGCAAAAGAAGCATTGCTTGTTGTAAGATGGTTTTCTAAGTGTGTAAATATCATCTGATTAAGAAACTTGTTGCAGATGCAGTTTCAGGTCAAAGTCC
ACAGGGGTGGCAGGCCTTCAGAAACTTGCATATTTTTCCACTGTTTTTGTGTATTATCATCTTCTTCTCCATAAAATGTAAGGAGATAGAGAAGGAAGAAAAAAAAAATG
AAAACAGAGCAAATGCACAACTTTTTTCTTTTCCATTGTGTTTTCCCTTCACTTTCCCCCCCTCAAACTCTCTTAAATTTTTTGTACCCATAATGGCCTTATTTTGTATT
GCTCAAAGTTGAAGTTTGTATGTATATATATAAACAAACACATGACACAACTTAAGCCCCACTTGAATGTAAAAACTTGTAGTTTTTTGTGTTTGTTTAAAGTTTATGTA
TTTACATGGTCGAAGG
Protein sequenceShow/hide protein sequence
MGKTDGEQPPPSAAASGEVPGGRCCSGCGSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERPVSLLEDNIEQLRADIFGEFLI
PSIKVDILSLESLSGSNRTKVVFGVDPDADDSEISSPFLSLIRSTVADIVANQSSLRITKSMFGDAFLFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVH
FSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNPPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGSDGNGPVRRSPSPAP
TPQPHNFHHPPPHHHHHHHTPLTPAISPAPALEMGAPEYGSPAPESAASPKKSYTAKPPGCQYKRKSGRKEGKQSHLTPVASPNISPIHSPASPPPRHKVYPPAAHVSPT
PALTPLPNVIYAHVQPPSKSDSSHPGKSTTNPSIVPSPSPSPSPSPSGAYRPRTITQWGFIPFLIIACIM