; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G008110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G008110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBulb-type lectin domain-containing protein
Genome locationCmo_Chr09:4171305..4172663
RNA-Seq ExpressionCmoCh09G008110
SyntenyCmoCh09G008110
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR001480 - Bulb-type lectin domain
IPR035446 - S-locus-specific glycoprotein/EP1
IPR036426 - Bulb-type lectin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591915.1 EP1-like glycoprotein 2, partial [Cucurbita argyrosperma subsp. sororia]8.3e-27099.12Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT FHLLLPPLCFLLCTVLLAAIAT+AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPI SGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_022937366.1 EP1-like glycoprotein 2 [Cucurbita moschata]2.3e-272100Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_022976498.1 EP1-like glycoprotein 2 [Cucurbita maxima]1.7e-26297.12Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT FHLLL PLCFL+ TVLLAAIAT+AQVPANATFHF+NQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGR KLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPI S  ALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYY KVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_023535213.1 EP1-like glycoprotein 2 [Cucurbita pepo subsp. pepo]1.9e-26698.23Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT FHLLLPPLCFLL TVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIG RNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPI SGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE CAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKD+NSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_038896945.1 EP1-like glycoprotein 2 [Benincasa hispida]2.3e-24892.2Show/hide
Query:  LLLPP--LCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDAN
        L LPP   CFLL T+LLAA+ATEAQVPAN TFHF+NQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIP DESLMRWVWDAN
Subjt:  LLLPP--LCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDAN

Query:  RNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYS
        RNDPVRENATLTFGRDGNFVLADVDGR+VWQTNTKNRGVTGIKMLPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQS+RIGGRNKLISRKSEIDGSDGPYS
Subjt:  RNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYS

Query:  LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNLKAFT
        LVLDRTGLTMFLSH GQLLTYGGWP TD  +RVTF+ EPEN+NATAYELLLL+N+DTPRRRLLQVRPI SGGALNLNKLNYNATYSFLRL HDGNLKAFT
Subjt:  LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNLKAFT

Query:  YYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCS-GGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVG
        YYD  SYLKWEESFAFFSSYFIRECALPSKCGAYGYC+RGMCVACPSPKGLLGWSESCAPPKTPPCS GGKGK+GYYKIVGVEHFLNPYK+DGEGPIKVG
Subjt:  YYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCS-GGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVG

Query:  DCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        DCRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKD+NSSSVGYIKYS+
Subjt:  DCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

TrEMBL top hitse value%identityAlignment
A0A0A0L3A7 Bulb-type lectin domain-containing protein2.9e-23688.39Show/hide
Query:  HLL-LPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDAN
        HLL LP LCF L T+L AAIAT+AQVPAN TFHF+NQGEFGDRIIEYDASYRVIRN+VYTFYTFPFRLCFYNTTPDSFIFAIRAGIP DESLMRWVWDAN
Subjt:  HLL-LPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDAN

Query:  RNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYS
        RNDPVRENATLTFG DGNFVLADVDGR+VWQTNTKN+GVTGIKMLPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQS+RIGGRNKLISRKSEIDGSDGPYS
Subjt:  RNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYS

Query:  LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNLKAFT
        L+L RTGLTMFL++ GQ LTYGGW  TD  S VTF  EPEN+NATAYELLL +N+DT RRRLLQVRPI SGGALNLNKLNYNATYSFLRL  DGNL+AFT
Subjt:  LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNLKAFT

Query:  YYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGD
        YYD  SYLKWEESFAFFSSYFIREC LPSKCGAYGYC+RGMCV CPSPKGLLGWSE CAPPKTP C GGK KFGYYKIVGVEHFLNPYK DGEGP+KVGD
Subjt:  YYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGD

Query:  CRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        CRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKD+NSSSVGYIKYS+
Subjt:  CRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

A0A438E5D3 EP1-like glycoprotein 27.7e-20574.67Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT+F  +     F+L  ++L   A  A VPAN TF FVNQGEFGDRIIEYDASYRVIRN VYTF+TFPFRLCFYNTTPD++IFAIRAG+P DESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRN+P  EN+TLTFGRDGNFVLA+ DGRVVWQTNT N+GVTGIK+LPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQ +RI GRNKL+SR SE+DGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIGSGGALNLNKLNYNATYSFLR
        G YSLV D+ GLTM++++ G+LL YGGWPG D G+ V+F A PENDNATA+EL+L   ++T        RRRLLQVRPI SGG  NLNKLNYNATYSFLR
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIGSGGALNLNKLNYNATYSFLR

Query:  LSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYK
        LSHDGNL+A+TYYD+VSYLKW+E+FAFFSSYFIRECALPSKCG++G CN+GMCVACPSPKGLLGWSESCAPP+ PPC GG  K  YYKI+GVE+FLNPY 
Subjt:  LSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYK

Query:  EDGEGPIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS
        +DG+GP+KV +CR +C RDCKC GFIYKE +SKCL  PLL TLIKD N++SVGYIKYS
Subjt:  EDGEGPIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS

A0A6J1FA56 EP1-like glycoprotein 21.1e-272100Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

A0A6J1IMC1 EP1-like glycoprotein 28.1e-26397.12Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT FHLLL PLCFL+ TVLLAAIAT+AQVPANATFHF+NQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGR KLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPI S  ALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYY KVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

F6H2N4 Uncharacterized protein7.7e-20574.67Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT+F  +     F+L  ++L   A  A VPAN TF FVNQGEFGDRIIEYDASYRVIRN VYTF+TFPFRLCFYNTTPD++IFAIRAG+P DESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRN+P  EN+TLTFGRDGNFVLA+ DGRVVWQTNT N+GVTGIK+LPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQ +RI GRNKL+SR SE+DGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIGSGGALNLNKLNYNATYSFLR
        G YSLV D+ GLTM++++ G+LL YGGWPG D G+ V+F A PENDNATA+EL+L   ++T        RRRLLQVRPI SGG  NLNKLNYNATYSFLR
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIGSGGALNLNKLNYNATYSFLR

Query:  LSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYK
        LSHDGNL+A+TYYD+VSYLKW+E+FAFFSSYFIRECALPSKCG++G CN+GMCVACPSPKGLLGWSESCAPP+ PPC GG  K  YYKI+GVE+FLNPY 
Subjt:  LSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYK

Query:  EDGEGPIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS
        +DG+GP+KV +CR +C RDCKC GFIYKE +SKCL  PLL TLIKD N++SVGYIKYS
Subjt:  EDGEGPIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS

SwissProt top hitse value%identityAlignment
Q39688 Epidermis-specific secreted glycoprotein EP11.3e-8443.98Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MA  F L L  L F +  +          VPAN TF FVN+GE G  I EY   YR +       +T PF+LCFYN TP +F  A+R G+   ESLMRWV
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        W+ANR +PV ENATLTFG DGN VLA  +G+V WQT+T N+GV G+K+LPNGN++L+D  GKF+WQSFD PTDTLLVGQS+++G   KL+SR S  +  +
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFL--SHDGQLLTYGGWP------GTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFL
        GPYSLV++  GL ++   +   + + Y  +         +    VTF  E END   A+ L L                   GGA  LN++ YN T SFL
Subjt:  GPYSLVLDRTGLTMFL--SHDGQLLTYGGWP------GTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFL

Query:  RLSHDGNLKAFTYYDKVSYLKWEESFAFF--------------SSYFIRECALPSKCGAYGYCNRGMCVACPSPKG-LLGWSESCAPPKTPPCSGGKGKF
        RL  DGN+K +TY DKV Y  WE ++  F              +     EC LP KCG +G C    CV CP+  G +L WS++C PPK   C  G   F
Subjt:  RLSHDGNLKAFTYYDKVSYLKWEESFAFF--------------SSYFIRECALPSKCGAYGYCNRGMCVACPSPKG-LLGWSESCAPPKTPPCSGGKGKF

Query:  GYYKIVG
         Y K+ G
Subjt:  GYYKIVG

Q9ZVA1 EP1-like glycoprotein 15.6e-16060Show/hide
Query:  FLLCTVLLAAIAT--EAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE
        +LL T L  +  +   AQVP    F  +N+  +   I EYDASYR + +    F+T PF+L FYNTTP +++ A+R G   D S  RW+WDANRN+PV +
Subjt:  FLLCTVLLAAIAT--EAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE

Query:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG
        N+TL+FGR+GN VLA+++G+V WQTNT N+GVTG ++LPNGN++LHDK+GKF+WQSFD+PTDTLLVGQS+++ G NKL+SR S+++GSDGPYS+VLD  G
Subjt:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG

Query:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNLKA
        LTM+++  G  L YGGW   D    VTFA   E DN T   AYELLL             RRLLQVRPIGS GG LNLNK+NYN T S+LRL  DG+LKA
Subjt:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNLKA

Query:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI
        F+Y+   +YL+WEE+FAFFS+YF+R+C LP+ CG YGYC+RGMCV CP+PKGLL WS+ CAPPKT   CSGGKGK   YYKIVGVEHF  PY  DG+GP 
Subjt:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
         V DC+AKCDRDCKC G+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

Q9ZVA2 EP1-like glycoprotein 29.4e-16863.27Show/hide
Query:  FLLCTVLLAAIATE----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV
        F +   L  AIAT     AQVP    F  VN+GEFG+ I EYDASYR I +   +F+T PF+L FYNTTP ++I A+R G+  DES MRW+WDANRN+PV
Subjt:  FLLCTVLLAAIATE----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV

Query:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR
         ENATL+ GR+GN VLA+ DGRV WQTNT N+GVTG ++LPNGN++LHDKNGKF+WQSFD+PTDTLL GQS+++ G NKL+SR S+ +GSDGPYS+VLD+
Subjt:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR

Query:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNL
         GLTM+++  G  L YGGWP  D    VTFA   E DN T   AYELLL             RRLLQVRPIGS GG LNLNK+NYN T S+LRL  DG+L
Subjt:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG
        KA++Y+   +YLKWEESF+FFS+YF+R+C LPS CG YGYC+RGMC ACP+PKGLLGWS+ CAPPKT   CSG KGK   YYKIVGVEHF  PY  DG+G
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
        P  V DC+AKCDRDCKC G+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

Q9ZVA4 EP1-like glycoprotein 35.1e-8940.35Show/hide
Query:  LCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR
        LCF L   L   I ++A+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTTP+++  A+R G    ES +RWVW+ANR  PV+
Subjt:  LCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR

Query:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT
        ENATLTFG DGN VLA+ DGR+VWQTNT N+G  GIK+L NGN++++D +GKF+WQSFD PTDTLLVGQS+++ GR KL+SR S    ++GPYSLV++  
Subjt:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT

Query:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLK
         L ++ + +          Y  +        +TF A  ++D                    L +  + SG   N    L++  +NAT SF+RL  DGN++
Subjt:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLK

Query:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG
         ++Y    +   W+ ++  F++       EC +P  C  +G C +G C ACPS KGLLGW E+C  P    C      F Y+KI G + F+  Y  +G  
Subjt:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
              C  KC RDCKC GF Y   SS+C     L TL +  +SS V Y+K
Subjt:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

Q9ZVA5 EP1-like glycoprotein 44.3e-8840.41Show/hide
Query:  LLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA
        L  T+ +  +  +A+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTT +++  A+R G    ES +RWVW+ANR  PV+ENA
Subjt:  LLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA

Query:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT
        TLTFG DGN VLA+ DGRVVWQTNT N+GV GIK+L NGN++++D NGKF+WQSFD PTDTLLVGQS+++ G+NKL+SR S    ++GPYSLV++   L 
Subjt:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT

Query:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV
        ++ + +      G            +  E     A    +     +D      L +  + SG   N    L++  +NAT SFLRL  DGN++ ++Y    
Subjt:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV

Query:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR
        +   W+ ++  F++       EC +P  C  +G C +G C ACPS  GLLGW E+C  P    C      F Y+KI G + F+  Y  +G        C 
Subjt:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR

Query:  AKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
         KC RDCKC GF Y   SS+C     L TL K  ++S V Y+K
Subjt:  AKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

Arabidopsis top hitse value%identityAlignment
AT1G16905.1 Curculin-like (mannose-binding) lectin family protein6.9e-8942.08Show/hide
Query:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYR---VIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLM
        MA + H+L+    FLL +++        QVP    F F+N G+FG+  +EY ASYR   VIRN         FRLCF+NTTP++F  AI  G  + +S++
Subjt:  MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYR---VIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLM

Query:  RWVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRI-GGRNKLISRKSEI
        RWVW AN   PV+E A+L+FG +GN VLA  DGRVVWQT T+N+GV G+ M  NGNL+L D  G  +WQSF++PTDTLLVGQS+ + G +NKL+SR    
Subjt:  RWVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRI-GGRNKLISRKSEI

Query:  DGSDGPYSLVL--DRTGLTMFLSH-DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLL---VNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYS
          ++G YSL+L  DR  L   +   + + L Y    G    S   ++A+   D  T  +L L    +  + P +  L  RP             +NA+ S
Subjt:  DGSDGPYSLVL--DRTGLTMFLSH-DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLL---VNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYS

Query:  FLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLN
        FLRL  DGNL+ +++  KV++L WE +F  F+     EC LPSKCGA+G C    CVACP   GL+GWS++C P K   C      F YY++ GVEHF+ 
Subjt:  FLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLN

Query:  PYKEDGEGPIKVGD--CRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
         Y       + +G+  CR  C  DCKC G+ + + S KC     LGTL+K  +S  V YIK
Subjt:  PYKEDGEGPIKVGD--CRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

AT1G78820.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain3.9e-16160Show/hide
Query:  FLLCTVLLAAIAT--EAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE
        +LL T L  +  +   AQVP    F  +N+  +   I EYDASYR + +    F+T PF+L FYNTTP +++ A+R G   D S  RW+WDANRN+PV +
Subjt:  FLLCTVLLAAIAT--EAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE

Query:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG
        N+TL+FGR+GN VLA+++G+V WQTNT N+GVTG ++LPNGN++LHDK+GKF+WQSFD+PTDTLLVGQS+++ G NKL+SR S+++GSDGPYS+VLD  G
Subjt:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG

Query:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNLKA
        LTM+++  G  L YGGW   D    VTFA   E DN T   AYELLL             RRLLQVRPIGS GG LNLNK+NYN T S+LRL  DG+LKA
Subjt:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNLKA

Query:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI
        F+Y+   +YL+WEE+FAFFS+YF+R+C LP+ CG YGYC+RGMCV CP+PKGLL WS+ CAPPKT   CSGGKGK   YYKIVGVEHF  PY  DG+GP 
Subjt:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
         V DC+AKCDRDCKC G+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  KVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

AT1G78830.1 Curculin-like (mannose-binding) lectin family protein6.7e-16963.27Show/hide
Query:  FLLCTVLLAAIATE----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV
        F +   L  AIAT     AQVP    F  VN+GEFG+ I EYDASYR I +   +F+T PF+L FYNTTP ++I A+R G+  DES MRW+WDANRN+PV
Subjt:  FLLCTVLLAAIATE----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV

Query:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR
         ENATL+ GR+GN VLA+ DGRV WQTNT N+GVTG ++LPNGN++LHDKNGKF+WQSFD+PTDTLL GQS+++ G NKL+SR S+ +GSDGPYS+VLD+
Subjt:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR

Query:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNL
         GLTM+++  G  L YGGWP  D    VTFA   E DN T   AYELLL             RRLLQVRPIGS GG LNLNK+NYN T S+LRL  DG+L
Subjt:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIGS-GGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG
        KA++Y+   +YLKWEESF+FFS+YF+R+C LPS CG YGYC+RGMC ACP+PKGLLGWS+ CAPPKT   CSG KGK   YYKIVGVEHF  PY  DG+G
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
        P  V DC+AKCDRDCKC G+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

AT1G78850.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain3.6e-9040.35Show/hide
Query:  LCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR
        LCF L   L   I ++A+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTTP+++  A+R G    ES +RWVW+ANR  PV+
Subjt:  LCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR

Query:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT
        ENATLTFG DGN VLA+ DGR+VWQTNT N+G  GIK+L NGN++++D +GKF+WQSFD PTDTLLVGQS+++ GR KL+SR S    ++GPYSLV++  
Subjt:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT

Query:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLK
         L ++ + +          Y  +        +TF A  ++D                    L +  + SG   N    L++  +NAT SF+RL  DGN++
Subjt:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLK

Query:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG
         ++Y    +   W+ ++  F++       EC +P  C  +G C +G C ACPS KGLLGW E+C  P    C      F Y+KI G + F+  Y  +G  
Subjt:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
              C  KC RDCKC GF Y   SS+C     L TL +  +SS V Y+K
Subjt:  PIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

AT1G78860.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain3.1e-8940.41Show/hide
Query:  LLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA
        L  T+ +  +  +A+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTT +++  A+R G    ES +RWVW+ANR  PV+ENA
Subjt:  LLCTVLLAAIATEAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA

Query:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT
        TLTFG DGN VLA+ DGRVVWQTNT N+GV GIK+L NGN++++D NGKF+WQSFD PTDTLLVGQS+++ G+NKL+SR S    ++GPYSLV++   L 
Subjt:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT

Query:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV
        ++ + +      G            +  E     A    +     +D      L +  + SG   N    L++  +NAT SFLRL  DGN++ ++Y    
Subjt:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV

Query:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR
        +   W+ ++  F++       EC +P  C  +G C +G C ACPS  GLLGW E+C  P    C      F Y+KI G + F+  Y  +G        C 
Subjt:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR

Query:  AKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
         KC RDCKC GF Y   SS+C     L TL K  ++S V Y+K
Subjt:  AKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAAGTTTTCATCTTCTTCTTCCCCCTCTCTGTTTCCTGCTCTGCACTGTTCTTCTCGCCGCCATAGCCACAGAAGCTCAAGTTCCTGCAAATGCCACCTTCCA
TTTCGTAAACCAAGGCGAATTCGGCGACCGAATCATTGAATACGACGCCAGCTACCGCGTAATTCGAAACCACGTGTACACCTTCTACACATTCCCCTTCCGCCTCTGTT
TTTACAACACCACCCCTGATTCCTTCATTTTCGCCATTAGAGCTGGAATCCCCAACGACGAGAGCTTAATGCGATGGGTTTGGGACGCCAATCGCAACGACCCAGTTCGT
GAAAACGCCACCCTCACCTTTGGCCGCGACGGAAACTTCGTCCTAGCCGACGTCGACGGCCGTGTCGTCTGGCAAACCAACACCAAAAACAGAGGAGTCACCGGCATCAA
AATGCTCCCTAATGGAAACTTAATCCTCCATGACAAGAACGGGAAATTCATCTGGCAGAGCTTTGATTACCCTACTGATACTCTGTTAGTCGGTCAATCGATTCGAATCG
GCGGCCGGAATAAATTAATTAGCCGGAAATCCGAAATCGACGGCTCTGATGGCCCTTACAGCCTTGTTCTAGATCGAACAGGGCTCACAATGTTTCTTTCCCACGACGGT
CAGCTTTTAACCTACGGCGGTTGGCCGGGGACGGATCATGGAAGCAGAGTAACATTCGCCGCCGAACCAGAGAATGACAACGCCACCGCGTACGAGCTTCTTCTTTTAGT
AAATCAGGACACCCCACGGCGGCGATTGTTACAAGTCCGACCAATTGGAAGCGGCGGAGCGTTGAATTTGAACAAATTGAACTACAATGCGACGTACTCGTTTCTCCGGC
TAAGTCACGACGGGAATTTGAAGGCATTCACGTACTACGATAAAGTGAGTTACTTGAAATGGGAAGAGAGCTTTGCGTTTTTTTCGAGCTATTTCATAAGGGAATGTGCT
CTGCCGAGCAAATGTGGAGCTTATGGGTACTGCAACAGGGGAATGTGTGTGGCGTGTCCGAGCCCAAAAGGGCTTTTGGGGTGGAGCGAGAGCTGTGCGCCGCCGAAGAC
GCCGCCGTGCAGCGGCGGAAAAGGGAAATTTGGGTACTATAAGATAGTGGGGGTGGAGCATTTTTTGAACCCGTACAAGGAGGACGGGGAAGGGCCGATTAAGGTGGGGG
ATTGCAGAGCCAAATGTGATAGAGATTGTAAGTGTTCAGGGTTTATCTATAAGGAGTATAGCTCAAAATGCTTGAGGGTTCCATTGTTGGGAACTTTGATTAAGGATGTT
AATTCGTCCTCGGTGGGTTATATTAAGTATTCGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGACAAGTTTTCATCTTCTTCTTCCCCCTCTCTGTTTCCTGCTCTGCACTGTTCTTCTCGCCGCCATAGCCACAGAAGCTCAAGTTCCTGCAAATGCCACCTTCCA
TTTCGTAAACCAAGGCGAATTCGGCGACCGAATCATTGAATACGACGCCAGCTACCGCGTAATTCGAAACCACGTGTACACCTTCTACACATTCCCCTTCCGCCTCTGTT
TTTACAACACCACCCCTGATTCCTTCATTTTCGCCATTAGAGCTGGAATCCCCAACGACGAGAGCTTAATGCGATGGGTTTGGGACGCCAATCGCAACGACCCAGTTCGT
GAAAACGCCACCCTCACCTTTGGCCGCGACGGAAACTTCGTCCTAGCCGACGTCGACGGCCGTGTCGTCTGGCAAACCAACACCAAAAACAGAGGAGTCACCGGCATCAA
AATGCTCCCTAATGGAAACTTAATCCTCCATGACAAGAACGGGAAATTCATCTGGCAGAGCTTTGATTACCCTACTGATACTCTGTTAGTCGGTCAATCGATTCGAATCG
GCGGCCGGAATAAATTAATTAGCCGGAAATCCGAAATCGACGGCTCTGATGGCCCTTACAGCCTTGTTCTAGATCGAACAGGGCTCACAATGTTTCTTTCCCACGACGGT
CAGCTTTTAACCTACGGCGGTTGGCCGGGGACGGATCATGGAAGCAGAGTAACATTCGCCGCCGAACCAGAGAATGACAACGCCACCGCGTACGAGCTTCTTCTTTTAGT
AAATCAGGACACCCCACGGCGGCGATTGTTACAAGTCCGACCAATTGGAAGCGGCGGAGCGTTGAATTTGAACAAATTGAACTACAATGCGACGTACTCGTTTCTCCGGC
TAAGTCACGACGGGAATTTGAAGGCATTCACGTACTACGATAAAGTGAGTTACTTGAAATGGGAAGAGAGCTTTGCGTTTTTTTCGAGCTATTTCATAAGGGAATGTGCT
CTGCCGAGCAAATGTGGAGCTTATGGGTACTGCAACAGGGGAATGTGTGTGGCGTGTCCGAGCCCAAAAGGGCTTTTGGGGTGGAGCGAGAGCTGTGCGCCGCCGAAGAC
GCCGCCGTGCAGCGGCGGAAAAGGGAAATTTGGGTACTATAAGATAGTGGGGGTGGAGCATTTTTTGAACCCGTACAAGGAGGACGGGGAAGGGCCGATTAAGGTGGGGG
ATTGCAGAGCCAAATGTGATAGAGATTGTAAGTGTTCAGGGTTTATCTATAAGGAGTATAGCTCAAAATGCTTGAGGGTTCCATTGTTGGGAACTTTGATTAAGGATGTT
AATTCGTCCTCGGTGGGTTATATTAAGTATTCGATTTAG
Protein sequenceShow/hide protein sequence
MATSFHLLLPPLCFLLCTVLLAAIATEAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR
ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDG
QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIGSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECA
LPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCSGFIYKEYSSKCLRVPLLGTLIKDV
NSSSVGYIKYSI