; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17053 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17053
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionBulb-type lectin domain-containing protein
Genome locationCarg_Chr09:4358175..4359533
RNA-Seq ExpressionCarg17053
SyntenyCarg17053
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR001480 - Bulb-type lectin domain
IPR035446 - S-locus-specific glycoprotein/EP1
IPR036426 - Bulb-type lectin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591915.1 EP1-like glycoprotein 2, partial [Cucurbita argyrosperma subsp. sororia]3.0e-272100Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_022937366.1 EP1-like glycoprotein 2 [Cucurbita moschata]1.1e-26999.12Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT FHLLLPPLCFLLCTVLLAAIAT+AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPI SGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_022976498.1 EP1-like glycoprotein 2 [Cucurbita maxima]6.1e-26598.01Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MATRFHLLL PLCFL+ TVLLAAIATQAQVPANATFHF+NQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGR KLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRS  ALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYY KVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_023535213.1 EP1-like glycoprotein 2 [Cucurbita pepo subsp. pepo]3.5e-26898.67Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MATRFHLLLPPLCFLL TVLLAAIAT+AQVPANATFHFVNQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIG RNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSE CAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKD+NSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

XP_038896945.1 EP1-like glycoprotein 2 [Benincasa hispida]1.6e-24991.87Show/hide
Query:  MATRFHLLLPP--LCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMR
        MA R  L LPP   CFLL T+LLAA+AT+AQVPAN TFHF+NQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIP DESLMR
Subjt:  MATRFHLLLPP--LCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMR

Query:  WVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDG
        WVWDANRNDPVRENATLTFGRDGNFVLADVDGR+VWQTNTKNRGVTGIKMLPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQS+RIGGRNKLISRKSEIDG
Subjt:  WVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDG

Query:  SDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDG
        SDGPYSLVLDRTGLTMFLSH GQLLTYGGWP TD  +RVTF+ EPEN+NATAYELLLL+N+DTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRL HDG
Subjt:  SDGPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDG

Query:  NLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCS-GGKGKFGYYKIVGVEHFLNPYKEDGE
        NLKAFTYYD  SYLKWEESFAFFSSYFIRECALPSKCGAYGYC+RGMCVACPSPKGLLGWSESCAPPKTPPCS GGKGK+GYYKIVGVEHFLNPYK+DGE
Subjt:  NLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCS-GGKGKFGYYKIVGVEHFLNPYKEDGE

Query:  GPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        GPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKD+NSSSVGYIKYS+
Subjt:  GPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

TrEMBL top hitse value%identityAlignment
A0A0A0L3A7 Bulb-type lectin domain-containing protein1.5e-23788.84Show/hide
Query:  HLL-LPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDAN
        HLL LP LCF L T+L AAIAT+AQVPAN TFHF+NQGEFGDRIIEYDASYRVIRN+VYTFYTFPFRLCFYNTTPDSFIFAIRAGIP DESLMRWVWDAN
Subjt:  HLL-LPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDAN

Query:  RNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYS
        RNDPVRENATLTFG DGNFVLADVDGR+VWQTNTKN+GVTGIKMLPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQS+RIGGRNKLISRKSEIDGSDGPYS
Subjt:  RNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYS

Query:  LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFT
        L+L RTGLTMFL++ GQ LTYGGW  TD  S VTF  EPEN+NATAYELLL +N+DT RRRLLQVRPIRSGGALNLNKLNYNATYSFLRL  DGNL+AFT
Subjt:  LVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFT

Query:  YYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGD
        YYD  SYLKWEESFAFFSSYFIREC LPSKCGAYGYC+RGMCV CPSPKGLLGWSE CAPPKTP C GGK KFGYYKIVGVEHFLNPYK DGEGP+KVGD
Subjt:  YYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGD

Query:  CRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        CRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKD+NSSSVGYIKYS+
Subjt:  CRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

A0A438E5D3 EP1-like glycoprotein 21.2e-20576.59Show/hide
Query:  VLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFG
        ++L   A  A VPAN TF FVNQGEFGDRIIEYDASYRVIRN VYTF+TFPFRLCFYNTTPD++IFAIRAG+P DESLMRWVWDANRN+P  EN+TLTFG
Subjt:  VLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFG

Query:  RDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSH
        RDGNFVLA+ DGRVVWQTNT N+GVTGIK+LPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQ +RI GRNKL+SR SE+DGSDG YSLV D+ GLTM++++
Subjt:  RDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSH

Query:  DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSY
         G+LL YGGWPG D G+ V+F A PENDNATA+EL+L   ++T        RRRLLQVRPI SGG  NLNKLNYNATYSFLRLSHDGNL+A+TYYD+VSY
Subjt:  DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSY

Query:  LKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDR
        LKW+E+FAFFSSYFIRECALPSKCG++G CN+GMCVACPSPKGLLGWSESCAPP+ PPC GG  K  YYKI+GVE+FLNPY +DG+GP+KV +CR +C R
Subjt:  LKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDR

Query:  DCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS
        DCKCLGFIYKE +SKCL  PLL TLIKD N++SVGYIKYS
Subjt:  DCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS

A0A6J1FA56 EP1-like glycoprotein 25.2e-27099.12Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MAT FHLLLPPLCFLLCTVLLAAIAT+AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPI SGGALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKC GFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

A0A6J1IMC1 EP1-like glycoprotein 23.0e-26598.01Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MATRFHLLL PLCFL+ TVLLAAIATQAQVPANATFHF+NQGEFGDRIIEYDASYRVIRN VYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGR KLISRKSEIDGSD
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL
        GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRS  ALNLNKLNYNATYSFLRLSHDGNL
Subjt:  GPYSLVLDRTGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
        KAFTYY KVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
        KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYSI

F6H2N4 Uncharacterized protein1.2e-20576.59Show/hide
Query:  VLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFG
        ++L   A  A VPAN TF FVNQGEFGDRIIEYDASYRVIRN VYTF+TFPFRLCFYNTTPD++IFAIRAG+P DESLMRWVWDANRN+P  EN+TLTFG
Subjt:  VLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENATLTFG

Query:  RDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSH
        RDGNFVLA+ DGRVVWQTNT N+GVTGIK+LPNGNL+LHDKNGKFIWQSFDYPTDTLLVGQ +RI GRNKL+SR SE+DGSDG YSLV D+ GLTM++++
Subjt:  RDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSH

Query:  DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSY
         G+LL YGGWPG D G+ V+F A PENDNATA+EL+L   ++T        RRRLLQVRPI SGG  NLNKLNYNATYSFLRLSHDGNL+A+TYYD+VSY
Subjt:  DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTP-------RRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSY

Query:  LKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDR
        LKW+E+FAFFSSYFIRECALPSKCG++G CN+GMCVACPSPKGLLGWSESCAPP+ PPC GG  K  YYKI+GVE+FLNPY +DG+GP+KV +CR +C R
Subjt:  LKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDR

Query:  DCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS
        DCKCLGFIYKE +SKCL  PLL TLIKD N++SVGYIKYS
Subjt:  DCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKYS

SwissProt top hitse value%identityAlignment
Q39688 Epidermis-specific secreted glycoprotein EP12.9e-8443.98Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV
        MA  F L L  L F +  +          VPAN TF FVN+GE G  I EY   YR +       +T PF+LCFYN TP +F  A+R G+   ESLMRWV
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWV

Query:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD
        W+ANR +PV ENATLTFG DGN VLA  +G+V WQT+T N+GV G+K+LPNGN++L+D  GKF+WQSFD PTDTLLVGQS+++G   KL+SR S  +  +
Subjt:  WDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSD

Query:  GPYSLVLDRTGLTMFL--SHDGQLLTYGGWP------GTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFL
        GPYSLV++  GL ++   +   + + Y  +         +    VTF  E END   A+ L L                   GGA  LN++ YN T SFL
Subjt:  GPYSLVLDRTGLTMFL--SHDGQLLTYGGWP------GTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFL

Query:  RLSHDGNLKAFTYYDKVSYLKWEESFAFF--------------SSYFIRECALPSKCGAYGYCNRGMCVACPSPKG-LLGWSESCAPPKTPPCSGGKGKF
        RL  DGN+K +TY DKV Y  WE ++  F              +     EC LP KCG +G C    CV CP+  G +L WS++C PPK   C  G   F
Subjt:  RLSHDGNLKAFTYYDKVSYLKWEESFAFF--------------SSYFIRECALPSKCGAYGYCNRGMCVACPSPKG-LLGWSESCAPPKTPPCSGGKGKF

Query:  GYYKIVG
         Y K+ G
Subjt:  GYYKIVG

Q9ZVA1 EP1-like glycoprotein 19.5e-16060Show/hide
Query:  FLLCTVLLAAIAT--QAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE
        +LL T L  +  +   AQVP    F  +N+  +   I EYDASYR + +    F+T PF+L FYNTTP +++ A+R G   D S  RW+WDANRN+PV +
Subjt:  FLLCTVLLAAIAT--QAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE

Query:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG
        N+TL+FGR+GN VLA+++G+V WQTNT N+GVTG ++LPNGN++LHDK+GKF+WQSFD+PTDTLLVGQS+++ G NKL+SR S+++GSDGPYS+VLD  G
Subjt:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG

Query:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNLKA
        LTM+++  G  L YGGW   D    VTFA   E DN T   AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+LKA
Subjt:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNLKA

Query:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI
        F+Y+   +YL+WEE+FAFFS+YF+R+C LP+ CG YGYC+RGMCV CP+PKGLL WS+ CAPPKT   CSGGKGK   YYKIVGVEHF  PY  DG+GP 
Subjt:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
         V DC+AKCDRDCKCLG+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

Q9ZVA2 EP1-like glycoprotein 22.7e-16763.27Show/hide
Query:  FLLCTVLLAAIATQ----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV
        F +   L  AIAT     AQVP    F  VN+GEFG+ I EYDASYR I +   +F+T PF+L FYNTTP ++I A+R G+  DES MRW+WDANRN+PV
Subjt:  FLLCTVLLAAIATQ----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV

Query:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR
         ENATL+ GR+GN VLA+ DGRV WQTNT N+GVTG ++LPNGN++LHDKNGKF+WQSFD+PTDTLL GQS+++ G NKL+SR S+ +GSDGPYS+VLD+
Subjt:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR

Query:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNL
         GLTM+++  G  L YGGWP  D    VTFA   E DN T   AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+L
Subjt:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG
        KA++Y+   +YLKWEESF+FFS+YF+R+C LPS CG YGYC+RGMC ACP+PKGLLGWS+ CAPPKT   CSG KGK   YYKIVGVEHF  PY  DG+G
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
        P  V DC+AKCDRDCKCLG+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

Q9ZVA4 EP1-like glycoprotein 36.1e-9040.8Show/hide
Query:  LCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR
        LCF L   L   I +QA+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTTP+++  A+R G    ES +RWVW+ANR  PV+
Subjt:  LCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR

Query:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT
        ENATLTFG DGN VLA+ DGR+VWQTNT N+G  GIK+L NGN++++D +GKF+WQSFD PTDTLLVGQS+++ GR KL+SR S    ++GPYSLV++  
Subjt:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT

Query:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLK
         L ++ + +          Y  +        +TF A  ++D                    L +  + SG   N    L++  +NAT SF+RL  DGN++
Subjt:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLK

Query:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG
         ++Y    +   W+ ++  F++       EC +P  C  +G C +G C ACPS KGLLGW E+C  P    C      F Y+KI G + F+  Y  +G  
Subjt:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
              C  KC RDCKCLGF Y   SS+C     L TL +  +SS V Y+K
Subjt:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

Q9ZVA5 EP1-like glycoprotein 46.7e-8940.86Show/hide
Query:  LLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA
        L  T+ +  +  QA+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTT +++  A+R G    ES +RWVW+ANR  PV+ENA
Subjt:  LLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA

Query:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT
        TLTFG DGN VLA+ DGRVVWQTNT N+GV GIK+L NGN++++D NGKF+WQSFD PTDTLLVGQS+++ G+NKL+SR S    ++GPYSLV++   L 
Subjt:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT

Query:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV
        ++ + +      G            +  E     A    +     +D      L +  + SG   N    L++  +NAT SFLRL  DGN++ ++Y    
Subjt:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV

Query:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR
        +   W+ ++  F++       EC +P  C  +G C +G C ACPS  GLLGW E+C  P    C      F Y+KI G + F+  Y  +G        C 
Subjt:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR

Query:  AKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
         KC RDCKCLGF Y   SS+C     L TL K  ++S V Y+K
Subjt:  AKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

Arabidopsis top hitse value%identityAlignment
AT1G16905.1 Curculin-like (mannose-binding) lectin family protein1.8e-8942.3Show/hide
Query:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYR---VIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLM
        MA   H+L+    FLL +++      + QVP    F F+N G+FG+  +EY ASYR   VIRN         FRLCF+NTTP++F  AI  G  + +S++
Subjt:  MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYR---VIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLM

Query:  RWVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRI-GGRNKLISRKSEI
        RWVW AN   PV+E A+L+FG +GN VLA  DGRVVWQT T+N+GV G+ M  NGNL+L D  G  +WQSF++PTDTLLVGQS+ + G +NKL+SR    
Subjt:  RWVWDANRNDPVRENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRI-GGRNKLISRKSEI

Query:  DGSDGPYSLVL--DRTGLTMFLSH-DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLL---VNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYS
          ++G YSL+L  DR  L   +   + + L Y    G    S   ++A+   D  T  +L L    +  + P +  L  RP             +NA+ S
Subjt:  DGSDGPYSLVL--DRTGLTMFLSH-DGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLL---VNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYS

Query:  FLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLN
        FLRL  DGNL+ +++  KV++L WE +F  F+     EC LPSKCGA+G C    CVACP   GL+GWS++C P K   C      F YY++ GVEHF+ 
Subjt:  FLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLN

Query:  PYKEDGEGPIKVGD--CRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
         Y       + +G+  CR  C  DCKCLG+ + + S KC     LGTL+K  +S  V YIK
Subjt:  PYKEDGEGPIKVGD--CRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

AT1G78820.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain6.7e-16160Show/hide
Query:  FLLCTVLLAAIAT--QAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE
        +LL T L  +  +   AQVP    F  +N+  +   I EYDASYR + +    F+T PF+L FYNTTP +++ A+R G   D S  RW+WDANRN+PV +
Subjt:  FLLCTVLLAAIAT--QAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRE

Query:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG
        N+TL+FGR+GN VLA+++G+V WQTNT N+GVTG ++LPNGN++LHDK+GKF+WQSFD+PTDTLLVGQS+++ G NKL+SR S+++GSDGPYS+VLD  G
Subjt:  NATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTG

Query:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNLKA
        LTM+++  G  L YGGW   D    VTFA   E DN T   AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+LKA
Subjt:  LTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNLKA

Query:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI
        F+Y+   +YL+WEE+FAFFS+YF+R+C LP+ CG YGYC+RGMCV CP+PKGLL WS+ CAPPKT   CSGGKGK   YYKIVGVEHF  PY  DG+GP 
Subjt:  FTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEGPI

Query:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
         V DC+AKCDRDCKCLG+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  KVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

AT1G78830.1 Curculin-like (mannose-binding) lectin family protein2.0e-16863.27Show/hide
Query:  FLLCTVLLAAIATQ----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV
        F +   L  AIAT     AQVP    F  VN+GEFG+ I EYDASYR I +   +F+T PF+L FYNTTP ++I A+R G+  DES MRW+WDANRN+PV
Subjt:  FLLCTVLLAAIATQ----AQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPV

Query:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR
         ENATL+ GR+GN VLA+ DGRV WQTNT N+GVTG ++LPNGN++LHDKNGKF+WQSFD+PTDTLL GQS+++ G NKL+SR S+ +GSDGPYS+VLD+
Subjt:  RENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDR

Query:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNL
         GLTM+++  G  L YGGWP  D    VTFA   E DN T   AYELLL             RRLLQVRPI S GG LNLNK+NYN T S+LRL  DG+L
Subjt:  TGLTMFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNAT---AYELLL-----LVNQDTPRRRLLQVRPIRS-GGALNLNKLNYNATYSFLRLSHDGNL

Query:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG
        KA++Y+   +YLKWEESF+FFS+YF+R+C LPS CG YGYC+RGMC ACP+PKGLLGWS+ CAPPKT   CSG KGK   YYKIVGVEHF  PY  DG+G
Subjt:  KAFTYYDKVSYLKWEESFAFFSSYFIRECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPP-CSGGKGK-FGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY
        P  V DC+AKCDRDCKCLG+ YKE   KCL  PLLGTLIKD N+SSV YIKY
Subjt:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIKY

AT1G78850.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain4.3e-9140.8Show/hide
Query:  LCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR
        LCF L   L   I +QA+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTTP+++  A+R G    ES +RWVW+ANR  PV+
Subjt:  LCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR

Query:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT
        ENATLTFG DGN VLA+ DGR+VWQTNT N+G  GIK+L NGN++++D +GKF+WQSFD PTDTLLVGQS+++ GR KL+SR S    ++GPYSLV++  
Subjt:  ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRT

Query:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLK
         L ++ + +          Y  +        +TF A  ++D                    L +  + SG   N    L++  +NAT SF+RL  DGN++
Subjt:  GLTMFLSHDG-----QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLK

Query:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG
         ++Y    +   W+ ++  F++       EC +P  C  +G C +G C ACPS KGLLGW E+C  P    C      F Y+KI G + F+  Y  +G  
Subjt:  AFTYYDKVSYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEG

Query:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
              C  KC RDCKCLGF Y   SS+C     L TL +  +SS V Y+K
Subjt:  PIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK

AT1G78860.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain4.8e-9040.86Show/hide
Query:  LLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA
        L  T+ +  +  QA+VP +  F  VN+G + D   IEY+   R      +  ++  FRLCFYNTT +++  A+R G    ES +RWVW+ANR  PV+ENA
Subjt:  LLCTVLLAAIATQAQVPANATFHFVNQGEFGD-RIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVRENA

Query:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT
        TLTFG DGN VLA+ DGRVVWQTNT N+GV GIK+L NGN++++D NGKF+WQSFD PTDTLLVGQS+++ G+NKL+SR S    ++GPYSLV++   L 
Subjt:  TLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLT

Query:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV
        ++ + +      G            +  E     A    +     +D      L +  + SG   N    L++  +NAT SFLRL  DGN++ ++Y    
Subjt:  MFLSHDGQLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALN----LNKLNYNATYSFLRLSHDGNLKAFTYYDKV

Query:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR
        +   W+ ++  F++       EC +P  C  +G C +G C ACPS  GLLGW E+C  P    C      F Y+KI G + F+  Y  +G        C 
Subjt:  SYLKWEESFAFFSSYFI---RECALPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCR

Query:  AKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK
         KC RDCKCLGF Y   SS+C     L TL K  ++S V Y+K
Subjt:  AKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDVNSSSVGYIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACACGTTTTCATCTTCTTCTTCCTCCTCTCTGTTTCCTGCTCTGCACTGTTCTTCTCGCCGCCATAGCCACACAAGCTCAAGTTCCTGCAAATGCCACCTTCCA
TTTCGTAAACCAAGGCGAATTCGGCGACCGAATCATTGAATACGACGCCAGCTACCGTGTAATTCGAAACCACGTGTACACCTTCTACACATTCCCCTTCCGCCTCTGTT
TTTACAACACCACCCCTGATTCCTTCATTTTCGCCATTAGAGCTGGAATCCCCAACGACGAGAGCTTAATGCGATGGGTTTGGGACGCCAATCGCAACGACCCAGTTCGT
GAAAACGCCACCCTCACCTTTGGCCGCGACGGAAACTTCGTCCTCGCCGACGTCGACGGCCGTGTCGTCTGGCAAACCAACACCAAAAACAGAGGAGTCACCGGCATCAA
AATGCTCCCTAATGGAAACTTAATCCTCCATGACAAGAACGGGAAATTCATCTGGCAGAGCTTTGATTACCCTACTGATACTCTGTTAGTCGGTCAATCGATTCGAATCG
GCGGCCGGAATAAATTAATTAGCCGGAAATCCGAAATCGACGGCTCTGATGGCCCTTACAGCCTTGTTCTAGATCGAACAGGGCTCACAATGTTTCTTTCCCACGACGGT
CAGCTTTTAACCTACGGCGGTTGGCCGGGGACGGATCATGGAAGCAGAGTAACATTCGCCGCCGAACCAGAGAATGACAACGCCACCGCGTACGAGCTTCTTCTTTTAGT
AAATCAGGACACCCCACGGCGGCGATTGTTACAAGTCCGGCCAATTAGAAGCGGCGGAGCGTTGAATTTGAACAAATTGAACTACAATGCGACGTACTCGTTTCTCCGGC
TCAGTCACGACGGGAATTTGAAGGCATTCACGTACTACGATAAAGTGAGTTACTTGAAATGGGAAGAGAGCTTTGCGTTTTTCTCGAGCTATTTCATCAGGGAATGTGCT
CTGCCGAGCAAATGTGGGGCTTATGGGTACTGCAACAGGGGAATGTGTGTGGCGTGTCCGAGCCCAAAAGGGCTTTTGGGGTGGAGCGAGAGCTGTGCGCCACCGAAGAC
GCCGCCGTGCAGCGGCGGAAAAGGGAAATTTGGGTACTATAAGATAGTGGGGGTGGAGCATTTTTTGAACCCGTACAAGGAGGACGGTGAAGGGCCGATTAAGGTGGGGG
ATTGCAGAGCCAAATGTGATAGAGATTGTAAGTGTTTAGGGTTTATCTATAAGGAGTATAGCTCAAAATGCTTGAGGGTTCCATTGTTGGGAACTTTGATTAAGGATGTT
AATTCGTCCTCGGTGGGTTATATTAAGTATTCGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGACACGTTTTCATCTTCTTCTTCCTCCTCTCTGTTTCCTGCTCTGCACTGTTCTTCTCGCCGCCATAGCCACACAAGCTCAAGTTCCTGCAAATGCCACCTTCCA
TTTCGTAAACCAAGGCGAATTCGGCGACCGAATCATTGAATACGACGCCAGCTACCGTGTAATTCGAAACCACGTGTACACCTTCTACACATTCCCCTTCCGCCTCTGTT
TTTACAACACCACCCCTGATTCCTTCATTTTCGCCATTAGAGCTGGAATCCCCAACGACGAGAGCTTAATGCGATGGGTTTGGGACGCCAATCGCAACGACCCAGTTCGT
GAAAACGCCACCCTCACCTTTGGCCGCGACGGAAACTTCGTCCTCGCCGACGTCGACGGCCGTGTCGTCTGGCAAACCAACACCAAAAACAGAGGAGTCACCGGCATCAA
AATGCTCCCTAATGGAAACTTAATCCTCCATGACAAGAACGGGAAATTCATCTGGCAGAGCTTTGATTACCCTACTGATACTCTGTTAGTCGGTCAATCGATTCGAATCG
GCGGCCGGAATAAATTAATTAGCCGGAAATCCGAAATCGACGGCTCTGATGGCCCTTACAGCCTTGTTCTAGATCGAACAGGGCTCACAATGTTTCTTTCCCACGACGGT
CAGCTTTTAACCTACGGCGGTTGGCCGGGGACGGATCATGGAAGCAGAGTAACATTCGCCGCCGAACCAGAGAATGACAACGCCACCGCGTACGAGCTTCTTCTTTTAGT
AAATCAGGACACCCCACGGCGGCGATTGTTACAAGTCCGGCCAATTAGAAGCGGCGGAGCGTTGAATTTGAACAAATTGAACTACAATGCGACGTACTCGTTTCTCCGGC
TCAGTCACGACGGGAATTTGAAGGCATTCACGTACTACGATAAAGTGAGTTACTTGAAATGGGAAGAGAGCTTTGCGTTTTTCTCGAGCTATTTCATCAGGGAATGTGCT
CTGCCGAGCAAATGTGGGGCTTATGGGTACTGCAACAGGGGAATGTGTGTGGCGTGTCCGAGCCCAAAAGGGCTTTTGGGGTGGAGCGAGAGCTGTGCGCCACCGAAGAC
GCCGCCGTGCAGCGGCGGAAAAGGGAAATTTGGGTACTATAAGATAGTGGGGGTGGAGCATTTTTTGAACCCGTACAAGGAGGACGGTGAAGGGCCGATTAAGGTGGGGG
ATTGCAGAGCCAAATGTGATAGAGATTGTAAGTGTTTAGGGTTTATCTATAAGGAGTATAGCTCAAAATGCTTGAGGGTTCCATTGTTGGGAACTTTGATTAAGGATGTT
AATTCGTCCTCGGTGGGTTATATTAAGTATTCGATTTAG
Protein sequenceShow/hide protein sequence
MATRFHLLLPPLCFLLCTVLLAAIATQAQVPANATFHFVNQGEFGDRIIEYDASYRVIRNHVYTFYTFPFRLCFYNTTPDSFIFAIRAGIPNDESLMRWVWDANRNDPVR
ENATLTFGRDGNFVLADVDGRVVWQTNTKNRGVTGIKMLPNGNLILHDKNGKFIWQSFDYPTDTLLVGQSIRIGGRNKLISRKSEIDGSDGPYSLVLDRTGLTMFLSHDG
QLLTYGGWPGTDHGSRVTFAAEPENDNATAYELLLLVNQDTPRRRLLQVRPIRSGGALNLNKLNYNATYSFLRLSHDGNLKAFTYYDKVSYLKWEESFAFFSSYFIRECA
LPSKCGAYGYCNRGMCVACPSPKGLLGWSESCAPPKTPPCSGGKGKFGYYKIVGVEHFLNPYKEDGEGPIKVGDCRAKCDRDCKCLGFIYKEYSSKCLRVPLLGTLIKDV
NSSSVGYIKYSI