; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020098 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020098
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionepidermis-specific secreted glycoprotein EP1-like
Genome locationtig00153447:404578..407261
RNA-Seq ExpressionSgr020098
SyntenySgr020098
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0030246 - carbohydrate binding (molecular function)
InterPro domainsIPR001480 - Bulb-type lectin domain
IPR036426 - Bulb-type lectin domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2095672.1 hypothetical protein ES319_A01G055000v1 [Gossypium barbadense]1.7e-5379.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

TYH30014.1 hypothetical protein ES288_A01G059800v1 [Gossypium darwinii]1.7e-5379.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

TYI41998.1 hypothetical protein ES332_A01G067200v1 [Gossypium tomentosum]1.7e-5379.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

XP_038880567.1 epidermis-specific secreted glycoprotein EP1-like [Benincasa hispida]1.7e-5383.33Show/hide
Query:  FFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLV
        FFF+SLSLALVPPNETF+FVNEG+FG FIVEYDA YR LSI  +PFQL FYNTTPNA+TLALRM + RSES  RWVWEANRG PVRENATFSLGADGNLV
Subjt:  FFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLV

Query:  LADSDGTVVWQSNTANKGVVGFKLLP
        LA+SDGTVVWQSNTANKGVV  +LLP
Subjt:  LADSDGTVVWQSNTANKGVVGFKLLP

XP_040930958.1 epidermis-specific secreted glycoprotein EP1 [Gossypium hirsutum]1.7e-5379.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

TrEMBL top hitse value%identityAlignment
A0A061F8Q5 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain, putative5.4e-5385.59Show/hide
Query:  ALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVLADSDGTV
        A VPP+ TFKFVN+GEFGPF+VEYD NYRVLSI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFSLG DGNLVLAD+DG +
Subjt:  ALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVLADSDGTV

Query:  VWQSNTANKGVVGFKLLP
         WQSNTANKGVVGFKLLP
Subjt:  VWQSNTANKGVVGFKLLP

A0A1U8KDH7 epidermis-specific secreted glycoprotein EP1-like8.3e-5479.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

A0A5D2HIE6 Uncharacterized protein8.3e-5479.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

A0A5D2RPN4 Uncharacterized protein8.3e-5479.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

A0A5J5WVA8 Uncharacterized protein8.3e-5479.1Show/hide
Query:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS
        S  SF FL L    + A+VPP+ETF+FVN+GEFGPF+VEYDANYRV+SI   PFQLAFYNTTPNAFTLALRM  TRSESLFRWVWEANRG PVRENATFS
Subjt:  SAASFFFLSL----SLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFS

Query:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        LG DGNLVLAD+DG + WQSNTANKGVVGF+LLP
Subjt:  LGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

SwissProt top hitse value%identityAlignment
Q39688 Epidermis-specific secreted glycoprotein EP13.1e-4270.94Show/hide
Query:  LVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVLADSDGTVV
        LVP NETFKFVNEGE G +I EY  +YR L    +PFQL FYN TP AFTLALRMGL R+ESL RWVWEANRG PV ENAT + G DGNLVLA S+G V 
Subjt:  LVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVLADSDGTVV

Query:  WQSNTANKGVVGFKLLP
        WQ++TANKGVVG K+LP
Subjt:  WQSNTANKGVVGFKLLP

Q9ZVA1 EP1-like glycoprotein 14.1e-3451.11Show/hide
Query:  SAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATF
        +A +   +S+ +A VPP + F+ +NE  + P+I EYDA+YR L     +    PFQL FYNTTP+A+ LALR+G  R  S  RW+W+ANR  PV +N+T 
Subjt:  SAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATF

Query:  SLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        S G +GNLVLA+ +G V WQ+NTANKGV GF++LP
Subjt:  SLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

Q9ZVA2 EP1-like glycoprotein 22.3e-4057.86Show/hide
Query:  AIVDASAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVR
        AI+   A +   +S+ +A VPP + F+ VNEGEFG +I EYDA+YR +     S   +PFQL FYNTTP+A+ LALR+GL R ES  RW+W+ANR  PV 
Subjt:  AIVDASAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVR

Query:  ENATFSLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        ENAT SLG +GNLVLA++DG V WQ+NTANKGV GF++LP
Subjt:  ENATFSLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

Q9ZVA4 EP1-like glycoprotein 32.3e-3256.45Show/hide
Query:  FLSLSLALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVL
        FL  S A VP ++ F+ VNEG +  +  +EY+ + R        F+L FYNTTPNA+TLALR+G    ES  RWVWEANRG PV+ENAT + G DGNLVL
Subjt:  FLSLSLALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVL

Query:  ADSDGTVVWQSNTANKGVVGFKLL
        A++DG +VWQ+NTANKG VG K+L
Subjt:  ADSDGTVVWQSNTANKGVVGFKLL

Q9ZVA5 EP1-like glycoprotein 41.0e-3256.06Show/hide
Query:  ASFFFLSLSL----ALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSL
        A FF LS+ L    A VP ++ F+ VNEG +  +  +EY+ + R        F+L FYNTT NA+TLALR+G    ES  RWVWEANRG PV+ENAT + 
Subjt:  ASFFFLSLSL----ALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSL

Query:  GADGNLVLADSDGTVVWQSNTANKGVVGFKLL
        G DGNLVLA++DG VVWQ+NTANKGVVG K+L
Subjt:  GADGNLVLADSDGTVVWQSNTANKGVVGFKLL

Arabidopsis top hitse value%identityAlignment
AT1G16905.1 Curculin-like (mannose-binding) lectin family protein1.1e-3454.03Show/hide
Query:  FFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLV
        F  +SL    VPP E F+F+N G+FG   VEY A+YR L + +  F+L F+NTTPNAFTLA+ MG   S+S+ RWVW+AN  +PV+E A+ S G +GNLV
Subjt:  FFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLV

Query:  LADSDGTVVWQSNTANKGVVGFKL
        LA  DG VVWQ+ T NKGV+G  +
Subjt:  LADSDGTVVWQSNTANKGVVGFKL

AT1G78820.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain2.9e-3551.11Show/hide
Query:  SAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATF
        +A +   +S+ +A VPP + F+ +NE  + P+I EYDA+YR L     +    PFQL FYNTTP+A+ LALR+G  R  S  RW+W+ANR  PV +N+T 
Subjt:  SAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATF

Query:  SLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        S G +GNLVLA+ +G V WQ+NTANKGV GF++LP
Subjt:  SLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

AT1G78830.1 Curculin-like (mannose-binding) lectin family protein1.6e-4157.86Show/hide
Query:  AIVDASAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVR
        AI+   A +   +S+ +A VPP + F+ VNEGEFG +I EYDA+YR +     S   +PFQL FYNTTP+A+ LALR+GL R ES  RW+W+ANR  PV 
Subjt:  AIVDASAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVL-----SIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVR

Query:  ENATFSLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP
        ENAT SLG +GNLVLA++DG V WQ+NTANKGV GF++LP
Subjt:  ENATFSLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLP

AT1G78850.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain1.6e-3356.45Show/hide
Query:  FLSLSLALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVL
        FL  S A VP ++ F+ VNEG +  +  +EY+ + R        F+L FYNTTPNA+TLALR+G    ES  RWVWEANRG PV+ENAT + G DGNLVL
Subjt:  FLSLSLALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSLGADGNLVL

Query:  ADSDGTVVWQSNTANKGVVGFKLL
        A++DG +VWQ+NTANKG VG K+L
Subjt:  ADSDGTVVWQSNTANKGVVGFKLL

AT1G78860.1 D-mannose binding lectin protein with Apple-like carbohydrate-binding domain7.2e-3456.06Show/hide
Query:  ASFFFLSLSL----ALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSL
        A FF LS+ L    A VP ++ F+ VNEG +  +  +EY+ + R        F+L FYNTT NA+TLALR+G    ES  RWVWEANRG PV+ENAT + 
Subjt:  ASFFFLSLSL----ALVPPNETFKFVNEGEFGPFI-VEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRPVRENATFSL

Query:  GADGNLVLADSDGTVVWQSNTANKGVVGFKLL
        G DGNLVLA++DG VVWQ+NTANKGVVG K+L
Subjt:  GADGNLVLADSDGTVVWQSNTANKGVVGFKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCTTCGAGTTACAGAAAACCCGCATTGGATCCGAACATGAGAATGCCGCCATTGTTGACGCCTCTGCTGCTTCTTTCTTCTTCCTTTCTCTCTCTCTAGCTCTGGT
TCCTCCCAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTCGGTCCCTTCATCGTCGAGTACGACGCCAATTACCGAGTCCTCAGCATCGGCCAAACGCCGTTTCAGC
TCGCTTTCTACAACACGACGCCCAACGCCTTCACCCTCGCCCTGCGGATGGGCCTCACTCGCTCCGAGTCCCTCTTCCGGTGGGTGTGGGAGGCCAACCGAGGCCGGCCG
GTGCGCGAGAACGCCACTTTCTCCCTCGGCGCCGACGGGAATCTGGTTCTCGCCGATTCCGACGGCACCGTCGTTTGGCAGTCGAACACCGCCAATAAGGGCGTCGTTGG
ATTCAAACTGCTCCCAACGGCAACATGGTTCTCCACGACTCCAAAGGAAAATTCCTCTGGCAGAGCTTCGATTCTCCGACCGACACTCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCTTCGAGTTACAGAAAACCCGCATTGGATCCGAACATGAGAATGCCGCCATTGTTGACGCCTCTGCTGCTTCTTTCTTCTTCCTTTCTCTCTCTCTAGCTCTGGT
TCCTCCCAACGAGACCTTCAAGTTCGTCAATGAAGGCGAGTTCGGTCCCTTCATCGTCGAGTACGACGCCAATTACCGAGTCCTCAGCATCGGCCAAACGCCGTTTCAGC
TCGCTTTCTACAACACGACGCCCAACGCCTTCACCCTCGCCCTGCGGATGGGCCTCACTCGCTCCGAGTCCCTCTTCCGGTGGGTGTGGGAGGCCAACCGAGGCCGGCCG
GTGCGCGAGAACGCCACTTTCTCCCTCGGCGCCGACGGGAATCTGGTTCTCGCCGATTCCGACGGCACCGTCGTTTGGCAGTCGAACACCGCCAATAAGGGCGTCGTTGG
ATTCAAACTGCTCCCAACGGCAACATGGTTCTCCACGACTCCAAAGGAAAATTCCTCTGGCAGAGCTTCGATTCTCCGACCGACACTCTCTTAG
Protein sequenceShow/hide protein sequence
MIFELQKTRIGSEHENAAIVDASAASFFFLSLSLALVPPNETFKFVNEGEFGPFIVEYDANYRVLSIGQTPFQLAFYNTTPNAFTLALRMGLTRSESLFRWVWEANRGRP
VRENATFSLGADGNLVLADSDGTVVWQSNTANKGVVGFKLLPTATWFSTTPKENSSGRASILRPTLS