; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004204 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004204
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationtig00002668:649..1426
RNA-Seq ExpressionSgr004204
SyntenySgr004204
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583745.1 hypothetical protein SDJN03_19677, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8281.05Show/hide
Query:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+S   FWL  FFFFLLFG GF +VVE AEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHAT SFEK CSIKILRT KNTRCRPAH AG EQL+LSSFGGGIRTYTSG+LRLQH+TSRP
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

XP_022142515.1 uncharacterized protein LOC111012615 [Momordica charantia]2.5e-8985.34Show/hide
Query:  MMTKFCSFHDSVKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC
        MMT+FC FHDSVKPFW S FFFFFLL G GF VVVESAEG+TP+ DLLSR+D RQMAGYGEERLSTVLVTGSVLCEACLHG EPQLH+WPI+GAMVGV+C
Subjt:  MMTKFCSFHDSVKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC

Query:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        HN  +NSKSS+W HGVTDEFGDFIIDIPSHLHAT SFEKVCSIKIL+T KN RCRPAHFAGREQL+LSSFGGGIRTYTSG L+LQHRTSRP
Subjt:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

XP_022927337.1 uncharacterized protein LOC111434195 [Cucurbita moschata]2.2e-8281.05Show/hide
Query:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+SV  FWL  FFFFLLFG GF +VVE AEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHAT SFEK CSIKILRT KNTRCRPAH AG EQL+LSS GGGIRTYTSG+LRLQH+TSRP
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

XP_023001613.1 uncharacterized protein LOC111495689 [Cucurbita maxima]3.8e-8281.05Show/hide
Query:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+SV  F L FFFFFLLFG GF +VVESAEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHA  SFEK CSIKILRT KNTRCRPAH AG EQL+LSSFGGG RTYTSG+LRLQH+TSRP
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

XP_023520170.1 uncharacterized protein LOC111783471 [Cucurbita pepo subsp. pepo]5.5e-8180.63Show/hide
Query:  MMTKFCSFHDSVKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC
        M+T+ CSFH+SV  F +  FFFFFLLFG GF +VVE AEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC
Subjt:  MMTKFCSFHDSVKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC

Query:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
         NT KNSKS  WV+GVTDEFGDF+IDIPSHLHAT SFEK CSIKILRT KNTRCRPAH AG EQL+LSSFGGGIRTYTSG+LRLQH+TSRP
Subjt:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

TrEMBL top hitse value%identityAlignment
A0A0A0LT20 Uncharacterized protein7.8e-8178.95Show/hide
Query:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+ +  SFH SV  FWL  FFF+L+ G GF +V+ESAE ETP+ DLLSR+  R++AGYGEERLSTVLVTGSVLCEACLHG EPQ+HAWPI GAMVGVNCH
Subjt:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        NT KNSKSS+WVHGVTDEFGDF+IDIPSHLHAT SFE VCSIKILRT KNT CRPAH AGR+ L+LSSFGGGIRTYTSGVLRLQH+TSRP
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

A0A5D3E5G2 Pollen_Ole_e_I domain-containing protein1.1e-7978.42Show/hide
Query:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        MM +  SFH SV  FWL  FFF+L+ G GF +V ESAE ETP+ DLL+R+  R++AGYGEERLSTVLVTGSVLCE+CLHG EPQ+HAWPI GAMVGVNCH
Subjt:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        NT KNSKSS+WVHGVTDEFGDF+IDIPS LHAT SFE VCSIKILRT KNT CRPAH AGR+QL+LSSFGGGIRTYTSGVLRLQH+TSRP
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

A0A6J1CL55 uncharacterized protein LOC1110126151.2e-8985.34Show/hide
Query:  MMTKFCSFHDSVKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC
        MMT+FC FHDSVKPFW S FFFFFLL G GF VVVESAEG+TP+ DLLSR+D RQMAGYGEERLSTVLVTGSVLCEACLHG EPQLH+WPI+GAMVGV+C
Subjt:  MMTKFCSFHDSVKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC

Query:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        HN  +NSKSS+W HGVTDEFGDFIIDIPSHLHAT SFEKVCSIKIL+T KN RCRPAHFAGREQL+LSSFGGGIRTYTSG L+LQHRTSRP
Subjt:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

A0A6J1EGW2 uncharacterized protein LOC1114341951.1e-8281.05Show/hide
Query:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+SV  FWL  FFFFLLFG GF +VVE AEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHAT SFEK CSIKILRT KNTRCRPAH AG EQL+LSS GGGIRTYTSG+LRLQH+TSRP
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

A0A6J1KR10 uncharacterized protein LOC1114956891.8e-8281.05Show/hide
Query:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+SV  F L FFFFFLLFG GF +VVESAEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHA  SFEK CSIKILRT KNTRCRPAH AG EQL+LSSFGGG RTYTSG+LRLQH+TSRP
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40113.1 Pollen Ole e 1 allergen and extensin family protein1.1e-2134.76Show/hide
Query:  VKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWV
        ++ F L FFFF   F                   L S  ++  MAGYGE +LS+V++TGS+LC             P++GA V + CH   K  + S W+
Subjt:  VKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWV

Query:  HGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKN-TRCRPAHFAG--REQLRLSSFGGGIRTYTSGVLRL----QHRTSRPCK
          VT++FG+F+I +PSHLHA    EK C +K +   K+  RC          + ++L S   G R YTSG ++L      RTS+P K
Subjt:  HGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKN-TRCRPAHFAG--REQLRLSSFGGGIRTYTSGVLRL----QHRTSRPCK

AT4G17215.1 Pollen Ole e 1 allergen and extensin family protein7.3e-2337.8Show/hide
Query:  FLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFI
        FLL      ++V   E      D  SR+++ +MAGYGE++LS+VL+T S+L  +         + PI GA +G  CH    + + S W+  VT+E G F+
Subjt:  FLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFI

Query:  IDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRT
        ID+PSHLHA    +K C IK L   K  RC      G   ++L S   G R YT+G + LQ  T
Subjt:  IDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRT

AT5G47635.1 Pollen Ole e 1 allergen and extensin family protein1.4e-2639.53Show/hide
Query:  LSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTD
        LSFF F   FG                 +L +R ++ +MAGYGE++LS+V++TGS+LC+      P LH+ PI GA V + CH  +K  + S W+  VTD
Subjt:  LSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTD

Query:  EFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSR
        E G+F ID+PS LHA    E  C IK +   +  RC        + ++L S   G R YTSG +RLQ  +SR
Subjt:  EFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTATATAAACTGGAGGAATCTGTCTCTCAAATGGGTGTCCGTTGATTTGTGTGATCATATTGATAAAGTTCAGCTAAATAGCCAAAGCAAGAAGATGATGACCAA
ATTTTGCAGCTTCCATGATTCTGTCAAGCCATTTTGGCTCAGTTTCTTCTTCTTCTTCCTCCTCTTTGGCTGTGGATTTACAGTGGTTGTAGAATCTGCAGAAGGTGAGA
CCCCGTTGTTCGATCTTTTGAGTCGAGAGGATTTGAGGCAGATGGCTGGATATGGTGAGGAGAGACTGTCCACAGTTTTGGTCACAGGGTCTGTTCTTTGTGAAGCTTGT
TTGCATGGCGAACCTCAGCTCCATGCATGGCCTATAGCAGGTGCCATGGTGGGCGTGAATTGCCACAACACTGCAAAAAACAGCAAATCTTCTAACTGGGTACATGGAGT
CACTGATGAATTTGGAGACTTCATTATTGATATTCCATCCCATCTTCATGCAACACACAGCTTTGAAAAGGTTTGTTCCATCAAGATTCTTCGGACACTGAAGAACACAC
GCTGCCGACCTGCTCATTTCGCTGGTCGGGAACAGCTGCGATTATCGTCATTTGGAGGTGGCATCCGTACATATACTTCTGGCGTCCTCAGGCTGCAGCACCGAACATCT
CGACCCTGCAAGCTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATTATATAAACTGGAGGAATCTGTCTCTCAAATGGGTGTCCGTTGATTTGTGTGATCATATTGATAAAGTTCAGCTAAATAGCCAAAGCAAGAAGATGATGACCAA
ATTTTGCAGCTTCCATGATTCTGTCAAGCCATTTTGGCTCAGTTTCTTCTTCTTCTTCCTCCTCTTTGGCTGTGGATTTACAGTGGTTGTAGAATCTGCAGAAGGTGAGA
CCCCGTTGTTCGATCTTTTGAGTCGAGAGGATTTGAGGCAGATGGCTGGATATGGTGAGGAGAGACTGTCCACAGTTTTGGTCACAGGGTCTGTTCTTTGTGAAGCTTGT
TTGCATGGCGAACCTCAGCTCCATGCATGGCCTATAGCAGGTGCCATGGTGGGCGTGAATTGCCACAACACTGCAAAAAACAGCAAATCTTCTAACTGGGTACATGGAGT
CACTGATGAATTTGGAGACTTCATTATTGATATTCCATCCCATCTTCATGCAACACACAGCTTTGAAAAGGTTTGTTCCATCAAGATTCTTCGGACACTGAAGAACACAC
GCTGCCGACCTGCTCATTTCGCTGGTCGGGAACAGCTGCGATTATCGTCATTTGGAGGTGGCATCCGTACATATACTTCTGGCGTCCTCAGGCTGCAGCACCGAACATCT
CGACCCTGCAAGCTTGTGTAA
Protein sequenceShow/hide protein sequence
MHYINWRNLSLKWVSVDLCDHIDKVQLNSQSKKMMTKFCSFHDSVKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEAC
LHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTLKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTS
RPCKLV