; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020878 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020878
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationtig00153574:833025..833835
RNA-Seq ExpressionSgr020878
SyntenySgr020878
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583745.1 hypothetical protein SDJN03_19677, partial [Cucurbita argyrosperma subsp. sororia]4.9e-8882.05Show/hide
Query:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+SD  FWL  FFFFLLFG GF +VVE AEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACV
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHAT SFEK CSIKILRTPKNTRCRPAH AG EQL+LSSFGGGIRTYTSG+LRLQH+TSRPLQAC+
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACV

KAG7012641.1 hypothetical protein SDJN02_25393 [Cucurbita argyrosperma subsp. argyrosperma]6.0e-8677.94Show/hide
Query:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSF+         FFFFFLLF  GF +VVESAE ETPL DLLSR++ RQ AGYGEERLSTVLVTGS+LCEACLHG EPQ+HAWP+ GAMVGV CH
Subjt:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSG
        N+ KNSKSS+WVHG+TDEFGDFIIDIPS  HAT SFEKVCSIKILRTPKN RCRPAHFAGR+QL+LSSFGGGIRTYTSG+LRLQH+TS+PLQAC NKG  
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSG

Query:  DQQT
        DQQT
Subjt:  DQQT

XP_004139527.1 uncharacterized protein LOC101215830 [Cucumis sativus]1.2e-8677.94Show/hide
Query:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+ +  SFH S   FWL  FFF+L+ G GF +V+ESAE ETP+ DLLSR+  R++AGYGEERLSTVLVTGSVLCEACLHG EPQ+HAWPI GAMVGVNCH
Subjt:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSG
        NT KNSKSS+WVHGVTDEFGDF+IDIPSHLHAT SFE VCSIKILRTPKNT CRPAH AGR+ L+LSSFGGGIRTYTSGVLRLQH+TSRPLQAC N+G G
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSG

Query:  DQQT
         +QT
Subjt:  DQQT

XP_022142515.1 uncharacterized protein LOC111012615 [Momordica charantia]2.0e-9784.88Show/hide
Query:  MMTKFCSFHDSDKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC
        MMT+FC FHDS KPFW S FFFFFLL G GF VVVESAEG+TP+ DLLSR+D RQMAGYGEERLSTVLVTGSVLCEACLHG EPQLH+WPI+GAMVGV+C
Subjt:  MMTKFCSFHDSDKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC

Query:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGS
        HN  +NSKSS+W HGVTDEFGDFIIDIPSHLHAT SFEKVCSIKIL+TPKN RCRPAHFAGREQL+LSSFGGGIRTYTSG L+LQHRTSRPLQ C+NKGS
Subjt:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGS

Query:  GDQQT
        GD+QT
Subjt:  GDQQT

XP_038895694.1 uncharacterized protein LOC120083866 [Benincasa hispida]6.4e-8878.82Show/hide
Query:  MTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCHN
        M +  S H S   FW+  FFFF LFG GF + VE+AE ETP+ DLLSR+D RQ+AGYGEERLSTVLVTGSVLCEACLHG EPQ+HAWPI GAMVGVNCHN
Subjt:  MTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCHN

Query:  TAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSGD
          KNSKSS+WVHGVTDEFGDFIIDIPSHLHAT SFE VCSIKIL+TPKNT CRPAH AG +QL+LSSFGGGIRTYTSGVLRLQH+TSRPLQAC N+G GD
Subjt:  TAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSGD

Query:  QQT
        +QT
Subjt:  QQT

TrEMBL top hitse value%identityAlignment
A0A0A0LT20 Uncharacterized protein5.8e-8777.94Show/hide
Query:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+ +  SFH S   FWL  FFF+L+ G GF +V+ESAE ETP+ DLLSR+  R++AGYGEERLSTVLVTGSVLCEACLHG EPQ+HAWPI GAMVGVNCH
Subjt:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSG
        NT KNSKSS+WVHGVTDEFGDF+IDIPSHLHAT SFE VCSIKILRTPKNT CRPAH AGR+ L+LSSFGGGIRTYTSGVLRLQH+TSRPLQAC N+G G
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGSG

Query:  DQQT
         +QT
Subjt:  DQQT

A0A5D3E5G2 Pollen_Ole_e_I domain-containing protein1.9e-8577.18Show/hide
Query:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        MM +  SFH S   FWL  FFF+L+ G GF +V ESAE ETP+ DLL+R+  R++AGYGEERLSTVLVTGSVLCE+CLHG EPQ+HAWPI GAMVGVNCH
Subjt:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKG-S
        NT KNSKSS+WVHGVTDEFGDF+IDIPS LHAT SFE VCSIKILRTPKNT CRPAH AGR+QL+LSSFGGGIRTYTSGVLRLQH+TSRPLQAC N+G S
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKG-S

Query:  GDQQTW
        G Q +W
Subjt:  GDQQTW

A0A6J1CL55 uncharacterized protein LOC1110126159.6e-9884.88Show/hide
Query:  MMTKFCSFHDSDKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC
        MMT+FC FHDS KPFW S FFFFFLL G GF VVVESAEG+TP+ DLLSR+D RQMAGYGEERLSTVLVTGSVLCEACLHG EPQLH+WPI+GAMVGV+C
Subjt:  MMTKFCSFHDSDKPFWLS-FFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNC

Query:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGS
        HN  +NSKSS+W HGVTDEFGDFIIDIPSHLHAT SFEKVCSIKIL+TPKN RCRPAHFAGREQL+LSSFGGGIRTYTSG L+LQHRTSRPLQ C+NKGS
Subjt:  HNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACVNKGS

Query:  GDQQT
        GD+QT
Subjt:  GDQQT

A0A6J1EGW2 uncharacterized protein LOC1114341952.9e-8681.03Show/hide
Query:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+S   FWL  FFFFLLFG GF +VVE AEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACV
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHAT SFEK CSIKILRTPKNTRCRPAH AG EQL+LSS GGGIRTYTSG+LRLQH+TSRPLQAC+
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACV

A0A6J1KR10 uncharacterized protein LOC1114956894.9e-8681.03Show/hide
Query:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH
        M+T+ CSFH+S   F L FFFFFLLFG GF +VVESAEGETPL DLL R+D RQ+AGYGEERLSTVLVTGSVLCEACLHG E Q+HAWPI GAMVGVNC 
Subjt:  MMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHG-EPQLHAWPIAGAMVGVNCH

Query:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACV
        NT KNSKS  WV+GVTDEFGDF+IDIPSHLHA  SFEK CSIKILRTPKNTRCRPAH AG EQL+LSSFGGG RTYTSG+LRLQH+TSRPLQAC+
Subjt:  NTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSRPLQACV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40113.1 Pollen Ole e 1 allergen and extensin family protein1.2e-2338.31Show/hide
Query:  LLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILR
        L S  ++  MAGYGE +LS+V++TGS+LC             P++GA V + CH   K  + S W+  VT++FG+F+I +PSHLHA    EK C +K + 
Subjt:  LLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILR

Query:  TPKN-TRCRPAHFAG--REQLRLSSFGGGIRTYTSGVLRL----QHRTSRPLQA
         PK+  RC          + ++L S   G R YTSG ++L      RTS+P +A
Subjt:  TPKN-TRCRPAHFAG--REQLRLSSFGGGIRTYTSGVLRL----QHRTSRPLQA

AT4G17215.1 Pollen Ole e 1 allergen and extensin family protein4.1e-2438.41Show/hide
Query:  FLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFI
        FLL      ++V   E      D  SR+++ +MAGYGE++LS+VL+T S+L  +         + PI GA +G  CH    + + S W+  VT+E G F+
Subjt:  FLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFI

Query:  IDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRT
        ID+PSHLHA    +K C IK L  PK  RC      G   ++L S   G R YT+G + LQ  T
Subjt:  IDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRT

AT5G47635.1 Pollen Ole e 1 allergen and extensin family protein7.9e-2840.12Show/hide
Query:  LSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTD
        LSFF F   FG                 +L +R ++ +MAGYGE++LS+V++TGS+LC+      P LH+ PI GA V + CH  +K  + S W+  VTD
Subjt:  LSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEACLHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTD

Query:  EFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSR
        E G+F ID+PS LHA    E  C IK +  P+  RC        + ++L S   G R YTSG +RLQ  +SR
Subjt:  EFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTATATAAACTGGAGGAATCTGTCTCTCAAATGGGTGTCCGTTGATTTGTGTGATCATATTGATAAAGTTCAGCTAAATAGCCAAAGCAAGAAGATGATGACCAA
ATTTTGCAGCTTCCATGATTCTGACAAGCCATTTTGGCTCAGTTTCTTCTTCTTCTTCCTCCTCTTTGGCTGTGGATTTACAGTGGTTGTAGAATCTGCAGAAGGCGAGA
CCCCGTTGTTCGATCTTTTGAGTCGAGAGGATTTGAGGCAGATGGCTGGATATGGTGAGGAGAGACTGTCCACAGTTTTGGTCACAGGGTCTGTTCTTTGTGAAGCTTGT
TTGCATGGTGAACCTCAGCTTCATGCATGGCCTATAGCAGGTGCCATGGTGGGCGTGAATTGCCACAACACTGCAAAAAACAGCAAATCTTCTAACTGGGTACATGGAGT
CACTGATGAATTTGGAGACTTCATTATTGATATTCCATCCCATCTTCATGCAACGCACAGCTTTGAAAAGGTTTGTTCCATCAAGATTCTTCGGACACCGAAGAACACAC
GCTGCCGACCTGCTCATTTCGCTGGTCGGGAACAGCTGCGATTATCGTCATTTGGAGGTGGCATCCGTACATATACTTCTGGCGTCCTCAGGCTGCAGCACCGAACATCT
CGACCCCTGCAAGCTTGTGTAAACAAGGGGAGTGGTGACCAACAGACATGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATTATATAAACTGGAGGAATCTGTCTCTCAAATGGGTGTCCGTTGATTTGTGTGATCATATTGATAAAGTTCAGCTAAATAGCCAAAGCAAGAAGATGATGACCAA
ATTTTGCAGCTTCCATGATTCTGACAAGCCATTTTGGCTCAGTTTCTTCTTCTTCTTCCTCCTCTTTGGCTGTGGATTTACAGTGGTTGTAGAATCTGCAGAAGGCGAGA
CCCCGTTGTTCGATCTTTTGAGTCGAGAGGATTTGAGGCAGATGGCTGGATATGGTGAGGAGAGACTGTCCACAGTTTTGGTCACAGGGTCTGTTCTTTGTGAAGCTTGT
TTGCATGGTGAACCTCAGCTTCATGCATGGCCTATAGCAGGTGCCATGGTGGGCGTGAATTGCCACAACACTGCAAAAAACAGCAAATCTTCTAACTGGGTACATGGAGT
CACTGATGAATTTGGAGACTTCATTATTGATATTCCATCCCATCTTCATGCAACGCACAGCTTTGAAAAGGTTTGTTCCATCAAGATTCTTCGGACACCGAAGAACACAC
GCTGCCGACCTGCTCATTTCGCTGGTCGGGAACAGCTGCGATTATCGTCATTTGGAGGTGGCATCCGTACATATACTTCTGGCGTCCTCAGGCTGCAGCACCGAACATCT
CGACCCCTGCAAGCTTGTGTAAACAAGGGGAGTGGTGACCAACAGACATGGTAG
Protein sequenceShow/hide protein sequence
MHYINWRNLSLKWVSVDLCDHIDKVQLNSQSKKMMTKFCSFHDSDKPFWLSFFFFFLLFGCGFTVVVESAEGETPLFDLLSREDLRQMAGYGEERLSTVLVTGSVLCEAC
LHGEPQLHAWPIAGAMVGVNCHNTAKNSKSSNWVHGVTDEFGDFIIDIPSHLHATHSFEKVCSIKILRTPKNTRCRPAHFAGREQLRLSSFGGGIRTYTSGVLRLQHRTS
RPLQACVNKGSGDQQTW