; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030202 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030202
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationtig00153574:1286568..1287011
RNA-Seq ExpressionSgr030202
SyntenySgr030202
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139493.1 uncharacterized protein LOC101207654 [Cucumis sativus]1.1e-4267.13Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSN--CIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ-H
        C       GN+I+VKC++ KNL IAITK DGSFET+LPS+ A S+AAPS+  CIAKL+GG HQL+ASRK+M ST++K + SK FFTIATALKFSTCK+  
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSN--CIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ-H

Query:  RKCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        R C+A+K+E V DSKT D PLPPEWG PP+SYY+PVLPIIGIP
Subjt:  RKCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

XP_022142556.1 uncharacterized protein LOC111012639 [Momordica charantia]1.3e-5482.58Show/hide
Query:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHRKCQAMKEEFV
        GN IVVKC+KVKNLA+AIT+ DGSFETTLPSD +NSQ+ PSNCIAKLVGGPHQLYASRKDMAS V+KA+ SK FFTIATALKFSTCKQH KCQAMK++F+
Subjt:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHRKCQAMKEEFV

Query:  GDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
         DSKT+DLPLP EWGL PSSYYLP LPIIGIP
Subjt:  GDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

XP_022967108.1 uncharacterized protein LOC111466611 [Cucurbita maxima]1.0e-4368.31Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDT--ANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR
        C       GNMIVV C  VKNL++AIT+ +GSF+TTLPSDT   +S+AAPSNCIAKL+GGPHQLYASRK MAS V+K + S  FFT+A AL FSTCK + 
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDT--ANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR

Query:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        KC ++K  FV DSKTIDLPLPPEWG PP+SYY PVLPIIGIP
Subjt:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

XP_023542373.1 uncharacterized protein LOC111802294 [Cucurbita pepo subsp. pepo]1.0e-4368.31Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR
        C       GNMIVV C  VKNL++AIT+ +GSF+TTLPSD A  +S+AAPSNCIAKL GGPHQLYASRK M+STV+K + S  FFT+A AL FSTCK + 
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR

Query:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        KC ++K  FV DSKTIDLPLPPEWG PP+SYY PVLPIIGIP
Subjt:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

XP_038895206.1 uncharacterized protein LOC120083500 [Benincasa hispida]9.2e-4568.06Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAAPSN--CIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ
        C     L GN+I+V C++VKNL +AITK DGSFET LPSD A  + +AA S+  CIAKLVGG HQL+ASRKDMAST++K +  K FFTIAT LKFSTCK+
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAAPSN--CIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ

Query:  HRKCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        ++KC+AMK++F+ DSKTIDLPLPPEWG PP+SYY PVLPIIGIP
Subjt:  HRKCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA1 Uncharacterized protein5.5e-4367.13Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSN--CIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ-H
        C       GN+I+VKC++ KNL IAITK DGSFET+LPS+ A S+AAPS+  CIAKL+GG HQL+ASRK+M ST++K + SK FFTIATALKFSTCK+  
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSN--CIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ-H

Query:  RKCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        R C+A+K+E V DSKT D PLPPEWG PP+SYY+PVLPIIGIP
Subjt:  RKCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

A0A1S3CDU1 uncharacterized protein LOC1034998969.3e-4367.12Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAA--PSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ
        C       GN+I+VKC++VKNL IAITK DGSFET LPSD A  +S+AA  P  CIAKLVGG HQL+ASRK++ ST++K + SK FFTIATAL+FSTCK+
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAA--PSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQ

Query:  -HRKCQAMKEEFVGDSKTIDLP-LPPEWGLPPSSYYLPVLPIIGIP
         +RKC+A+K+E + DSKT DLP LPPEWG PP+SYYLPVLPIIGIP
Subjt:  -HRKCQAMKEEFVGDSKTIDLP-LPPEWGLPPSSYYLPVLPIIGIP

A0A6J1CMJ0 uncharacterized protein LOC1110126396.2e-5582.58Show/hide
Query:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHRKCQAMKEEFV
        GN IVVKC+KVKNLA+AIT+ DGSFETTLPSD +NSQ+ PSNCIAKLVGGPHQLYASRKDMAS V+KA+ SK FFTIATALKFSTCKQH KCQAMK++F+
Subjt:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHRKCQAMKEEFV

Query:  GDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
         DSKT+DLPLP EWGL PSSYYLP LPIIGIP
Subjt:  GDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

A0A6J1G179 uncharacterized protein LOC1114497451.6e-4266.2Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR
        C       GNM+ V C  VKNL+IAIT+ +GSF+TTLPS+ A  +S+ APSNCIAKL+GGPHQLYASRK MAS V+K + S  FFT+A AL FSTCK + 
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTA--NSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR

Query:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        KC ++K  FV DSKTIDLPLPPEWG PP+SYY PVLPIIGIP
Subjt:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

A0A6J1HU61 uncharacterized protein LOC1114666114.9e-4468.31Show/hide
Query:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDT--ANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR
        C       GNMIVV C  VKNL++AIT+ +GSF+TTLPSDT   +S+AAPSNCIAKL+GGPHQLYASRK MAS V+K + S  FFT+A AL FSTCK + 
Subjt:  CDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDT--ANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHR

Query:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        KC ++K  FV DSKTIDLPLPPEWG PP+SYY PVLPIIGIP
Subjt:  KCQAMKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein1.6e-1538.41Show/hide
Query:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKAS------TSKLFFTIATALKFSTCKQHRKCQA
        G  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S +VK        +SKLFF  +    F +         
Subjt:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKAS------TSKLFFTIATALKFSTCKQHRKCQA

Query:  MKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
                SKT+DLP+PPEWGL P+SYY+P LPIIGIP
Subjt:  MKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein1.6e-1538.41Show/hide
Query:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKAS------TSKLFFTIATALKFSTCKQHRKCQA
        G  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S +VK        +SKLFF  +    F +         
Subjt:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKAS------TSKLFFTIATALKFSTCKQHRKCQA

Query:  MKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
                SKT+DLP+PPEWGL P+SYY+P LPIIGIP
Subjt:  MKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein1.6e-1538.41Show/hide
Query:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKAS------TSKLFFTIATALKFSTCKQHRKCQA
        G  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S +VK        +SKLFF  +    F +         
Subjt:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKAS------TSKLFFTIATALKFSTCKQHRKCQA

Query:  MKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
                SKT+DLP+PPEWGL P+SYY+P LPIIGIP
Subjt:  MKEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein3.9e-1739.85Show/hide
Query:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFS-TCKQHRKCQAMKEEF
        G  +++KCD  K    A+   DGSF + LP  TA+ + +  NC+AKL+GGP QLYA + ++ S +VK+       T +  L FS +C +  +        
Subjt:  GNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFS-TCKQHRKCQAMKEEF

Query:  VGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP
        +GDSKTI+ P    +G PP+S++ P LPIIGIP
Subjt:  VGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTATCCTAGGGCTCTTTGTGACATATGTATCTCTCTTTTAGGGAACATGATCGTGGTGAAATGTGACAAAGTGAAGAACCTAGCCATAGCAATCACCAAAGTCGA
TGGATCATTCGAAACTACGCTTCCCTCCGACACCGCCAACTCCCAAGCAGCTCCTTCCAACTGCATTGCCAAGCTTGTTGGGGGACCTCACCAGCTCTATGCATCAAGGA
AAGACATGGCTTCAACTGTTGTCAAGGCAAGCACCTCCAAGTTGTTCTTCACCATTGCCACTGCTCTCAAGTTCTCCACATGCAAACAACACAGAAAATGCCAAGCCATG
AAAGAGGAGTTTGTTGGAGATTCAAAGACCATTGATTTGCCTCTACCACCTGAGTGGGGGCTGCCACCCTCCAGCTACTATCTTCCTGTGCTTCCAATCATAGGCATTCC
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTATCCTAGGGCTCTTTGTGACATATGTATCTCTCTTTTAGGGAACATGATCGTGGTGAAATGTGACAAAGTGAAGAACCTAGCCATAGCAATCACCAAAGTCGA
TGGATCATTCGAAACTACGCTTCCCTCCGACACCGCCAACTCCCAAGCAGCTCCTTCCAACTGCATTGCCAAGCTTGTTGGGGGACCTCACCAGCTCTATGCATCAAGGA
AAGACATGGCTTCAACTGTTGTCAAGGCAAGCACCTCCAAGTTGTTCTTCACCATTGCCACTGCTCTCAAGTTCTCCACATGCAAACAACACAGAAAATGCCAAGCCATG
AAAGAGGAGTTTGTTGGAGATTCAAAGACCATTGATTTGCCTCTACCACCTGAGTGGGGGCTGCCACCCTCCAGCTACTATCTTCCTGTGCTTCCAATCATAGGCATTCC
TTGA
Protein sequenceShow/hide protein sequence
MCYPRALCDICISLLGNMIVVKCDKVKNLAIAITKVDGSFETTLPSDTANSQAAPSNCIAKLVGGPHQLYASRKDMASTVVKASTSKLFFTIATALKFSTCKQHRKCQAM
KEEFVGDSKTIDLPLPPEWGLPPSSYYLPVLPIIGIP