; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027320 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027320
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1677)
Genome locationtig00153048:3059396..3059869
RNA-Seq ExpressionSgr027320
SyntenySgr027320
Gene Ontology termsNA
InterPro domainsIPR012876 - Protein of unknown function DUF1677, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7021978.1 hypothetical protein SDJN02_15706, partial [Cucurbita argyrosperma subsp. argyrosperma]9.8e-6985.99Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AMAL N  SQ   A   A   AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALA+HASFC+EFRSSNPLD
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGR+LRRSLDSPRVLRSNSS +I+LE IV IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

XP_022147607.1 uncharacterized protein LOC111016490 [Momordica charantia]4.7e-7190.51Show/hide
Query:  MLMAMALSNKESQVQTAKAA-AAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPL
        ML AMAL N ESQ  T   A +AA AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSS+PL
Subjt:  MLMAMALSNKESQVQTAKAA-AAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPL

Query:  DETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
         ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIV+IGDS RLQRSGSCFPSLSS
Subjt:  DETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

XP_022927044.1 uncharacterized protein LOC111433987 [Cucurbita moschata]4.9e-6886.62Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AMAL N  SQ     AAA   AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALA+HASFC+EFRSSNPLD
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGR+LRRSLDSPRVLRSNSS +I+LE IV IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

XP_022968416.1 uncharacterized protein LOC111467661 [Cucurbita maxima]2.8e-6887.9Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AMAL N  SQ     AAAAA AEVECVKCYSCGFTEDCTPAYISRV DRFHGRWICGLCIEAVKDEVVRSGTLISTEEALA+HASFC+EFRSSNPLD
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGR+LRRSLDSPRVLRSNSSS I+LE IV IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

XP_023530607.1 uncharacterized protein LOC111793105 [Cucurbita pepo subsp. pepo]2.8e-6887.26Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AMAL N  SQ   A AAA   AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALA+HASFC+EFRS+NPLD
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGR+LRRSLDSPRVLRSNSS VI+LE IV IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

TrEMBL top hitse value%identityAlignment
A0A1S3BRF7 uncharacterized protein LOC1034923776.4e-6683.44Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AM L N +SQ        A  AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLC+EAVKDEVVRSG LIST+EAL RHASFCKEFRS+NP+D
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGRLLRRSLDSPRVLRSNSS+VI+LE IV+IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

A0A5A7UNA4 Uncharacterized protein6.4e-6683.44Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AM L N +SQ        A  AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLC+EAVKDEVVRSG LIST+EAL RHASFCKEFRS+NP+D
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGRLLRRSLDSPRVLRSNSS+VI+LE IV+IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

A0A6J1D2U1 uncharacterized protein LOC1110164902.3e-7190.51Show/hide
Query:  MLMAMALSNKESQVQTAKAA-AAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPL
        ML AMAL N ESQ  T   A +AA AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSS+PL
Subjt:  MLMAMALSNKESQVQTAKAA-AAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPL

Query:  DETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
         ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIV+IGDS RLQRSGSCFPSLSS
Subjt:  DETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

A0A6J1EGW6 uncharacterized protein LOC1114339872.3e-6886.62Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AMAL N  SQ     AAA   AEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALA+HASFC+EFRSSNPLD
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGR+LRRSLDSPRVLRSNSS +I+LE IV IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

A0A6J1HUT0 uncharacterized protein LOC1114676611.4e-6887.9Show/hide
Query:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD
        ML+AMAL N  SQ     AAAAA AEVECVKCYSCGFTEDCTPAYISRV DRFHGRWICGLCIEAVKDEVVRSGTLISTEEALA+HASFC+EFRSSNPLD
Subjt:  MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLD

Query:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        ETEHPISAMGR+LRRSLDSPRVLRSNSSS I+LE IV IGDS RLQRSGSCFPSLSS
Subjt:  ETEHPISAMGRLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72510.1 Protein of unknown function (DUF1677)2.5e-3052.24Show/hide
Query:  EVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSN-PLDETEHPISAMGRLLRRSLDSPRVLR
        E + V C  CG TE+CT +YI  VR+R+ G+WICGLC EAVK EV+R+  L++TEEA+ARH + C +F+SS+ P + T H ISAM ++LR+SLDSPR+LR
Subjt:  EVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSN-PLDETEHPISAMGRLLRRSLDSPRVLR

Query:  S--NSSSVIELEGIVEIGDSVRLQRSGSCFPSLS
        S  NS S  + +   +   +V L RS SCF SL+
Subjt:  S--NSSSVIELEGIVEIGDSVRLQRSGSCFPSLS

AT1G72510.2 Protein of unknown function (DUF1677)2.5e-3052.24Show/hide
Query:  EVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSN-PLDETEHPISAMGRLLRRSLDSPRVLR
        E + V C  CG TE+CT +YI  VR+R+ G+WICGLC EAVK EV+R+  L++TEEA+ARH + C +F+SS+ P + T H ISAM ++LR+SLDSPR+LR
Subjt:  EVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSN-PLDETEHPISAMGRLLRRSLDSPRVLR

Query:  S--NSSSVIELEGIVEIGDSVRLQRSGSCFPSLS
        S  NS S  + +   +   +V L RS SCF SL+
Subjt:  S--NSSSVIELEGIVEIGDSVRLQRSGSCFPSLS

AT2G09970.1 Protein of unknown function (DUF1677)1.9e-2543.54Show/hide
Query:  VQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLIST-EEALARHASFCKEFRSSNPLDE-TEHPISAMGR
        + T      +  E++ V C  CG T++CT +Y   +R+R+ G+WI G C EAVK +V+R+   ++T EEA+ARH + C +F+SS+P    T H ISAM +
Subjt:  VQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLIST-EEALARHASFCKEFRSSNPLDE-TEHPISAMGR

Query:  LLRRSLDSPRVLRS--NSSSVIELEGIVEIGDSVRLQRSGSCFPSLS
        +LR+SLDSPR+LRS  NS S  + +   +   +V L RS SCF SL+
Subjt:  LLRRSLDSPRVLRS--NSSSVIELEGIVEIGDSVRLQRSGSCFPSLS

AT3G22540.1 Protein of unknown function (DUF1677)6.4e-1841.07Show/hide
Query:  EVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLDETEHPISAMGRLLRR-------SLD
        E+E V+C  CG  EDCT  YIS V+  F  +W+CGLC EAV+DEV R   + + +EA+  H SFC +F+  NP     H    M ++LRR       S  
Subjt:  EVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLDETEHPISAMGRLLRR-------SLD

Query:  SPRVLRSNSSSV
        S +  RSN++ +
Subjt:  SPRVLRSNSSSV

AT5G20670.1 Protein of unknown function (DUF1677)5.0e-3955.19Show/hide
Query:  MALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLDETEH
        M+ S+  S   T    ++++  VE V C +CGF E+CTPAYI+RV++R  G W+CGLC EAVKDEVVRS T IS EEAL RH +FC  FRS +P DE E 
Subjt:  MALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLDETEH

Query:  PISAMGRLLRRSLD-SPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS
        PIS +GR+LRRSLD SPR   + +SS   L GI  +     L RSGSCFPSLS+
Subjt:  PISAMGRLLRRSLD-SPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGATGGCCATGGCGTTATCGAACAAAGAGTCTCAGGTGCAGACGGCGAAAGCGGCGGCGGCCGCTATGGCGGAGGTCGAGTGCGTGAAATGTTACTCGTGCGGGTT
CACAGAGGACTGCACCCCTGCTTACATTTCTCGTGTTCGCGATCGCTTCCATGGCAGGTGGATCTGCGGGCTCTGCATCGAGGCGGTGAAAGACGAAGTTGTGAGATCAG
GAACGCTCATCTCCACCGAGGAAGCGCTGGCCCGGCACGCGAGTTTCTGCAAAGAGTTCAGATCTTCGAACCCTCTGGACGAAACAGAGCATCCCATCTCCGCCATGGGG
AGGCTGCTGCGGCGGAGCTTGGATTCTCCGAGAGTGCTCCGGTCGAATTCGAGCAGCGTAATCGAACTCGAAGGGATCGTGGAGATTGGCGATTCGGTGAGGCTCCAACG
TTCGGGGAGTTGCTTCCCTTCCTTGTCTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGATGGCCATGGCGTTATCGAACAAAGAGTCTCAGGTGCAGACGGCGAAAGCGGCGGCGGCCGCTATGGCGGAGGTCGAGTGCGTGAAATGTTACTCGTGCGGGTT
CACAGAGGACTGCACCCCTGCTTACATTTCTCGTGTTCGCGATCGCTTCCATGGCAGGTGGATCTGCGGGCTCTGCATCGAGGCGGTGAAAGACGAAGTTGTGAGATCAG
GAACGCTCATCTCCACCGAGGAAGCGCTGGCCCGGCACGCGAGTTTCTGCAAAGAGTTCAGATCTTCGAACCCTCTGGACGAAACAGAGCATCCCATCTCCGCCATGGGG
AGGCTGCTGCGGCGGAGCTTGGATTCTCCGAGAGTGCTCCGGTCGAATTCGAGCAGCGTAATCGAACTCGAAGGGATCGTGGAGATTGGCGATTCGGTGAGGCTCCAACG
TTCGGGGAGTTGCTTCCCTTCCTTGTCTAGTTAA
Protein sequenceShow/hide protein sequence
MLMAMALSNKESQVQTAKAAAAAMAEVECVKCYSCGFTEDCTPAYISRVRDRFHGRWICGLCIEAVKDEVVRSGTLISTEEALARHASFCKEFRSSNPLDETEHPISAMG
RLLRRSLDSPRVLRSNSSSVIELEGIVEIGDSVRLQRSGSCFPSLSS