; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019540 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019540
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationtig00153348:622840..624693
RNA-Seq ExpressionSgr019540
SyntenySgr019540
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573252.1 hypothetical protein SDJN03_27139, partial [Cucurbita argyrosperma subsp. sororia]6.9e-6274.44Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLV+VVVFV DLVAFALA+AAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLL LFASQVI+MVASRCLCCGKA+RPS   A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS + NE++SCKMLR+GVFGAGAAFI+FTCVASELFY+SFSKA +QTSS+AKDTGIRMA++
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

XP_022139805.1 uncharacterized protein LOC111010632 [Momordica charantia]8.7e-6577.78Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        MAMLVLVVVFVFDLVAFALA+AAEQRRTTA VVQSGN++FCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGK++RPS   A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKY+S + NEQ+SCKMLRKGVFGAGAAFI+FTCVASELFYVSFSKA DQTSS+AKDTGIRMANL
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

XP_022954617.1 uncharacterized protein LOC111456828 [Cucurbita moschata]6.9e-6274.44Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLV+VVVFV DLVAFALA+AAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLL LFASQVI+MVASRCLCCGKA+RPS   A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS + NE++SCKMLR+GVFGAGAAFI+FTCVASELFY+SFSKA +QTSS+AKDTGIRMA++
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

XP_023542784.1 uncharacterized protein LOC111802591 [Cucurbita pepo subsp. pepo]1.5e-6173.89Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLV+VVVFV DLVAFALA+AAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLL LFA+QVI+MVASRCLCCGKA+RPS   A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS + NE++SCKMLR+GVFGAGAAFI+FTCVASELFY+SFSKA +QTSS+AKDTGIRMA++
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

XP_038905949.1 uncharacterized protein LOC120091872 [Benincasa hispida]7.6e-6173.33Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLVL+VVF+FDLVAFALA+AAEQRRTTATVVQSG SKFCAYDSD+ATGLGVGSLL LFASQVI+MVASRCLCCG+ +RP    A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS L NEQ+SCKMLR+GVFGAGAAFI+FTC ASELFYVSFSKA  QTSS+AKDTGIRMA+L
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

TrEMBL top hitse value%identityAlignment
A0A0A0LV31 Uncharacterized protein3.1e-6072.22Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLVL+VVFVFDLVAFALA+AAEQRRTTA+VVQ+GNS+FCAYDSDIATGLGVGSLL LFASQVI+MVASRCLCCG+ +RP    A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS + +EQ+SCKMLR+GVFGAGAAFI+FTCVASELFYVSFSKA  +TSS+AKD+GIRMANL
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

A0A1S3B5C0 uncharacterized protein LOC1034861884.1e-6071.67Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLVL+VVF+FDLVAFALA+AAEQRRTTATV QSGNS+FCAYDSDIATGLGVGS L LFASQVI+MVASRCLCCG+ +RP    A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS + NEQ+SCKMLR+GVFGAGAAFI+FTCVASELFYVSFSKA  +T+S+AKDTGIRMA+L
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

A0A6J1CEZ1 uncharacterized protein LOC1110106324.2e-6577.78Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        MAMLVLVVVFVFDLVAFALA+AAEQRRTTA VVQSGN++FCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGK++RPS   A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKY+S + NEQ+SCKMLRKGVFGAGAAFI+FTCVASELFYVSFSKA DQTSS+AKDTGIRMANL
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

A0A6J1GRF5 uncharacterized protein LOC1114568283.3e-6274.44Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLV+VVVFV DLVAFALA+AAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLL LFASQVI+MVASRCLCCGKA+RPS   A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS + NE++SCKMLR+GVFGAGAAFI+FTCVASELFY+SFSKA +QTSS+AKDTGIRMA++
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

A0A6J1K1A5 uncharacterized protein LOC1114901893.1e-6073.33Show/hide
Query:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        M MLV+VVVFV DLVAFALA+AAEQRRTTATVV SGNSKFCAYDSDIATGLGVGSLL LFASQVI+MVASRCLC GKA+RPS   A              
Subjt:  MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                 SVRNAYHTKYVS + NE++SCKMLR+G+FGAGAAFI+FTCVASELFYVSFSKA +QTSS+AKDTGIRMA++
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)1.0e-2338.51Show/hide
Query:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQS--GNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------
        LV ++V    LVAF  ++AAE+RR+    +Q    N+ FC YDSD+ATG GVG+ LFL +S+ ++M  ++C+C G+ + P    A               
Subjt:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQS--GNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------

Query:  ------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKA
               G+ +NAYHTKY   L+++  SC  LRKG+F AGA FI+ T V +  +Y+ F+K+
Subjt:  ------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKA

AT1G52910.1 Protein of unknown function (DUF1218)8.4e-3447.37Show/hide
Query:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKF--CAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------
        LV+++VF+ DL+A  LA+AAEQRR+   VV  G  +F  C Y SDIAT  G G+ + LF SQVIIMVASRC CCGKA++P    A               
Subjt:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKF--CAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------

Query:  ------GGSVRNAYHTKYVSFLTNEQ-LSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARD--QTSSY
               GS+RNAYHT Y      E   SC+++RKGVF AGA+F +FT + S+ +Y+S+S+ARD  QT  Y
Subjt:  ------GGSVRNAYHTKYVSFLTNEQ-LSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARD--QTSSY

AT1G61065.1 Protein of unknown function (DUF1218)1.1e-4153.33Show/hide
Query:  AMLVLVVVFVFDLVAFALALAAEQRRTTATVV-QSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------
        ++L+L++VFVFDL+AF LA+AAEQRRTT  +  +S +  +C YD DIATGLGVGS L L ASQ++IMVASRCLCCG+A+ PS   +              
Subjt:  AMLVLVVVFVFDLVAFALALAAEQRRTTATVV-QSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA--------------

Query:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL
                GSVRNAYHTKY  +  N   SC+ LRKGVFGAGAAFI+ T + SEL+YV+ S+A+D   S  +D GIRM++L
Subjt:  -------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL

AT3G15480.1 Protein of unknown function (DUF1218)5.7e-3042.69Show/hide
Query:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSK--FCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------
        LV+++VF+ DL+A  LA+AAEQRR+   V    + +  +C Y +DIAT  G G+ + LF SQV+IM ASRC CCGK++ P    A               
Subjt:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSK--FCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------

Query:  ------GGSVRNAYHTKY-VSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARD--QTSSY
                S+RNAYHT+Y   +   +  SC+++RKGVF AGAAF +FT + S+ +YV +S+ARD  Q  SY
Subjt:  ------GGSVRNAYHTKY-VSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARD--QTSSY

AT4G27435.1 Protein of unknown function (DUF1218)2.3e-3144.31Show/hide
Query:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSK--FCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------
        +V  +VFVF+L+AF LA+AAEQRR+TA VVQ    +  +C YDSD ATG GVG+ LF  ASQ++IM+ SRC CCGK ++P    A               
Subjt:  LVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSK--FCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSA---------------

Query:  ------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSS
               GSV NAYHTKY +   +    C+ LRKGVF AGA+F+ F  + S+ +Y  +  A + + S
Subjt:  ------GGSVRNAYHTKYVSFLTNEQLSCKMLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATGCTAGTTTTGGTTGTGGTGTTTGTCTTTGATTTGGTTGCTTTTGCGCTTGCTCTTGCCGCAGAGCAGAGAAGAACCACTGCCACTGTCGTTCAATCAGGCAA
TTCTAAATTCTGTGCCTATGATTCTGATATTGCAACTGGCTTAGGTGTGGGTTCACTTCTGTTCCTATTTGCTAGCCAAGTGATCATAATGGTGGCAAGTCGATGCTTAT
GCTGCGGGAAGGCTATGCGACCCAGCGATATGTCTGCTGGCGGCTCGGTGCGAAATGCCTACCATACCAAATACGTGAGTTTTCTGACAAACGAGCAACTTTCGTGCAAG
ATGCTGAGGAAGGGAGTGTTCGGGGCCGGGGCTGCTTTTATTATCTTTACATGTGTAGCATCAGAGCTCTTCTATGTTAGCTTTTCCAAGGCTCGTGACCAGACTTCCTC
CTATGCCAAAGACACTGGCATTAGAATGGCAAACCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCATGCTAGTTTTGGTTGTGGTGTTTGTCTTTGATTTGGTTGCTTTTGCGCTTGCTCTTGCCGCAGAGCAGAGAAGAACCACTGCCACTGTCGTTCAATCAGGCAA
TTCTAAATTCTGTGCCTATGATTCTGATATTGCAACTGGCTTAGGTGTGGGTTCACTTCTGTTCCTATTTGCTAGCCAAGTGATCATAATGGTGGCAAGTCGATGCTTAT
GCTGCGGGAAGGCTATGCGACCCAGCGATATGTCTGCTGGCGGCTCGGTGCGAAATGCCTACCATACCAAATACGTGAGTTTTCTGACAAACGAGCAACTTTCGTGCAAG
ATGCTGAGGAAGGGAGTGTTCGGGGCCGGGGCTGCTTTTATTATCTTTACATGTGTAGCATCAGAGCTCTTCTATGTTAGCTTTTCCAAGGCTCGTGACCAGACTTCCTC
CTATGCCAAAGACACTGGCATTAGAATGGCAAACCTATAG
Protein sequenceShow/hide protein sequence
MAMLVLVVVFVFDLVAFALALAAEQRRTTATVVQSGNSKFCAYDSDIATGLGVGSLLFLFASQVIIMVASRCLCCGKAMRPSDMSAGGSVRNAYHTKYVSFLTNEQLSCK
MLRKGVFGAGAAFIIFTCVASELFYVSFSKARDQTSSYAKDTGIRMANL