; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028550 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028550
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0114 domain-containing protein
Genome locationtig00153204:2224215..2225518
RNA-Seq ExpressionSgr028550
SyntenySgr028550
Gene Ontology termsNA
InterPro domainsIPR005134 - Uncharacterised protein family UPF0114


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065266.1 UPF0114 domain-containing protein [Cucumis melo var. makuwa]7.1e-5255.51Show/hide
Query:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF
        GG EG        A   P+ VETKT ELDL SL+++LLV LKTT+GK KI+K +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGS +VAESYLQYF
Subjt:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF

Query:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG
        H L QRT QT   +  +        F  G    V  IG         +MKEKN +WIS SNLF    + +  I    E+E  +          MMILQVG
Subjt:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG

Query:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        VLEKF++IPL+SA DLACFA AVLISSASIFFLS+L +  GG+ G
Subjt:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

XP_008444667.1 PREDICTED: uncharacterized protein LOC103487936 [Cucumis melo]5.5e-5255.51Show/hide
Query:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF
        GG EG        A   P+ VETKT ELDL SL+++LLV LKTT+GK KI+K +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGS +VAESYLQYF
Subjt:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF

Query:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG
        H L QRT QT   +  +        F  G    V  IG         +MKEKN +WIS SNLF    + +  I    E+E  +          MMILQVG
Subjt:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG

Query:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        VLEKF++IPL+SA DLACFA AVLISSASIFFLS+L +  GG+ G
Subjt:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

XP_022951147.1 uncharacterized protein LOC111454079 [Cucurbita moschata]6.5e-5357.69Show/hide
Query:  AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQ-----
        AAA PE VET+TRELDL SLLANLLV LK T  K KIR+ QIQKFIEKIIIDCRFFTLFAVAGSLLGSILC+LEGS +VAESYLQYF+G+ +R+      
Subjt:  AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQ-----

Query:  ---TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLN
            +  D ++        F  G+        +M EKN RW+SGSNLF            E   V      +G  V      MMILQVGVLEKF+SIPL+
Subjt:  ---TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLN

Query:  SAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        SA DLACFA A+LISSASIFFLSRL +  GG  G
Subjt:  SAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

XP_023538418.1 uncharacterized protein LOC111799204 [Cucurbita pepo subsp. pepo]3.4e-5458.97Show/hide
Query:  AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQ-----
        AAA PE VETKTRELDL SLLANLLV LK TV K KIR+ QIQKFIEKIIIDCRFFTLFAVAGSLLGSILC+LEGS +VAESYLQYF+G+ +R+      
Subjt:  AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQ-----

Query:  ---TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLN
            +  D ++        F  G+        +M EKN RW+SGSNLF            E   V      +G  V      MMILQVGVLEKF+SIPL+
Subjt:  ---TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLN

Query:  SAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        SAADLACFAGA+LISSASIFFLSRL +  G   G
Subjt:  SAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

XP_038885641.1 uncharacterized protein LOC120075956 [Benincasa hispida]3.2e-5254.8Show/hide
Query:  VNNFWRRREKADGGRE-----GAAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLM
        +NN  R     DG R+      AAA P+ VET+T EL+L SLLANLLV LKTTVGK KI++ QIQKFIEKIIIDCRFFTL AVAGSLLGSILCY+EGS +
Subjt:  VNNFWRRREKADGGRE-----GAAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLM

Query:  VAESYLQYFHGLWQRTQ--------TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRT------CGMM
        VAESYLQYFHGL Q +          +  D ++        F  G+       G+MKEKN   ISGSN F    + +    V  E   +         MM
Subjt:  VAESYLQYFHGLWQRTQ--------TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRT------CGMM

Query:  ILQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        ILQVGVLEKF++IPL+SA DLACFA AV++SSASIFFLS+L L  GG+ G
Subjt:  ILQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

TrEMBL top hitse value%identityAlignment
A0A0A0LLC9 Uncharacterized protein1.0e-5154.73Show/hide
Query:  RREKADGGREGAAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHG
        RR+        A A P+ VETKT ELDL SL+ANLL+ LK T+GK KI+K +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGS +V ESYLQYFHG
Subjt:  RREKADGGREGAAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHG

Query:  LWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVGVL
        L QRT QT   +  +        F  G    V  IG         +MK+KN +W S SNLF    + +  I    E+E  +          MMILQVGVL
Subjt:  LWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVGVL

Query:  EKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        EKF++IPL+SA DLACFA AVLISSASIFFLS+L +  GG++G
Subjt:  EKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

A0A1S3BAX1 uncharacterized protein LOC1034879362.6e-5255.51Show/hide
Query:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF
        GG EG        A   P+ VETKT ELDL SL+++LLV LKTT+GK KI+K +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGS +VAESYLQYF
Subjt:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF

Query:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG
        H L QRT QT   +  +        F  G    V  IG         +MKEKN +WIS SNLF    + +  I    E+E  +          MMILQVG
Subjt:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG

Query:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        VLEKF++IPL+SA DLACFA AVLISSASIFFLS+L +  GG+ G
Subjt:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

A0A5A7VG09 UPF0114 domain-containing protein3.5e-5255.51Show/hide
Query:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF
        GG EG        A   P+ VETKT ELDL SL+++LLV LKTT+GK KI+K +IQKFIEKIIIDCRFFTL AV+GSL+GSILCY+EGS +VAESYLQYF
Subjt:  GGREG--------AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYF

Query:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG
        H L QRT QT   +  +        F  G    V  IG         +MKEKN +WIS SNLF    + +  I    E+E  +          MMILQVG
Subjt:  HGLWQRT-QTKLFDRYVPRRNCSGCFWGGIVCNVRRIG---------EMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG--------MMILQVG

Query:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        VLEKF++IPL+SA DLACFA AVLISSASIFFLS+L +  GG+ G
Subjt:  VLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

A0A6J1CTB3 uncharacterized protein LOC111014021 isoform X11.3e-5156.63Show/hide
Query:  NNFWRRREKADGGR------EGAAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLM
        NN  R     DG R      + A A PE V+TKTRELDL SLLANLLV LKT VGK K     IQ FIEK IIDCRFFTLFAVAGSLLGSILCYLEGS +
Subjt:  NNFWRRREKADGGR------EGAAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLM

Query:  VAESYLQYFHGLWQRT-QTKLFDRYVPRRNC----SGCFWGGIVCNVRRIG--EMKEKNSRWISGSNLFETSDVGRNGIGVGSE----VEDRT--CGMMI
        VAESYLQYFHGL Q++ Q    +  +   +     +  F  G+      +G  +MKE+N  W SGSNLF    + +    VG E    V+ +     +MI
Subjt:  VAESYLQYFHGLWQRT-QTKLFDRYVPRRNC----SGCFWGGIVCNVRRIG--EMKEKNSRWISGSNLFETSDVGRNGIGVGSE----VEDRT--CGMMI

Query:  LQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        LQVGVLEKF+SIPLNSAADLACFA AVLISSASIFFLS+L    GG  G
Subjt:  LQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

A0A6J1GGV4 uncharacterized protein LOC1114540793.1e-5357.69Show/hide
Query:  AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQ-----
        AAA PE VET+TRELDL SLLANLLV LK T  K KIR+ QIQKFIEKIIIDCRFFTLFAVAGSLLGSILC+LEGS +VAESYLQYF+G+ +R+      
Subjt:  AAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQ-----

Query:  ---TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLN
            +  D ++        F  G+        +M EKN RW+SGSNLF            E   V      +G  V      MMILQVGVLEKF+SIPL+
Subjt:  ---TKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLN

Query:  SAADLACFAGAVLISSASIFFLSRLRLNMGGAAG
        SA DLACFA A+LISSASIFFLSRL +  GG  G
Subjt:  SAADLACFAGAVLISSASIFFLSRLRLNMGGAAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19390.1 Uncharacterised protein family (UPF0114)1.9e-1025.95Show/hide
Query:  IQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQY-------------------FHGLWQRTQTKLFDRYVPRRNCSGCFWGGIVCNVRR
        +++ IEK+I  CRF T     GSLLGS+LC+++G + V +S+LQY                      +       L++ ++   + S      IV N   
Subjt:  IQKFIEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQY-------------------FHGLWQRTQTKLFDRYVPRRNCSGCFWGGIVCNVRR

Query:  IGEM--KEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRL
        +  M   ++  +W+      E   V      +G  +      +M+L +G+ +K + + + S  DL C + ++  SSA +F LSRL
Subjt:  IGEM--KEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRL

AT5G13720.1 Uncharacterised protein family (UPF0114)5.7e-0725.54Show/hide
Query:  IEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQT-KLFDRYVPRRNCSGCFWGGIVCNVRRIG-----------EMKEKNSRW
        +E+II D RF  L AV GSL GS+LC+L G + + E+Y  Y+    +   T ++  R V        +  G V  +  +G           ++  ++ R 
Subjt:  IEKIIIDCRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQT-KLFDRYVPRRNCSGCFWGGIVCNVRRIG-----------EMKEKNSRW

Query:  ISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRL
        +  S+LF            + S +      VG  +      +MIL V + E+ + + + +  DL  ++  + +SSAS++ L  L
Subjt:  ISGSNLF------------ETSDVGRNGIGVGSEVEDRTCGMMILQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCACTAGATTGTTACGGTCGTTCGCCTTCTGCTTCTGTCGTGTCTTCTTCCTCCTCTCCGTCGTCGGCGACGACTGTGAGGTGTTTGAGCAAGACGGGGTTGA
ACAATGGCGAACGGTTAATAACTTTTGGCGACGGCGAGAGAAGGCAGATGGTGGCCGTGAGGGCGCAGCGGCCACTCCCGAGATTGTGGAAACTAAAACCAGAGAACTGG
ATTTGGCTTCTTTGCTGGCGAATCTACTCGTTCCATTGAAGACCACTGTGGGGAAGATGAAGATTCGGAAGCTACAGATCCAGAAGTTCATCGAAAAGATCATAATCGAC
TGCCGATTCTTCACGTTATTCGCCGTCGCCGGATCTTTACTGGGTTCGATACTCTGCTACCTCGAGGGTAGCTTGATGGTTGCAGAGTCGTATCTGCAGTATTTTCATGG
TCTCTGGCAGAGAACGCAAACGAAACTCTTTGACAGATATGTTCCTCGTCGGAACTGCTCTGGTTGTTTTTGGGGTGGGATTGTTTGCAATGTTCGTCGGATCGGAGAGA
TGAAGGAAAAAAACAGTCGTTGGATTTCTGGGTCGAACTTGTTTGAAACTTCCGACGTGGGTAGAAATGGAATCGGTGTCGGAAGCGAAGTCGAAGATCGGACATGCGGT
ATGATGATACTGCAAGTGGGGGTATTGGAGAAGTTCAGGAGTATACCTTTGAACTCTGCCGCCGACCTCGCGTGTTTCGCCGGCGCCGTTCTGATTTCCTCCGCCTCCAT
CTTCTTTCTCTCCAGACTCAGACTCAACATGGGCGGCGCCGCGGGTACAAGTGAAGTGAACCGCCCCCAATGGCGGAGCGGCATTCTTCATCGGCCTCCACAAATATATG
TATTTTATGGAAGTTTCGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCCACTAGATTGTTACGGTCGTTCGCCTTCTGCTTCTGTCGTGTCTTCTTCCTCCTCTCCGTCGTCGGCGACGACTGTGAGGTGTTTGAGCAAGACGGGGTTGA
ACAATGGCGAACGGTTAATAACTTTTGGCGACGGCGAGAGAAGGCAGATGGTGGCCGTGAGGGCGCAGCGGCCACTCCCGAGATTGTGGAAACTAAAACCAGAGAACTGG
ATTTGGCTTCTTTGCTGGCGAATCTACTCGTTCCATTGAAGACCACTGTGGGGAAGATGAAGATTCGGAAGCTACAGATCCAGAAGTTCATCGAAAAGATCATAATCGAC
TGCCGATTCTTCACGTTATTCGCCGTCGCCGGATCTTTACTGGGTTCGATACTCTGCTACCTCGAGGGTAGCTTGATGGTTGCAGAGTCGTATCTGCAGTATTTTCATGG
TCTCTGGCAGAGAACGCAAACGAAACTCTTTGACAGATATGTTCCTCGTCGGAACTGCTCTGGTTGTTTTTGGGGTGGGATTGTTTGCAATGTTCGTCGGATCGGAGAGA
TGAAGGAAAAAAACAGTCGTTGGATTTCTGGGTCGAACTTGTTTGAAACTTCCGACGTGGGTAGAAATGGAATCGGTGTCGGAAGCGAAGTCGAAGATCGGACATGCGGT
ATGATGATACTGCAAGTGGGGGTATTGGAGAAGTTCAGGAGTATACCTTTGAACTCTGCCGCCGACCTCGCGTGTTTCGCCGGCGCCGTTCTGATTTCCTCCGCCTCCAT
CTTCTTTCTCTCCAGACTCAGACTCAACATGGGCGGCGCCGCGGGTACAAGTGAAGTGAACCGCCCCCAATGGCGGAGCGGCATTCTTCATCGGCCTCCACAAATATATG
TATTTTATGGAAGTTTCGGCTGA
Protein sequenceShow/hide protein sequence
MAATRLLRSFAFCFCRVFFLLSVVGDDCEVFEQDGVEQWRTVNNFWRRREKADGGREGAAATPEIVETKTRELDLASLLANLLVPLKTTVGKMKIRKLQIQKFIEKIIID
CRFFTLFAVAGSLLGSILCYLEGSLMVAESYLQYFHGLWQRTQTKLFDRYVPRRNCSGCFWGGIVCNVRRIGEMKEKNSRWISGSNLFETSDVGRNGIGVGSEVEDRTCG
MMILQVGVLEKFRSIPLNSAADLACFAGAVLISSASIFFLSRLRLNMGGAAGTSEVNRPQWRSGILHRPPQIYVFYGSFG