; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003220 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003220
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00002024:9904..14266
RNA-Seq ExpressionSgr003220
SyntenySgr003220
Gene Ontology termsNA
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598702.1 hypothetical protein SDJN03_08480, partial [Cucurbita argyrosperma subsp. sororia]6.3e-2049.71Show/hide
Query:  SLSKLKDHAAAAMA----RSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF
        +LSKLK H AAA A    R+ N G     + RC+KHPKHKQSPGVCSLCLREKLS+L  T      +A+ T  S SSSSLSSLSSYYSSS  SS +SPYF
Subjt:  SLSKLKDHAAAAMA----RSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF

Query:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRS
         P T      S+S LFKRR +    T                        GFWSKLM+NRR K++V  +  RS
Subjt:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRS

KAG7029644.1 hypothetical protein SDJN02_07984, partial [Cucurbita argyrosperma subsp. argyrosperma]6.3e-2049.71Show/hide
Query:  SLSKLKDHAAAAMA----RSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF
        +L KLK H AAA A    R  N G G   + RC+KHPKHKQSPGVCSLCLREKLS+L  T      +A+ T  S SSSSLSSLSSYYSSS  SS +SPYF
Subjt:  SLSKLKDHAAAAMA----RSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF

Query:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRS
         P T      S+S LFKRR +    T                        GFWSKLM+NRR K++V     RS
Subjt:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRS

XP_008444863.1 PREDICTED: uncharacterized serine-rich protein C215.13-like [Cucumis melo]4.9e-3357.89Show/hide
Query:  ILSLSKLKDHAAAAMARSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY
        ++  SKLK HAAA MARSSN GG  GGG  +C+KHPKHKQSPGVCS+CLREKL +L    T  S+S  +S  + S SSSSLSSLSSYYSSSS SS SSPY
Subjt:  ILSLSKLKDHAAAAMARSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY

Query:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS---TRDHQRIAS
        F   + +K SIS   SLLFKRR SS+  ++S + T++ F      + K    DGFWSKLM+NRRGKEIVEE   R +S   T DHQ I +
Subjt:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS---TRDHQRIAS

XP_022131967.1 uncharacterized protein LOC111004952 [Momordica charantia]1.8e-2760.44Show/hide
Query:  SLSKLKDHAAAAMARSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF
        +L KLKDHAAAAMAR+     GGGG  RC+KHPKH+QSPGVCSLCLREKLS L+NT   GS+A +IA     S SSSSLSS+SS YSS SSASSCSSP  
Subjt:  SLSKLKDHAAAAMARSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF

Query:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS-TRDH
             RK SIS MS LFKRR S +N L++SRSL                 ADG WSKL+VNRRGK   E+ LRRS S T DH
Subjt:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS-TRDH

XP_031736533.1 uncharacterized serine-rich protein C215.13-like [Cucumis sativus]9.6e-2952.04Show/hide
Query:  SKLKDHAAAAMARSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS
        + LK HAAA MARSSN G         GGG G+  C+KHPKHKQSPGVCS+CLREKL +L  T + +S  +S  + S SSSSLSSLSSYYSSSS SS SS
Subjt:  SKLKDHAAAAMARSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS

Query:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDG--DRKKNKA-DGFWSKLMVNRRGKEIVEEAL----RRSTSTRDHQRIAS
        PY    + +K S+S   SLLFKRR SS+  +++ + T++ F    D    R  NK+  GFWSKLM+NRRGKEI+ E +      +++T DHQ I +
Subjt:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDG--DRKKNKA-DGFWSKLMVNRRGKEIVEEAL----RRSTSTRDHQRIAS

TrEMBL top hitse value%identityAlignment
A0A067D3G1 Uncharacterized protein (Fragment)4.8e-1849.34Show/hide
Query:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT
        I+C+KHPKHKQSPGVCSLCLR+KLS L    SS+SR    T+ SC  SS SSLSSYYSSS ASSCSSP F  +    T  +G  S+S L    R +N LT
Subjt:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT

Query:  TSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTSTRD
         SRSL S     ++     K K +G +SKL    R K+  ++ L  S + R+
Subjt:  TSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTSTRD

A0A0A0LPJ0 Uncharacterized protein4.7e-2952.04Show/hide
Query:  SKLKDHAAAAMARSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS
        + LK HAAA MARSSN G         GGG G+  C+KHPKHKQSPGVCS+CLREKL +L  T + +S  +S  + S SSSSLSSLSSYYSSSS SS SS
Subjt:  SKLKDHAAAAMARSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS

Query:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDG--DRKKNKA-DGFWSKLMVNRRGKEIVEEAL----RRSTSTRDHQRIAS
        PY    + +K S+S   SLLFKRR SS+  +++ + T++ F    D    R  NK+  GFWSKLM+NRRGKEI+ E +      +++T DHQ I +
Subjt:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDG--DRKKNKA-DGFWSKLMVNRRGKEIVEEAL----RRSTSTRDHQRIAS

A0A1S3BC83 uncharacterized serine-rich protein C215.13-like2.4e-3357.89Show/hide
Query:  ILSLSKLKDHAAAAMARSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY
        ++  SKLK HAAA MARSSN GG  GGG  +C+KHPKHKQSPGVCS+CLREKL +L    T  S+S  +S  + S SSSSLSSLSSYYSSSS SS SSPY
Subjt:  ILSLSKLKDHAAAAMARSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY

Query:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS---TRDHQRIAS
        F   + +K SIS   SLLFKRR SS+  ++S + T++ F      + K    DGFWSKLM+NRRGKEIVEE   R +S   T DHQ I +
Subjt:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS---TRDHQRIAS

A0A6J1BSJ2 uncharacterized protein LOC1110049528.8e-2860.44Show/hide
Query:  SLSKLKDHAAAAMARSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF
        +L KLKDHAAAAMAR+     GGGG  RC+KHPKH+QSPGVCSLCLREKLS L+NT   GS+A +IA     S SSSSLSS+SS YSS SSASSCSSP  
Subjt:  SLSKLKDHAAAAMARSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF

Query:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS-TRDH
             RK SIS MS LFKRR S +N L++SRSL                 ADG WSKL+VNRRGK   E+ LRRS S T DH
Subjt:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTS-TRDH

V4V5C9 Uncharacterized protein4.8e-1849.34Show/hide
Query:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT
        I+C+KHPKHKQSPGVCSLCLR+KLS L    SS+SR    T+ SC  SS SSLSSYYSSS ASSCSSP F  +    T  +G  S+S L    R +N LT
Subjt:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT

Query:  TSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTSTRD
         SRSL S     ++     K K +G +SKL    R K+  ++ L  S + R+
Subjt:  TSRSLTSSRFTDKDDGDRKKNKADGFWSKLMVNRRGKEIVEEALRRSTSTRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22470.1 unknown protein2.6e-0851.9Show/hide
Query:  CKKHPKHKQSPGVCSLCLREKLSHLVN-----TGSSASRIASATMGSCSSSSLSS--------LSSYYSSSSASSCSSP
        CKKHPKH+QSPG+CSLCL E LS L +     + S +S   + TM SCSS+S  S        +SSYY  SS SSC SP
Subjt:  CKKHPKHKQSPGVCSLCLREKLSHLVN-----TGSSASRIASATMGSCSSSSLSS--------LSSYYSSSSASSCSSP

AT1G35210.1 unknown protein1.7e-0739.86Show/hide
Query:  AIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPHTARKGSISMSLLFKRRRSSNFLTTSR
        A+ CKKHPKH+QSPGVCSLCL E+LS  +   SS  R  S  + S SSS+ SSLSS   SSS SSC SP       R+  +      +  +  +++T SR
Subjt:  AIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPHTARKGSISMSLLFKRRRSSNFLTTSR

Query:  SLTSSRFTDKDDGDRKKNKA---DGFWSKLMVNRRGKE
        S+        DD  R+K K     GF+  L++  + ++
Subjt:  SLTSSRFTDKDDGDRKKNKA---DGFWSKLMVNRRGKE

AT1G72240.1 unknown protein6.9e-0944.86Show/hide
Query:  AAMARSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASAT---MGSCSSSSLSSLSSYYSSSSASSCSSP-YFRPHTARKGSI
        A   + S+          CKKH KH+QSPG+CSLCL E+LS L       ++ A  T    GS S+SS SS+SS YSSSS SSCSSP  +R    +K   
Subjt:  AAMARSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASAT---MGSCSSSSLSSLSSYYSSSSASSCSSP-YFRPHTARKGSI

Query:  SMSLLFK
          S LF+
Subjt:  SMSLLFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGTGGAGTCAACTTTGCATTTGAAACCATCCAAAAGAGCTTCCCAGACTTCAATGTCGACTTCATCTACAATACTTTCCGTGATGATTTCAACTCCGGTGGCGA
TCATGCTGGCGAAAGCTCAGGTCCGGGCCCCACGGGCGGTCGACCCGGGCGGGCCGAGCCGGCTGAATTTCCAAAAGTGGAGGCAGTGGCCCCCACCCAAAACCCTCTCC
TCGAATCACAGCCGTTGATTTTGAGCTTGAGCAAACTAAAGGATCATGCGGCGGCGGCAATGGCAAGGAGTTCCAACCTTGGTGGTGGCGGCGGCGGCGCTATTCGATGC
AAGAAGCACCCAAAACACAAGCAATCACCGGGCGTCTGCTCGCTTTGTCTAAGAGAAAAACTCTCTCACTTGGTTAATACAGGTTCTTCCGCTAGCCGAATAGCGTCTGC
AACAATGGGTTCTTGTTCATCTTCTTCTTTATCTTCCTTATCGTCCTATTACTCTTCTTCTTCGGCTTCGTCTTGCTCTTCCCCGTATTTTCGTCCACATACTGCAAGAA
AGGGCTCGATTTCCATGTCCTTGTTGTTCAAAAGACGAAGAAGCAGTAATTTCCTGACTACAAGTAGATCTCTAACTTCTTCCAGATTTACAGACAAGGACGACGGAGAT
AGAAAGAAGAATAAAGCTGACGGGTTCTGGTCAAAGTTGATGGTGAATCGAAGAGGGAAGGAGATCGTGGAAGAAGCTCTCAGACGTTCAACTTCTACAAGAGATCATCA
GAGAATTGCTAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGTGGAGTCAACTTTGCATTTGAAACCATCCAAAAGAGCTTCCCAGACTTCAATGTCGACTTCATCTACAATACTTTCCGTGATGATTTCAACTCCGGTGGCGA
TCATGCTGGCGAAAGCTCAGGTCCGGGCCCCACGGGCGGTCGACCCGGGCGGGCCGAGCCGGCTGAATTTCCAAAAGTGGAGGCAGTGGCCCCCACCCAAAACCCTCTCC
TCGAATCACAGCCGTTGATTTTGAGCTTGAGCAAACTAAAGGATCATGCGGCGGCGGCAATGGCAAGGAGTTCCAACCTTGGTGGTGGCGGCGGCGGCGCTATTCGATGC
AAGAAGCACCCAAAACACAAGCAATCACCGGGCGTCTGCTCGCTTTGTCTAAGAGAAAAACTCTCTCACTTGGTTAATACAGGTTCTTCCGCTAGCCGAATAGCGTCTGC
AACAATGGGTTCTTGTTCATCTTCTTCTTTATCTTCCTTATCGTCCTATTACTCTTCTTCTTCGGCTTCGTCTTGCTCTTCCCCGTATTTTCGTCCACATACTGCAAGAA
AGGGCTCGATTTCCATGTCCTTGTTGTTCAAAAGACGAAGAAGCAGTAATTTCCTGACTACAAGTAGATCTCTAACTTCTTCCAGATTTACAGACAAGGACGACGGAGAT
AGAAAGAAGAATAAAGCTGACGGGTTCTGGTCAAAGTTGATGGTGAATCGAAGAGGGAAGGAGATCGTGGAAGAAGCTCTCAGACGTTCAACTTCTACAAGAGATCATCA
GAGAATTGCTAGCTAG
Protein sequenceShow/hide protein sequence
MGCGVNFAFETIQKSFPDFNVDFIYNTFRDDFNSGGDHAGESSGPGPTGGRPGRAEPAEFPKVEAVAPTQNPLLESQPLILSLSKLKDHAAAAMARSSNLGGGGGGAIRC
KKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPHTARKGSISMSLLFKRRRSSNFLTTSRSLTSSRFTDKDDGD
RKKNKADGFWSKLMVNRRGKEIVEEALRRSTSTRDHQRIAS