; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023035 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023035
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00000729:2160032..2164404
RNA-Seq ExpressionSgr023035
SyntenySgr023035
Gene Ontology termsNA
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598702.1 hypothetical protein SDJN03_08480, partial [Cucurbita argyrosperma subsp. sororia]5.3e-1948.55Show/hide
Query:  SLSKLKDHAAAAMAWSS----NLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF
        +LSKLK H AAA A +     N G     + RC+KHPKHKQSPGVCSLCLREKLS+L  T      +A+ T  S SSSSLSSLSSYYSSS  SS +SPYF
Subjt:  SLSKLKDHAAAAMAWSS----NLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF

Query:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRS
         P T      S+S LFKRR +    T                        GFWSKL++NRR K++V  +  RS
Subjt:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRS

KAG7029644.1 hypothetical protein SDJN02_07984, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-1948.55Show/hide
Query:  SLSKLKDHAAAAMAWSS----NLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF
        +L KLK H AAA A +     N G G   + RC+KHPKHKQSPGVCSLCLREKLS+L  T      +A+ T  S SSSSLSSLSSYYSSS  SS +SPYF
Subjt:  SLSKLKDHAAAAMAWSS----NLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYF

Query:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRS
         P T      S+S LFKRR +    T                        GFWSKL++NRR K++V     RS
Subjt:  RPHTARKGSISMSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRS

XP_008444863.1 PREDICTED: uncharacterized serine-rich protein C215.13-like [Cucumis melo]1.6e-3156.32Show/hide
Query:  ILSLSKLKDHAAAAMAWSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY
        ++  SKLK HAAA MA SSN GG  GGG  +C+KHPKHKQSPGVCS+CLREKL +L    T  S+S  +S  + S SSSSLSSLSSYYSSSS SS SSPY
Subjt:  ILSLSKLKDHAAAAMAWSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY

Query:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS---TRDHQRIAS
        F   + +K SIS   SLLFKRR SS+  ++S +  ++ F      + K    DGFWSKL++NRRGKEIVEE   R +S   T DHQ I +
Subjt:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS---TRDHQRIAS

XP_022131967.1 uncharacterized protein LOC111004952 [Momordica charantia]4.1e-2759.89Show/hide
Query:  SLSKLKDHAAAAMAWSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF
        +L KLKDHAAAAMA +     GGGG  RC+KHPKH+QSPGVCSLCLREKLS L+NT   GS+A +IA     S SSSSLSS+SS YSS SSASSCSSP  
Subjt:  SLSKLKDHAAAAMAWSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF

Query:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS-TRDH
             RK SIS MS LFKRR S +N L++SRSL                 ADG WSKL+VNRRGK   E+ LRRS S T DH
Subjt:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS-TRDH

XP_031736533.1 uncharacterized serine-rich protein C215.13-like [Cucumis sativus]3.1e-2750.51Show/hide
Query:  SKLKDHAAAAMAWSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS
        + LK HAAA MA SSN G         GGG G+  C+KHPKHKQSPGVCS+CLREKL +L  T + +S  +S  + S SSSSLSSLSSYYSSSS SS SS
Subjt:  SKLKDHAAAAMAWSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS

Query:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDG--DRKKNKA-DGFWSKLIVNRRGKEIVEEAL----RRSTSTRDHQRIAS
        PY    + +K S+S   SLLFKRR SS+  +++ +  ++ F    D    R  NK+  GFWSKL++NRRGKEI+ E +      +++T DHQ I +
Subjt:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDG--DRKKNKA-DGFWSKLIVNRRGKEIVEEAL----RRSTSTRDHQRIAS

TrEMBL top hitse value%identityAlignment
A0A067D3G1 Uncharacterized protein (Fragment)2.8e-1849.34Show/hide
Query:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT
        I+C+KHPKHKQSPGVCSLCLR+KLS L    SS+SR    T+ SC  SS SSLSSYYSSS ASSCSSP F  +    T  +G  S+S L    R +N LT
Subjt:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT

Query:  TSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTSTRD
         SRSL S     ++     K K +G +SKL    R K+  ++ L  S + R+
Subjt:  TSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTSTRD

A0A0A0LPJ0 Uncharacterized protein1.5e-2750.51Show/hide
Query:  SKLKDHAAAAMAWSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS
        + LK HAAA MA SSN G         GGG G+  C+KHPKHKQSPGVCS+CLREKL +L  T + +S  +S  + S SSSSLSSLSSYYSSSS SS SS
Subjt:  SKLKDHAAAAMAWSSNLG---------GGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSS

Query:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDG--DRKKNKA-DGFWSKLIVNRRGKEIVEEAL----RRSTSTRDHQRIAS
        PY    + +K S+S   SLLFKRR SS+  +++ +  ++ F    D    R  NK+  GFWSKL++NRRGKEI+ E +      +++T DHQ I +
Subjt:  PYFRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDG--DRKKNKA-DGFWSKLIVNRRGKEIVEEAL----RRSTSTRDHQRIAS

A0A1S3BC83 uncharacterized serine-rich protein C215.13-like7.7e-3256.32Show/hide
Query:  ILSLSKLKDHAAAAMAWSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY
        ++  SKLK HAAA MA SSN GG  GGG  +C+KHPKHKQSPGVCS+CLREKL +L    T  S+S  +S  + S SSSSLSSLSSYYSSSS SS SSPY
Subjt:  ILSLSKLKDHAAAAMAWSSNLGG-GGGGAIRCKKHPKHKQSPGVCSLCLREKLSHL--VNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPY

Query:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS---TRDHQRIAS
        F   + +K SIS   SLLFKRR SS+  ++S +  ++ F      + K    DGFWSKL++NRRGKEIVEE   R +S   T DHQ I +
Subjt:  FRPHTARKGSIS--MSLLFKRRRSSNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS---TRDHQRIAS

A0A6J1BSJ2 uncharacterized protein LOC1110049522.0e-2759.89Show/hide
Query:  SLSKLKDHAAAAMAWSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF
        +L KLKDHAAAAMA +     GGGG  RC+KHPKH+QSPGVCSLCLREKLS L+NT   GS+A +IA     S SSSSLSS+SS YSS SSASSCSSP  
Subjt:  SLSKLKDHAAAAMAWSSNLGGGGGGAIRCKKHPKHKQSPGVCSLCLREKLSHLVNT---GSSASRIASATMGSCSSSSLSSLSSYYSS-SSASSCSSPYF

Query:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS-TRDH
             RK SIS MS LFKRR S +N L++SRSL                 ADG WSKL+VNRRGK   E+ LRRS S T DH
Subjt:  RPHTARKGSIS-MSLLFKRRRS-SNFLTTSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTS-TRDH

V4V5C9 Uncharacterized protein2.8e-1849.34Show/hide
Query:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT
        I+C+KHPKHKQSPGVCSLCLR+KLS L    SS+SR    T+ SC  SS SSLSSYYSSS ASSCSSP F  +    T  +G  S+S L    R +N LT
Subjt:  IRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPH----TARKGSISMSLLFKRRRSSNFLT

Query:  TSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTSTRD
         SRSL S     ++     K K +G +SKL    R K+  ++ L  S + R+
Subjt:  TSRSLASSRFTDKDDGDRKKNKADGFWSKLIVNRRGKEIVEEALRRSTSTRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22470.1 unknown protein2.6e-0851.9Show/hide
Query:  CKKHPKHKQSPGVCSLCLREKLSHLVN-----TGSSASRIASATMGSCSSSSLSS--------LSSYYSSSSASSCSSP
        CKKHPKH+QSPG+CSLCL E LS L +     + S +S   + TM SCSS+S  S        +SSYY  SS SSC SP
Subjt:  CKKHPKHKQSPGVCSLCLREKLSHLVN-----TGSSASRIASATMGSCSSSSLSS--------LSSYYSSSSASSCSSP

AT1G35210.1 unknown protein1.5e-0840.58Show/hide
Query:  AIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPHTARKGSISMSLLFKRRRSSNFLTTSR
        A+ CKKHPKH+QSPGVCSLCL E+LS  +   SS  R  S  + S SSS+ SSLSS   SSS SSC SP       R+  +      +  +  +++T SR
Subjt:  AIRCKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPHTARKGSISMSLLFKRRRSSNFLTTSR

Query:  SLASSRFTDKDDGDRKKNKA---DGFWSKLIVNRRGKE
        S+A       DD  R+K K     GF+  L++  + ++
Subjt:  SLASSRFTDKDDGDRKKNKA---DGFWSKLIVNRRGKE

AT1G72240.1 unknown protein1.4e-0951.69Show/hide
Query:  CKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASAT---MGSCSSSSLSSLSSYYSSSSASSCSSP-YFRPHTARKGSISMSLLFK
        CKKH KH+QSPG+CSLCL E+LS L       ++ A  T    GS S+SS SS+SS YSSSS SSCSSP  +R    +K     S LF+
Subjt:  CKKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASAT---MGSCSSSSLSSLSSYYSSSSASSCSSP-YFRPHTARKGSISMSLLFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGTGGAGTGAACTTTGCATTTGAAACCATCCAAAAGAGCTTCCCAGACTTCAATGTCGACTTCATCTACAATACTTTCCGTGATGATTTCAACTCCGGTGGCGA
TCGTGCTGGCGAAAGCTCAGGTCCGGGCCCCACGGGCGGTCGAGCCGGGCGGGCCGAGCCGGCTGAATTTCCAAAAGTGGAGGCAGTGGCCCCCACCCAAAACCCTCTCC
TCGAATCACAGCCGTTGATTTTGAGCTTGAGCAAACTAAAGGATCATGCGGCGGCGGCAATGGCATGGAGTTCCAACCTTGGTGGTGGCGGCGGCGGCGCTATTCGATGC
AAGAAGCACCCAAAACACAAGCAATCACCGGGCGTCTGCTCGCTTTGTCTAAGAGAAAAACTCTCTCACTTGGTTAATACAGGTTCTTCCGCTAGCCGAATAGCGTCTGC
AACAATGGGTTCTTGTTCATCTTCTTCTTTATCTTCCTTATCGTCCTATTACTCTTCTTCTTCGGCTTCGTCTTGCTCTTCCCCGTATTTTCGTCCACATACTGCAAGAA
AGGGCTCGATTTCCATGTCCTTGTTGTTCAAAAGACGAAGAAGCAGTAATTTCCTGACTACAAGTAGATCTCTAGCTTCTTCCAGATTTACAGACAAGGACGACGGAGAT
AGAAAGAAGAATAAAGCTGACGGGTTCTGGTCGAAGTTGATAGTGAATCGAAGAGGGAAGGAGATCGTGGAAGAAGCTCTCAGACGTTCAACTTCTACAAGAGATCATCA
GAGAATTGCTAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGTGGAGTGAACTTTGCATTTGAAACCATCCAAAAGAGCTTCCCAGACTTCAATGTCGACTTCATCTACAATACTTTCCGTGATGATTTCAACTCCGGTGGCGA
TCGTGCTGGCGAAAGCTCAGGTCCGGGCCCCACGGGCGGTCGAGCCGGGCGGGCCGAGCCGGCTGAATTTCCAAAAGTGGAGGCAGTGGCCCCCACCCAAAACCCTCTCC
TCGAATCACAGCCGTTGATTTTGAGCTTGAGCAAACTAAAGGATCATGCGGCGGCGGCAATGGCATGGAGTTCCAACCTTGGTGGTGGCGGCGGCGGCGCTATTCGATGC
AAGAAGCACCCAAAACACAAGCAATCACCGGGCGTCTGCTCGCTTTGTCTAAGAGAAAAACTCTCTCACTTGGTTAATACAGGTTCTTCCGCTAGCCGAATAGCGTCTGC
AACAATGGGTTCTTGTTCATCTTCTTCTTTATCTTCCTTATCGTCCTATTACTCTTCTTCTTCGGCTTCGTCTTGCTCTTCCCCGTATTTTCGTCCACATACTGCAAGAA
AGGGCTCGATTTCCATGTCCTTGTTGTTCAAAAGACGAAGAAGCAGTAATTTCCTGACTACAAGTAGATCTCTAGCTTCTTCCAGATTTACAGACAAGGACGACGGAGAT
AGAAAGAAGAATAAAGCTGACGGGTTCTGGTCGAAGTTGATAGTGAATCGAAGAGGGAAGGAGATCGTGGAAGAAGCTCTCAGACGTTCAACTTCTACAAGAGATCATCA
GAGAATTGCTAGCTAG
Protein sequenceShow/hide protein sequence
MGCGVNFAFETIQKSFPDFNVDFIYNTFRDDFNSGGDRAGESSGPGPTGGRAGRAEPAEFPKVEAVAPTQNPLLESQPLILSLSKLKDHAAAAMAWSSNLGGGGGGAIRC
KKHPKHKQSPGVCSLCLREKLSHLVNTGSSASRIASATMGSCSSSSLSSLSSYYSSSSASSCSSPYFRPHTARKGSISMSLLFKRRRSSNFLTTSRSLASSRFTDKDDGD
RKKNKADGFWSKLIVNRRGKEIVEEALRRSTSTRDHQRIAS