; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017519 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017519
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionStress induced protein
Genome locationtig00153048:818856..819302
RNA-Seq ExpressionSgr017519
SyntenySgr017519
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587852.1 hypothetical protein SDJN03_16417, partial [Cucurbita argyrosperma subsp. sororia]1.6e-6079.19Show/hide
Query:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA HQ RP P   YSPLNEE E L D D+AVPSNGC CF+LFGFGF+RN NYE+ NLLQQ + REEE WMV+KLKK+KEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH
        KGKKQ+NRFQYDPESYALNFDGG+D EDD PP+GFS+RFAVP+ASREQH
Subjt:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH

XP_008443900.1 PREDICTED: uncharacterized protein LOC103487383 [Cucumis melo]4.0e-5676.67Show/hide
Query:  MAAHQARPTPMASYSPLN-EEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGY
        MA+HQ RP P   YSPLN ++ +DLQD DD++ SNGCGCF+LFGFG +RN NYE  NLLQQ Q REEESWMV+KLKKVKEVSEMVAGPKWK FIRKMGGY
Subjt:  MAAHQARPTPMASYSPLN-EEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGY

Query:  LKGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE
        LKGKKQ+NRFQYDPESYALNFDGG D E+D   PP+GFS+RFAVP+ASRE
Subjt:  LKGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE

XP_022927019.1 uncharacterized protein LOC111433973 [Cucurbita moschata]7.1e-6179.19Show/hide
Query:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA HQ RP P   YSPLNEE E L D D+A+PSNGC CF+LFGFGF+RN NYE+ NLLQQQ  REEE WMV+KLKK+KEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH
        KGKKQ+NRFQYDPESYALNFDGG+D EDD PP+GFS+RFAVP+ASREQH
Subjt:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH

XP_023003195.1 uncharacterized protein LOC111496880 [Cucurbita maxima]5.4e-6179.87Show/hide
Query:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA HQ RP P   YSPLNEE E L D D+AVPSNGC CF+LFGFGF+RN NYE+ NLLQQQ  REEE WMV+KLKK+KEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH
        KG+KQ+NRFQYDPESYALNFDGGLD EDD PP+GFS+RFAVP+ASREQH
Subjt:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH

XP_038876901.1 uncharacterized protein LOC120069255 [Benincasa hispida]3.8e-6280.79Show/hide
Query:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA HQ RP P   YSPLNE+ +DLQD D+AVPSNGCGCFRLFGFGF+RN NYE RNLLQQ Q REEESWMVRKLKK+KEVSEMVAGPKWK F+RKMGGYL
Subjt:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQKNRFQYDPESYALNFDGGLDREDD--GPPLGFSARFAVPMASREQH
        KGKKQ+NRFQYDPESYALNFDGG D E+D   PP+GFS+RFAVP+ASREQH
Subjt:  KGKKQKNRFQYDPESYALNFDGGLDREDD--GPPLGFSARFAVPMASREQH

TrEMBL top hitse value%identityAlignment
A0A0A0LTG0 Uncharacterized protein7.4e-5674.5Show/hide
Query:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA HQ RP P   YSPL ++ +DLQD DD++ SNGCGCF+LFGFG +RN NYE  NLLQQ Q REEESWMV++LKKV+EVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE
        KGKK++NRFQYDPESYALNFDGG D E+D   PP+GFS+RFAVP+ASRE
Subjt:  KGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE

A0A1S3B8N4 uncharacterized protein LOC1034873831.9e-5676.67Show/hide
Query:  MAAHQARPTPMASYSPLN-EEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGY
        MA+HQ RP P   YSPLN ++ +DLQD DD++ SNGCGCF+LFGFG +RN NYE  NLLQQ Q REEESWMV+KLKKVKEVSEMVAGPKWK FIRKMGGY
Subjt:  MAAHQARPTPMASYSPLN-EEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGY

Query:  LKGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE
        LKGKKQ+NRFQYDPESYALNFDGG D E+D   PP+GFS+RFAVP+ASRE
Subjt:  LKGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE

A0A5A7U6Q0 Uncharacterized protein1.9e-5676.67Show/hide
Query:  MAAHQARPTPMASYSPLN-EEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGY
        MA+HQ RP P   YSPLN ++ +DLQD DD++ SNGCGCF+LFGFG +RN NYE  NLLQQ Q REEESWMV+KLKKVKEVSEMVAGPKWK FIRKMGGY
Subjt:  MAAHQARPTPMASYSPLN-EEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQ-QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGY

Query:  LKGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE
        LKGKKQ+NRFQYDPESYALNFDGG D E+D   PP+GFS+RFAVP+ASRE
Subjt:  LKGKKQKNRFQYDPESYALNFDGGLDREDDG--PPLGFSARFAVPMASRE

A0A6J1EGU0 uncharacterized protein LOC1114339733.4e-6179.19Show/hide
Query:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA HQ RP P   YSPLNEE E L D D+A+PSNGC CF+LFGFGF+RN NYE+ NLLQQQ  REEE WMV+KLKK+KEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH
        KGKKQ+NRFQYDPESYALNFDGG+D EDD PP+GFS+RFAVP+ASREQH
Subjt:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH

A0A6J1KR34 uncharacterized protein LOC1114968802.6e-6179.87Show/hide
Query:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL
        MA HQ RP P   YSPLNEE E L D D+AVPSNGC CF+LFGFGF+RN NYE+ NLLQQQ  REEE WMV+KLKK+KEVSEMVAGPKWK FIRKMGGYL
Subjt:  MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQ-VREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYL

Query:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH
        KG+KQ+NRFQYDPESYALNFDGGLD EDD PP+GFS+RFAVP+ASREQH
Subjt:  KGKKQKNRFQYDPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)6.7e-0934.91Show/hide
Query:  QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQKNR------FQYDPESYALNFDGGLDR---EDDGPPLGFS
        ++  +E W +R  ++++E SE+VAGP+WKT+IR+ G                G   G    NR      F+YD  SY+LNFD G      +D+ P   +S
Subjt:  QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMG----------------GYLKGKKQKNR------FQYDPESYALNFDGGLDR---EDDGPPLGFS

Query:  ARFAVP
         RFA P
Subjt:  ARFAVP

AT3G48020.1 unknown protein2.7e-1041.12Show/hide
Query:  HRNNNYEDRNLLQQQVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQKNRFQYDPESYALNFDGGLDREDD-----GPPLGFSARF
        HRNN+ E R            W VR   K++E SE+VAGP+WKTFIR+      +G+     ++F+YDP SY L+F+   D++DD     G    FS R+
Subjt:  HRNNNYEDRNLLQQQVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMG-GYLKGK--KQKNRFQYDPESYALNFDGGLDREDD-----GPPLGFSARF

Query:  A-VPMAS
        A VP+AS
Subjt:  A-VPMAS

AT5G14890.1 NHL domain-containing protein2.7e-1041.94Show/hide
Query:  QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQKNRFQYDPESYALNFDGGLDR---EDDGPPLGFSARFAVP
        ++  +E W V    K++E SE+VAGPKWKTFIR+ G      G + G   + +   F+YD  SY+LNFD G      ED+ P   +S RFA P
Subjt:  QVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMG------GYLKG---KKQKNRFQYDPESYALNFDGGLDR---EDDGPPLGFSARFAVP

AT5G25240.1 unknown protein6.9e-1438.21Show/hide
Query:  DADDAVPSNGCGCFRLFGFGFHRNNNYEDRN------LLQQQVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQKNRFQYDPESYALNF
        D ++     GCG FR F F   R  + E R+       LQ++ R   +W   KLK +KE+SE +AGPKWK FIR      K  ++   F YD ++Y+LNF
Subjt:  DADDAVPSNGCGCFRLFGFGFHRNNNYEDRN------LLQQQVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQKNRFQYDPESYALNF

Query:  DGGLDREDDGPPLGFSARFAVPM
        D G D +D  P      RF  P+
Subjt:  DGGLDREDDGPPLGFSARFAVPM

AT5G62865.1 unknown protein9.3e-1146.07Show/hide
Query:  EEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---KNRFQYDPESYALNFDGGLDREDDGPPLG----FSARFA-VPMAS
        +E  W +R   K++E SE+VAGP+WKTFIR+     +  +      +FQYDP SY+LNFD   D ED+   LG    FS RFA VP+ S
Subjt:  EEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQ---KNRFQYDPESYALNFDGGLDREDDGPPLG----FSARFA-VPMAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCGCATCAAGCCAGACCAACTCCCATGGCTTCTTATTCGCCACTAAACGAGGAACCAGAAGATCTGCAGGACGCCGACGACGCCGTGCCGTCAAATGGGTGCGG
TTGTTTCCGGCTATTCGGCTTCGGATTCCATCGGAACAACAATTACGAAGACAGAAATCTTCTGCAGCAACAGGTACGAGAAGAGGAATCTTGGATGGTGAGGAAGCTGA
AGAAGGTGAAGGAAGTTTCGGAGATGGTGGCCGGACCCAAATGGAAGACGTTCATCAGAAAGATGGGCGGTTATTTGAAGGGCAAGAAGCAGAAGAACAGGTTTCAGTAC
GACCCAGAAAGCTATGCTCTGAATTTCGATGGCGGTTTGGATAGAGAAGACGATGGTCCGCCGCTTGGCTTCTCTGCTAGGTTTGCTGTGCCCATGGCGTCCAGGGAACA
ACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCGCATCAAGCCAGACCAACTCCCATGGCTTCTTATTCGCCACTAAACGAGGAACCAGAAGATCTGCAGGACGCCGACGACGCCGTGCCGTCAAATGGGTGCGG
TTGTTTCCGGCTATTCGGCTTCGGATTCCATCGGAACAACAATTACGAAGACAGAAATCTTCTGCAGCAACAGGTACGAGAAGAGGAATCTTGGATGGTGAGGAAGCTGA
AGAAGGTGAAGGAAGTTTCGGAGATGGTGGCCGGACCCAAATGGAAGACGTTCATCAGAAAGATGGGCGGTTATTTGAAGGGCAAGAAGCAGAAGAACAGGTTTCAGTAC
GACCCAGAAAGCTATGCTCTGAATTTCGATGGCGGTTTGGATAGAGAAGACGATGGTCCGCCGCTTGGCTTCTCTGCTAGGTTTGCTGTGCCCATGGCGTCCAGGGAACA
ACATTGA
Protein sequenceShow/hide protein sequence
MAAHQARPTPMASYSPLNEEPEDLQDADDAVPSNGCGCFRLFGFGFHRNNNYEDRNLLQQQVREEESWMVRKLKKVKEVSEMVAGPKWKTFIRKMGGYLKGKKQKNRFQY
DPESYALNFDGGLDREDDGPPLGFSARFAVPMASREQH