; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026555 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026555
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionEncodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).
Genome locationtig00153033:1305107..1308909
RNA-Seq ExpressionSgr026555
SyntenySgr026555
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147915.1 uncharacterized protein LOC101215701 isoform X1 [Cucumis sativus]1.0e-5779.62Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGS E+SYSSTTGIFGSIFAPSSKVLGRESLLS TK  ER+SVNEPW P    +DDNANH QKESQE KNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
        RAQPCHLSSSIYYGGQD+Y+HPQNS+NSGANS + KEGGEDDSGSASRGNWWQ +++
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

XP_008448572.1 PREDICTED: uncharacterized protein LOC103490707 isoform X1 [Cucumis melo]2.7e-5880.25Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGS E+SYSSTTGIFGSIFAPSSKVLGRESLLSQTK  ER+SVNEPW P    +DDNANH QKESQEMKNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
         AQPCHLSSSIYYGGQD+Y+HPQNS+NSGANS + KEGGEDDSGSASRGNWWQ +++
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

XP_008448581.1 PREDICTED: uncharacterized protein LOC103490707 isoform X2 [Cucumis melo]1.4e-5782.35Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGS E+SYSSTTGIFGSIFAPSSKVLGRESLLSQTK  ER+SVNEPW P    +DDNANH QKESQEMKNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQ
         AQPCHLSSSIYYGGQD+Y+HPQNS+NSGANS + KEGGEDDSGSASRGNWWQ
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQ

XP_022151567.1 uncharacterized protein LOC111019480 [Momordica charantia]2.0e-6184.42Show/hide
Query:  MEGKKQVGSSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQRAQ
        MEGKKQ+GSSSSLT +LFGSKE+SYSSTTGIFGSIFAPSSKVLG ESLLSQ K GERDSVNEPWIP    KDD ANHRQKESQEMKNKD++SIYQEQRAQ
Subjt:  MEGKKQVGSSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQRAQ

Query:  PCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
        PCHLSSSIYYGGQD+YS  QNSHNSG NSVF K+GGEDDSGSASRGNWWQ +++
Subjt:  PCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

XP_038882075.1 uncharacterized protein LOC120073353 isoform X1 [Benincasa hispida]9.4e-5980.25Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGSKE+SYSSTTGIFGSIFAPSSKVLGR+SLLSQTK GERDSVNEPWIP    +DD ANH QKES EMKNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
        RAQPCHLSSSIYYGGQD+Y+HPQNS+NS  NS + KEGGEDDSGSASRGNWWQ +++
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

TrEMBL top hitse value%identityAlignment
A0A0A0L143 Uncharacterized protein5.0e-5879.62Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGS E+SYSSTTGIFGSIFAPSSKVLGRESLLS TK  ER+SVNEPW P    +DDNANH QKESQE KNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
        RAQPCHLSSSIYYGGQD+Y+HPQNS+NSGANS + KEGGEDDSGSASRGNWWQ +++
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

A0A1S3BK03 uncharacterized protein LOC103490707 isoform X26.6e-5882.35Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGS E+SYSSTTGIFGSIFAPSSKVLGRESLLSQTK  ER+SVNEPW P    +DDNANH QKESQEMKNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQ
         AQPCHLSSSIYYGGQD+Y+HPQNS+NSGANS + KEGGEDDSGSASRGNWWQ
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQ

A0A1S3BKM1 uncharacterized protein LOC103490707 isoform X11.3e-5880.25Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGS E+SYSSTTGIFGSIFAPSSKVLGRESLLSQTK  ER+SVNEPW P    +DDNANH QKESQEMKNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
         AQPCHLSSSIYYGGQD+Y+HPQNS+NSGANS + KEGGEDDSGSASRGNWWQ +++
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

A0A5D3DIY4 Uncharacterized protein1.3e-5880.25Show/hide
Query:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ
        MEGKK VG   SSSSLTT+LFGS E+SYSSTTGIFGSIFAPSSKVLGRESLLSQTK  ER+SVNEPW P    +DDNANH QKESQEMKNKD++SIYQ+Q
Subjt:  MEGKKQVG---SSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
         AQPCHLSSSIYYGGQD+Y+HPQNS+NSGANS + KEGGEDDSGSASRGNWWQ +++
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

A0A6J1DDV5 uncharacterized protein LOC1110194809.8e-6284.42Show/hide
Query:  MEGKKQVGSSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQRAQ
        MEGKKQ+GSSSSLT +LFGSKE+SYSSTTGIFGSIFAPSSKVLG ESLLSQ K GERDSVNEPWIP    KDD ANHRQKESQEMKNKD++SIYQEQRAQ
Subjt:  MEGKKQVGSSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIP----KDDNANHRQKESQEMKNKDINSIYQEQRAQ

Query:  PCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF
        PCHLSSSIYYGGQD+YS  QNSHNSG NSVF K+GGEDDSGSASRGNWWQ +++
Subjt:  PCHLSSSIYYGGQDIYSHPQNSHNSGANSVF-KEGGEDDSGSASRGNWWQAAMF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39855.2 unknown protein9.8e-1443.14Show/hide
Query:  MEGKKQV-GSSSSLTTEL---FGSKES-SY-SSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQR
        M+ KK V GSSSS ++ L   FG + S SY SSTTG+F SIF P S V       +Q  +  R+   +      +  N R + S   KNK+  S   E+ 
Subjt:  MEGKKQV-GSSSSLTTEL---FGSKES-SY-SSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQR

Query:  AQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVFKEGGEDDSGSASRGNWWQAA
          PC+LSSSIYYGGQD YS    S  +  ++  K+G E DS SASRGNWW+ +
Subjt:  AQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVFKEGGEDDSGSASRGNWWQAA

AT3G55646.1 unknown protein9.2e-1236.31Show/hide
Query:  EGKKQVGSSSSLTTEL------FGSK--ESSYSSTTGIFGSIF-APSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQE
        + KK++ S+SS ++ L      FG +   SS SS TG+F SIF  PS+  LGR+           D  ++    K  + N + + S + + K   S Y E
Subjt:  EGKKQVGSSSSLTTEL------FGSK--ESSYSSTTGIFGSIF-APSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQE

Query:  QRAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVFKEGGEDDSGSASRGNWWQAAMF
        +   PCHLSSS+YYGGQ+ YS   ++  +  ++  K+G E DS  ASRGNWW+ +++
Subjt:  QRAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVFKEGGEDDSGSASRGNWWQAAMF

AT5G02020.1 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).4.8e-2950Show/hide
Query:  MEGKKQVG------SSSSLTTELFGSKES-SYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQ
        MEG+K+        SSSSLT+ELFGS+E+ S  S++GI GSIF P SKVLGRES+  +T  G        W  K          ++E +    +   Q+Q
Subjt:  MEGKKQVG------SSSSLTTELFGSKES-SYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVFKEGGEDDSGSASRGNWWQAAMF
        R QPCHLSSSIYYGG D+Y  PQNS ++  N   K+GGEDDSGSASRGNWWQ +++
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNSHNSGANSVFKEGGEDDSGSASRGNWWQAAMF

AT5G02020.2 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).1.3e-1848Show/hide
Query:  MEGKKQVG------SSSSLTTELFGSKES-SYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQ
        MEG+K+        SSSSLT+ELFGS+E+ S  S++GI GSIF P SKVLGRES+  +T  G        W  K          ++E +    +   Q+Q
Subjt:  MEGKKQVG------SSSSLTTELFGSKES-SYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQ

Query:  RAQPCHLSSSIYYGGQDIYSHPQNS
        R QPCHLSSSIYYGG D+Y  PQNS
Subjt:  RAQPCHLSSSIYYGGQDIYSHPQNS

AT5G59080.1 unknown protein5.7e-1438.12Show/hide
Query:  MEGKKQVGS----SSSLTTELFGSKE-SSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQRA
        MEGK +VGS    SSS T ELFGSK+ S  SS++GIF ++F   SK   R+                       N+ H    SQ  + + +N+  QE R 
Subjt:  MEGKKQVGS----SSSLTTELFGSKE-SSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQRA

Query:  QPCHLSSSIYYGGQDIYSH-PQNSHNSGANSVFKEGGEDDSG-----SASRGNWWQAAMF
        +PCHLSSS+YYGGQD+Y+    N       +  +  GEDD+        SRGNWWQ +++
Subjt:  QPCHLSSSIYYGGQDIYSH-PQNSHNSGANSVFKEGGEDDSG-----SASRGNWWQAAMF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAAAGAAGCAAGTGGGATCGTCTTCTTCTCTCACTACTGAGCTGTTTGGCTCCAAGGAAAGTTCGTATTCTTCAACTACTGGAATTTTCGGCTCTATATTTGC
ACCTTCTTCGAAGGTGTTAGGGAGAGAGTCTCTGCTCTCTCAGACCAAAGTGGGAGAGAGGGATTCTGTAAATGAGCCATGGATCCCCAAAGATGATAATGCTAATCATA
GACAAAAGGAGAGTCAGGAGATGAAGAATAAAGATATCAATTCTATTTATCAGGAACAAAGAGCACAACCATGTCATCTTAGCTCATCAATCTATTATGGTGGCCAAGAC
ATTTACTCTCATCCTCAGAATTCCCACAACTCCGGCGCGAACTCGGTGTTCAAGGAGGGGGGAGAAGATGATTCTGGGAGTGCTTCAAGAGGAAATTGGTGGCAAGCTGC
CATGTTTTGTGAGCAGTCCAAGTTGGAAGTGTCATTTTCCCATCCATATCCGAGTTCAGGATATCTTGACACAGTTGCATTGCCAAACAAGTCGTTTTCTGTGAAAGAGT
TGTGGTTCTTGCTTTTTCCATGGTCGTTAAGTTTGGCCGTTAAAAATGAAAAAGATGGTGGGATGAGGACAGAGATTCTGCAGGCACAATTCTTGGCAATGTTTGATCTG
AATTTTCTTTTTTCTTTTTTGATGTTACTTTTGGTTCTCTTCTTTCCTGTCCACACCATTCATAATCCAAAAAGTAATAATGTGCAAACCAACAACATCAATAAGCAACT
TGACACATCGCCAAAGCCACCAGCAGGCCGTAAATTTCACAAACTTCATTCCCTCAAGGCAGTATCGTTTGGGGATGGTGAGGGCAATTCTGGGGGAGACGGCGGCGCTT
CCGGCGGTCGGAGAAAGCAAAGCGGCGCAGAGGCAGCTCGGTCGTTTCCCGATTTCCGCCACCCGCCGGCAGGAATAGGGCCGGGGACACTGGCTCTGGCGTTTTGCACG
GCGGTGATGCAGGAAACAAGCTTCAAGACGACGCTATCTGGAGTGGATTTCCCACATGGAGTCCGCCACTTTCAGCTGCAGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGAAAGAAGCAAGTGGGATCGTCTTCTTCTCTCACTACTGAGCTGTTTGGCTCCAAGGAAAGTTCGTATTCTTCAACTACTGGAATTTTCGGCTCTATATTTGC
ACCTTCTTCGAAGGTGTTAGGGAGAGAGTCTCTGCTCTCTCAGACCAAAGTGGGAGAGAGGGATTCTGTAAATGAGCCATGGATCCCCAAAGATGATAATGCTAATCATA
GACAAAAGGAGAGTCAGGAGATGAAGAATAAAGATATCAATTCTATTTATCAGGAACAAAGAGCACAACCATGTCATCTTAGCTCATCAATCTATTATGGTGGCCAAGAC
ATTTACTCTCATCCTCAGAATTCCCACAACTCCGGCGCGAACTCGGTGTTCAAGGAGGGGGGAGAAGATGATTCTGGGAGTGCTTCAAGAGGAAATTGGTGGCAAGCTGC
CATGTTTTGTGAGCAGTCCAAGTTGGAAGTGTCATTTTCCCATCCATATCCGAGTTCAGGATATCTTGACACAGTTGCATTGCCAAACAAGTCGTTTTCTGTGAAAGAGT
TGTGGTTCTTGCTTTTTCCATGGTCGTTAAGTTTGGCCGTTAAAAATGAAAAAGATGGTGGGATGAGGACAGAGATTCTGCAGGCACAATTCTTGGCAATGTTTGATCTG
AATTTTCTTTTTTCTTTTTTGATGTTACTTTTGGTTCTCTTCTTTCCTGTCCACACCATTCATAATCCAAAAAGTAATAATGTGCAAACCAACAACATCAATAAGCAACT
TGACACATCGCCAAAGCCACCAGCAGGCCGTAAATTTCACAAACTTCATTCCCTCAAGGCAGTATCGTTTGGGGATGGTGAGGGCAATTCTGGGGGAGACGGCGGCGCTT
CCGGCGGTCGGAGAAAGCAAAGCGGCGCAGAGGCAGCTCGGTCGTTTCCCGATTTCCGCCACCCGCCGGCAGGAATAGGGCCGGGGACACTGGCTCTGGCGTTTTGCACG
GCGGTGATGCAGGAAACAAGCTTCAAGACGACGCTATCTGGAGTGGATTTCCCACATGGAGTCCGCCACTTTCAGCTGCAGGCTTGA
Protein sequenceShow/hide protein sequence
MEGKKQVGSSSSLTTELFGSKESSYSSTTGIFGSIFAPSSKVLGRESLLSQTKVGERDSVNEPWIPKDDNANHRQKESQEMKNKDINSIYQEQRAQPCHLSSSIYYGGQD
IYSHPQNSHNSGANSVFKEGGEDDSGSASRGNWWQAAMFCEQSKLEVSFSHPYPSSGYLDTVALPNKSFSVKELWFLLFPWSLSLAVKNEKDGGMRTEILQAQFLAMFDL
NFLFSFLMLLLVLFFPVHTIHNPKSNNVQTNNINKQLDTSPKPPAGRKFHKLHSLKAVSFGDGEGNSGGDGGASGGRRKQSGAEAARSFPDFRHPPAGIGPGTLALAFCT
AVMQETSFKTTLSGVDFPHGVRHFQLQA