; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g20670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g20670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEncodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).
Genome locationchr1:14430798..14435327
RNA-Seq ExpressionMoc01g20670
SyntenyMoc01g20670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448572.1 PREDICTED: uncharacterized protein LOC103490707 isoform X1 [Cucumis melo]1.7e-5981.41Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGS ETSYSSTTGIFGSIFAPSSKVLG ESLLSQ KE ER+SVNEPW PN EA+DD ANH QKESQEMKNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
         AQPCHLSSSIYYGGQDVY+  QNS+NSG NS +KK+GGEDDSGSASRGNWWQ  L
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

XP_008448581.1 PREDICTED: uncharacterized protein LOC103490707 isoform X2 [Cucumis melo]2.9e-5982.35Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGS ETSYSSTTGIFGSIFAPSSKVLG ESLLSQ KE ER+SVNEPW PN EA+DD ANH QKESQEMKNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ
         AQPCHLSSSIYYGGQDVY+  QNS+NSG NS +KK+GGEDDSGSASRGNWWQ
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ

XP_022151567.1 uncharacterized protein LOC111019480 [Momordica charantia]1.1e-7498.69Show/hide
Query:  MEGKKQLGSSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQRAQ
        MEGKKQLGSSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQRAQ
Subjt:  MEGKKQLGSSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQRAQ

Query:  PCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
        PCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ  L
Subjt:  PCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

XP_038882075.1 uncharacterized protein LOC120073353 isoform X1 [Benincasa hispida]8.2e-6283.33Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGSKETSYSSTTGIFGSIFAPSSKVLG +SLLSQ KEGERDSVNEPWIPN EA+DD ANH QKES EMKNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
        RAQPCHLSSSIYYGGQDVY+  QNS+NS VNS +KK+GGEDDSGSASRGNWWQ  L
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

XP_038882076.1 uncharacterized protein LOC120073353 isoform X2 [Benincasa hispida]8.5e-5981.41Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGSKETSYSSTTGIFGSIFAPSSKVLG +SLLSQ KEGERDSVNEPWIPN EA+DD ANH QKES EMKNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
        RAQPCHLSSSIYYGGQDVY+  QNS+    NS +KK+GGEDDSGSASRGNWWQ  L
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

TrEMBL top hitse value%identityAlignment
A0A0A0L143 Uncharacterized protein2.0e-5880.13Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGS ETSYSSTTGIFGSIFAPSSKVLG ESLLS  KE ER+SVNEPW PN  A+DD ANH QKESQE KNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
        RAQPCHLSSSIYYGGQDVY+  QNS+NSG NS +KK+GGEDDSGSASRGNWWQ  L
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

A0A1S3BK03 uncharacterized protein LOC103490707 isoform X21.4e-5982.35Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGS ETSYSSTTGIFGSIFAPSSKVLG ESLLSQ KE ER+SVNEPW PN EA+DD ANH QKESQEMKNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ
         AQPCHLSSSIYYGGQDVY+  QNS+NSG NS +KK+GGEDDSGSASRGNWWQ
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ

A0A1S3BKM1 uncharacterized protein LOC103490707 isoform X18.3e-6081.41Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGS ETSYSSTTGIFGSIFAPSSKVLG ESLLSQ KE ER+SVNEPW PN EA+DD ANH QKESQEMKNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
         AQPCHLSSSIYYGGQDVY+  QNS+NSG NS +KK+GGEDDSGSASRGNWWQ  L
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

A0A5D3DIY4 Uncharacterized protein8.3e-6081.41Show/hide
Query:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ
        MEGKK +G   SSSSLT DLFGS ETSYSSTTGIFGSIFAPSSKVLG ESLLSQ KE ER+SVNEPW PN EA+DD ANH QKESQEMKNKD+SSIYQ+Q
Subjt:  MEGKKQLG---SSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQ

Query:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
         AQPCHLSSSIYYGGQDVY+  QNS+NSG NS +KK+GGEDDSGSASRGNWWQ  L
Subjt:  RAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

A0A6J1DDV5 uncharacterized protein LOC1110194805.3e-7598.69Show/hide
Query:  MEGKKQLGSSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQRAQ
        MEGKKQLGSSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQRAQ
Subjt:  MEGKKQLGSSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQRAQ

Query:  PCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
        PCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ  L
Subjt:  PCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39855.2 unknown protein3.0e-1442.41Show/hide
Query:  MEGKKQL---GSSSSLTID-LFGSKET-SY-SSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHR--QKESQEMKNKDLSS
        M+ KK +    SSSS ++D +FG + + SY SSTTG+F SIF P S V  G +L S+               N  AK    N     +  +  KNK+  S
Subjt:  MEGKKQL---GSSSSLTID-LFGSKET-SY-SSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHR--QKESQEMKNKDLSS

Query:  IYQEQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ
           E+   PC+LSSSIYYGGQD YS S  + ++     +KKDG E DS SASRGNWW+
Subjt:  IYQEQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQ

AT3G55646.1 unknown protein2.0e-1337.5Show/hide
Query:  EGKKQLGSSSSLTIDL------FGSK--ETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSI
        + KK++ S+SS +  L      FG +   +S SS TG+F SIF P S     + L  Q+    +    +   PN  AK + +N ++K+          S 
Subjt:  EGKKQLGSSSSLTIDL------FGSK--ETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSI

Query:  YQEQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
        Y E+   PCHLSSS+YYGGQ+ YS    S  +  +  +KKDG E DS  ASRGNWW+  L
Subjt:  YQEQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

AT5G02020.1 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).4.1e-2753.12Show/hide
Query:  MEGKKQLG------SSSSLTIDLFGSKET-SYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSI
        MEG+K+        SSSSLT +LFGS+E  S  S++GI GSIF P SKVLG ES+  +   G        W   T +K      R +E QE      S  
Subjt:  MEGKKQLG------SSSSLTIDLFGSKET-SYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSI

Query:  YQEQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL
         Q+QR QPCHLSSSIYYGG DVY Q QNS +   NS  KKDGGEDDSGSASRGNWWQ  L
Subjt:  YQEQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQL

AT5G02020.2 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).7.2e-1648.84Show/hide
Query:  MEGKKQLG------SSSSLTIDLFGSKET-SYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSI
        MEG+K+        SSSSLT +LFGS+E  S  S++GI GSIF P SKVLG ES+  +   G        W   T +K      R +E QE      S  
Subjt:  MEGKKQLG------SSSSLTIDLFGSKET-SYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSI

Query:  YQEQRAQPCHLSSSIYYGGQDVYSQSQNS
         Q+QR QPCHLSSSIYYGG DVY Q QNS
Subjt:  YQEQRAQPCHLSSSIYYGGQDVYSQSQNS

AT5G59080.1 unknown protein1.7e-1237.42Show/hide
Query:  MEGKKQLGS----SSSLTIDLFGSKETS-YSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQ
        MEGK ++GS    SSS T +LFGSK+ S  SS++GIF ++F   S            K   RD  N                 +  SQ  + + L++  Q
Subjt:  MEGKKQLGS----SSSLTIDLFGSKETS-YSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQ

Query:  EQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSG-----SASRGNWWQEQL
        E R +PCHLSSS+YYGGQDVY++S  +         ++  GEDD+        SRGNWWQ  L
Subjt:  EQRAQPCHLSSSIYYGGQDVYSQSQNSHNSGVNSVFKKDGGEDDSG-----SASRGNWWQEQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAAAAAAGCAACTGGGTTCCTCTTCTTCTCTCACCATTGACCTGTTTGGCTCCAAAGAAACTTCCTACTCCTCAACCACTGGAATTTTCGGCTCTATATTTGC
ACCTTCTTCCAAGGTGTTAGGGGGAGAGTCTCTGCTCTCTCAGATCAAAGAGGGAGAGAGGGATTCTGTAAATGAGCCATGGATCCCCAACACTGAAGCTAAAGATGATG
CTGCTAATCATAGACAAAAGGAGAGTCAGGAGATGAAGAATAAAGATCTGAGTTCCATTTATCAGGAACAAAGAGCACAACCATGTCATCTTAGCTCATCAATCTATTAT
GGTGGCCAAGATGTTTATTCTCAGTCTCAGAATTCCCATAATTCCGGGGTGAACTCGGTGTTCAAGAAGGATGGGGGAGAAGATGATTCTGGGAGTGCTTCAAGAGGAAA
TTGGTGGCAAGAACAACTCACGGCCACACTTGAAGCCGACGGGGCGGCGAGCAATGTTGCAGCGTGTGGGGATGGTGACGGCAATTCTGGGGTCGACGCCGGCGCTTCTG
GCGGTCGGAGAAAGAAAGGCGGCGCAGAGGCAGGCTGGTCGTCGCCCAATTTGGGCCAACCGCCGGCAGCAATTTCTGGGGACATTGGCTCCGGCGTTTCGCGCGGCGGT
GATGCAGGGAAAAAGCTTCACGGCGACGCTATCTGGATCGGATTTCCCACATGGACTCGACCCCGCCGCCACTTTCAGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGAAAAAAGCAACTGGGTTCCTCTTCTTCTCTCACCATTGACCTGTTTGGCTCCAAAGAAACTTCCTACTCCTCAACCACTGGAATTTTCGGCTCTATATTTGC
ACCTTCTTCCAAGGTGTTAGGGGGAGAGTCTCTGCTCTCTCAGATCAAAGAGGGAGAGAGGGATTCTGTAAATGAGCCATGGATCCCCAACACTGAAGCTAAAGATGATG
CTGCTAATCATAGACAAAAGGAGAGTCAGGAGATGAAGAATAAAGATCTGAGTTCCATTTATCAGGAACAAAGAGCACAACCATGTCATCTTAGCTCATCAATCTATTAT
GGTGGCCAAGATGTTTATTCTCAGTCTCAGAATTCCCATAATTCCGGGGTGAACTCGGTGTTCAAGAAGGATGGGGGAGAAGATGATTCTGGGAGTGCTTCAAGAGGAAA
TTGGTGGCAAGAACAACTCACGGCCACACTTGAAGCCGACGGGGCGGCGAGCAATGTTGCAGCGTGTGGGGATGGTGACGGCAATTCTGGGGTCGACGCCGGCGCTTCTG
GCGGTCGGAGAAAGAAAGGCGGCGCAGAGGCAGGCTGGTCGTCGCCCAATTTGGGCCAACCGCCGGCAGCAATTTCTGGGGACATTGGCTCCGGCGTTTCGCGCGGCGGT
GATGCAGGGAAAAAGCTTCACGGCGACGCTATCTGGATCGGATTTCCCACATGGACTCGACCCCGCCGCCACTTTCAGCCTTGA
Protein sequenceShow/hide protein sequence
MEGKKQLGSSSSLTIDLFGSKETSYSSTTGIFGSIFAPSSKVLGGESLLSQIKEGERDSVNEPWIPNTEAKDDAANHRQKESQEMKNKDLSSIYQEQRAQPCHLSSSIYY
GGQDVYSQSQNSHNSGVNSVFKKDGGEDDSGSASRGNWWQEQLTATLEADGAASNVAACGDGDGNSGVDAGASGGRRKKGGAEAGWSSPNLGQPPAAISGDIGSGVSRGG
DAGKKLHGDAIWIGFPTWTRPRRHFQP