; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018403 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018403
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold342:690606..693249
RNA-Seq ExpressionMS018403
SyntenyMS018403
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451370.1 PREDICTED: uncharacterized protein LOC103492680 [Cucumis melo]4.0e-7166.24Show/hide
Query:  ELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ------QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR
        +L +EEAMPWLPSQVLDEACDIK                         VYMRQ+      QQ P+LHRQR H RPL SP    FAL  ++KSKY N+ +R
Subjt:  ELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ------QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR

Query:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPR--GAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGREC-NSTVKN
        P+Q+QK A+NWTAGG GMQAIFLDSGRQLGGTGVFLPR  G ++ YQPN+KP CS+VL+PARVV+ALNLDVQALG QISPRK+   NQKGREC NS VKN
Subjt:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPR--GAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGREC-NSTVKN

Query:  KKGKDVTSTNCSFMSQNQTNSSQE-IFLPKEWTY
        KKGKD+TST+CSFMSQNQTNSSQ+ IFLPKEWTY
Subjt:  KKGKDVTSTNCSFMSQNQTNSSQE-IFLPKEWTY

XP_022150263.1 uncharacterized protein LOC111018471 [Momordica charantia]3.5e-10788.7Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR
        MAADDLELAVEEAMPWLPSQVLDEACDIK                         VYMRQQQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR

Query:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG
        PHQ+QKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG
Subjt:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG

Query:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
        KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
Subjt:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

XP_022953989.1 uncharacterized protein LOC111456382 isoform X1 [Cucurbita moschata]6.9e-7168.35Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV
        MAA+  EL VEEA+PWLPSQVLDEACDIK                         VYMRQ   +QQ  +L RQRRHDRPL S  S  FAL  +QKSK+ NV
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV

Query:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNS
        S+ PHQRQK   SAANWTAGG GMQAIFLD GRQLGGTGVFLPRG  +T YQPNKKP CS+VL+PARVVQALNLDVQALG QISPRKDP  NQ GRECNS
Subjt:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNS

Query:  TVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
         VKNKKGKDV    CSF+S+NQ +SSQEIFLPKEWTY
Subjt:  TVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

XP_038899697.1 uncharacterized protein LOC120086955 isoform X1 [Benincasa hispida]6.6e-7470.59Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ----QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGN
        MAAD  EL VEEAMPWLP+QVLDEACDIK                         VYM+Q     QQ  +LH  RRHDRPL SP    F+L P+QKSKY N
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ----QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGN

Query:  VSARPHQRQKSA---ANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECN
        V +RPHQRQK A   ANWTAGG GMQAIFLDSGRQLGGTGVFLPRG   T YQPNKKP CS+VL+PARVVQALNLDVQALG QIS RK+P  NQKGRECN
Subjt:  VSARPHQRQKSA---ANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECN

Query:  STVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
        S VKNKKGKDVTSTNCS MSQNQTNSSQEIFLPKEWTY
Subjt:  STVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

XP_038899698.1 uncharacterized protein LOC120086955 isoform X2 [Benincasa hispida]6.2e-7270.17Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ----QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGN
        MAAD  EL VEEAMPWLP+QVLDEACDIK                         VYM+Q     QQ  +LH  RRHDRPL SP    F+L P  KSKY N
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ----QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGN

Query:  VSARPHQRQKSA---ANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECN
        V +RPHQRQK A   ANWTAGG GMQAIFLDSGRQLGGTGVFLPRG   T YQPNKKP CS+VL+PARVVQALNLDVQALG QIS RK+P  NQKGRECN
Subjt:  VSARPHQRQKSA---ANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECN

Query:  STVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
        S VKNKKGKDVTSTNCS MSQNQTNSSQEIFLPKEWTY
Subjt:  STVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

TrEMBL top hitse value%identityAlignment
A0A1S3BS46 uncharacterized protein LOC1034926801.9e-7166.24Show/hide
Query:  ELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ------QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR
        +L +EEAMPWLPSQVLDEACDIK                         VYMRQ+      QQ P+LHRQR H RPL SP    FAL  ++KSKY N+ +R
Subjt:  ELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQ------QQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR

Query:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPR--GAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGREC-NSTVKN
        P+Q+QK A+NWTAGG GMQAIFLDSGRQLGGTGVFLPR  G ++ YQPN+KP CS+VL+PARVV+ALNLDVQALG QISPRK+   NQKGREC NS VKN
Subjt:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPR--GAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGREC-NSTVKN

Query:  KKGKDVTSTNCSFMSQNQTNSSQE-IFLPKEWTY
        KKGKD+TST+CSFMSQNQTNSSQ+ IFLPKEWTY
Subjt:  KKGKDVTSTNCSFMSQNQTNSSQE-IFLPKEWTY

A0A6J1D8Z6 uncharacterized protein LOC1110184711.7e-10788.7Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR
        MAADDLELAVEEAMPWLPSQVLDEACDIK                         VYMRQQQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSAR

Query:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG
        PHQ+QKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG
Subjt:  PHQRQKSAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG

Query:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
        KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
Subjt:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

A0A6J1GR68 uncharacterized protein LOC111456382 isoform X23.1e-6967.93Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV
        MAA+  EL VEEA+PWLPSQVLDEACDIK                         VYMRQ   +QQ  +L RQRRHDRPL S  S  FAL    KSK+ NV
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV

Query:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNS
        S+ PHQRQK   SAANWTAGG GMQAIFLD GRQLGGTGVFLPRG  +T YQPNKKP CS+VL+PARVVQALNLDVQALG QISPRKDP  NQ GRECNS
Subjt:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNS

Query:  TVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
         VKNKKGKDV    CSF+S+NQ +SSQEIFLPKEWTY
Subjt:  TVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

A0A6J1GRJ3 uncharacterized protein LOC111456382 isoform X13.3e-7168.35Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV
        MAA+  EL VEEA+PWLPSQVLDEACDIK                         VYMRQ   +QQ  +L RQRRHDRPL S  S  FAL  +QKSK+ NV
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV

Query:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNS
        S+ PHQRQK   SAANWTAGG GMQAIFLD GRQLGGTGVFLPRG  +T YQPNKKP CS+VL+PARVVQALNLDVQALG QISPRKDP  NQ GRECNS
Subjt:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNS

Query:  TVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
         VKNKKGKDV    CSF+S+NQ +SSQEIFLPKEWTY
Subjt:  TVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

A0A6J1JXV2 uncharacterized protein LOC111488842 isoform X11.4e-6967.78Show/hide
Query:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV
        MAA+  EL VEEAMPWLPSQVLDEACDIK                         VYMRQ   +QQ  +L RQR HDRPL S  S  FAL  +QKSK+ NV
Subjt:  MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQ---QQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNV

Query:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQA--LGFQISPRKDPNGNQKGREC
        S+ PHQRQK   SAANWTAGG GMQAIFLD GRQLGGTGVFLPRG   T YQPNKKP CS+VL+PARVVQALNLDVQA  LG QISPRKDP  N+ GREC
Subjt:  SARPHQRQK---SAANWTAGGPGMQAIFLDSGRQLGGTGVFLPRG-AATGYQPNKKPVCSIVLLPARVVQALNLDVQA--LGFQISPRKDPNGNQKGREC

Query:  NSTVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
        NS VKNKKGKDV   NCSF+S+NQ +SSQEIFLPKEWTY
Subjt:  NSTVKNKKGKDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02830.1 unknown protein9.7e-1541.54Show/hide
Query:  RQKSAANW--TAGGPGMQAIFL-DSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG
        RQK  +NW  + G   MQA FL   GR   GTGVFLP  A   + P KK  CS VLLP RVVQALNL++   G  ISPR +   N            KK 
Subjt:  RQKSAANW--TAGGPGMQAIFL-DSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG

Query:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
        + + +T     ++N  +S +++ LP+EW Y
Subjt:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

AT4G02830.2 unknown protein9.7e-1541.54Show/hide
Query:  RQKSAANW--TAGGPGMQAIFL-DSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG
        RQK  +NW  + G   MQA FL   GR   GTGVFLP  A   + P KK  CS VLLP RVVQALNL++   G  ISPR +   N            KK 
Subjt:  RQKSAANW--TAGGPGMQAIFL-DSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKG

Query:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY
        + + +T     ++N  +S +++ LP+EW Y
Subjt:  KDVTSTNCSFMSQNQTNSSQEIFLPKEWTY

AT5G59050.1 unknown protein3.0e-0834.53Show/hide
Query:  HQRQKSAANWTAGGPGMQAIFLD-SGRQL--GGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKD-PNGNQKGRECNSTVKN
        HQ Q+  +       G++A+F+D SG +   GGTGVFLPRG  T  +  KK  CS V++PARVV+AL +    LG   +   D P  +       +  K 
Subjt:  HQRQKSAANWTAGGPGMQAIFLD-SGRQL--GGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKD-PNGNQKGRECNSTVKN

Query:  KKGKDVTSTNCSFMSQNQTNSSQEIF------LPKEWTY
        K  K+ + +     S  +   S E        LP+EWTY
Subjt:  KKGKDVTSTNCSFMSQNQTNSSQEIF------LPKEWTY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGACGACCTCGAGCTTGCTGTAGAAGAAGCAATGCCATGGCTTCCGAGTCAAGTTCTCGACGAGGCTTGCGATATCAAGGTTCTACTTCTAATTCTACTTCT
TCATCCATCGATAAATCGATCCATAGATCGCCTGATCTCTTTTAAATTTCAGGTGTATATGCGGCAACAACAGCAGAAGCCTTACCTCCACCGGCAACGGCGCCATGATC
GTCCTCTGTCATCGCCGCCGTCGGATCATTTCGCGCTTCCACCGCATCAGAAATCGAAGTACGGTAATGTCTCTGCTCGGCCGCATCAGAGGCAGAAATCGGCGGCCAAT
TGGACGGCCGGAGGACCTGGAATGCAAGCGATTTTCCTCGACTCCGGCCGGCAATTAGGCGGCACCGGCGTTTTTCTTCCTCGAGGAGCAGCCACTGGTTACCAACCAAA
CAAGAAGCCAGTTTGCTCCATCGTTCTTCTCCCTGCTCGTGTTGTTCAAGCTCTTAATCTCGACGTTCAAGCATTAGGATTTCAAATTTCTCCTCGAAAAGATCCCAACG
GCAACCAAAAGGGTAGAGAGTGCAACTCAACTGTAAAAAACAAGAAGGGCAAAGATGTAACATCCACAAATTGCTCTTTCATGTCCCAAAATCAAACCAATTCATCCCAA
GAGATATTTCTTCCCAAGGAATGGACATAT
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCGACGACCTCGAGCTTGCTGTAGAAGAAGCAATGCCATGGCTTCCGAGTCAAGTTCTCGACGAGGCTTGCGATATCAAGGTTCTACTTCTAATTCTACTTCT
TCATCCATCGATAAATCGATCCATAGATCGCCTGATCTCTTTTAAATTTCAGGTGTATATGCGGCAACAACAGCAGAAGCCTTACCTCCACCGGCAACGGCGCCATGATC
GTCCTCTGTCATCGCCGCCGTCGGATCATTTCGCGCTTCCACCGCATCAGAAATCGAAGTACGGTAATGTCTCTGCTCGGCCGCATCAGAGGCAGAAATCGGCGGCCAAT
TGGACGGCCGGAGGACCTGGAATGCAAGCGATTTTCCTCGACTCCGGCCGGCAATTAGGCGGCACCGGCGTTTTTCTTCCTCGAGGAGCAGCCACTGGTTACCAACCAAA
CAAGAAGCCAGTTTGCTCCATCGTTCTTCTCCCTGCTCGTGTTGTTCAAGCTCTTAATCTCGACGTTCAAGCATTAGGATTTCAAATTTCTCCTCGAAAAGATCCCAACG
GCAACCAAAAGGGTAGAGAGTGCAACTCAACTGTAAAAAACAAGAAGGGCAAAGATGTAACATCCACAAATTGCTCTTTCATGTCCCAAAATCAAACCAATTCATCCCAA
GAGATATTTCTTCCCAAGGAATGGACATAT
Protein sequenceShow/hide protein sequence
MAADDLELAVEEAMPWLPSQVLDEACDIKVLLLILLLHPSINRSIDRLISFKFQVYMRQQQQKPYLHRQRRHDRPLSSPPSDHFALPPHQKSKYGNVSARPHQRQKSAAN
WTAGGPGMQAIFLDSGRQLGGTGVFLPRGAATGYQPNKKPVCSIVLLPARVVQALNLDVQALGFQISPRKDPNGNQKGRECNSTVKNKKGKDVTSTNCSFMSQNQTNSSQ
EIFLPKEWTY