; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019411 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019411
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold28:210395..210868
RNA-Seq ExpressionMS019411
SyntenyMS019411
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043783.1 uncharacterized protein E6C27_scaffold236G001150 [Cucumis melo var. makuwa]1.7e-6077.99Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE
        MSN IM RPP S RRQPLL+S+S SGSVRFAEVAGGT AECAAVCCCCP VV+N LVLA+YK+PAGLCRRALRTKRR  LKKKG +P+RRGR+S  G DE
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE

Query:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
        TDIQILSAGK+L+SSEP+ Q++EETERKVMELEKEMWEIFY TGFWRSPS+R+Q   S+
Subjt:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK

XP_011652052.1 uncharacterized protein LOC105434984 [Cucumis sativus]2.2e-6078.62Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE
        MSN IMFRPP S RRQPLL+S+S SGSVRFAEVAGGT AECAAVCCCCP VV+N LVLA+YK+PAGLCRRALRTKRR  LKKKG  P+RRGRFS  G DE
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE

Query:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
        TDIQILSAGK ++SSEP+ Q++ ETERKVMELEKEMWEIFY TGFWRSPS+RDQ   S+
Subjt:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK

XP_016899649.1 PREDICTED: uncharacterized protein LOC103486710 [Cucumis melo]8.1e-5574.05Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSSGLDET
        MSN IM RPP S RRQPLL+S+S SGSVRFAEVAGGT AECAAVCCCCP VV+N LVLA+YK+PAGLCRRALRTKRR  LKKK            G DET
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSSGLDET

Query:  DIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
        DIQILSAGK+L+SSEP+ Q++EETERKVMELEKEMWEIFY TGFWRSPS+R+Q   S+
Subjt:  DIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK

XP_022955715.1 uncharacterized protein LOC111457605 [Cucurbita moschata]5.2e-5472.78Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSS-GLDE
        MSN +MFRPP S R QPLL S+S  GS+RFAEVAGGT AECAAVCCCCP  VL+ LVLA+YK+PAGLCRRALRT+RRH+L +KG LPSRRGR S+   DE
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSS-GLDE

Query:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS
         D+QIL+ GK +H+SE K +KS+ETERKVMELE EMWE FYGTGFWRSPSQRDQ+  S
Subjt:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS

XP_038905288.1 uncharacterized protein LOC120091362 [Benincasa hispida]2.0e-6180.89Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE
        MSN IM RPP + RRQPLL+S+S SGSVRFAEVAGGT AECAAVCCCCP VVLN LVLA+YK+PAGLCRRALRTKRR  LKKKG +P+RR RFS  G DE
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE

Query:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQS
        TDIQILS GKT++SSEPK QKSEETER+VMELEKEMWEIFY TGFWRSPSQRDQ+ S
Subjt:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQS

TrEMBL top hitse value%identityAlignment
A0A0A0LGV0 Uncharacterized protein1.1e-6078.62Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE
        MSN IMFRPP S RRQPLL+S+S SGSVRFAEVAGGT AECAAVCCCCP VV+N LVLA+YK+PAGLCRRALRTKRR  LKKKG  P+RRGRFS  G DE
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE

Query:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
        TDIQILSAGK ++SSEP+ Q++ ETERKVMELEKEMWEIFY TGFWRSPS+RDQ   S+
Subjt:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK

A0A1S4DUJ2 uncharacterized protein LOC1034867103.9e-5574.05Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSSGLDET
        MSN IM RPP S RRQPLL+S+S SGSVRFAEVAGGT AECAAVCCCCP VV+N LVLA+YK+PAGLCRRALRTKRR  LKKK            G DET
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSSGLDET

Query:  DIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
        DIQILSAGK+L+SSEP+ Q++EETERKVMELEKEMWEIFY TGFWRSPS+R+Q   S+
Subjt:  DIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK

A0A5D3DQ13 Uncharacterized protein8.1e-6177.99Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE
        MSN IM RPP S RRQPLL+S+S SGSVRFAEVAGGT AECAAVCCCCP VV+N LVLA+YK+PAGLCRRALRTKRR  LKKKG +P+RRGR+S  G DE
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFS-SGLDE

Query:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
        TDIQILSAGK+L+SSEP+ Q++EETERKVMELEKEMWEIFY TGFWRSPS+R+Q   S+
Subjt:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK

A0A6J1GVU4 uncharacterized protein LOC1114576052.5e-5472.78Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSS-GLDE
        MSN +MFRPP S R QPLL S+S  GS+RFAEVAGGT AECAAVCCCCP  VL+ LVLA+YK+PAGLCRRALRT+RRH+L +KG LPSRRGR S+   DE
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSS-GLDE

Query:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS
         D+QIL+ GK +H+SE K +KS+ETERKVMELE EMWE FYGTGFWRSPSQRDQ+  S
Subjt:  TDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS

A0A6J1J6H8 uncharacterized protein LOC1114817887.4e-4665.41Show/hide
Query:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNL-KKKGQLPSRRGRFSSG-LD
        MSNA+MFRP  S  RQ L+ S+S S +VRFAEVAGGT AECAAV CCCP + +N LVLA+YKLPAGLCRRALRTK R N+ KKKG  P+R GRF  G LD
Subjt:  MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNL-KKKGQLPSRRGRFSSG-LD

Query:  ETDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS
        + D+Q+++  KT++        SEE ERKV+ELEKEMWE+FY TGFWRSPSQRDQ+ SS
Subjt:  ETDIQILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27180.1 unknown protein9.0e-2038.56Show/hide
Query:  SIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALR-------TKRRHNLKKKGQLPSRRGRFSSGLDETDIQI
        ++R  P  ++       +  EVAGG AAECAAV CCCP  V+N++VLAVYK+PA +C++A R       T++RH L              + L+E D+  
Subjt:  SIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALR-------TKRRHNLKKKGQLPSRRGRFSSGLDETDIQI

Query:  LSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS
             T      +C  S      V+ LE EM + FYG GFWRSPSQ+D    S
Subjt:  LSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSS

AT3G11690.1 unknown protein1.4e-2540.66Show/hide
Query:  PPPSIRRQPLLSSQSGSGSVRF-------AEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPS-------------
        P  S RRQPLL     S S R        AE  GGT A CAAV CCCP  ++N+LVLA+YK+P G+CRRA+R++RR  L K G LP              
Subjt:  PPPSIRRQPLLSSQSGSGSVRF-------AEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPS-------------

Query:  -RRGRFS-SGLDETDI----------QILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
         +   F+   LD  D+           +   GK++ +     ++++E +  V+ LEKEMW  FYG GFWRSPSQR+ V S +
Subjt:  -RRGRFS-SGLDETDI----------QILSAGKTLHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK

AT5G06380.1 unknown protein1.9e-2245.27Show/hide
Query:  PSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSSGLDETDIQILSAGKT
        PS+RR     S  G GS   AE  GGT A CAA+C C P  V+N++VLAVYKLP GLCRRA+R  RR  L KK  + S R     G  +  +  L     
Subjt:  PSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSSGLDETDIQILSAGKT

Query:  LHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK
           S  + ++ EE +  V+ LEKEMW  FY  GFWRS SQ +   S K
Subjt:  LHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACGCGATTATGTTCCGGCCGCCGCCCTCGATCCGGCGGCAACCTCTGCTATCGAGCCAATCCGGCTCCGGCAGCGTCCGGTTCGCGGAGGTGGCCGGTGGAAC
CGCCGCCGAGTGCGCCGCCGTGTGCTGCTGTTGCCCGTTCGTGGTCCTCAACGTCCTCGTGCTCGCCGTTTACAAGCTTCCGGCCGGCCTCTGCCGCCGCGCCCTGAGAA
CGAAGCGCCGCCATAATCTGAAGAAGAAGGGGCAGCTTCCGTCGCGGCGCGGCCGGTTCTCGAGCGGCCTCGACGAGACGGATATTCAGATTCTGTCCGCCGGAAAAACG
CTGCACTCGTCGGAGCCAAAATGCCAGAAATCGGAGGAGACGGAGAGGAAAGTGATGGAGCTGGAGAAGGAGATGTGGGAGATTTTCTACGGCACCGGGTTCTGGAGAAG
TCCGTCGCAGAGAGATCAAGTTCAATCTTCAAAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACGCGATTATGTTCCGGCCGCCGCCCTCGATCCGGCGGCAACCTCTGCTATCGAGCCAATCCGGCTCCGGCAGCGTCCGGTTCGCGGAGGTGGCCGGTGGAAC
CGCCGCCGAGTGCGCCGCCGTGTGCTGCTGTTGCCCGTTCGTGGTCCTCAACGTCCTCGTGCTCGCCGTTTACAAGCTTCCGGCCGGCCTCTGCCGCCGCGCCCTGAGAA
CGAAGCGCCGCCATAATCTGAAGAAGAAGGGGCAGCTTCCGTCGCGGCGCGGCCGGTTCTCGAGCGGCCTCGACGAGACGGATATTCAGATTCTGTCCGCCGGAAAAACG
CTGCACTCGTCGGAGCCAAAATGCCAGAAATCGGAGGAGACGGAGAGGAAAGTGATGGAGCTGGAGAAGGAGATGTGGGAGATTTTCTACGGCACCGGGTTCTGGAGAAG
TCCGTCGCAGAGAGATCAAGTTCAATCTTCAAAA
Protein sequenceShow/hide protein sequence
MSNAIMFRPPPSIRRQPLLSSQSGSGSVRFAEVAGGTAAECAAVCCCCPFVVLNVLVLAVYKLPAGLCRRALRTKRRHNLKKKGQLPSRRGRFSSGLDETDIQILSAGKT
LHSSEPKCQKSEETERKVMELEKEMWEIFYGTGFWRSPSQRDQVQSSK