; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013685 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013685
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function, DUF538
Genome locationscaffold263:163246..163704
RNA-Seq ExpressionMS013685
SyntenyMS013685
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030212.1 hypothetical protein SDJN02_08559, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-5976.97Show/hide
Query:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE
        MS +AA P+  NGE+ SVYDILREFNFPIGLLPEG VGC LDR TGKLEAYL  ACQF P++ Y+L+YK+TISGQIS+NRLTDLKGV+VKFMFFWVNIVE
Subjt:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE

Query:  VVRVGDDLEFSVGMATASFPVENFSEC-PQCGCGIDCKNGKVRKINAKSLIS
        VVR GDDL FSVG+ATASFPV+NFSEC P  GCG DCKNGK  KI  KSL+S
Subjt:  VVRVGDDLEFSVGMATASFPVENFSEC-PQCGCGIDCKNGKVRKINAKSLIS

XP_004136167.1 uncharacterized protein LOC101222381 [Cucumis sativus]4.6e-6379.74Show/hide
Query:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV
        MSP+  A PS  NG++ SVYDILREFNFPIGLLPEG VGC+LDR TGKLEAYL  +C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV

Query:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS
        EVVR GDDLEFS+GMATASFPV+NFSECP  GCG+DC +GKVRKI AKSL+SS
Subjt:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS

XP_008451557.1 PREDICTED: uncharacterized protein LOC103492802 [Cucumis melo]9.2e-6480.39Show/hide
Query:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV
        MSP+  ALPS  NGE+ SVYDILREFNFPIGLLPEG VGC+LDR TGKLEAYL  +C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV

Query:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS
        EVVR GDDLEFS+GMATASFPV+NFSECP   CG+DC +GKVRKI AKSL+SS
Subjt:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS

XP_022151830.1 uncharacterized protein LOC111019713 [Momordica charantia]2.9e-8199.35Show/hide
Query:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE
        MSPSAALPSAGNGES SVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE
Subjt:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE

Query:  VVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV
        VVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV
Subjt:  VVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV

XP_038896189.1 uncharacterized protein LOC120084473 [Benincasa hispida]1.3e-6279.74Show/hide
Query:  MSPSA-ALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV
        MSP+  A PS  + E+LSVYDILREFNFPIGLLPEG VG +LDR TGKLEAYL G+C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPSA-ALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV

Query:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS
        EVVR GDDL+FSVGMATASFPV+NFSECP  GCG+DC +GKVRKI AKSL+SS
Subjt:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS

TrEMBL top hitse value%identityAlignment
A0A0A0K643 Uncharacterized protein2.2e-6379.74Show/hide
Query:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV
        MSP+  A PS  NG++ SVYDILREFNFPIGLLPEG VGC+LDR TGKLEAYL  +C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV

Query:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS
        EVVR GDDLEFS+GMATASFPV+NFSECP  GCG+DC +GKVRKI AKSL+SS
Subjt:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS

A0A1S3BR55 uncharacterized protein LOC1034928024.5e-6480.39Show/hide
Query:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV
        MSP+  ALPS  NGE+ SVYDILREFNFPIGLLPEG VGC+LDR TGKLEAYL  +C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV

Query:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS
        EVVR GDDLEFS+GMATASFPV+NFSECP   CG+DC +GKVRKI AKSL+SS
Subjt:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS

A0A5A7VQJ7 Uncharacterized protein4.5e-6480.39Show/hide
Query:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV
        MSP+  ALPS  NGE+ SVYDILREFNFPIGLLPEG VGC+LDR TGKLEAYL  +C F PDE Y+LKYKSTISG ISRNRLT+LKGVSVKFMFFWVNIV
Subjt:  MSPS-AALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIV

Query:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS
        EVVR GDDLEFS+GMATASFPV+NFSECP   CG+DC +GKVRKI AKSL+SS
Subjt:  EVVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS

A0A6J1DCA2 uncharacterized protein LOC1110197131.4e-8199.35Show/hide
Query:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE
        MSPSAALPSAGNGES SVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE
Subjt:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE

Query:  VVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV
        VVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV
Subjt:  VVRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV

A0A6J1KX02 uncharacterized protein LOC1114979015.1e-6076.97Show/hide
Query:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE
        MS +AA P+  NGE+ SVYDILREFNFPIGLLPEG VGC LDR TGKLEAYL  ACQF P++ Y+L+YK+TISGQIS+NRLTDLKGV+VKFMFFWVNIVE
Subjt:  MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVE

Query:  VVRVGDDLEFSVGMATASFPVENFSEC-PQCGCGIDCKNGKVRKINAKSLIS
        VVR GDDL FSVG+ATASFPV+NFSEC P  GCG DCKNGK  KI  KSL+S
Subjt:  VVRVGDDLEFSVGMATASFPVENFSEC-PQCGCGIDCKNGKVRKINAKSLIS

SwissProt top hitse value%identityAlignment
Q9M015 Uncharacterized protein At5g016101.8e-0928.42Show/hide
Query:  DILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDLEFSVGM
        ++L+E++ PIG+ P  A     D  T KL   +   C+    ++  LK+ +T++G + + +LTD++G+  K M  WV +  +      + F+ GM
Subjt:  DILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDLEFSVGM

Arabidopsis top hitse value%identityAlignment
AT1G02813.1 Protein of unknown function, DUF5384.3e-2746.09Show/hide
Query:  SAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDL
        S    +  SVY +L  +  P G+LPEG     L+R TG  +      CQF  D +Y +KYK  ISG I+R R+  L GVSVK +FFW+NI EV R GDD+
Subjt:  SAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDL

Query:  EFSVGMATASFPVENFSECPQCGCGIDC
        EF VG A+  F  + F + P+CGCG +C
Subjt:  EFSVGMATASFPVENFSECPQCGCGIDC

AT1G02816.1 Protein of unknown function, DUF5381.3e-3947.68Show/hide
Query:  PSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVV
        PS  + +A + +  + Y +L+ +NFP+G+LP+G V   LD++TG+  AY   +C F    +Y L YKSTISG IS N++T L GV VK +F W+NIVEV+
Subjt:  PSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVV

Query:  RVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV
        R GD+LEFSVG+ +A+F ++ F E PQCGCG DCK  K + +     +SSV
Subjt:  RVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV

AT4G02360.1 Protein of unknown function, DUF5382.5e-3553.08Show/hide
Query:  AGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDLE
        A +G+  + YD ++ +N P G+LP+G V   L+  TG  + Y    C+F   ++Y LKYKSTISG IS   + +LKGVSVK +FFWVNI EV   G DL+
Subjt:  AGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDLE

Query:  FSVGMATASFPVENFSECPQCGCGIDCKNG
        FSVG+A+ASFP  NF E PQCGCG DC NG
Subjt:  FSVGMATASFPVENFSECPQCGCGIDCKNG

AT4G02370.1 Protein of unknown function, DUF5383.8e-3947.68Show/hide
Query:  SPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEV
        S +AA+ +A   ++ + Y +L+ +NFP+G+LP+G V   LD  TGK  AY   +C F    +Y L YKSTISG IS N+L  L GV VK +F W+NIVEV
Subjt:  SPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEV

Query:  VRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS
        +R GD++EFSVG+ +A+F ++ F E PQCGCG +CK+ K+  I     +SS
Subjt:  VRVGDDLEFSVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISS

AT5G19590.1 Protein of unknown function, DUF5381.6e-1333.64Show/hide
Query:  LREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFR-PDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDLEFSVGMATASFP
        L    FPIGLLP       L++ +G    +L GAC+   P + Y   Y + ++G+IS+ ++ +L+G+ V+  F   +I  +   GD+L F V   TA +P
Subjt:  LREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFR-PDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDLEFSVGMATASFP

Query:  VENFSECPQC
         +NF E   C
Subjt:  VENFSECPQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCCTCCGCCGCATTGCCCTCCGCTGGCAACGGCGAATCACTGTCGGTTTACGACATTCTCCGGGAATTCAACTTCCCGATCGGTCTCCTCCCAGAGGGTGCAGT
GGGCTGCCGTCTGGACCGAGCCACCGGGAAATTGGAGGCTTATCTGGGCGGAGCTTGCCAATTCAGACCGGACGAGACATATGACCTGAAATATAAATCTACCATTAGTG
GGCAGATCTCCAGGAATAGGCTTACAGATCTGAAGGGTGTGAGTGTGAAGTTCATGTTCTTCTGGGTCAACATTGTGGAGGTGGTGAGAGTAGGCGACGATCTGGAGTTC
TCGGTGGGGATGGCGACGGCGTCGTTTCCGGTGGAGAATTTCTCCGAGTGCCCACAATGTGGGTGTGGAATCGATTGCAAGAATGGGAAAGTGAGGAAAATTAATGCCAA
ATCTCTTATTTCATCCGTT
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCCTCCGCCGCATTGCCCTCCGCTGGCAACGGCGAATCACTGTCGGTTTACGACATTCTCCGGGAATTCAACTTCCCGATCGGTCTCCTCCCAGAGGGTGCAGT
GGGCTGCCGTCTGGACCGAGCCACCGGGAAATTGGAGGCTTATCTGGGCGGAGCTTGCCAATTCAGACCGGACGAGACATATGACCTGAAATATAAATCTACCATTAGTG
GGCAGATCTCCAGGAATAGGCTTACAGATCTGAAGGGTGTGAGTGTGAAGTTCATGTTCTTCTGGGTCAACATTGTGGAGGTGGTGAGAGTAGGCGACGATCTGGAGTTC
TCGGTGGGGATGGCGACGGCGTCGTTTCCGGTGGAGAATTTCTCCGAGTGCCCACAATGTGGGTGTGGAATCGATTGCAAGAATGGGAAAGTGAGGAAAATTAATGCCAA
ATCTCTTATTTCATCCGTT
Protein sequenceShow/hide protein sequence
MSPSAALPSAGNGESLSVYDILREFNFPIGLLPEGAVGCRLDRATGKLEAYLGGACQFRPDETYDLKYKSTISGQISRNRLTDLKGVSVKFMFFWVNIVEVVRVGDDLEF
SVGMATASFPVENFSECPQCGCGIDCKNGKVRKINAKSLISSV