; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1991 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1991
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionMyosin heavy chain, striated muscle
Genome locationMC04:26672131..26675518
RNA-Seq ExpressionMC04g1991
SyntenyMC04g1991
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015899.1 hypothetical protein SDJN02_21002 [Cucurbita argyrosperma subsp. argyrosperma]1.47e-12580.08Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        LDAAKSELK+LKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQTLELIQQEKVSLGAKIIEKS YY K +E+I+LKFQD QDWVNANMI  E EE
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE
        H LVK +TA+R SETEGS DT+ GISGT+IYC+P NLVEE+KDLLGKLESAK KLSQV+KMKCAV+LENSKI QSIEEVK+ LN+F  +LR MD VTL+E
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE

Query:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD
        E KALLSDKAGET+YS+SLQDQIAKLK ISRVIKCTCG+EYKAG+ 
Subjt:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD

XP_022152474.1 uncharacterized protein LOC111020195 [Momordica charantia]3.60e-15286.83Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L +AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDV------------
        HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDV            
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDV------------

Query:  ----------------------KLREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
                              +LREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
Subjt:  ----------------------KLREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

XP_023550047.1 uncharacterized protein LOC111808353 isoform X1 [Cucurbita pepo subsp. pepo]4.91e-12881.3Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L AAKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQTLELIQQEKVSLGAKIIEKS YY K +E+I+LKFQD QDWVNANMIR E EE
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE
        HELV  +TA+R SETEGS DT+ GISGT+IYC+P NLVEE+KDLLGKLESAKAKLSQV+KMKCAV+LEN KI QSIEEVK+ LN+F  +LR MD VTL+E
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE

Query:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD
        E KALLSDKAGETEYSQSLQDQIAKLK ISRVIKCTCG+EYKAG+ 
Subjt:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD

XP_023550050.1 uncharacterized protein LOC111808353 isoform X3 [Cucurbita pepo subsp. pepo]1.46e-12781.15Show/hide
Query:  AAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHE
        +AKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQTLELIQQEKVSLGAKIIEKS YY K +E+I+LKFQD QDWVNANMIR E EEHE
Subjt:  AAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHE

Query:  LVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDEEY
        LV  +TA+R SETEGS DT+ GISGT+IYC+P NLVEE+KDLLGKLESAKAKLSQV+KMKCAV+LEN KI QSIEEVK+ LN+F  +LR MD VTL+EE 
Subjt:  LVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDEEY

Query:  KALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD
        KALLSDKAGETEYSQSLQDQIAKLK ISRVIKCTCG+EYKAG+ 
Subjt:  KALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD

XP_038885088.1 myosin heavy chain, striated muscle isoform X1 [Benincasa hispida]2.98e-12981.78Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L +AKSELKQL EDAE+MMRAKGEIC QIL KQRKIASLESDI+TLSQTL+LIQQEKVSLGAKIIEKS YY K AEEI+LKFQD QDWVNANMIRREV E
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE
        HELVKL+TA+RASETEG SDT+ GISGT+IYCNP NLVEE+KDLLGKLESA+AKLSQV K KCA++LE SKI QSIEE+K+ LN+F  +LR MD VTL+E
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE

Query:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
        E KALLSD+AGETEYS+SLQDQIAKLKGISRVIKCTCG+EY AGV L
Subjt:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

TrEMBL top hitse value%identityAlignment
A0A1S3B2U9 uncharacterized protein LOC103485401 isoform X17.25e-12579.76Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L++AKSELKQL EDAE+MM+AKGEICSQIL KQRKIASLESD+STLSQTLELIQQEKVSLGAKIIEKS YYTK AE+INLKFQD QDWVNANMIR EVEE
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE
         +LVKL+ A++ASETEG SD + GISGT+IY NPKNLVE + DLLGKLESA+AKLS+V+K KCAV+LE SKI QSIEE+K+ LN+F  +LR MD VTL+E
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE

Query:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
        EYKALLSD+AGETEYS+SLQD+IAKLKGIS VIKCTCG+EYKAGV L
Subjt:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

A0A6J1DG39 uncharacterized protein LOC1110201951.74e-15286.83Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L +AKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDV------------
        HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDV            
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDV------------

Query:  ----------------------KLREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
                              +LREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
Subjt:  ----------------------KLREMDVVTLDEEYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

A0A6J1FHU2 uncharacterized protein LOC111445570 isoform X35.07e-12278.86Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L AAKSELKQ KEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQTLELIQQEKVSLGAKIIEKS YY K +E+I+LKFQD QDWVNANMIR E EE
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE
        H LVK +TA+R SETEGS DT+ GISGT+IYC+P NLVEE+KDL    ESAK KLSQV+KMKCAV+LENSKI QSIEEVK+ LN+F  +LR MD VTL+E
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE

Query:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD
        E KALLSDKAGETEYS+SLQDQIAKLK ISRVIKCTCG+EYKAG+ 
Subjt:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVD

A0A6J1JXU6 uncharacterized protein LOC111489811 isoform X11.30e-12579.76Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L AAKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQTLELIQQEKVSLGAKIIEKS YY K +E+I+LKFQD QDWVNANMIR E E 
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE
        HELVK +TA+R SETEGS DT+ GISGT+IYCN   +VEE+KDLLGKLESAKAKLSQV+KMKCAV+LENSKI QSIEEVK+ LN+F  +LR MD VTL+E
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE

Query:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
        E KALLSDKAGETEYS+SLQDQIAKLK IS VIKCTCG+EYK G+ L
Subjt:  EYKALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

A0A6J1K3U6 uncharacterized protein LOC111489811 isoform X33.87e-12579.59Show/hide
Query:  AAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHE
        +AKSELKQLKEDAE+MMRAKGEICSQIL +QRKI SLE DI TLSQTLELIQQEKVSLGAKIIEKS YY K +E+I+LKFQD QDWVNANMIR E E HE
Subjt:  AAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHE

Query:  LVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDEEY
        LVK +TA+R SETEGS DT+ GISGT+IYCN   +VEE+KDLLGKLESAKAKLSQV+KMKCAV+LENSKI QSIEEVK+ LN+F  +LR MD VTL+EE 
Subjt:  LVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDEEY

Query:  KALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL
        KALLSDKAGETEYS+SLQDQIAKLK IS VIKCTCG+EYK G+ L
Subjt:  KALLSDKAGETEYSQSLQDQIAKLKGISRVIKCTCGEEYKAGVDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33500.1 unknown protein3.8e-3341.41Show/hide
Query:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE
        L+ A SE K+LKE+ +Q  R +GEICS IL KQRKI+S+ESD   ++Q+LELI QE+ SL AK++ K   Y K AEE   K ++ + W  ++M       
Subjt:  LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEE

Query:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE
                         S++T  G  G        +  E + +L+   +SA+AKL Q   M+  ++ ENSKI  SIE VK ++NEF  +L  +D+  L+E
Subjt:  HELVKLDTAERASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDE

Query:  EYKALLSDKAGETEYSQSLQDQIAKLK
        EY ALLSD++GE EY  SLQ Q  KLK
Subjt:  EYKALLSDKAGETEYSQSLQDQIAKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTGGATGCAGCAAAAAGTGAGTTAAAACAACTCAAAGAGGATGCTGAGCAAATGATGCGGGCAAAGGGCGAAATATGCTCCCAGATATTAGCAAAACAAAGAAAAATAGC
CTCTTTGGAGTCTGACATATCTACACTTTCACAGACACTCGAGCTCATTCAACAAGAAAAGGTCAGTTTAGGCGCCAAAATCATTGAGAAGAGTATTTATTATACCAAAG
CTGCTGAGGAGATCAATCTGAAATTTCAAGATCTACAGGACTGGGTAAATGCTAACATGATTCGCAGAGAAGTGGAAGAGCACGAATTGGTTAAACTTGACACCGCTGAG
CGAGCAAGTGAAACTGAAGGATCTTCTGATACAATTCGAGGGATCTCTGGCACTCAGATTTACTGCAATCCAAAAAATCTGGTGGAAGAGAAGAAAGATCTATTGGGCAA
GTTGGAATCTGCTAAAGCCAAACTTAGTCAAGTTACGAAAATGAAATGTGCAGTTATTTTGGAGAATTCCAAGATTGCACAGTCAATTGAGGAAGTTAAGAGCAGATTAA
ATGAGTTCGATGTTAAACTCAGGGAAATGGATGTCGTTACATTGGACGAAGAGTACAAGGCTCTCTTATCAGATAAAGCTGGAGAAACGGAGTACTCACAGTCCCTTCAA
GACCAAATTGCAAAACTGAAGGGAATTTCCCGTGTGATTAAATGCACTTGTGGAGAGGAATACAAGGCTGGAGTAGACCTA
mRNA sequenceShow/hide mRNA sequence
CTGGATGCAGCAAAAAGTGAGTTAAAACAACTCAAAGAGGATGCTGAGCAAATGATGCGGGCAAAGGGCGAAATATGCTCCCAGATATTAGCAAAACAAAGAAAAATAGC
CTCTTTGGAGTCTGACATATCTACACTTTCACAGACACTCGAGCTCATTCAACAAGAAAAGGTCAGTTTAGGCGCCAAAATCATTGAGAAGAGTATTTATTATACCAAAG
CTGCTGAGGAGATCAATCTGAAATTTCAAGATCTACAGGACTGGGTAAATGCTAACATGATTCGCAGAGAAGTGGAAGAGCACGAATTGGTTAAACTTGACACCGCTGAG
CGAGCAAGTGAAACTGAAGGATCTTCTGATACAATTCGAGGGATCTCTGGCACTCAGATTTACTGCAATCCAAAAAATCTGGTGGAAGAGAAGAAAGATCTATTGGGCAA
GTTGGAATCTGCTAAAGCCAAACTTAGTCAAGTTACGAAAATGAAATGTGCAGTTATTTTGGAGAATTCCAAGATTGCACAGTCAATTGAGGAAGTTAAGAGCAGATTAA
ATGAGTTCGATGTTAAACTCAGGGAAATGGATGTCGTTACATTGGACGAAGAGTACAAGGCTCTCTTATCAGATAAAGCTGGAGAAACGGAGTACTCACAGTCCCTTCAA
GACCAAATTGCAAAACTGAAGGGAATTTCCCGTGTGATTAAATGCACTTGTGGAGAGGAATACAAGGCTGGAGTAGACCTA
Protein sequenceShow/hide protein sequence
LDAAKSELKQLKEDAEQMMRAKGEICSQILAKQRKIASLESDISTLSQTLELIQQEKVSLGAKIIEKSIYYTKAAEEINLKFQDLQDWVNANMIRREVEEHELVKLDTAE
RASETEGSSDTIRGISGTQIYCNPKNLVEEKKDLLGKLESAKAKLSQVTKMKCAVILENSKIAQSIEEVKSRLNEFDVKLREMDVVTLDEEYKALLSDKAGETEYSQSLQ
DQIAKLKGISRVIKCTCGEEYKAGVDL