; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004588 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004588
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein MODIFYING WALL LIGNIN-2-like
Genome locationscaffold995:513070..514598
RNA-Seq ExpressionMS004588
SyntenyMS004588
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139149.1 uncharacterized protein LOC111010123 isoform X1 [Momordica charantia]1.3e-9399.44Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKL+GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
        ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG

XP_022139157.1 uncharacterized protein LOC111010123 isoform X2 [Momordica charantia]1.2e-9198.88Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRT KKDLKL+GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
        ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG

XP_022937014.1 uncharacterized protein LOC111443438 [Cucurbita moschata]2.0e-8388.76Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        MEK PS F ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVA +VCL+MA IIGNTIICH+YWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI
        ISWVSFGIAVAMM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALLVLINGGSTIGSAAIGRR    GP+Q+HAQI
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI

XP_022976861.1 uncharacterized protein LOC111477105 [Cucurbita maxima]1.6e-8087.08Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        ME  PS F ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVA +VCL+MA IIGNTIICH+YWPKE RKSCSVKRPLL TTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI
        ISWVSFGIAVAM+ GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALLVLINGGSTIGSAAIGRR    GP+Q+HAQI
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI

XP_023535227.1 uncharacterized protein LOC111796718 [Cucurbita pepo subsp. pepo]5.3e-8488.83Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        MEK PS F ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVA +VCL+MA IIGNTIICH+YWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
        ISWVSFGIAVAMM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALLVLINGGSTIGSAAIGRR    GP+Q+HAQIG
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG

TrEMBL top hitse value%identityAlignment
A0A0A0L099 Uncharacterized protein3.8e-8070.46Show/hide
Query:  KSQLPQVKRENDGFSTTNLLESFV-FQYHDLKSTSSSSSFLATM----------EKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCF
        KS++ QVKREN  FSTTNL  SF+ FQ HDL ++SSSSS  ++               S F ISFS+VA LTL SFASCMAAEFNRTKK+DLKLN + CF
Subjt:  KSQLPQVKRENDGFSTTNLLESFV-FQYHDLKSTSSSSSFLATM----------EKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCF

Query:  LPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLLISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAAL
        LPESEAFKLG+  L+CL+MAQIIG T+ICHSYWPKE RKSCSVK+PLLS  LLISWVSF IAV M+SGATSMSRRQEY KGWVEGECY+VKDGIFV AA+
Subjt:  LPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLLISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAAL

Query:  LVLINGGSTIGSAAIG----RRSHV-TGPSQIHAQIG
        LVLINGGSTI SAAIG    R +HV   P+QIHAQIG
Subjt:  LVLINGGSTIGSAAIG----RRSHV-TGPSQIHAQIG

A0A6J1CC41 uncharacterized protein LOC111010123 isoform X16.1e-9499.44Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKL+GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
        ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG

A0A6J1CD79 uncharacterized protein LOC111010123 isoform X25.7e-9298.88Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRT KKDLKL+GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
        ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQIG

A0A6J1F9Y8 uncharacterized protein LOC1114434389.7e-8488.76Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        MEK PS F ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVA +VCL+MA IIGNTIICH+YWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI
        ISWVSFGIAVAMM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALLVLINGGSTIGSAAIGRR    GP+Q+HAQI
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI

A0A6J1II23 uncharacterized protein LOC1114771057.7e-8187.08Show/hide
Query:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL
        ME  PS F ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVA +VCL+MA IIGNTIICH+YWPKE RKSCSVKRPLL TTLL
Subjt:  MEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI
        ISWVSFGIAVAM+ GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALLVLINGGSTIGSAAIGRR    GP+Q+HAQI
Subjt:  ISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIHAQI

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-11.8e-2640.13Show/hide
Query:  LVSFASCMAAEFNRTKK----------KDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLLISWVSFGIA
        L +F  C++AEF + K           KDLK +G  C+LPE+ AF LG+A+LVC+ +AQI+GN +IC  +   +K ++           LL SWV+F +A
Subjt:  LVSFASCMAAEFNRTKK----------KDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLLISWVSFGIA

Query:  VAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAA
        V ++S   SM+R Q YGKGW+  ECY+VKDG+F  +  L +    + +G+ A
Subjt:  VAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAA

O65708 Protein MODIFYING WALL LIGNIN-24.0e-2641.45Show/hide
Query:  FSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTT--LLISWVSFGI
        +S+V  L LVSF +C AAEF RT+K+D++ +  R C++P S AF LG A+++C  +AQI+GN ++  ++  + KR+       L   T  LL+SW +F +
Subjt:  FSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTT--LLISWVSFGI

Query:  AVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSA
         V ++S A SMSR Q YG+GW++ +CY+VKDG+F  +  L ++  G+   SA
Subjt:  AVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSA

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)9.8e-2838.32Show/hide
Query:  PSGFAISFSIVAFLTLVSFASCMAAEFNRTKK----------KDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPL
        P  F + F  +    L +F  C++AEF + K           KDLK +G  C+LPE+ AF LG+A+LVC+ +AQI+GN +IC  +   +K ++       
Subjt:  PSGFAISFSIVAFLTLVSFASCMAAEFNRTKK----------KDLKLNGRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPL

Query:  LSTTLLISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAA
            LL SWV+F +AV ++S   SM+R Q YGKGW+  ECY+VKDG+F  +  L +    + +G+ A
Subjt:  LSTTLLISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAA

AT4G19370.1 Protein of unknown function (DUF1218)2.8e-2741.45Show/hide
Query:  FSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTT--LLISWVSFGI
        +S+V  L LVSF +C AAEF RT+K+D++ +  R C++P S AF LG A+++C  +AQI+GN ++  ++  + KR+       L   T  LL+SW +F +
Subjt:  FSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRFCFLPESEAFKLGVASLVCLVMAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTT--LLISWVSFGI

Query:  AVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSA
         V ++S A SMSR Q YG+GW++ +CY+VKDG+F  +  L ++  G+   SA
Subjt:  AVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTGGAACAAAGTCCCAACTTCCACAAGTCAAGAGAGAAAATGATGGATTTTCCACCACTAACCTTCTAGAGTCTTTCGTTTTCCAATACCATGATTTAAAATCCACTTC
TTCTTCTTCGAGTTTCCTGGCCACCATGGAAAAGCACCCATCGGGCTTCGCAATAAGCTTTTCCATTGTCGCATTCCTCACCCTCGTCTCCTTTGCATCATGTATGGCTG
CTGAATTCAACAGAACAAAAAAGAAAGACCTGAAGTTGAACGGCAGATTCTGTTTCCTGCCTGAAAGTGAAGCATTCAAATTGGGAGTTGCAAGTTTGGTCTGTTTGGTG
ATGGCTCAGATCATTGGAAACACCATAATCTGCCATAGCTATTGGCCAAAAGAGAAAAGAAAGAGTTGCAGCGTCAAAAGGCCTCTGCTTTCAACTACCCTTCTCATCTC
TTGGGTCAGCTTCGGAATTGCGGTGGCGATGATGAGTGGAGCAACCAGCATGAGCAGGAGGCAGGAGTATGGGAAGGGGTGGGTGGAGGGAGAATGCTATGTGGTCAAAG
ATGGAATATTTGTTGGCGCAGCCTTATTGGTTCTCATTAATGGAGGGTCCACCATAGGCTCGGCCGCCATTGGAAGGAGGAGCCACGTCACTGGGCCCAGTCAAATACAT
GCACAAATTGGA
mRNA sequenceShow/hide mRNA sequence
ATTGGAACAAAGTCCCAACTTCCACAAGTCAAGAGAGAAAATGATGGATTTTCCACCACTAACCTTCTAGAGTCTTTCGTTTTCCAATACCATGATTTAAAATCCACTTC
TTCTTCTTCGAGTTTCCTGGCCACCATGGAAAAGCACCCATCGGGCTTCGCAATAAGCTTTTCCATTGTCGCATTCCTCACCCTCGTCTCCTTTGCATCATGTATGGCTG
CTGAATTCAACAGAACAAAAAAGAAAGACCTGAAGTTGAACGGCAGATTCTGTTTCCTGCCTGAAAGTGAAGCATTCAAATTGGGAGTTGCAAGTTTGGTCTGTTTGGTG
ATGGCTCAGATCATTGGAAACACCATAATCTGCCATAGCTATTGGCCAAAAGAGAAAAGAAAGAGTTGCAGCGTCAAAAGGCCTCTGCTTTCAACTACCCTTCTCATCTC
TTGGGTCAGCTTCGGAATTGCGGTGGCGATGATGAGTGGAGCAACCAGCATGAGCAGGAGGCAGGAGTATGGGAAGGGGTGGGTGGAGGGAGAATGCTATGTGGTCAAAG
ATGGAATATTTGTTGGCGCAGCCTTATTGGTTCTCATTAATGGAGGGTCCACCATAGGCTCGGCCGCCATTGGAAGGAGGAGCCACGTCACTGGGCCCAGTCAAATACAT
GCACAAATTGGA
Protein sequenceShow/hide protein sequence
IGTKSQLPQVKRENDGFSTTNLLESFVFQYHDLKSTSSSSSFLATMEKHPSGFAISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRFCFLPESEAFKLGVASLVCLV
MAQIIGNTIICHSYWPKEKRKSCSVKRPLLSTTLLISWVSFGIAVAMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLVLINGGSTIGSAAIGRRSHVTGPSQIH
AQIG