; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G17370 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G17370
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein MODIFYING WALL LIGNIN-2-like
Genome locationClcChr11:28140343..28143548
RNA-Seq ExpressionClc11G17370
SyntenyClc11G17370
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444240.1 PREDICTED: uncharacterized protein LOC103487629 [Cucumis melo]4.0e-8285.79Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEKP  S FVISFSIVA+LTLASFASC+AAEFNRTKK+DLKLNG+FCFLPESEAF+LG+GGLVCLIMAQIIG+T IYHSYWPKEHRKSCSVK PLLSI L
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSH--VKGPNQVHAQIG
        LISWVSF IAV+MMSGATSMSRRQEY +GWVEGECY+VKDGIFVGAALL LINGGSTIGSAAIGRR  +  VK PNQ+HAQIG
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSH--VKGPNQVHAQIG

XP_022139149.1 uncharacterized protein LOC111010123 isoform X1 [Momordica charantia]8.3e-8087.29Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEK P S F ISFSIVA LTL SFASC+AAEFNRTKKKDLKL+GRFCFLPESEAF+LGV  LVCL+MAQIIGNT I HSYWPKE RKSCSVK PLLS TL
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG
        LISWVSFGIAV MMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALL LINGGSTIGSAAIGRR SHV GP+Q+HAQIG
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG

XP_022937014.1 uncharacterized protein LOC111443438 [Cucurbita moschata]2.0e-8188.33Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEKPP S FVISFSIVAVLTLASFASC+AAEFNRTKKKDLKLNGRFCFLPESEAF+LGV G+VCLIMA IIGNT I H+YWPKEHRKSCSVK PLLS TL
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQI
        LISWVSFGIAV MM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGSTIGSAAIGRR    KGPNQVHAQI
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQI

XP_023535227.1 uncharacterized protein LOC111796718 [Cucurbita pepo subsp. pepo]4.0e-8288.4Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEKPP S FVISFSIVAVLTLASFASC+AAEFNRTKKKDLKLNGRFCFLPESEAF+LGV G+VCLIMA IIGNT I H+YWPKEHRKSCSVK PLLS TL
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG
        LISWVSFGIAV MM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGSTIGSAAIGRR    KGPNQVHAQIG
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG

XP_038896171.1 protein MODIFYING WALL LIGNIN-1 [Benincasa hispida]5.9e-8689.5Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEKPP   FVISFSIVAVLT+ASFASC+AAEFNRTKK+DLKLNGR CFLPESEAF+LGVGGLVCLIMAQIIGN  I HSYWPKEHRKSCSVK P+LSI L
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG
        LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAA+L LINGGSTI SAAIGRRT+HVKGPNQ+HAQIG
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG

TrEMBL top hitse value%identityAlignment
A0A1S3B9G0 uncharacterized protein LOC1034876291.9e-8285.79Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEKP  S FVISFSIVA+LTLASFASC+AAEFNRTKK+DLKLNG+FCFLPESEAF+LG+GGLVCLIMAQIIG+T IYHSYWPKEHRKSCSVK PLLSI L
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSH--VKGPNQVHAQIG
        LISWVSF IAV+MMSGATSMSRRQEY +GWVEGECY+VKDGIFVGAALL LINGGSTIGSAAIGRR  +  VK PNQ+HAQIG
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSH--VKGPNQVHAQIG

A0A6J1CC41 uncharacterized protein LOC111010123 isoform X14.0e-8087.29Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEK P S F ISFSIVA LTL SFASC+AAEFNRTKKKDLKL+GRFCFLPESEAF+LGV  LVCL+MAQIIGNT I HSYWPKE RKSCSVK PLLS TL
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG
        LISWVSFGIAV MMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALL LINGGSTIGSAAIGRR SHV GP+Q+HAQIG
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG

A0A6J1CD79 uncharacterized protein LOC111010123 isoform X23.8e-7886.74Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEK P S F ISFSIVA LTL SFASC+AAEFNRT KKDLKL+GRFCFLPESEAF+LGV  LVCL+MAQIIGNT I HSYWPKE RKSCSVK PLLS TL
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG
        LISWVSFGIAV MMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALL LINGGSTIGSAAIGRR SHV GP+Q+HAQIG
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG

A0A6J1F9Y8 uncharacterized protein LOC1114434389.6e-8288.33Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        MEKPP S FVISFSIVAVLTLASFASC+AAEFNRTKKKDLKLNGRFCFLPESEAF+LGV G+VCLIMA IIGNT I H+YWPKEHRKSCSVK PLLS TL
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQI
        LISWVSFGIAV MM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGSTIGSAAIGRR    KGPNQVHAQI
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQI

A0A6J1II23 uncharacterized protein LOC1114771053.4e-7986.67Show/hide
Query:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL
        ME PP S FVISFSIVAVLTLASFASC+AAEFNRTKKKDLKLNGRFCFLPESEAF+LGV G+VCLIMA IIGNT I H+YWPKEHRKSCSVK PLL+ TL
Subjt:  MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQI
        LISWVSFGIAV M+ GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGSTIGSAAIGRR    KGPNQVHAQI
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQI

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-11.9e-2638.24Show/hide
Query:  LASFASCIAAEFNRTKK----------KDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITLLISWVSFGIA
        LA+F  C++AEF + K           KDLK +G  C+LPE+ AF LG+  LVC+ +AQI+GN  I   +   +  ++    T    I LL SWV+F +A
Subjt:  LASFASCIAAEFNRTKK----------KDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITLLISWVSFGIA

Query:  VVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTS-HVKGPNQVHAQ
        V ++S   SM+R Q YGKGW+  ECY+VKDG+F  +  L++    + +G+ A   + S  V+  ++ H Q
Subjt:  VVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTS-HVKGPNQVHAQ

O65708 Protein MODIFYING WALL LIGNIN-21.1e-2640.13Show/hide
Query:  FSIVAVLTLASFASCIAAEFNRTKKKDLKLN-GRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLL--SITLLISWVSFGI
        +S+V  L L SF +C AAEF RT+K+D++ +  R C++P S AF LG   ++C  +AQI+GN  ++ ++  +  R+     T L   ++ LL+SW +F +
Subjt:  FSIVAVLTLASFASCIAAEFNRTKKKDLKLN-GRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLL--SITLLISWVSFGI

Query:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSA
         V+++S A SMSR Q YG+GW++ +CY+VKDG+F  +  LA++  G+   SA
Subjt:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSA

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)7.9e-2837.08Show/hide
Query:  FSIVAVLTLASFASCIAAEFNRTKK----------KDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITLLI
        F  + +  LA+F  C++AEF + K           KDLK +G  C+LPE+ AF LG+  LVC+ +AQI+GN  I   +   +  ++    T    I LL 
Subjt:  FSIVAVLTLASFASCIAAEFNRTKK----------KDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITLLI

Query:  SWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTS-HVKGPNQVHAQ
        SWV+F +AV ++S   SM+R Q YGKGW+  ECY+VKDG+F  +  L++    + +G+ A   + S  V+  ++ H Q
Subjt:  SWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTS-HVKGPNQVHAQ

AT4G19370.1 Protein of unknown function (DUF1218)7.9e-2840.13Show/hide
Query:  FSIVAVLTLASFASCIAAEFNRTKKKDLKLN-GRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLL--SITLLISWVSFGI
        +S+V  L L SF +C AAEF RT+K+D++ +  R C++P S AF LG   ++C  +AQI+GN  ++ ++  +  R+     T L   ++ LL+SW +F +
Subjt:  FSIVAVLTLASFASCIAAEFNRTKKKDLKLN-GRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLL--SITLLISWVSFGI

Query:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSA
         V+++S A SMSR Q YG+GW++ +CY+VKDG+F  +  LA++  G+   SA
Subjt:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCCCCCTTTTTCTAGGTTCGTGATAAGCTTTTCCATTGTCGCCGTCCTTACCCTCGCCTCCTTCGCATCCTGTATAGCTGCTGAATTCAACAGAACAAAAAA
AAAGGACCTGAAATTGAACGGCAGATTCTGCTTCCTGCCTGAAAGTGAAGCATTCCAATTGGGAGTCGGAGGTTTGGTGTGTTTAATAATGGCTCAGATCATCGGAAACA
CCACAATCTACCATAGCTATTGGCCAAAAGAGCATAGAAAAAGTTGCAGTGTTAAAACGCCTCTGCTTTCAATAACCCTTCTCATCTCTTGGGTGAGTTTCGGAATTGCG
GTGGTAATGATGAGTGGAGCAACCAGCATGAGCAGGAGACAAGAGTATGGAAAAGGATGGGTGGAAGGAGAATGCTATGTGGTCAAAGACGGAATATTCGTTGGCGCTGC
CTTATTGGCTCTCATTAATGGAGGCTCCACCATAGGCTCCGCCGCCATTGGGAGGAGGACGAGCCACGTTAAAGGGCCCAATCAAGTACATGCTCAAATTGGATAA
mRNA sequenceShow/hide mRNA sequence
GGTAAACTGGTATCTGGGAAAGGGTGTACCCAATAAAATTATTCCAAAATATTTCGACTCTAACCAACTTAAACATAACTTAACTGGTTAAAGATCAGATATTCAAATTC
TCGCCAGCTATCTAAAATGTTTTTTATTATTATCATTATTAAGAAAAATTGTAAATTCAGTCTTCAATCTGTCTGTCTGTCTCTAGGTATTTGACTAGTGATGAAGATGA
AGTCTCAAGTTTTACAAGTCAAGAGAGAAAATGATGAAATTTCCACCACTAACCTCCAAGAGTCGTTCCTTTTTTCCAATACCATGATTTAAACTCCCACTTCTTCCTTC
GATCTTCTTCAGCACGTCATGGAAAAGCCCCCTTTTTCTAGGTTCGTGATAAGCTTTTCCATTGTCGCCGTCCTTACCCTCGCCTCCTTCGCATCCTGTATAGCTGCTGA
ATTCAACAGAACAAAAAAAAAGGACCTGAAATTGAACGGCAGATTCTGCTTCCTGCCTGAAAGTGAAGCATTCCAATTGGGAGTCGGAGGTTTGGTGTGTTTAATAATGG
CTCAGATCATCGGAAACACCACAATCTACCATAGCTATTGGCCAAAAGAGCATAGAAAAAGTTGCAGTGTTAAAACGCCTCTGCTTTCAATAACCCTTCTCATCTCTTGG
GTGAGTTTCGGAATTGCGGTGGTAATGATGAGTGGAGCAACCAGCATGAGCAGGAGACAAGAGTATGGAAAAGGATGGGTGGAAGGAGAATGCTATGTGGTCAAAGACGG
AATATTCGTTGGCGCTGCCTTATTGGCTCTCATTAATGGAGGCTCCACCATAGGCTCCGCCGCCATTGGGAGGAGGACGAGCCACGTTAAAGGGCCCAATCAAGTACATG
CTCAAATTGGATAACAAAAGAATTTACACATCATCAAAATATATGTATTTATATATATATTGTATTTATTTGGATGATGAATAATGAAAATGTCACACATTACTACTTAT
ATACTTCTTGATTTATTT
Protein sequenceShow/hide protein sequence
MEKPPFSRFVISFSIVAVLTLASFASCIAAEFNRTKKKDLKLNGRFCFLPESEAFQLGVGGLVCLIMAQIIGNTTIYHSYWPKEHRKSCSVKTPLLSITLLISWVSFGIA
VVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSTIGSAAIGRRTSHVKGPNQVHAQIG