; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007517 (gene) of Snake gourd v1 genome

Gene IDTan0007517
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein MODIFYING WALL LIGNIN-2-like
Genome locationLG01:105664990..105668345
RNA-Seq ExpressionTan0007517
SyntenyTan0007517
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139149.1 uncharacterized protein LOC111010123 isoform X1 [Momordica charantia]5.0e-8291.86Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEK PS F ISFSIVA LTL SFASCMAAEFNR KKKDLKL+GRFCFLPESEAFKLGVA LVCL+MAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV MMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALL LINGGS IGSAAIGRRSHV GPS
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

XP_022139157.1 uncharacterized protein LOC111010123 isoform X2 [Momordica charantia]4.6e-8091.28Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEK PS F ISFSIVA LTL SFASCMAAEFNR  KKDLKL+GRFCFLPESEAFKLGVA LVCL+MAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV MMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALL LINGGS IGSAAIGRRSHV GPS
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

XP_022937014.1 uncharacterized protein LOC111443438 [Cucurbita moschata]8.4e-8290.12Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEKPPSSFVISFSIVA+LTLASFASCMAAEFNR KKKDLKLNGRFCFLPESEAFKLGVAG+VCLIMA IIGNTIICH+YWPKE+RKSCSVKRPLLSTTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV MM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGS IGSAAIGRR    GP+
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

XP_022976861.1 uncharacterized protein LOC111477105 [Cucurbita maxima]6.7e-7988.37Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        ME PPSSFVISFSIVA+LTLASFASCMAAEFNR KKKDLKLNGRFCFLPESEAFKLGVAG+VCLIMA IIGNTIICH+YWPKE+RKSCSVKRPLL TTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV M+ GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGS IGSAAIGRR    GP+
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

XP_023535227.1 uncharacterized protein LOC111796718 [Cucurbita pepo subsp. pepo]8.4e-8290.12Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEKPPSSFVISFSIVA+LTLASFASCMAAEFNR KKKDLKLNGRFCFLPESEAFKLGVAG+VCLIMA IIGNTIICH+YWPKE+RKSCSVKRPLLSTTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV MM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGS IGSAAIGRR    GP+
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

TrEMBL top hitse value%identityAlignment
A0A6J1CC41 uncharacterized protein LOC111010123 isoform X12.4e-8291.86Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEK PS F ISFSIVA LTL SFASCMAAEFNR KKKDLKL+GRFCFLPESEAFKLGVA LVCL+MAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV MMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALL LINGGS IGSAAIGRRSHV GPS
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

A0A6J1CD79 uncharacterized protein LOC111010123 isoform X22.2e-8091.28Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEK PS F ISFSIVA LTL SFASCMAAEFNR  KKDLKL+GRFCFLPESEAFKLGVA LVCL+MAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV MMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALL LINGGS IGSAAIGRRSHV GPS
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

A0A6J1F9Y8 uncharacterized protein LOC1114434384.1e-8290.12Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEKPPSSFVISFSIVA+LTLASFASCMAAEFNR KKKDLKLNGRFCFLPESEAFKLGVAG+VCLIMA IIGNTIICH+YWPKE+RKSCSVKRPLLSTTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV MM GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGS IGSAAIGRR    GP+
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

A0A6J1FQT7 uncharacterized protein LOC1114464054.2e-7989.6Show/hide
Query:  MEKPPSS-FVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTL
        MEKPPSS FVI FSIVA LTLASFASCMAAEFNR  KKDLKLNGRFCFLPESEAFKLGVAGL+CLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTL
Subjt:  MEKPPSS-FVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTL

Query:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        LISW SFGIAVVMMSGA SMS RQEYGKGWVEGECYVVKD IFVGAALL LING S I SAAIGR+SH  GP+
Subjt:  LISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

A0A6J1II23 uncharacterized protein LOC1114771053.2e-7988.37Show/hide
Query:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        ME PPSSFVISFSIVA+LTLASFASCMAAEFNR KKKDLKLNGRFCFLPESEAFKLGVAG+VCLIMA IIGNTIICH+YWPKE+RKSCSVKRPLL TTLL
Subjt:  MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS
        ISWVSFGIAV M+ GATSMSRRQEYGKGWVEGECY+VKDG+FVGAALL LINGGS IGSAAIGRR    GP+
Subjt:  ISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-17.3e-2841.4Show/hide
Query:  VAILTLASFASCMAAEFNRAKK----------KDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLLISWV
        + +  LA+F  C++AEF +AK           KDLK +G  C+LPE+ AF LG+A LVC+ +AQI+GN +IC  +  K ++   ++   +L   LL SWV
Subjt:  VAILTLASFASCMAAEFNRAKK----------KDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLLISWV

Query:  SFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAA
        +F +AV ++S   SM+R Q YGKGW+  ECY+VKDG+F  +  L++    +I+G+ A
Subjt:  SFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAA

O65708 Protein MODIFYING WALL LIGNIN-21.5e-2540.13Show/hide
Query:  FSIVAILTLASFASCMAAEFNRAKKKDLKLN-GRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLISWVSFGI
        +S+V  L L SF +C AAEF R +K+D++ +  R C++P S AF LG A ++C  +AQI+GN ++  ++  +  R+       L   T  LL+SW +F +
Subjt:  FSIVAILTLASFASCMAAEFNRAKKKDLKLN-GRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLISWVSFGI

Query:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSA
         V+++S A SMSR Q YG+GW++ +CY+VKDG+F  +  LA++  G++  SA
Subjt:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSA

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)6.1e-3041.32Show/hide
Query:  PSSFVISFSIVAILTLASFASCMAAEFNRAKK----------KDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPL
        P SF+  F  + +  LA+F  C++AEF +AK           KDLK +G  C+LPE+ AF LG+A LVC+ +AQI+GN +IC  +  K ++   ++   +
Subjt:  PSSFVISFSIVAILTLASFASCMAAEFNRAKK----------KDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPL

Query:  LSTTLLISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAA
        L   LL SWV+F +AV ++S   SM+R Q YGKGW+  ECY+VKDG+F  +  L++    +I+G+ A
Subjt:  LSTTLLISWVSFGIAVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAA

AT4G19370.1 Protein of unknown function (DUF1218)1.1e-2640.13Show/hide
Query:  FSIVAILTLASFASCMAAEFNRAKKKDLKLN-GRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLISWVSFGI
        +S+V  L L SF +C AAEF R +K+D++ +  R C++P S AF LG A ++C  +AQI+GN ++  ++  +  R+       L   T  LL+SW +F +
Subjt:  FSIVAILTLASFASCMAAEFNRAKKKDLKLN-GRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLISWVSFGI

Query:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSA
         V+++S A SMSR Q YG+GW++ +CY+VKDG+F  +  LA++  G++  SA
Subjt:  AVVMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCCTCCATCCAGTTTCGTGATAAGCTTTTCCATTGTCGCCATCCTCACCCTCGCCTCCTTCGCATCATGTATGGCTGCTGAATTCAACAGAGCAAAAAAAAA
GGACCTGAAATTGAACGGCAGATTCTGTTTTCTGCCTGAAAGTGAAGCATTCAAATTGGGAGTCGCAGGTTTGGTCTGTTTAATAATGGCTCAGATCATCGGAAACACCA
TAATCTGCCATAGCTATTGGCCCAAGGAGAATAGAAAGAGTTGCAGTGTCAAAAGGCCTCTGCTTTCAACCACCCTTCTCATCTCCTGGGTGAGCTTCGGAATTGCGGTG
GTAATGATGAGTGGAGCAACCAGCATGAGCAGGAGACAAGAGTATGGGAAGGGATGGGTGGAAGGAGAATGCTATGTGGTCAAAGACGGAATATTTGTTGGCGCGGCCTT
ATTGGCTCTCATTAATGGAGGCTCCATCATTGGCTCGGCCGCCATTGGAAGGAGGAGCCACGTCAATGGGCCCAGTTAA
mRNA sequenceShow/hide mRNA sequence
CTTCAAGTTAGAAAAACAAATAAATTATTGTCAAATATTCTGAGTCCACCACCTAATTAAGTGCTCACTTAATTGGTTAACGACATTACTTGTCTCTCTAATTAAAGGTT
AGATATTCAAAAAAAACAAAAATCGTTAATTTAAATTATAATATCTGCTTCAATTTTCCCCCTTCCTCTGTATTTGAACAGTGATGAAGATGAAAAACGAGAGATTTGAC
TCAAAGAAAGTCTCAAATTTCACAAGTCAAGAGCGAAAATGATGATTTTTCACCACTAACCTCTAATAGCCTTTCCCTTTCCAATTCCATGATTTAAACAAACTCCCACT
TCTTCTTCGTTCTTCGATCTTCTAGCCGCCATGGAAAAGCCTCCATCCAGTTTCGTGATAAGCTTTTCCATTGTCGCCATCCTCACCCTCGCCTCCTTCGCATCATGTAT
GGCTGCTGAATTCAACAGAGCAAAAAAAAAGGACCTGAAATTGAACGGCAGATTCTGTTTTCTGCCTGAAAGTGAAGCATTCAAATTGGGAGTCGCAGGTTTGGTCTGTT
TAATAATGGCTCAGATCATCGGAAACACCATAATCTGCCATAGCTATTGGCCCAAGGAGAATAGAAAGAGTTGCAGTGTCAAAAGGCCTCTGCTTTCAACCACCCTTCTC
ATCTCCTGGGTGAGCTTCGGAATTGCGGTGGTAATGATGAGTGGAGCAACCAGCATGAGCAGGAGACAAGAGTATGGGAAGGGATGGGTGGAAGGAGAATGCTATGTGGT
CAAAGACGGAATATTTGTTGGCGCGGCCTTATTGGCTCTCATTAATGGAGGCTCCATCATTGGCTCGGCCGCCATTGGAAGGAGGAGCCACGTCAATGGGCCCAGTTAAA
TACATGCACAAATTGGATGACAAGTTATATTATACTATACCATAACATCATCAAAATAATTTCTTCTTCTTTTTTTCCTGCATTTATTCGATGGTCAATAAAGAAAATGC
CCCAAATCCCCACTCTCTTCTAATTTATTTGTATAGTACCCC
Protein sequenceShow/hide protein sequence
MEKPPSSFVISFSIVAILTLASFASCMAAEFNRAKKKDLKLNGRFCFLPESEAFKLGVAGLVCLIMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLLISWVSFGIAV
VMMSGATSMSRRQEYGKGWVEGECYVVKDGIFVGAALLALINGGSIIGSAAIGRRSHVNGPS