; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003495 (gene) of Snake gourd v1 genome

Gene IDTan0003495
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein MODIFYING WALL LIGNIN-2-like
Genome locationLG01:10007725..10008725
RNA-Seq ExpressionTan0003495
SyntenyTan0003495
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600430.1 Protein MODIFYING WALL LIGNIN-1, partial [Cucurbita argyrosperma subsp. sororia]7.7e-7384.41Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA
        MEKR F YA SLSIVVSFALVA+VSCIAAELHRTKTKDLRLDGKLCYLPESQAF YGVAAL CLV+AQVIGN+L CTS    SR KKSNDQ  P+RSNLA
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA

Query:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
         I L+ISWASFTVVIL+LS ATSMSRQQ Y AGWL GECYLVK GVYIAAAILILVSTCS V S VAVLRKSLQIDESRKTSTLPK
Subjt:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

KAG7031081.1 hypothetical protein SDJN02_05120, partial [Cucurbita argyrosperma subsp. argyrosperma]4.7e-7083.98Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA
        MEKR F YA SLSIVVSFALVA+VSCIAAELHRTKTKDLRLDGKLCYLPESQAF YGVAAL CLV+AQVIGN+L CTS    SR KKSNDQ  P+RSNLA
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA

Query:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKT
         I L+ISWASFTVVIL+LS ATSMSRQQ Y AGWL GECYLVK GVYIAAAILILVSTCS V S VAVLRKSLQIDESRKT
Subjt:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKT

XP_022941598.1 uncharacterized protein LOC111446907 [Cucurbita moschata]8.5e-7283.87Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA
        MEKR F YA SLSIVVSFALVA+VSCIAAELHRTKTKDLRLDGKLCYLPESQAF YGVAAL CLV+AQVIGN+L CTS    SR KKSNDQ  P+RS LA
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA

Query:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
         I L+ISWASFTVVIL+LS ATSMSRQQ Y AGWL GECYLVK GVYIAAAILILVSTCS V S VAVLRKSLQIDESRKTSTLPK
Subjt:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

XP_022982923.1 uncharacterized protein LOC111481621 [Cucurbita maxima]2.3e-6980.11Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT
        MEKR F YA SLSIVVSFALVA+VSCIAAELHRTKTKDLRLDGKLCYLPESQAF YGVAAL C V+AQVIGN+L CT+    SR KK+NDQ P R ++  
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT

Query:  I--LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
        I  L+ISWASFTVVIL+LS ATSMSRQQ Y AGWL GECYLVK GVYIAAAILILVST S V S VAVLRKSLQIDESRKTSTLPK
Subjt:  I--LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

XP_023543619.1 uncharacterized protein LOC111803449 [Cucurbita pepo subsp. pepo]6.5e-7282.26Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT
        MEKR F YA SLSIVVSFALVA+VSCIAAELHRTKTKDLRLDGKLCYLPESQAF YGVAAL CLV+AQVIGN+L CTS    SR KKSNDQ P R ++  
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT

Query:  I--LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
        I  L+ISWASFTVVIL+LS ATSMSRQQ Y AGWL GECYLVK GVYIAAAILILVSTCS V S VAVLRKSLQIDESRKTSTLPK
Subjt:  I--LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

TrEMBL top hitse value%identityAlignment
A0A1S3BTM2 uncharacterized protein LOC1034930274.0e-6775.14Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT
        M K CFGYA SLSIVVS AL+A VSC+AAELHRTKTKDL+LDGKLCYLPESQAFGYGVAAL CLVMAQVIGNILLCTS    SREKK ++Q PKR NLAT
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT

Query:  -ILIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
          L++SWASFTVVIL+L  A+SMSRQQ YA GWL GECYLVK GVY+AAAILIL+S C+TVGS V V     +++ESRK++TLPK
Subjt:  -ILIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

A0A5A7T9R1 DUF1218 domain-containing protein4.0e-6775.14Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT
        M K CFGYA SLSIVVS AL+A VSC+AAELHRTKTKDL+LDGKLCYLPESQAFGYGVAAL CLVMAQVIGNILLCTS    SREKK ++Q PKR NLAT
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT

Query:  -ILIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
          L++SWASFTVVIL+L  A+SMSRQQ YA GWL GECYLVK GVY+AAAILIL+S C+TVGS V V     +++ESRK++TLPK
Subjt:  -ILIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

A0A6J1C3S2 uncharacterized protein LOC1110081162.3e-6776.76Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT
        MEKR  GY  +LSIVVS ALVA VSCIAAELHRTK KDL+LDGK CYLPE++AFGYGVAALVCLVMAQVIGNILLC SFKF SREKK ++QS KR NLA 
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT

Query:  I-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
        I L++SWASFTVVI++LS A+SMSR+Q YAAGWL GECYLVK GV++A+AILILV+  STVGS V +LRKSLQIDES KTS  PK
Subjt:  I-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

A0A6J1FSJ4 uncharacterized protein LOC1114469074.1e-7283.87Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA
        MEKR F YA SLSIVVSFALVA+VSCIAAELHRTKTKDLRLDGKLCYLPESQAF YGVAAL CLV+AQVIGN+L CTS    SR KKSNDQ  P+RS LA
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQS-PKRSNLA

Query:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
         I L+ISWASFTVVIL+LS ATSMSRQQ Y AGWL GECYLVK GVYIAAAILILVSTCS V S VAVLRKSLQIDESRKTSTLPK
Subjt:  TI-LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

A0A6J1J6B1 uncharacterized protein LOC1114816211.1e-6980.11Show/hide
Query:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT
        MEKR F YA SLSIVVSFALVA+VSCIAAELHRTKTKDLRLDGKLCYLPESQAF YGVAAL C V+AQVIGN+L CT+    SR KK+NDQ P R ++  
Subjt:  MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLAT

Query:  I--LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK
        I  L+ISWASFTVVIL+LS ATSMSRQQ Y AGWL GECYLVK GVYIAAAILILVST S V S VAVLRKSLQIDESRKTSTLPK
Subjt:  I--LIISWASFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-15.1e-2736.78Show/hide
Query:  FALVALVSCIAAELHRTKT----------KDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATILIISWA
        F L A   C++AE  + K           KDL+ DG+ CYLPE++AFG G+AALVC+ +AQ++GN+++C  F          D++        +L+ SW 
Subjt:  FALVALVSCIAAELHRTKT----------KDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATILIISWA

Query:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTST
        +F V + ++S   SM+R+Q Y  GWL  ECYLVKDGV+ A+  L + +  + +G+    ++ SLQ++   K  T
Subjt:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTST

O65708 Protein MODIFYING WALL LIGNIN-29.5e-2641.21Show/hide
Query:  FSLSIVVSFALVALVSCIAAELHRTKTKDLRLD-GKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATI-LIISWA
        F  S+V S  LV+ ++C AAE  RT+ +D+R D  + CY+P S AFG G AA++C  +AQ++GNI++  +   ++R K+ +        L T+ L++SW+
Subjt:  FSLSIVVSFALVALVSCIAAELHRTKTKDLRLD-GKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATI-LIISWA

Query:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQ
        +F VV+LILS A SMSR Q+Y  GWL  +CYLVKDGV+ A+  L ++   +   S   +  K  Q
Subjt:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQ

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)3.6e-2836.78Show/hide
Query:  FALVALVSCIAAELHRTKT----------KDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATILIISWA
        F L A   C++AE  + K           KDL+ DG+ CYLPE++AFG G+AALVC+ +AQ++GN+++C  F          D++        +L+ SW 
Subjt:  FALVALVSCIAAELHRTKT----------KDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATILIISWA

Query:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTST
        +F V + ++S   SM+R+Q Y  GWL  ECYLVKDGV+ A+  L + +  + +G+    ++ SLQ++   K  T
Subjt:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTST

AT4G19370.1 Protein of unknown function (DUF1218)6.8e-2741.21Show/hide
Query:  FSLSIVVSFALVALVSCIAAELHRTKTKDLRLD-GKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATI-LIISWA
        F  S+V S  LV+ ++C AAE  RT+ +D+R D  + CY+P S AFG G AA++C  +AQ++GNI++  +   ++R K+ +        L T+ L++SW+
Subjt:  FSLSIVVSFALVALVSCIAAELHRTKTKDLRLD-GKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATI-LIISWA

Query:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQ
        +F VV+LILS A SMSR Q+Y  GWL  +CYLVKDGV+ A+  L ++   +   S   +  K  Q
Subjt:  SFTVVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAACGCTGTTTCGGCTATGCCTTTAGCCTCTCGATCGTTGTCTCATTTGCGCTCGTAGCCTTGGTATCGTGTATAGCTGCAGAATTACACAGAACAAAGACCAA
AGACCTCAGGTTGGATGGGAAACTGTGCTATTTGCCAGAAAGTCAAGCATTTGGATATGGAGTTGCAGCTTTGGTGTGTTTGGTTATGGCTCAAGTTATTGGGAATATTT
TACTTTGCACAAGTTTCAAGTTCAAGTCAAGAGAGAAGAAGAGCAATGATCAATCTCCTAAAAGATCAAACTTAGCCACAATTCTTATTATTTCTTGGGCAAGCTTTACC
GTGGTGATATTGATACTGAGCGGGGCGACGAGTATGAGCAGACAGCAGTCGTACGCGGCGGGCTGGTTGCGCGGCGAGTGCTATTTGGTGAAGGACGGCGTATACATCGC
CGCAGCAATTCTGATCCTCGTTTCGACATGCTCGACCGTCGGCTCCGTCGTAGCAGTCTTGAGGAAGAGCCTTCAGATCGATGAATCCAGAAAAACTAGTACACTGCCGA
AATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAACGCTGTTTCGGCTATGCCTTTAGCCTCTCGATCGTTGTCTCATTTGCGCTCGTAGCCTTGGTATCGTGTATAGCTGCAGAATTACACAGAACAAAGACCAA
AGACCTCAGGTTGGATGGGAAACTGTGCTATTTGCCAGAAAGTCAAGCATTTGGATATGGAGTTGCAGCTTTGGTGTGTTTGGTTATGGCTCAAGTTATTGGGAATATTT
TACTTTGCACAAGTTTCAAGTTCAAGTCAAGAGAGAAGAAGAGCAATGATCAATCTCCTAAAAGATCAAACTTAGCCACAATTCTTATTATTTCTTGGGCAAGCTTTACC
GTGGTGATATTGATACTGAGCGGGGCGACGAGTATGAGCAGACAGCAGTCGTACGCGGCGGGCTGGTTGCGCGGCGAGTGCTATTTGGTGAAGGACGGCGTATACATCGC
CGCAGCAATTCTGATCCTCGTTTCGACATGCTCGACCGTCGGCTCCGTCGTAGCAGTCTTGAGGAAGAGCCTTCAGATCGATGAATCCAGAAAAACTAGTACACTGCCGA
AATGA
Protein sequenceShow/hide protein sequence
MEKRCFGYAFSLSIVVSFALVALVSCIAAELHRTKTKDLRLDGKLCYLPESQAFGYGVAALVCLVMAQVIGNILLCTSFKFKSREKKSNDQSPKRSNLATILIISWASFT
VVILILSGATSMSRQQSYAAGWLRGECYLVKDGVYIAAAILILVSTCSTVGSVVAVLRKSLQIDESRKTSTLPK