; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028038 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028038
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein MODIFYING WALL LIGNIN-2-like
Genome locationtig00153056:2826124..2828061
RNA-Seq ExpressionSgr028038
SyntenySgr028038
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139149.1 uncharacterized protein LOC111010123 isoform X1 [Momordica charantia]6.3e-8893.3Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEKHPSGF ISFSIVAFLTLVSFASCMAAEFNRTKKKDLKL+GR CFLP+SEAFKLGVA LVCLVMAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG
        +SWVSFGIAVAMMSGATS+SRRQEYGKGW+EGECYVVKDGIFVGAALLVLING STIGSAAIGRRS V GPSQIHAQIG
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG

XP_022139157.1 uncharacterized protein LOC111010123 isoform X2 [Momordica charantia]5.9e-8692.74Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEKHPSGF ISFSIVAFLTLVSFASCMAAEFNRT KKDLKL+GR CFLP+SEAFKLGVA LVCLVMAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG
        +SWVSFGIAVAMMSGATS+SRRQEYGKGW+EGECYVVKDGIFVGAALLVLING STIGSAAIGRRS V GPSQIHAQIG
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG

XP_022937014.1 uncharacterized protein LOC111443438 [Cucurbita moschata]1.4e-8286.52Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEK PS F+ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGR CFLP+SEAFKLGVAG+VCL+MA IIGNTIICH+YWPKE+RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI
        +SWVSFGIAVAMM GATS+SRRQEYGKGW+EGECY+VKDG+FVGAALLVLING STIGSAAIGRR R  GP+Q+HAQI
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI

XP_022976861.1 uncharacterized protein LOC111477105 [Cucurbita maxima]1.1e-7984.83Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        ME  PS F+ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGR CFLP+SEAFKLGVAG+VCL+MA IIGNTIICH+YWPKE+RKSCSVKRPLL TTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI
        +SWVSFGIAVAM+ GATS+SRRQEYGKGW+EGECY+VKDG+FVGAALLVLING STIGSAAIGRR R  GP+Q+HAQI
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI

XP_023535227.1 uncharacterized protein LOC111796718 [Cucurbita pepo subsp. pepo]3.6e-8386.59Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEK PS F+ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGR CFLP+SEAFKLGVAG+VCL+MA IIGNTIICH+YWPKE+RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG
        +SWVSFGIAVAMM GATS+SRRQEYGKGW+EGECY+VKDG+FVGAALLVLING STIGSAAIGRR R  GP+Q+HAQIG
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG

TrEMBL top hitse value%identityAlignment
A0A6J1CC41 uncharacterized protein LOC111010123 isoform X13.0e-8893.3Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEKHPSGF ISFSIVAFLTLVSFASCMAAEFNRTKKKDLKL+GR CFLP+SEAFKLGVA LVCLVMAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG
        +SWVSFGIAVAMMSGATS+SRRQEYGKGW+EGECYVVKDGIFVGAALLVLING STIGSAAIGRRS V GPSQIHAQIG
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG

A0A6J1CD79 uncharacterized protein LOC111010123 isoform X22.8e-8692.74Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEKHPSGF ISFSIVAFLTLVSFASCMAAEFNRT KKDLKL+GR CFLP+SEAFKLGVA LVCLVMAQIIGNTIICHSYWPKE RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG
        +SWVSFGIAVAMMSGATS+SRRQEYGKGW+EGECYVVKDGIFVGAALLVLING STIGSAAIGRRS V GPSQIHAQIG
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG

A0A6J1F9Y8 uncharacterized protein LOC1114434386.6e-8386.52Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        MEK PS F+ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGR CFLP+SEAFKLGVAG+VCL+MA IIGNTIICH+YWPKE+RKSCSVKRPLLSTTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI
        +SWVSFGIAVAMM GATS+SRRQEYGKGW+EGECY+VKDG+FVGAALLVLING STIGSAAIGRR R  GP+Q+HAQI
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI

A0A6J1FQT7 uncharacterized protein LOC1114464058.3e-7883.8Show/hide
Query:  MEKHPSG-FLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTL
        MEK PS  F+I FSIVA LTL SFASCMAAEFNRT KKDLKLNGR CFLP+SEAFKLGVAGL+CL+MAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTL
Subjt:  MEKHPSG-FLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTL

Query:  LLSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI
        L+SW SFGIAV MMSGA S+S RQEYGKGW+EGECYVVKD IFVGAALLVLING STI SAAIGR+S   GP+QI++QI
Subjt:  LLSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI

A0A6J1II23 uncharacterized protein LOC1114771055.2e-8084.83Show/hide
Query:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL
        ME  PS F+ISFSIVA LTL SFASCMAAEFNRTKKKDLKLNGR CFLP+SEAFKLGVAG+VCL+MA IIGNTIICH+YWPKE+RKSCSVKRPLL TTLL
Subjt:  MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLL

Query:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI
        +SWVSFGIAVAM+ GATS+SRRQEYGKGW+EGECY+VKDG+FVGAALLVLING STIGSAAIGRR R  GP+Q+HAQI
Subjt:  LSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQI

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-11.6e-2539.47Show/hide
Query:  LVSFASCMAAEFNRTKK----------KDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLLLSWVSFGIA
        L +F  C++AEF + K           KDLK +G  C+LP++ AF LG+A LVC+ +AQI+GN +IC  +  K ++   ++   +L   LL SWV+F +A
Subjt:  LVSFASCMAAEFNRTKK----------KDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLLLSWVSFGIA

Query:  VAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAA
        V ++S   S++R Q YGKGW+  ECY+VKDG+F  +  L +    + +G+ A
Subjt:  VAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAA

O65708 Protein MODIFYING WALL LIGNIN-21.2e-2540.51Show/hide
Query:  LISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLLSWVS
        L  +S+V  L LVSF +C AAEF RT+K+D++ +  R C++P S AF LG A ++C  +AQI+GN ++  ++  +  R+       L   T  LLLSW +
Subjt:  LISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLLSWVS

Query:  FGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLIN-GVSTIGSAAI
        F + V ++S A S+SR Q YG+GW++ +CY+VKDG+F  +  L ++  G  TI +  I
Subjt:  FGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLIN-GVSTIGSAAI

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)2.3e-2738.32Show/hide
Query:  PSGFLISFSIVAFLTLVSFASCMAAEFNRTKK----------KDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPL
        P  FL  F  +    L +F  C++AEF + K           KDLK +G  C+LP++ AF LG+A LVC+ +AQI+GN +IC  +  K ++   ++   +
Subjt:  PSGFLISFSIVAFLTLVSFASCMAAEFNRTKK----------KDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPL

Query:  LSTTLLLSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAA
        L   LL SWV+F +AV ++S   S++R Q YGKGW+  ECY+VKDG+F  +  L +    + +G+ A
Subjt:  LSTTLLLSWVSFGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAA

AT4G19370.1 Protein of unknown function (DUF1218)8.6e-2740.51Show/hide
Query:  LISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLLSWVS
        L  +S+V  L LVSF +C AAEF RT+K+D++ +  R C++P S AF LG A ++C  +AQI+GN ++  ++  +  R+       L   T  LLLSW +
Subjt:  LISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLN-GRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTT--LLLSWVS

Query:  FGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLIN-GVSTIGSAAI
        F + V ++S A S+SR Q YG+GW++ +CY+VKDG+F  +  L ++  G  TI +  I
Subjt:  FGIAVAMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLIN-GVSTIGSAAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCATCCATCCGGCTTCCTAATAAGCTTTTCCATTGTCGCCTTCCTCACCCTCGTCTCCTTCGCATCATGTATGGCTGCTGAATTCAACAGAACAAAAAAAAA
GGACCTGAAGTTGAACGGCAGACTCTGTTTCCTGCCTCAAAGTGAAGCGTTCAAATTGGGAGTCGCAGGTTTGGTCTGTTTGGTAATGGCTCAGATCATTGGAAACACCA
TAATCTGCCATAGCTATTGGCCAAAAGAGAATAGGAAGAGTTGCAGTGTCAAAAGGCCTCTGCTTTCAACCACCCTTCTCCTCTCTTGGGTGAGCTTTGGAATTGCGGTG
GCAATGATGAGTGGAGCAACCAGCATAAGCAGGAGACAGGAGTATGGGAAGGGGTGGATGGAGGGGGAATGCTATGTGGTCAAAGATGGAATATTCGTTGGGGCAGCCCT
ATTGGTTCTCATCAATGGAGTGTCCACCATAGGCTCGGCCGCCATTGGAAGGAGGAGCCGCGTCAATGGGCCCAGTCAAATACATGCACAAATTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGCATCCATCCGGCTTCCTAATAAGCTTTTCCATTGTCGCCTTCCTCACCCTCGTCTCCTTCGCATCATGTATGGCTGCTGAATTCAACAGAACAAAAAAAAA
GGACCTGAAGTTGAACGGCAGACTCTGTTTCCTGCCTCAAAGTGAAGCGTTCAAATTGGGAGTCGCAGGTTTGGTCTGTTTGGTAATGGCTCAGATCATTGGAAACACCA
TAATCTGCCATAGCTATTGGCCAAAAGAGAATAGGAAGAGTTGCAGTGTCAAAAGGCCTCTGCTTTCAACCACCCTTCTCCTCTCTTGGGTGAGCTTTGGAATTGCGGTG
GCAATGATGAGTGGAGCAACCAGCATAAGCAGGAGACAGGAGTATGGGAAGGGGTGGATGGAGGGGGAATGCTATGTGGTCAAAGATGGAATATTCGTTGGGGCAGCCCT
ATTGGTTCTCATCAATGGAGTGTCCACCATAGGCTCGGCCGCCATTGGAAGGAGGAGCCGCGTCAATGGGCCCAGTCAAATACATGCACAAATTGGATAA
Protein sequenceShow/hide protein sequence
MEKHPSGFLISFSIVAFLTLVSFASCMAAEFNRTKKKDLKLNGRLCFLPQSEAFKLGVAGLVCLVMAQIIGNTIICHSYWPKENRKSCSVKRPLLSTTLLLSWVSFGIAV
AMMSGATSISRRQEYGKGWMEGECYVVKDGIFVGAALLVLINGVSTIGSAAIGRRSRVNGPSQIHAQIG