; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020411 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020411
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein MODIFYING WALL LIGNIN-2-like
Genome locationtig00153490:927382..928409
RNA-Seq ExpressionSgr020411
SyntenySgr020411
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600430.1 Protein MODIFYING WALL LIGNIN-1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-6777.42Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPS-SKRPNLA
        MEKRRF YALSLSIVV  ALVA VSCIAAELHRTKTKDL+LDGKLCYLPES+AF YGVAAL CLV+AQVIGN+L C S   NSR KKSN+    +R NLA
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPS-SKRPNLA

Query:  TILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
         ILLV+SWASFT+VI+LLSAA+SMSR+Q Y  GWL GECYLVK GVY+A+AILILV+TCS V SAV VLRKSLQIDESRKT T PK
Subjt:  TILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

XP_008451867.1 PREDICTED: uncharacterized protein LOC103493027 [Cucumis melo]4.4e-6877.3Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        M K  FGYALSLSIVV LAL+AFVSC+AAELHRTKTKDLKLDGKLCYLPES+AFGYGVAAL CLVMAQVIGNILLC S   NSREKK +    KRPNLAT
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
          LVVSWASFT+VI+LL AASSMSR+QPYATGWL GECYLVK GVYVA+AILIL++ C+TVGSAVTV     +++ESRK+ T PK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

XP_011653242.1 protein MODIFYING WALL LIGNIN-2 [Cucumis sativus]6.1e-7078.92Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        M K RFGYALSLSIVV LAL+AFVSC+AAELHRTKTKDLKLDGKLCYLPES+AFGYGVAAL CLVMAQVIGNILLC S   NSREKK +    KRPNLAT
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
          LVVSWASFT+VI+LLS ASSMSR+QPYATGWL GECYLVK GVYVA+AILIL++ CSTVGSAVTV     +I+ESRK+ T PK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

XP_022136395.1 uncharacterized protein LOC111008116 [Momordica charantia]4.2e-7987.57Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        MEKRR GY L+LSIVV LALVAFVSCIAAELHRTK KDLKLDGK CYLPE+RAFGYGVAALVCLVMAQVIGNILLCRSF+FNSREKK +N SSKRPNLA 
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
        ILLVVSWASFT+VIVLLSAASSMSRRQPYA GWL GECYLVK GV+VASAILILVT  STVGSAVT+LRKSLQIDES KT  QPK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

XP_038895935.1 protein MODIFYING WALL LIGNIN-2-like [Benincasa hispida]5.2e-6977.3Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        MEK    Y LSLSI+V LAL+AFVSC+AAELHRTKT DLKLDGKLCYLPESRAFGYGVAAL CLVMAQVIGNILLC S  FN R+KKS+  S KRPNLAT
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
        I L+VSWASFT+VI+LLSAASSMSRRQPYA GWL GECYLVK GVYVA+A+LIL++TCSTVGSAVTV     QI++SRK+   PK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

TrEMBL top hitse value%identityAlignment
A0A0A0KYL9 Uncharacterized protein3.0e-7078.92Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        M K RFGYALSLSIVV LAL+AFVSC+AAELHRTKTKDLKLDGKLCYLPES+AFGYGVAAL CLVMAQVIGNILLC S   NSREKK +    KRPNLAT
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
          LVVSWASFT+VI+LLS ASSMSR+QPYATGWL GECYLVK GVYVA+AILIL++ CSTVGSAVTV     +I+ESRK+ T PK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

A0A1S3BTM2 uncharacterized protein LOC1034930272.1e-6877.3Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        M K  FGYALSLSIVV LAL+AFVSC+AAELHRTKTKDLKLDGKLCYLPES+AFGYGVAAL CLVMAQVIGNILLC S   NSREKK +    KRPNLAT
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
          LVVSWASFT+VI+LL AASSMSR+QPYATGWL GECYLVK GVYVA+AILIL++ C+TVGSAVTV     +++ESRK+ T PK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

A0A5A7T9R1 DUF1218 domain-containing protein2.1e-6877.3Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        M K  FGYALSLSIVV LAL+AFVSC+AAELHRTKTKDLKLDGKLCYLPES+AFGYGVAAL CLVMAQVIGNILLC S   NSREKK +    KRPNLAT
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
          LVVSWASFT+VI+LL AASSMSR+QPYATGWL GECYLVK GVYVA+AILIL++ C+TVGSAVTV     +++ESRK+ T PK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

A0A6J1C3S2 uncharacterized protein LOC1110081162.0e-7987.57Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT
        MEKRR GY L+LSIVV LALVAFVSCIAAELHRTK KDLKLDGK CYLPE+RAFGYGVAALVCLVMAQVIGNILLCRSF+FNSREKK +N SSKRPNLA 
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
        ILLVVSWASFT+VIVLLSAASSMSRRQPYA GWL GECYLVK GV+VASAILILVT  STVGSAVT+LRKSLQIDES KT  QPK
Subjt:  ILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

A0A6J1FSJ4 uncharacterized protein LOC1114469076.8e-6776.88Show/hide
Query:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPS-SKRPNLA
        MEKRRF YALSLSIVV  ALVA VSCIAAELHRTKTKDL+LDGKLCYLPES+AF YGVAAL CLV+AQVIGN+L C S   NSR KKSN+    +R  LA
Subjt:  MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPS-SKRPNLA

Query:  TILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK
         ILLV+SWASFT+VI+LLSAA+SMSR+Q Y  GWL GECYLVK GVY+A+AILILV+TCS V SAV VLRKSLQIDESRKT T PK
Subjt:  TILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-11.7e-3039.66Show/hide
Query:  VVLLALVAFVSCIAAELHRTKT----------KDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLATILLV
        + L  L AF  C++AE  + K           KDLK DG+ CYLPE+RAFG G+AALVC+ +AQ++GN+++CR F        +    ++      ILL+
Subjt:  VVLLALVAFVSCIAAELHRTKT----------KDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLATILLV

Query:  VSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQ
         SW +F + + L+S  +SM+R Q Y  GWL+ ECYLVK GV+ AS  L + T  + +G+    ++ SLQ++   K  TQ
Subjt:  VSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQ

O65708 Protein MODIFYING WALL LIGNIN-23.5e-2842.59Show/hide
Query:  SIVVLLALVAFVSCIAAELHRTKTKDLKLD-GKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLATILLVVSWASFT
        S+V  L LV+F++C AAE  RT+ +D++ D  + CY+P S AFG G AA++C  +AQ++GNI++ R+ R  +R K+ +        L T+LL++SW++F 
Subjt:  SIVVLLALVAFVSCIAAELHRTKTKDLKLD-GKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLATILLVVSWASFT

Query:  IVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQ
        +V+++LS A SMSR Q Y  GWLD +CYLVK GV+ AS  L ++   +   SA  +  K  Q
Subjt:  IVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQ

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)1.6e-3139.13Show/hide
Query:  LSLSIVVLLALVAFVSCIAAELHRTKT----------KDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLA
        L    + L  L AF  C++AE  + K           KDLK DG+ CYLPE+RAFG G+AALVC+ +AQ++GN+++CR F        +    ++     
Subjt:  LSLSIVVLLALVAFVSCIAAELHRTKT----------KDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLA

Query:  TILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQ
         ILL+ SW +F + + L+S  +SM+R Q Y  GWL+ ECYLVK GV+ AS  L + T  + +G+    ++ SLQ++   K  TQ
Subjt:  TILLVVSWASFTIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQ

AT4G19370.1 Protein of unknown function (DUF1218)2.5e-2942.59Show/hide
Query:  SIVVLLALVAFVSCIAAELHRTKTKDLKLD-GKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLATILLVVSWASFT
        S+V  L LV+F++C AAE  RT+ +D++ D  + CY+P S AFG G AA++C  +AQ++GNI++ R+ R  +R K+ +        L T+LL++SW++F 
Subjt:  SIVVLLALVAFVSCIAAELHRTKTKDLKLD-GKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLATILLVVSWASFT

Query:  IVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQ
        +V+++LS A SMSR Q Y  GWLD +CYLVK GV+ AS  L ++   +   SA  +  K  Q
Subjt:  IVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQ

AT4G21310.1 Protein of unknown function (DUF1218)6.2e-0431.78Show/hide
Query:  RRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNIL---LCRSFRFNSREKKSNNPSSKRPNLAT
        R  G+ + + +++ + + A +  I AE+ + K K LK+    C  P   AF YG+AA + LV+A V  N L   LC + R    EK S N       LA 
Subjt:  RRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNIL---LCRSFRFNSREKKSNNPSSKRPNLAT

Query:  ILLVVSWASFTIVIVLL---SAASSMSRR
          L+ +W    I   +L   + A+S SR+
Subjt:  ILLVVSWASFTIVIVLL---SAASSMSRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAACGCCGTTTCGGCTACGCCTTAAGCCTCTCCATTGTTGTCTTACTCGCGCTTGTTGCCTTTGTCTCATGTATAGCCGCTGAATTACACAGAACAAAGACCAA
AGACCTCAAATTGGATGGGAAGCTGTGTTATTTACCAGAAAGTCGAGCATTTGGGTATGGAGTTGCAGCTTTGGTGTGTTTGGTAATGGCTCAAGTTATTGGGAATATTT
TACTCTGCCGAAGTTTCAGGTTCAATTCAAGAGAGAAGAAGAGCAACAATCCATCTTCTAAAAGACCAAACTTAGCCACAATTCTTCTTGTTGTTTCTTGGGCAAGCTTT
ACCATTGTGATCGTGTTGCTGAGCGCGGCGTCGAGTATGAGTAGACGGCAGCCGTACGCGACGGGCTGGTTAGACGGCGAGTGCTACTTGGTGAAGGGCGGCGTATACGT
CGCCTCAGCAATTCTAATCCTCGTCACGACATGCTCAACCGTCGGCTCCGCCGTAACAGTCCTGAGGAAGAGCCTTCAGATCGACGAATCCAGAAAAACTTGCACACAGC
CAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAACGCCGTTTCGGCTACGCCTTAAGCCTCTCCATTGTTGTCTTACTCGCGCTTGTTGCCTTTGTCTCATGTATAGCCGCTGAATTACACAGAACAAAGACCAA
AGACCTCAAATTGGATGGGAAGCTGTGTTATTTACCAGAAAGTCGAGCATTTGGGTATGGAGTTGCAGCTTTGGTGTGTTTGGTAATGGCTCAAGTTATTGGGAATATTT
TACTCTGCCGAAGTTTCAGGTTCAATTCAAGAGAGAAGAAGAGCAACAATCCATCTTCTAAAAGACCAAACTTAGCCACAATTCTTCTTGTTGTTTCTTGGGCAAGCTTT
ACCATTGTGATCGTGTTGCTGAGCGCGGCGTCGAGTATGAGTAGACGGCAGCCGTACGCGACGGGCTGGTTAGACGGCGAGTGCTACTTGGTGAAGGGCGGCGTATACGT
CGCCTCAGCAATTCTAATCCTCGTCACGACATGCTCAACCGTCGGCTCCGCCGTAACAGTCCTGAGGAAGAGCCTTCAGATCGACGAATCCAGAAAAACTTGCACACAGC
CAAAATGA
Protein sequenceShow/hide protein sequence
MEKRRFGYALSLSIVVLLALVAFVSCIAAELHRTKTKDLKLDGKLCYLPESRAFGYGVAALVCLVMAQVIGNILLCRSFRFNSREKKSNNPSSKRPNLATILLVVSWASF
TIVIVLLSAASSMSRRQPYATGWLDGECYLVKGGVYVASAILILVTTCSTVGSAVTVLRKSLQIDESRKTCTQPK