; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG03G005140 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG03G005140
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF1218)
Genome locationCG_Chr03:5366798..5369124
RNA-Seq ExpressionClCG03G005140
SyntenyClCG03G005140
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573220.1 hypothetical protein SDJN03_27107, partial [Cucurbita argyrosperma subsp. sororia]2.3e-7784.36Show/hide
Query:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF
        MARNYGFLVCILVM +DAVAG+L I+AEKAQN+V L S S+WV EC RKPRDDAFSQGLAATILLGLAH IAKVLGGCI IRN QHFQ+S+AN+RLGLLF
Subjt:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF

Query:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV
        MILSWITLAIG S+L+AGTVDNSK KNSCEISSHGLFL GGIVCFIHGLSTVAYYVSATAAYREE+RK K  PS PQHV
Subjt:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV

XP_022139993.1 uncharacterized protein LOC111010766 [Momordica charantia]6.1e-8379.33Show/hide
Query:  FPLFHLPCLQFRALNPFPAITNLGESSESATEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLA
        +P+F  P       N F +I NLGESSESA EMA+NYGFLVCILVM +DAVAGILGI+AEKAQNRVVL SVS+WV EC RKPRDDAFSQGLA TILLGLA
Subjt:  FPLFHLPCLQFRALNPFPAITNLGESSESATEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLA

Query:  HVIAKVLGGCICIRNKQHFQESSANKRLGLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRK
        H IAKVLG CICIR+KQHFQESSANKRLGL FMILSWITLAIG S+L+AGTVDNS WKNSCEISS GLFL GGIVCF HGL TVAYYVSATAA REEQRK
Subjt:  HVIAKVLGGCICIRNKQHFQESSANKRLGLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRK

Query:  PKANPSEP
           N S P
Subjt:  PKANPSEP

XP_022994470.1 uncharacterized protein LOC111490180 [Cucurbita maxima]2.5e-7682.32Show/hide
Query:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL
        T MARNYGFLVCILVM +DAVAG+L I+AEKAQN+V L S S+W  EC RKPRDDAFSQGLAATILLGLAH IAKVLGGCI IRN QHFQ+S+AN+RLGL
Subjt:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL

Query:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV
        LFMILSWITLAIG S+L+AGTVDNSK KNSC+ISSHGLFL GGIVCFIHGL TVAYYVSATAAYREE+RK K  PS PQHV
Subjt:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV

XP_023542772.1 uncharacterized protein LOC111802580 [Cucurbita pepo subsp. pepo]8.6e-7782.87Show/hide
Query:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL
        T MA+NYGFLVCILVM +DAVAG+L I+AEKAQN+V L S S+WV EC RKPRDDAFSQGLAATILLGLAH IAKVLGGCI IRN QHFQ+S+AN+RLGL
Subjt:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL

Query:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV
        LFMILSWITLAIG S+L+AGTVDNSK KNSCEISSHGLFL GGIVCFIHGL TVAYYVSATAAYREE+RK K  PS PQHV
Subjt:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV

XP_038894860.1 uncharacterized protein LOC120083260 [Benincasa hispida]6.8e-9095.53Show/hide
Query:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF
        MARNYGFLVCILVM IDAVAGILGIQAEKAQNRVVL SVSIWVR C RKPRDDAFSQGLAATILLG+AHVIAKVLGGCICIRNKQHFQES+ANKRLGLLF
Subjt:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF

Query:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV
        MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGL TVAYYVSATAAYREEQRKPKA PSEPQHV
Subjt:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ0 Uncharacterized protein2.8e-8979.64Show/hide
Query:  ILCKPGFPLFHLPCLQFRALNPFPAITNLGESSESAT--EMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLA
        + CKPG PLFHLPCLQF  L+PFP ITNL ESSESA   EM RNYGFLVCILV+ IDAVAG+LGI+AEKAQNRVVL S+SI + EC RKPRDDAFS+GLA
Subjt:  ILCKPGFPLFHLPCLQFRALNPFPAITNLGESSESAT--EMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLA

Query:  ATILLGLAHVIAKVLGG--CICIRNKQHFQESSANKRLGLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSA
        A+ILLGLAHVIAKVLGG  CICIRNKQ+ QE SAN+ LG LFMILSWITLAIG S+L+A T+DNSKWKNSCEISSHGLFLGGGIVCF HGL TVAYYVSA
Subjt:  ATILLGLAHVIAKVLGG--CICIRNKQHFQESSANKRLGLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSA

Query:  TAAYREEQRKPKANPSEPQHV
        TAAYREEQR  K  P EPQ V
Subjt:  TAAYREEQRKPKANPSEPQHV

A0A1S3B4I4 uncharacterized protein LOC1034857083.8e-7079.56Show/hide
Query:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCIC--IRNKQHFQESSANKRLGL
        M RNYGFLVCILVM ID VAG+LGI+AEKAQNRVVL S+SI V EC RKPRDDAFS+GLAA ILLGLAHVIA VLGGC C  I NKQ+ Q+ SAN+ LGL
Subjt:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCIC--IRNKQHFQESSANKRLGL

Query:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV
         FMILSWITL IG SLL+A T+DNSKWKNSCEISSHGLFLGGGIVCF+HGL TVAYYVSATAAYREEQR  K +P EPQ V
Subjt:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV

A0A6J1CGZ6 uncharacterized protein LOC1110107663.0e-8379.33Show/hide
Query:  FPLFHLPCLQFRALNPFPAITNLGESSESATEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLA
        +P+F  P       N F +I NLGESSESA EMA+NYGFLVCILVM +DAVAGILGI+AEKAQNRVVL SVS+WV EC RKPRDDAFSQGLA TILLGLA
Subjt:  FPLFHLPCLQFRALNPFPAITNLGESSESATEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLA

Query:  HVIAKVLGGCICIRNKQHFQESSANKRLGLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRK
        H IAKVLG CICIR+KQHFQESSANKRLGL FMILSWITLAIG S+L+AGTVDNS WKNSCEISS GLFL GGIVCF HGL TVAYYVSATAA REEQRK
Subjt:  HVIAKVLGGCICIRNKQHFQESSANKRLGLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRK

Query:  PKANPSEP
           N S P
Subjt:  PKANPSEP

A0A6J1GRM8 uncharacterized protein LOC1114568413.9e-7582.58Show/hide
Query:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL
        T MARNYGFLVCILVM +DAVAG+L I+AEKAQN+V L S S+WV EC RKPRDDAFSQGLAATILLGLAH IAKVLGGCI IRN QHFQ+S+AN+RLGL
Subjt:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL

Query:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEP
        LFMILSWITLAIG S+L+AGTVDNSK KNSC+ISSHGLFL GGIVCFIHGL TVAYYVSATAAYREE+RK K  PS P
Subjt:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEP

A0A6J1K198 uncharacterized protein LOC1114901801.2e-7682.32Show/hide
Query:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL
        T MARNYGFLVCILVM +DAVAG+L I+AEKAQN+V L S S+W  EC RKPRDDAFSQGLAATILLGLAH IAKVLGGCI IRN QHFQ+S+AN+RLGL
Subjt:  TEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGL

Query:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV
        LFMILSWITLAIG S+L+AGTVDNSK KNSC+ISSHGLFL GGIVCFIHGL TVAYYVSATAAYREE+RK K  PS PQHV
Subjt:  LFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)3.2e-1330.9Show/hide
Query:  LVCI-LVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANK-----RLGLLFM
        +VCI L + +D VAG +G+QA+ AQ  V  + +     EC + P   AF  G+ A   L  AHV A V+ GC      Q       NK      +  LF+
Subjt:  LVCI-LVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANK-----RLGLLFM

Query:  ILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV
        I  W+    G  +L  G   N++ +  C  +++ +F  GG VCF+H + +  YY+S+  A      + K N ++P  +
Subjt:  ILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQRKPKANPSEPQHV

AT1G11500.1 Protein of unknown function (DUF1218)1.6e-3343.53Show/hide
Query:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRE----CRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRL
        M    GFLV ++++  D  A +LGI+AE AQ++   +      R     CRR P D AF++G+AA +LL + HV+A VLGGC  IR+KQ F+ ++ANK L
Subjt:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRE----CRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRL

Query:  GLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQ
         + F++LSWI   +  S L+ GT+ NS+    C +     FL GGI C  HG+ T AYYVSA AA +E++
Subjt:  GLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREEQ

AT2G32280.1 Protein of unknown function (DUF1218)9.3e-3744.51Show/hide
Query:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF
        M +  G LVC++++ +D  A ILGIQAE AQN+V    + +W+ EC R+P  DAF  GL A  +L +AHV+  ++GGC+CI ++  FQ SS+ +++ +  
Subjt:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF

Query:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYRE
        ++L+WI  A+G   ++ GT+ NSK ++SC  + H     GGI+CF+H L  VAYYVSATAA  E
Subjt:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYRE

AT4G21310.1 Protein of unknown function (DUF1218)2.1e-4453.94Show/hide
Query:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF
        MARN GF +CIL++A+D  AGILGI+AE AQN+V    + +W+ EC R P   AF  GLAA ILL LAHV A  LGGC+C+ ++Q  ++SSANK+L +  
Subjt:  MARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDDAFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLF

Query:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREE
        +I +WI LAI  S+LI GT+ NS+ + +C IS H +   GGI+CF+HGL  VAYY+SATA+ RE+
Subjt:  MILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGCCTCTCATCTCTGTAACTCCACATCGCCGCGGCTTCTCGGGTCGATCTTCATCCTCTGCAAGCCTGGGTTTCCTTTATTCCATCTTCCCTGTCTCCAATTCAG
AGCTTTAAACCCTTTTCCCGCAATTACAAATCTCGGGGAAAGCTCTGAATCCGCTACAGAAATGGCGCGAAACTATGGCTTTCTAGTCTGCATTTTGGTCATGGCAATCG
ACGCTGTTGCCGGAATACTTGGCATTCAAGCTGAAAAGGCTCAGAATCGGGTGGTACTAAATTCGGTGAGTATATGGGTGCGTGAATGCCGGAGAAAGCCGAGAGACGAT
GCTTTTAGTCAGGGGCTGGCTGCAACCATTCTCCTTGGCCTTGCTCATGTCATTGCTAAAGTACTTGGTGGATGCATTTGCATTAGGAATAAGCAACATTTCCAAGAATC
ATCTGCAAACAAGCGATTGGGATTGCTCTTCATGATTCTCTCATGGATTACTTTGGCTATTGGGTTATCATTGTTGATAGCTGGGACGGTGGACAATTCCAAGTGGAAAA
ACTCGTGTGAGATATCAAGTCATGGCCTGTTTTTGGGTGGTGGGATAGTGTGTTTCATTCATGGGCTCTCTACTGTCGCTTATTACGTTTCTGCAACAGCAGCTTATAGA
GAAGAACAGAGGAAGCCCAAAGCAAATCCTTCTGAACCACAACATGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGCCTCTCATCTCTGTAACTCCACATCGCCGCGGCTTCTCGGGTCGATCTTCATCCTCTGCAAGCCTGGGTTTCCTTTATTCCATCTTCCCTGTCTCCAATTCAG
AGCTTTAAACCCTTTTCCCGCAATTACAAATCTCGGGGAAAGCTCTGAATCCGCTACAGAAATGGCGCGAAACTATGGCTTTCTAGTCTGCATTTTGGTCATGGCAATCG
ACGCTGTTGCCGGAATACTTGGCATTCAAGCTGAAAAGGCTCAGAATCGGGTGGTACTAAATTCGGTGAGTATATGGGTGCGTGAATGCCGGAGAAAGCCGAGAGACGAT
GCTTTTAGTCAGGGGCTGGCTGCAACCATTCTCCTTGGCCTTGCTCATGTCATTGCTAAAGTACTTGGTGGATGCATTTGCATTAGGAATAAGCAACATTTCCAAGAATC
ATCTGCAAACAAGCGATTGGGATTGCTCTTCATGATTCTCTCATGGATTACTTTGGCTATTGGGTTATCATTGTTGATAGCTGGGACGGTGGACAATTCCAAGTGGAAAA
ACTCGTGTGAGATATCAAGTCATGGCCTGTTTTTGGGTGGTGGGATAGTGTGTTTCATTCATGGGCTCTCTACTGTCGCTTATTACGTTTCTGCAACAGCAGCTTATAGA
GAAGAACAGAGGAAGCCCAAAGCAAATCCTTCTGAACCACAACATGTATAA
Protein sequenceShow/hide protein sequence
MRASHLCNSTSPRLLGSIFILCKPGFPLFHLPCLQFRALNPFPAITNLGESSESATEMARNYGFLVCILVMAIDAVAGILGIQAEKAQNRVVLNSVSIWVRECRRKPRDD
AFSQGLAATILLGLAHVIAKVLGGCICIRNKQHFQESSANKRLGLLFMILSWITLAIGLSLLIAGTVDNSKWKNSCEISSHGLFLGGGIVCFIHGLSTVAYYVSATAAYR
EEQRKPKANPSEPQHV