; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G22880 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G22880
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF1218)
Genome locationClcChr01:33840726..33842493
RNA-Seq ExpressionClc01G22880
SyntenyClc01G22880
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647674.1 hypothetical protein Csa_003691 [Cucumis sativus]1.3e-8389.83Show/hide
Query:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA
        K+TYG VVCIL+II+DVTAGILGIQAEIAQNKVNH KMWIFEC+DPSYNAFKLGLAAAILLALAHAIANLVGGCI VRSA++YKGL+ANKQLAVGSL+FA
Subjt:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA

Query:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        WIALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TAG RE+ KPPPQ NPA ATGHV
Subjt:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

XP_004134744.2 uncharacterized protein LOC101214369 [Cucumis sativus]2.6e-8485.71Show/hide
Query:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA
        S EL   + +  K+TYG VVCIL+II+DVTAGILGIQAEIAQNKVNH KMWIFEC+DPSYNAFKLGLAAAILLALAHAIANLVGGCI VRSA++YKGL+A
Subjt:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA

Query:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        NKQLAVGSL+FAWIALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TAG RE+ KPPPQ NPA ATGHV
Subjt:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

XP_008439942.1 PREDICTED: uncharacterized protein LOC103484578 [Cucumis melo]2.0e-8485.19Show/hide
Query:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA
        S EL   + +  K+TYG VVCIL+II+DVTAGILGIQAEIAQNKVNH KMWIFEC+DPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSA++YKGL+A
Subjt:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA

Query:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        NKQLA GSL+FAWIAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TAG RE+ KPPPQ NPA ATGHV
Subjt:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

XP_022977369.1 uncharacterized protein LOC111477717 [Cucurbita maxima]2.7e-8187.01Show/hide
Query:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA
        ++  GFVVCILVI+MDVTAGILGIQAE+AQNKVNH KMWIFEC+DPSYNAFKLGLAAAILL LAHAIANLVGGC++VRSA+EYKGLT+NKQLA GSLVFA
Subjt:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA

Query:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        WIALVVGFSLLISGAMYNTRSRKSCGLAH+ LLSIGGIVCFVHGLFAVAYYVS TAGHRE  +PPPQ +PA ATGHV
Subjt:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

XP_038881391.1 uncharacterized protein LOC120072925 [Benincasa hispida]5.5e-8289.27Show/hide
Query:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA
        ++TYGF+VC+L+IIMDVTAGILGIQAEIAQNKVNH KMWIFEC+DPSYNAFKLGLAAAILL LAHAIANL GG IFVRSA EYKGLTANKQLAVGSLVFA
Subjt:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA

Query:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        WIALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA HREE KPPPQ NPA A G V
Subjt:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

TrEMBL top hitse value%identityAlignment
A0A0A0KLA9 Uncharacterized protein1.3e-8485.71Show/hide
Query:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA
        S EL   + +  K+TYG VVCIL+II+DVTAGILGIQAEIAQNKVNH KMWIFEC+DPSYNAFKLGLAAAILLALAHAIANLVGGCI VRSA++YKGL+A
Subjt:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA

Query:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        NKQLAVGSL+FAWIALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TAG RE+ KPPPQ NPA ATGHV
Subjt:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

A0A1S3AZY5 uncharacterized protein LOC1034845789.8e-8585.19Show/hide
Query:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA
        S EL   + +  K+TYG VVCIL+II+DVTAGILGIQAEIAQNKVNH KMWIFEC+DPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSA++YKGL+A
Subjt:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA

Query:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        NKQLA GSL+FAWIAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TAG RE+ KPPPQ NPA ATGHV
Subjt:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

A0A5D3CMA3 Uncharacterized protein9.8e-8585.19Show/hide
Query:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA
        S EL   + +  K+TYG VVCIL+II+DVTAGILGIQAEIAQNKVNH KMWIFEC+DPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSA++YKGL+A
Subjt:  SKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTA

Query:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        NKQLA GSL+FAWIAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TAG RE+ KPPPQ NPA ATGHV
Subjt:  NKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

A0A6J1GE80 uncharacterized protein LOC1114533742.3e-8186.44Show/hide
Query:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA
        ++  GFVVCILVI+MDVTAGILGIQAE+AQNKVNH KMWIFEC+DPSYNAFKLGLAAAILL LAHAIANLVGGC++VRSA+EYKGLT+NKQLA GSLVFA
Subjt:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA

Query:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        WIA+VVGFSLLISGAMYNTRSRKSCGLAHN LLSIGGI CFVHGLFAVAYYVS TAG+REE +PPPQ +PA ATGHV
Subjt:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

A0A6J1IR67 uncharacterized protein LOC1114777171.3e-8187.01Show/hide
Query:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA
        ++  GFVVCILVI+MDVTAGILGIQAE+AQNKVNH KMWIFEC+DPSYNAFKLGLAAAILL LAHAIANLVGGC++VRSA+EYKGLT+NKQLA GSLVFA
Subjt:  KRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFA

Query:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV
        WIALVVGFSLLISGAMYNTRSRKSCGLAH+ LLSIGGIVCFVHGLFAVAYYVS TAGHRE  +PPPQ +PA ATGHV
Subjt:  WIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPAPATGHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)1.3e-2035.59Show/hide
Query:  VVCILVII-MDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANK---QLAVGSLVFAWI
        +VCI++ + +D+ AG +G+QA+ AQ  V H K+   EC+ PS  AF LG+ A   LA AH  AN++ GC      +    L  NK      +  L   W+
Subjt:  VVCILVII-MDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANK---QLAVGSLVFAWI

Query:  ALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSN---TAGHREEAKP----PPQVNPAP
          + G  +L +G   NT SR  C   +N + SIGG VCF+H + +  YY+S+    A H    KP    P ++ P P
Subjt:  ALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSN---TAGHREEAKP----PPQVNPAP

AT1G11500.1 Protein of unknown function (DUF1218)7.3e-3243.1Show/hide
Query:  GFVVCILVIIMDVTAGILGIQAEIAQNKV------NHLKMWIFEC-RDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSL
        GF+V +++I  D+TA +LGI+AEIAQ+K        H +     C R PS  AF  G+AA +LL + H +AN++GGC ++RS +++K  TANK LAV  L
Subjt:  GFVVCILVIIMDVTAGILGIQAEIAQNKV------NHLKMWIFEC-RDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSL

Query:  VFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPA
        V +WI  VV +S L+ G + N+R+ + C L H     IGGI C  HG+   AYYVS  A  +E+ +   Q N A
Subjt:  VFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAKPPPQVNPA

AT2G32280.1 Protein of unknown function (DUF1218)1.4e-4651.57Show/hide
Query:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFAWIAL
        G +VC++++ +DV A ILGIQAE+AQN+V H+++W+FECR+PS +AF+LGL AA +L +AH + NLVGGC+ + S +E++  ++ +Q+++  LV  WI  
Subjt:  GFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFAWIAL

Query:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAK
         VGF  ++ G M N++SR SCG  H+  LSIGGI+CF+H LF VAYYVS TA  ++EAK
Subjt:  VVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREEAK

AT4G21310.1 Protein of unknown function (DUF1218)1.1e-5967.5Show/hide
Query:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFAW
        R  GF +CIL++ MDV+AGILGI+AEIAQNKV HLKMWIFECRDPSY AFK GLAA ILL LAH  AN +GGC+ V S ++ +  +ANKQLAV SL+F W
Subjt:  RTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKMWIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFAW

Query:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREE
        I L + FS+LI G M N+RSRK+CG++H+R+LSIGGI+CFVHGLFAVAYY+S TA  RE+
Subjt:  IALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSNTAGHREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGGCTGCAGCATATGAGCCCTTCCTCCTTTGCTCGGCTTCATCCTCCACCAAGCAGAATCTCGCACTCATGCTATTCTTCAACTTTAATTCTCATTGGTTCTAT
TTATGATGATCACAACACAAACAGTGGGATAAAAAAGAAAAGAAACCCTGGTCTTTTAACTGTGTTATATTCCAAAGAGCTTTTTGATGGATTAAAAAGCTTGAAGAAGA
GAACTTATGGTTTTGTTGTCTGCATCTTGGTTATTATAATGGATGTTACAGCCGGAATTCTTGGAATTCAAGCTGAAATAGCGCAAAACAAGGTGAACCATTTAAAAATG
TGGATATTTGAGTGTAGAGACCCAAGCTATAATGCTTTCAAGCTAGGCTTGGCTGCAGCCATACTGCTCGCTCTTGCTCACGCCATTGCCAACCTGGTTGGTGGGTGCAT
TTTTGTTCGATCTGCTGAAGAATACAAAGGATTAACAGCTAACAAGCAACTTGCTGTGGGTTCACTCGTCTTTGCATGGATTGCACTAGTGGTTGGATTCTCCTTGCTTA
TTTCCGGGGCAATGTATAACACAAGGTCGAGAAAATCATGCGGGCTTGCTCACAATCGGCTACTATCTATAGGGGGGATTGTGTGCTTCGTGCACGGTCTGTTTGCAGTT
GCATATTATGTTTCAAACACAGCAGGACACAGGGAGGAGGCGAAGCCACCACCACAGGTGAATCCTGCCCCAGCTACAGGCCACGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGGCTGCAGCATATGAGCCCTTCCTCCTTTGCTCGGCTTCATCCTCCACCAAGCAGAATCTCGCACTCATGCTATTCTTCAACTTTAATTCTCATTGGTTCTAT
TTATGATGATCACAACACAAACAGTGGGATAAAAAAGAAAAGAAACCCTGGTCTTTTAACTGTGTTATATTCCAAAGAGCTTTTTGATGGATTAAAAAGCTTGAAGAAGA
GAACTTATGGTTTTGTTGTCTGCATCTTGGTTATTATAATGGATGTTACAGCCGGAATTCTTGGAATTCAAGCTGAAATAGCGCAAAACAAGGTGAACCATTTAAAAATG
TGGATATTTGAGTGTAGAGACCCAAGCTATAATGCTTTCAAGCTAGGCTTGGCTGCAGCCATACTGCTCGCTCTTGCTCACGCCATTGCCAACCTGGTTGGTGGGTGCAT
TTTTGTTCGATCTGCTGAAGAATACAAAGGATTAACAGCTAACAAGCAACTTGCTGTGGGTTCACTCGTCTTTGCATGGATTGCACTAGTGGTTGGATTCTCCTTGCTTA
TTTCCGGGGCAATGTATAACACAAGGTCGAGAAAATCATGCGGGCTTGCTCACAATCGGCTACTATCTATAGGGGGGATTGTGTGCTTCGTGCACGGTCTGTTTGCAGTT
GCATATTATGTTTCAAACACAGCAGGACACAGGGAGGAGGCGAAGCCACCACCACAGGTGAATCCTGCCCCAGCTACAGGCCACGTCTAGCCCTTTGATCTTCATGTTCT
AAGAAAATGCAATGAAATTCACTTTCTTTTTCAGTTCTTCTGCCGTGAGATTTTGTGCTTAATTGTTATCAACTTCCGAATGTTGCATCAATTCGTAGCTAGTACTACTC
CCAAATTCGGGAAAAAGTTCTCTCCTTACAAATATAAAATCATATTCCTCCT
Protein sequenceShow/hide protein sequence
MARLQHMSPSSFARLHPPPSRISHSCYSSTLILIGSIYDDHNTNSGIKKKRNPGLLTVLYSKELFDGLKSLKKRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHLKM
WIFECRDPSYNAFKLGLAAAILLALAHAIANLVGGCIFVRSAEEYKGLTANKQLAVGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAV
AYYVSNTAGHREEAKPPPQVNPAPATGHV