; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G12140 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G12140
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF1118)
Genome locationClcChr04:25687173..25688570
RNA-Seq ExpressionClc04G12140
SyntenyClc04G12140
Gene Ontology termsGO:0010027 - thylakoid membrane organization (biological process)
GO:0010196 - nonphotochemical quenching (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0090391 - granum assembly (biological process)
GO:0009515 - granal stacked thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009500 - Protein of unknown function DUF1118


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137438.1 uncharacterized protein LOC101204037 [Cucumis sativus]4.3e-8291.88Show/hide
Query:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
        MEVTSFCSSS RAASF+YSYPS TTS  K+ LH++SMA EK  PSAAKTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
Subjt:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL

Query:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        SAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQVAVALVS+VGGSAAFAASNLVSNLQRSN
Subjt:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN

XP_008451074.1 PREDICTED: uncharacterized protein LOC103492437 [Cucumis melo]2.3e-8392.39Show/hide
Query:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
        MEVTSFCSSS RAASF+YSYPS TTS  K+ LH++SMAAEK PPSA+KTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
Subjt:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL

Query:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        SAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQVAVALVS+VGGSAAFAASNLVSNLQRSN
Subjt:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN

XP_022137623.1 uncharacterized protein LOC111009021 [Momordica charantia]4.4e-7989.55Show/hide
Query:  MEVTSFCSSSRAASFVYSYPSKT------TSTKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEK
        MEVTSFCSSSR  SFVYSYP KT       ST   L V SMAAEK  PSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSP IKLLTRVEQLKLLSKAEK
Subjt:  MEVTSFCSSSRAASFVYSYPSKT------TSTKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEK

Query:  AGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRS
        AGLLSAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQ AVALVSVVGGSAAFAASNLVSNLQRS
Subjt:  AGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRS

Query:  N
        N
Subjt:  N

XP_023520287.1 uncharacterized protein LOC111783600 [Cucurbita pepo subsp. pepo]2.1e-8190.82Show/hide
Query:  MEVTSFCSSSRAASFVYSYPSKTTSTKQSLHVYSMAAEKSPPSAAKTVGS-KKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLS
        MEVTSFC SSRAASFVYSYPSKT S KQSL + SMA +K PPSAAKTV S KK+NSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLS
Subjt:  MEVTSFCSSSRAASFVYSYPSKTTSTKQSLHVYSMAAEKSPPSAAKTVGS-KKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLS

Query:  AAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        AAEKAGLSLSSIEKLG LSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPE++VWEI LQ AVALVS+VGGSAAFAASNLVSNLQRSN
Subjt:  AAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN

XP_038893804.1 uncharacterized protein LOC120082623 [Benincasa hispida]1.8e-8593.33Show/hide
Query:  MEVTSFCSSSRAASFVYSYPSKTTSTKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSA
        MEV+SFCSSSRAASFV+SYPSKT S K+SLH++SMAAEK PPSAAKTVGSKKINSTVFPLGEKGPR+SISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSA
Subjt:  MEVTSFCSSSRAASFVYSYPSKTTSTKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSA

Query:  AEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        AEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VW+I LQV+VALVSVVGGSAAFAASNLVSNLQRSN
Subjt:  AEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN

TrEMBL top hitse value%identityAlignment
A0A0A0LQ98 Uncharacterized protein2.1e-8291.88Show/hide
Query:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
        MEVTSFCSSS RAASF+YSYPS TTS  K+ LH++SMA EK  PSAAKTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
Subjt:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL

Query:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        SAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQVAVALVS+VGGSAAFAASNLVSNLQRSN
Subjt:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN

A0A1S3BQ40 uncharacterized protein LOC1034924371.1e-8392.39Show/hide
Query:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
        MEVTSFCSSS RAASF+YSYPS TTS  K+ LH++SMAAEK PPSA+KTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
Subjt:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL

Query:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        SAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQVAVALVS+VGGSAAFAASNLVSNLQRSN
Subjt:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN

A0A5D3BGQ3 Uncharacterized protein1.1e-8392.39Show/hide
Query:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
        MEVTSFCSSS RAASF+YSYPS TTS  K+ LH++SMAAEK PPSA+KTVGSKKIN+TVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL
Subjt:  MEVTSFCSSS-RAASFVYSYPSKTTS-TKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLL

Query:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        SAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQVAVALVS+VGGSAAFAASNLVSNLQRSN
Subjt:  SAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN

A0A6J1CAU4 uncharacterized protein LOC1110090212.1e-7989.55Show/hide
Query:  MEVTSFCSSSRAASFVYSYPSKT------TSTKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEK
        MEVTSFCSSSR  SFVYSYP KT       ST   L V SMAAEK  PSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSP IKLLTRVEQLKLLSKAEK
Subjt:  MEVTSFCSSSRAASFVYSYPSKT------TSTKQSLHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEK

Query:  AGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRS
        AGLLSAAEKAGLSLSSIEKLGLLSKAEELG+LSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED+VWEIVLQ AVALVSVVGGSAAFAASNLVSNLQRS
Subjt:  AGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRS

Query:  N
        N
Subjt:  N

A0A6J1K230 uncharacterized protein LOC1114903506.2e-7987.06Show/hide
Query:  MEVTSFCSSSRAASFVYSYPSKTTSTKQS------LHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEK
        MEV S  SS+RAASF+YSYP KT S  Q+      LHV++MAAEK P SA KT+GSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEK
Subjt:  MEVTSFCSSSRAASFVYSYPSKTTSTKQS------LHVYSMAAEKSPPSAAKTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEK

Query:  AGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRS
        AGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPED VWEIVLQ A ALVSV+GGSAAFAASNLVSNLQRS
Subjt:  AGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRS

Query:  N
        N
Subjt:  N

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G74730.1 Protein of unknown function (DUF1118)7.9e-1038.02Show/hide
Query:  IKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATD-PGT-PGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVS
        + +  ++E+ K+LS  EK+GLLS AE  GL+LSS+EKL + SKAE+LG+LS   +  GT P  L S +L  L      V L+P+D+   +V Q  +A   
Subjt:  IKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATD-PGT-PGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVS

Query:  VVGGSAAFAASNLVSNLQRSN
         + G      S ++  LQ ++
Subjt:  VVGGSAAFAASNLVSNLQRSN

AT5G08050.1 Protein of unknown function (DUF1118)2.4e-4363.1Show/hide
Query:  SLHVYSMAAEKSPPSAA-KTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGILS
        SL V+SMA +K  PSAA +T+ SKK  +T                  P +KLLTRVEQLKLL+KAEKAGLLS AEK+G SLS+IE+LGLL+KAEE G+LS
Subjt:  SLHVYSMAAEKSPPSAA-KTVGSKKINSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGILS

Query:  AATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN
        AAT+P TPG L +LSLGLLLLGP   Y+VPED  WE+V+QV VAL+SV+GGSAAFAAS  VSNLQ+S+
Subjt:  AATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWEIVLQVAVALVSVVGGSAAFAASNLVSNLQRSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAACACGTGGAAAATCCAATATCTCAATTCTGAAAACTAATCCAACTACCTACCTTTTCCATCATTCATTTCCATGGTGCGGGCCTCAGCAATTCTGTCATTCAGA
TTCTGCCCTAACCATCGCCATTCTTCACTCTTCACACACACTCTGCTCGAGATACACGGATATGGAGGTCACTTCTTTCTGTAGCAGCAGCAGAGCAGCTTCGTTCGTCT
ACTCCTATCCATCCAAAACGACTTCAACAAAACAATCTCTTCATGTATACTCCATGGCCGCCGAGAAATCTCCCCCGTCTGCCGCTAAAACCGTCGGCTCCAAGAAGATA
AACTCCACGGTGTTCCCTCTCGGCGAAAAAGGACCGAGGAGCAGCATCTCGCTTTCGACCTCGCCGCCGATTAAGCTACTGACGAGAGTGGAGCAATTGAAGCTACTGAG
CAAGGCGGAGAAGGCCGGTCTGCTGTCTGCCGCGGAGAAAGCCGGGTTGTCTCTGTCGTCGATTGAGAAATTAGGGCTTCTGTCGAAGGCTGAGGAATTAGGGATTTTAT
CGGCGGCGACAGATCCAGGAACTCCCGGAGCGCTACTGAGCCTGAGCTTAGGGCTGTTGCTATTGGGGCCTTCGTGTGTGTATTTGGTACCGGAGGACAATGTATGGGAG
ATCGTATTGCAGGTGGCGGTGGCTCTGGTCTCCGTCGTGGGCGGCTCTGCGGCGTTTGCTGCGTCGAATTTGGTGTCGAATTTGCAGAGATCGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAACACGTGGAAAATCCAATATCTCAATTCTGAAAACTAATCCAACTACCTACCTTTTCCATCATTCATTTCCATGGTGCGGGCCTCAGCAATTCTGTCATTCAGA
TTCTGCCCTAACCATCGCCATTCTTCACTCTTCACACACACTCTGCTCGAGATACACGGATATGGAGGTCACTTCTTTCTGTAGCAGCAGCAGAGCAGCTTCGTTCGTCT
ACTCCTATCCATCCAAAACGACTTCAACAAAACAATCTCTTCATGTATACTCCATGGCCGCCGAGAAATCTCCCCCGTCTGCCGCTAAAACCGTCGGCTCCAAGAAGATA
AACTCCACGGTGTTCCCTCTCGGCGAAAAAGGACCGAGGAGCAGCATCTCGCTTTCGACCTCGCCGCCGATTAAGCTACTGACGAGAGTGGAGCAATTGAAGCTACTGAG
CAAGGCGGAGAAGGCCGGTCTGCTGTCTGCCGCGGAGAAAGCCGGGTTGTCTCTGTCGTCGATTGAGAAATTAGGGCTTCTGTCGAAGGCTGAGGAATTAGGGATTTTAT
CGGCGGCGACAGATCCAGGAACTCCCGGAGCGCTACTGAGCCTGAGCTTAGGGCTGTTGCTATTGGGGCCTTCGTGTGTGTATTTGGTACCGGAGGACAATGTATGGGAG
ATCGTATTGCAGGTGGCGGTGGCTCTGGTCTCCGTCGTGGGCGGCTCTGCGGCGTTTGCTGCGTCGAATTTGGTGTCGAATTTGCAGAGATCGAATTGAATTTCAGCGTT
GGAGTTTTCCAATGGCAGCCATGGGAGTTGTAGTGATCAGTATCAACTTTCCTTCTCCTTCCATGTCCTCGTTTTGTCTGCTGCCATTATTGAAATTTATCATTTTCCGG
CGAACTGAAATGGCGATATGTGAACGGAAAATTCTCTGCCCTCAATTTTATACCAATGCCGATACAAAATTATAATATTAACGCATACGTTATATAACAACGTTATATAT
TCAAACCTACTTTTATCAAATCATTACGTTATGCATTTTTGTTAATTTAATTTGTTTAAACAAATTAAATTTATTTTCGTAATAAATCTTTTCCATTTTTCAACTCAAAT
TGATTATAATAGAATGTAGAAAATAGTATTGTGTTTGAATCTGCATCTTTGATTTCCACATTCTATTACTTAGAAATAGATGATGATCGGTTTCATAATTTTTTCTTCTT
CTTTTCAATGGCTTCATTATGGGAGGTTGGACTTCCCAATAGTCTTGTTGTGATTCCCAAATAGGGTCTTATCATGTATGGTAACCCTTCACGCTGAAGCGAGAATAGTT
TAA
Protein sequenceShow/hide protein sequence
MRTRGKSNISILKTNPTTYLFHHSFPWCGPQQFCHSDSALTIAILHSSHTLCSRYTDMEVTSFCSSSRAASFVYSYPSKTTSTKQSLHVYSMAAEKSPPSAAKTVGSKKI
NSTVFPLGEKGPRSSISLSTSPPIKLLTRVEQLKLLSKAEKAGLLSAAEKAGLSLSSIEKLGLLSKAEELGILSAATDPGTPGALLSLSLGLLLLGPSCVYLVPEDNVWE
IVLQVAVALVSVVGGSAAFAASNLVSNLQRSN