; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G012900 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G012900
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationCG_Chr02:26650774..26651247
RNA-Seq ExpressionClCG02G012900
SyntenyClCG02G012900
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149511.1 uncharacterized protein LOC111017924 [Momordica charantia]4.4e-6184.71Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NVATAKLILTDGTL+E+SYPVKVSYVLQK PA FICNSDEMDFDDVV A+DD DELQLGQLYFALPLDRLNQPL AEEMAALAVKAS+AL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGG   EKCGSRRTAISP  FSDEEF KV R   KK  GSGR RKFTAKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_022928071.1 uncharacterized protein LOC111434966 [Cucurbita moschata]6.8e-6281.53Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLILTDGTL+EFSYPVKVS++L KHPA FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GG EK GSRRTA+SP+AFSDEEFRK P R L+KG G G  RKF AKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_022971678.1 uncharacterized protein LOC111470350 [Cucurbita maxima]4.3e-6484.08Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLILTDGTL+EFSYPVKVS++L KHPA FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GG EK GSRRTA+SPVAFSDEEFRK PRRGL+K  GSGR RKF AKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_023512544.1 uncharacterized protein LOC111777254 [Cucurbita pepo subsp. pepo]1.0e-6282.17Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLILTDGTL+EFSYPVKVS++L KHPA FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GG EK GSRRTA+SP+AFSDEEFRK PRR L+KG G G  RKF AKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_038902080.1 uncharacterized protein LOC120088720 [Benincasa hispida]1.1e-7294.27Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGICVSSD+INVATAKLILTDGTLVEFSYPVKVSY+LQKHPA FICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGGGG EKCGSRRTAISPV FSDEEFRK PR+GLKK  GSGRSRKFTAKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

TrEMBL top hitse value%identityAlignment
A0A6J1D8L2 uncharacterized protein LOC1110179242.1e-6184.71Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NVATAKLILTDGTL+E+SYPVKVSYVLQK PA FICNSDEMDFDDVV A+DD DELQLGQLYFALPLDRLNQPL AEEMAALAVKAS+AL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGG   EKCGSRRTAISP  FSDEEF KV R   KK  GSGR RKFTAKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1EJ90 uncharacterized protein LOC1114349663.3e-6281.53Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLILTDGTL+EFSYPVKVS++L KHPA FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GG EK GSRRTA+SP+AFSDEEFRK P R L+KG G G  RKF AKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1G7I2 uncharacterized protein LOC1114514546.9e-6079.62Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NV+TAKLIL+DGTL+E+SYPVKVSYVL K PA FICNSD+MDF+DVV AVDDDDELQLGQLYFALPL++LN+PL AE+MAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGGGG EKCGSRR     V FS+EE RK PR+G+KKG GSG SRKFTAKL AIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1I7K7 uncharacterized protein LOC1114703502.1e-6484.08Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLILTDGTL+EFSYPVKVS++L KHPA FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GG EK GSRRTA+SPVAFSDEEFRK PRRGL+K  GSGR RKF AKLSAIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1L0H7 uncharacterized protein LOC1114999286.9e-6079.62Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NV+TAKLIL+DGTL+E+SYPVKVSYVLQK PA FICNSD+MDF+DVV AVDD+DELQLGQLYFALPL++LN+PL AE+MAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGGGG EKCGSRR    PV FS+EE RK PRRG+KKG  +G SRKFTAKL AIPE
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein1.7e-1539.71Show/hide
Query:  MGICVSSDS----INVATAKLILTDGTLVEFSYPVKVSYVLQKH----------PARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQA
        MG+CVS +      +  TAK++  +G L E+  PV  S VL+             + F+CNSD + +DD + A++ D+ LQ  Q+YF LP+ +    L A
Subjt:  MGICVSSDS----INVATAKLILTDGTLVEFSYPVKVSYVLQKH----------PARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQA

Query:  EEMAALAVKASSALMKAGGGGPEKCGSRRTA-ISPV
         +MAALAVKAS A+ KA G   +K   RR+  ISPV
Subjt:  EEMAALAVKASSALMKAGGGGPEKCGSRRTA-ISPV

AT2G23690.1 unknown protein7.5e-4359.51Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC S +S  VATAKLIL DG ++EF+ PVKV YVLQK+P  FICNSD+MDFD+VV A+  D+E QLGQLYFALPL  L+  L+AEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGG-GPEKCGSRRTAISPVAFSDEEFRKV-----PRRGLKKGMGSGRSRKFTAKLSAIPE
        M++GG  G +KC  RR  +SPV FS      V      R G ++G G    RK+ AKLS I E
Subjt:  MKAGGG-GPEKCGSRRTAISPVAFSDEEFRKV-----PRRGLKKGMGSGRSRKFTAKLSAIPE

AT3G50800.1 unknown protein3.4e-3553.33Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MG C S +S    TAKLIL DGTL EFS PVKV  +LQK+P  F+CNSD+MDFDD V AV   ++L+ G+LYF LPL  LN PL+A+EMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEE-----FRKVPRRGL-KKGMGSGRS--RKFTAKLSAIPE
         K+GGGG             ++++DE+      R+V R G   +G G G    RKFTA+LS+I E
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEE-----FRKVPRRGL-KKGMGSGRS--RKFTAKLSAIPE

AT4G37240.1 unknown protein5.6e-3860.96Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC SS+S  VATAKLIL DG ++EF+ PVKV YVL K+P  FICNSD+MDFDD V A+  D+ELQLGQ+YFALPL  L QPL+AEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSR
        M+ GGG     G RR  + P+  SD+   +V       G GSGR +
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSR

AT5G66580.1 unknown protein3.4e-3554.32Show/hide
Query:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MG C S +S+   +AKLIL DGTL EFS PVKV  +LQK+P  F+CNSDEMDFDD V AV  ++EL+ GQLYF LPL  LN PL+AEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGR-----SRKFTAKLSAIPE
         K+GG G        +    V  S++ ++K    G+K   G GR      R+FTA LS I E
Subjt:  MKAGGGGPEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGR-----SRKFTAKLSAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATTTGCGTGTCGTCAGATTCGATCAATGTTGCTACAGCTAAATTGATTCTTACAGATGGAACTTTGGTCGAATTCTCTTACCCAGTTAAGGTTTCTTACGTGCT
ACAAAAACATCCGGCGAGATTTATCTGCAACTCCGACGAGATGGACTTTGACGACGTCGTTTATGCCGTTGACGACGACGATGAGCTCCAACTCGGGCAGCTTTACTTTG
CGTTGCCGTTGGACAGGCTGAACCAGCCGCTTCAGGCGGAGGAAATGGCGGCATTGGCCGTCAAGGCCAGTTCGGCGCTTATGAAGGCCGGCGGTGGAGGGCCGGAAAAA
TGTGGATCTAGGCGGACGGCGATTTCTCCGGTGGCGTTTTCCGATGAGGAGTTTAGGAAGGTTCCAAGAAGGGGATTGAAGAAGGGAATGGGGAGTGGTAGAAGTAGAAA
ATTTACTGCGAAATTGAGTGCAATTCCGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCATTTGCGTGTCGTCAGATTCGATCAATGTTGCTACAGCTAAATTGATTCTTACAGATGGAACTTTGGTCGAATTCTCTTACCCAGTTAAGGTTTCTTACGTGCT
ACAAAAACATCCGGCGAGATTTATCTGCAACTCCGACGAGATGGACTTTGACGACGTCGTTTATGCCGTTGACGACGACGATGAGCTCCAACTCGGGCAGCTTTACTTTG
CGTTGCCGTTGGACAGGCTGAACCAGCCGCTTCAGGCGGAGGAAATGGCGGCATTGGCCGTCAAGGCCAGTTCGGCGCTTATGAAGGCCGGCGGTGGAGGGCCGGAAAAA
TGTGGATCTAGGCGGACGGCGATTTCTCCGGTGGCGTTTTCCGATGAGGAGTTTAGGAAGGTTCCAAGAAGGGGATTGAAGAAGGGAATGGGGAGTGGTAGAAGTAGAAA
ATTTACTGCGAAATTGAGTGCAATTCCGGAATAA
Protein sequenceShow/hide protein sequence
MGICVSSDSINVATAKLILTDGTLVEFSYPVKVSYVLQKHPARFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSALMKAGGGGPEK
CGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE