; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G16110 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G16110
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationClcChr01:28938527..28939152
RNA-Seq ExpressionClc01G16110
SyntenyClc01G16110
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141948.1 uncharacterized protein LOC101203564 [Cucumis sativus]1.6e-7192.86Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSES AVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AI+DDEELQLGQLYFALPL+RLKQPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKC GG+KC H RRSVSPVVFTVEELK RKR AAGRGGAGGR+KFAANLMAIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

XP_008440178.1 PREDICTED: uncharacterized protein LOC103484721 [Cucumis melo]2.1e-7192.21Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AI+DDEELQ+GQLYFALPL+RL+QPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKC GG+KC H RRSVSPVVFTVEELK RKR AAGRGGAGGR+KFAANLMAIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

XP_022963140.1 uncharacterized protein LOC111463438 [Cucurbita moschata]7.3e-6990.26Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSES AVATAKLIL+DGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AI+D+EELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKCG G+KCSH RRSVSP+VFTVEELK RKR +AGR GAGGR+KFAANL AIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

XP_023518464.1 uncharacterized protein LOC111781952 [Cucurbita pepo subsp. pepo]1.1e-6992.21Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSES AVATAKLIL+DGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKCG G+KCSH RRSVSPVVFTVEELK RKR +AGR GAGGR KFAANL AIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

XP_038883003.1 uncharacterized protein LOC120074082 [Benincasa hispida]2.9e-7395.45Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQ NPSCFICNSDEMDFDDAL AISDDEELQLGQLYFALPL+RLKQPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKCGGG+KCSH RRSVSPVVFTVEELK RKR AAGRGGAGGRRKFAANLMAIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

TrEMBL top hitse value%identityAlignment
A0A0A0KJB4 Uncharacterized protein7.7e-7292.86Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSES AVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AI+DDEELQLGQLYFALPL+RLKQPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKC GG+KC H RRSVSPVVFTVEELK RKR AAGRGGAGGR+KFAANLMAIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

A0A1S3B0H3 uncharacterized protein LOC1034847211.0e-7192.21Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AI+DDEELQ+GQLYFALPL+RL+QPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKC GG+KC H RRSVSPVVFTVEELK RKR AAGRGGAGGR+KFAANLMAIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

A0A5D3CRH4 HTH-type transcriptional regulator protein ptxE1.0e-7192.21Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AI+DDEELQ+GQLYFALPL+RL+QPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKC GG+KC H RRSVSPVVFTVEELK RKR AAGRGGAGGR+KFAANLMAIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

A0A6J1HH54 uncharacterized protein LOC1114634383.6e-6990.26Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSES AVATAKLIL+DGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDAL AI+D+EELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKCG G+KCSH RRSVSP+VFTVEELK RKR +AGR GAGGR+KFAANL AIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

A0A6J1KPP1 uncharacterized protein LOC1114971501.8e-6888.96Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSES AVATAKLIL+DG+LQEFSYPVKVSY+LQKNPSCFICNSDEMDFDDAL AISD+EELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE
        MKCG G+KCSH RRSVSPVVF +EELK RKR +AGR GAGGR KFAANL AIPE
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein2.6e-1637.2Show/hide
Query:  MGICSSSES----AAVATAKLILHDGSLQEFSYPVKVSYVLQ----------KNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQA
        MG+C S       ++  TAK++  +G L+E+  PV  S VL+           + S F+CNSD + +DD + AI  DE LQ  Q+YF LP+S+ +  L A
Subjt:  MGICSSSES----AAVATAKLILHDGSLQEFSYPVKVSYVLQ----------KNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQA

Query:  EEMAALAVKANSALMKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLM
         +MAALAVKA+ A+ K  G +        +SPVV T+ +     R AA     GG    A N+M
Subjt:  EEMAALAVKANSALMKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLM

AT2G23690.1 unknown protein9.3e-4663.19Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSS ES  VATAKLILHDG + EF+ PVKV YVLQKNP CFICNSD+MDFD+ + AIS DEE QLGQLYFALPLS L   L+AEEMAALAVKA+SAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGG---EKCSHHRRSVSPVVFTVEELKA-----RKRAAAGRGGAG-GRRKFAANLMAIPE
        M+ GG    +KC   R+ VSPV+F+   + A       R    RGG G GRRK+AA L  I E
Subjt:  MKCGGG---EKCSHHRRSVSPVVFTVEELKA-----RKRAAAGRGGAG-GRRKFAANLMAIPE

AT3G50800.1 unknown protein8.7e-3656.13Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MG C+S ES    TAKLIL DG+LQEFS PVKV  +LQKNP+ F+CNSD+MDFDDA+LA+   E+L+ G+LYF LPL+ L  PL+A+EMAALAVKA+SAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAG-GRRKFAANLMAIPE
         K GGG   S++   V      V  +K       G GG G GRRKF A L +I E
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGRGGAG-GRRKFAANLMAIPE

AT4G37240.1 unknown protein2.6e-4064.63Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MGICSSSES  VATAKLIL DG + EF+ PVKV YVL K P CFICNSD+MDFDDA+ AIS DEELQLGQ+YFALPL  L+QPL+AEEMAALAVKA+SAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGR---GGAGGRRK
        M+ GGG      R+ V P+V      K R R  +G    G   GRRK
Subjt:  MKCGGGEKCSHHRRSVSPVVFTVEELKARKRAAAGR---GGAGGRRK

AT5G66580.1 unknown protein5.7e-3554.49Show/hide
Query:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL
        MG C+S ES    +AKLIL DG+LQEFS PVKV  +LQKNP+ F+CNSDEMDFDDA+ A++ +EEL+ GQLYF LPL+ L  PL+AEEMAALAVKA+SAL
Subjt:  MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSAL

Query:  MKCGG-GEKCSHHRRSVSPVVFTVEELKARK-RAAAGRGGAGGRRKFAANLMAIPE
         K GG G        + S   +  + +   K     GRG   G+R+F ANL  I E
Subjt:  MKCGG-GEKCSHHRRSVSPVVFTVEELKARK-RAAAGRGGAGGRRKFAANLMAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATTTGCAGTTCTTCGGAATCTGCCGCCGTTGCCACCGCTAAGTTGATCCTTCACGACGGAAGCCTGCAGGAATTTTCGTATCCGGTTAAGGTTTCGTATGTTCT
TCAAAAGAATCCGTCGTGCTTTATATGCAACTCCGACGAGATGGATTTCGACGATGCCTTGTTGGCCATTAGCGACGACGAGGAGCTTCAACTCGGACAGCTTTATTTTG
CGCTGCCGTTGAGTAGGCTGAAGCAGCCGCTTCAGGCCGAGGAAATGGCCGCATTGGCTGTCAAGGCTAACTCTGCGCTAATGAAATGTGGCGGCGGAGAGAAATGTAGC
CACCACCGGAGATCGGTGTCTCCGGTGGTTTTCACGGTTGAGGAACTCAAGGCTCGTAAACGAGCGGCCGCCGGCCGTGGTGGCGCCGGTGGAAGAAGAAAGTTCGCGGC
GAATTTGATGGCGATTCCTGAGTAG
mRNA sequenceShow/hide mRNA sequence
CAAATCCCTAACGCCTCCAAATCCGCCACCCGCCGTCCTCTCTTCCCTCCGCCGTGCCTTGCCGCCGTTAATGGGTATTTGCAGTTCTTCGGAATCTGCCGCCGTTGCCA
CCGCTAAGTTGATCCTTCACGACGGAAGCCTGCAGGAATTTTCGTATCCGGTTAAGGTTTCGTATGTTCTTCAAAAGAATCCGTCGTGCTTTATATGCAACTCCGACGAG
ATGGATTTCGACGATGCCTTGTTGGCCATTAGCGACGACGAGGAGCTTCAACTCGGACAGCTTTATTTTGCGCTGCCGTTGAGTAGGCTGAAGCAGCCGCTTCAGGCCGA
GGAAATGGCCGCATTGGCTGTCAAGGCTAACTCTGCGCTAATGAAATGTGGCGGCGGAGAGAAATGTAGCCACCACCGGAGATCGGTGTCTCCGGTGGTTTTCACGGTTG
AGGAACTCAAGGCTCGTAAACGAGCGGCCGCCGGCCGTGGTGGCGCCGGTGGAAGAAGAAAGTTCGCGGCGAATTTGATGGCGATTCCTGAGTAGGGTCATTTTTGCAAA
ATGGGATGGGTTTGTTGTGTAAATAACGGGAAAATAGGGGCTGGTTGGTTTAGATCTTTTGATTTGTTTTTTCTTT
Protein sequenceShow/hide protein sequence
MGICSSSESAAVATAKLILHDGSLQEFSYPVKVSYVLQKNPSCFICNSDEMDFDDALLAISDDEELQLGQLYFALPLSRLKQPLQAEEMAALAVKANSALMKCGGGEKCS
HHRRSVSPVVFTVEELKARKRAAAGRGGAGGRRKFAANLMAIPE