; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G005800 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G005800
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationchr10:8418467..8418940
RNA-Seq ExpressionLsi10G005800
SyntenyLsi10G005800
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149511.1 uncharacterized protein LOC111017924 [Momordica charantia]1.3e-6084.71Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NVATAKLIL DGTL+E+SYPVKVSYVLQK PASFICNSDEMDFDDVV A+DD DELQLGQLYFALPLDRLNQPL AEEMAALAVKAS+AL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGG   EKCGSRRTAISP  FSDEEF KV R   KK  GSGR RKFTAKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_022928071.1 uncharacterized protein LOC111434966 [Cucurbita moschata]8.9e-6281.53Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLIL DGTL+EFSYPVKVS++L KHPA+FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GGTEK GSRRTA+SP+AFSDEEFRK P R L+KG G G  RKF AKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_022971678.1 uncharacterized protein LOC111470350 [Cucurbita maxima]5.6e-6484.08Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLIL DGTL+EFSYPVKVS++L KHPA+FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GGTEK GSRRTA+SPVAFSDEEFRK PRRGL+K  GSGR RKF AKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_023512544.1 uncharacterized protein LOC111777254 [Cucurbita pepo subsp. pepo]1.4e-6282.17Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLIL DGTL+EFSYPVKVS++L KHPA+FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GGTEK GSRRTA+SP+AFSDEEFRK PRR L+KG G G  RKF AKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

XP_038902080.1 uncharacterized protein LOC120088720 [Benincasa hispida]6.5e-7394.9Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGICVSSD+INVATAKLIL DGTLVEFSYPVKVSY+LQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGGGGTEKCGSRRTAISPV FSDEEFRK PR+GLKK  GSGRSRKFTAKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

TrEMBL top hitse value%identityAlignment
A0A6J1D8L2 uncharacterized protein LOC1110179246.2e-6184.71Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NVATAKLIL DGTL+E+SYPVKVSYVLQK PASFICNSDEMDFDDVV A+DD DELQLGQLYFALPLDRLNQPL AEEMAALAVKAS+AL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGG   EKCGSRRTAISP  FSDEEF KV R   KK  GSGR RKFTAKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1EJ90 uncharacterized protein LOC1114349664.3e-6281.53Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLIL DGTL+EFSYPVKVS++L KHPA+FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GGTEK GSRRTA+SP+AFSDEEFRK P R L+KG G G  RKF AKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1G7I2 uncharacterized protein LOC1114514545.2e-6080.25Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NV+TAKLIL DGTL+E+SYPVKVSYVL K PASFICNSD+MDF+DVV AVDDDDELQLGQLYFALPL++LN+PL AE+MAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGGGG+EKCGSRR     V FS+EE RK PR+G+KKG GSG SRKFTAKL AIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1I7K7 uncharacterized protein LOC1114703502.7e-6484.08Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS+NVATAKLIL DGTL+EFSYPVKVS++L KHPA+FICNSD+MDFDD VYAV DDD LQLG LYFALPLDRLNQPL  EEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        +KAG GGTEK GSRRTA+SPVAFSDEEFRK PRRGL+K  GSGR RKF AKLSAIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

A0A6J1L0H7 uncharacterized protein LOC1114999285.2e-6080.25Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC+SSDS NV+TAKLIL DGTL+E+SYPVKVSYVLQK PASFICNSD+MDF+DVV AVDD+DELQLGQLYFALPL++LN+PL AE+MAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE
        MKAGGGG+EKCGSRR    PV FS+EE RK PRRG+KKG  +G SRKFTAKL AIPE
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein2.1e-1640.74Show/hide
Query:  MGICVSSDS----INVATAKLILIDGTLVEFSYPVKVSYVLQKHPAS----------FICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQA
        MG+CVS +      +  TAK++ I+G L E+  PV  S VL+    S          F+CNSD + +DD + A++ D+ LQ  Q+YF LP+ +    L A
Subjt:  MGICVSSDS----INVATAKLILIDGTLVEFSYPVKVSYVLQKHPAS----------FICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQA

Query:  EEMAALAVKASSALMKAGGGGTEKCGSRRTAISPV
         +MAALAVKAS A+ KA G    +  S R  ISPV
Subjt:  EEMAALAVKASSALMKAGGGGTEKCGSRRTAISPV

AT2G23690.1 unknown protein1.3e-4259.51Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC S +S  VATAKLIL DG ++EF+ PVKV YVLQK+P  FICNSD+MDFD+VV A+  D+E QLGQLYFALPL  L+  L+AEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGG-GTEKCGSRRTAISPVAFSDEEFRKV-----PRRGLKKGMGSGRSRKFTAKLSAIPE
        M++GG  G +KC  RR  +SPV FS      V      R G ++G G    RK+ AKLS I E
Subjt:  MKAGGG-GTEKCGSRRTAISPVAFSDEEFRKV-----PRRGLKKGMGSGRSRKFTAKLSAIPE

AT3G50800.1 unknown protein3.4e-3553.94Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MG C S +S    TAKLIL DGTL EFS PVKV  +LQK+P SF+CNSD+MDFDD V AV   ++L+ G+LYF LPL  LN PL+A+EMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEE-----FRKVPRRGL-KKGMGSGRS--RKFTAKLSAIPE
         K+GGGG             ++++DE+      R+V R G   +G G G    RKFTA+LS+I E
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEE-----FRKVPRRGL-KKGMGSGRS--RKFTAKLSAIPE

AT4G37240.1 unknown protein1.2e-3760.96Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MGIC SS+S  VATAKLIL DG ++EF+ PVKV YVL K+P  FICNSD+MDFDD V A+  D+ELQLGQ+YFALPL  L QPL+AEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSR
        M+ GGG     G RR  + P+  SD+   +V       G GSGR +
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSR

AT5G66580.1 unknown protein4.0e-3654.94Show/hide
Query:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL
        MG C S +S+   +AKLIL+DGTL EFS PVKV  +LQK+P SF+CNSDEMDFDD V AV  ++EL+ GQLYF LPL  LN PL+AEEMAALAVKASSAL
Subjt:  MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGR-----SRKFTAKLSAIPE
         K+GG G        +    V  S++ ++K    G+K   G GR      R+FTA LS I E
Subjt:  MKAGGGGTEKCGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGR-----SRKFTAKLSAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATTTGCGTGTCGTCCGATTCGATCAATGTTGCTACAGCGAAATTGATTCTTATTGATGGAACTTTGGTGGAATTCTCTTACCCAGTAAAAGTTTCTTACGTACT
ACAGAAACATCCGGCGAGTTTTATCTGCAACTCCGACGAGATGGATTTTGACGACGTCGTTTACGCCGTTGACGACGACGATGAGCTCCAACTTGGGCAGCTTTACTTTG
CCTTGCCGTTGGACAGACTGAACCAGCCGCTGCAGGCGGAGGAAATGGCGGCATTGGCCGTCAAGGCTAGCTCAGCGCTTATGAAGGCTGGCGGCGGAGGGACGGAGAAA
TGTGGATCTAGACGAACGGCGATTTCACCGGTGGCATTTTCCGATGAGGAGTTTAGGAAGGTTCCGAGAAGGGGATTGAAGAAGGGAATGGGGAGTGGTAGAAGTAGAAA
ATTTACGGCGAAATTGAGTGCAATTCCGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCATTTGCGTGTCGTCCGATTCGATCAATGTTGCTACAGCGAAATTGATTCTTATTGATGGAACTTTGGTGGAATTCTCTTACCCAGTAAAAGTTTCTTACGTACT
ACAGAAACATCCGGCGAGTTTTATCTGCAACTCCGACGAGATGGATTTTGACGACGTCGTTTACGCCGTTGACGACGACGATGAGCTCCAACTTGGGCAGCTTTACTTTG
CCTTGCCGTTGGACAGACTGAACCAGCCGCTGCAGGCGGAGGAAATGGCGGCATTGGCCGTCAAGGCTAGCTCAGCGCTTATGAAGGCTGGCGGCGGAGGGACGGAGAAA
TGTGGATCTAGACGAACGGCGATTTCACCGGTGGCATTTTCCGATGAGGAGTTTAGGAAGGTTCCGAGAAGGGGATTGAAGAAGGGAATGGGGAGTGGTAGAAGTAGAAA
ATTTACGGCGAAATTGAGTGCAATTCCGGAATAA
Protein sequenceShow/hide protein sequence
MGICVSSDSINVATAKLILIDGTLVEFSYPVKVSYVLQKHPASFICNSDEMDFDDVVYAVDDDDELQLGQLYFALPLDRLNQPLQAEEMAALAVKASSALMKAGGGGTEK
CGSRRTAISPVAFSDEEFRKVPRRGLKKGMGSGRSRKFTAKLSAIPE