; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003912 (gene) of Snake gourd v1 genome

Gene IDTan0003912
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHTH-type transcriptional regulator protein ptxE
Genome locationLG07:35697967..35698724
RNA-Seq ExpressionTan0003912
SyntenyTan0003912
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605242.1 hypothetical protein SDJN03_02559, partial [Cucurbita argyrosperma subsp. sororia]3.0e-6283.44Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVL KDPA+FICNSD+MDF+DVV A+DDDDELQLG LYFALPLE+LN+PL AE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGGGGSEKCGSRR    PVVF EEE RK PR+GVKKG GSG SRKF AKL AIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

XP_022947655.1 uncharacterized protein LOC111451454 [Cucurbita moschata]6.1e-6384.71Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVL KDPASFICNSD+MDF+DVV A+DDDDELQLGQLYFALPLE+LN+PL AE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGGGGSEKCGSRR     VVFSEEE RK PR+GVKKG GSG SRKF AKL AIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

XP_023007426.1 uncharacterized protein LOC111499928 [Cucurbita maxima]6.1e-6384.71Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVLQKDPASFICNSD+MDF+DVV A+DD+DELQLGQLYFALPLE+LN+PL AE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGGGGSEKCGSRR    PVVFSEEE RK PRRGVKKG  +G SRKF AKL AIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

XP_023533314.1 uncharacterized protein LOC111795245 isoform X1 [Cucurbita pepo subsp. pepo]3.0e-6284.08Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVL KDPASFICNSD+MDF+DVV A+DDDDELQLGQLYFALPLE+LN+PL AE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGGGGSEKCGSRR    PVVF EEE RK PR+GVKKG  SG SRKF AKL AIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

XP_038902080.1 uncharacterized protein LOC120088720 [Benincasa hispida]1.3e-6888.54Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGIC+SSD+ NVATAKLILTDGTL+E+SYPVKVSY+LQK PASFICNSDEMDFDDVVYA+DDDDELQLGQLYFALPL+RLNQPLQAEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGGGG+EKCGSRRTAISPV FS+EEFRK PR+G+KK  GSGRSRKF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

TrEMBL top hitse value%identityAlignment
A0A6J1D8L2 uncharacterized protein LOC1110179245.6e-6286.62Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVV AIDD DELQLGQLYFALPL+RLNQPL AEEMAALAVKAS+AL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGG   EKCGSRRTAISP +FS+EEF K  R   KK  GSGR RKF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

A0A6J1EJ90 uncharacterized protein LOC1114349667.6e-5978.34Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD VYA+ DDD LQLG LYFALPL+RLNQPL  EEMAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        +KAG GG+EK GSRRTA+SP+ FS+EEFRK P R ++KG G G  RKF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

A0A6J1G7I2 uncharacterized protein LOC1114514543.0e-6384.71Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVL KDPASFICNSD+MDF+DVV A+DDDDELQLGQLYFALPLE+LN+PL AE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGGGGSEKCGSRR     VVFSEEE RK PR+GVKKG GSG SRKF AKL AIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

A0A6J1I7K7 uncharacterized protein LOC1114703504.7e-6180.89Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NVATAKLILTDGTLLE+SYPVKVS++L K PA+FICNSD+MDFDD VYA+ DDD LQLG LYFALPL+RLNQPL  EEMAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        +KAG GG+EK GSRRTA+SPV FS+EEFRK PRRG++K  GSGR RKF AKLSAIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

A0A6J1L0H7 uncharacterized protein LOC1114999283.0e-6384.71Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGICISSDS NV+TAKLIL+DGTLLEYSYPVKVSYVLQKDPASFICNSD+MDF+DVV A+DD+DELQLGQLYFALPLE+LN+PL AE+MAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        MKAGGGGSEKCGSRR    PVVFSEEE RK PRRGVKKG  +G SRKF AKL AIPE
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein1.9e-1741.18Show/hide
Query:  MGICISSDS----TNVATAKLILTDGTLLEYSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQA
        MG+C+S +     ++  TAK++  +G L EY  PV  S VL+ +  S          F+CNSD + +DD + AI+ D+ LQ  Q+YF LP+ +    L A
Subjt:  MGICISSDS----TNVATAKLILTDGTLLEYSYPVKVSYVLQKDPAS----------FICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQA

Query:  EEMAALAVKASSALMKAGGGGSEKCGSRRTAISPVV
         +MAALAVKAS A+ KA G  + +  S R  ISPVV
Subjt:  EEMAALAVKASSALMKAGGGGSEKCGSRRTAISPVV

AT2G23690.1 unknown protein8.9e-4462.28Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGIC S +ST VATAKLIL DG ++E++ PVKV YVLQK+P  FICNSD+MDFD+VV AI  D+E QLGQLYFALPL  L+  L+AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGG-GSEKCGSRRTAISPVVFSEE---------EFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE
        M++GG  G +KC  RR  +SPV+FS           E R   RRG   G GSGR RK+AAKLS I E
Subjt:  MKAGGG-GSEKCGSRRTAISPVVFSEE---------EFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE

AT3G50800.1 unknown protein1.4e-3351.2Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MG C S +S    TAKLIL DGTL E+S PVKV  +LQK+P SF+CNSD+MDFDD V A+   ++L+ G+LYF LPL  LN PL+A+EMAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKK---------GMGSGRSRKFAAKLSAIPE
         K+GGGG             + +++E+  +   R VK+         G G GR RKF A+LS+I E
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKK---------GMGSGRSRKFAAKLSAIPE

AT4G37240.1 unknown protein6.6e-3956.07Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MGIC SS+ST VATAKLIL DG ++E++ PVKV YVL K P  FICNSD+MDFDD V AI  D+ELQLGQ+YFALPL  L QPL+AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFR----------KAPRRGVKKGMGSG------RSRKFAAKLSAIPE
        M+ GGG     G RR  + P+V  +   R           + RR V+ G G G      R + +AA+LS I E
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFR----------KAPRRGVKKGMGSG------RSRKFAAKLSAIPE

AT5G66580.1 unknown protein1.7e-3454.32Show/hide
Query:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL
        MG C S +S    +AKLIL DGTL E+S PVKV  +LQK+P SF+CNSDEMDFDD V A+  ++EL+ GQLYF LPL  LN PL+AEEMAALAVKASSAL
Subjt:  MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSAL

Query:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGR-----SRKFAAKLSAIPE
         K+GG G        +    V  SE+ ++K    GVK   G GR      R+F A LS I E
Subjt:  MKAGGGGSEKCGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGR-----SRKFAAKLSAIPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCATTTGCATTTCCTCCGATTCCACCAATGTTGCTACGGCCAAACTGATTTTAACGGACGGAACGCTGCTCGAATACTCGTACCCAGTCAAAGTCTCTTATGTGTT
ACAGAAAGATCCGGCGAGTTTTATCTGTAACTCCGACGAGATGGACTTCGACGACGTCGTATACGCCATTGACGACGACGACGAGCTCCAACTCGGGCAGTTGTACTTTG
CCCTGCCGTTGGAGAGGCTGAACCAGCCGCTGCAAGCGGAGGAGATGGCCGCATTGGCTGTGAAGGCCAGCTCGGCGCTCATGAAGGCCGGCGGCGGCGGGAGTGAGAAA
TGTGGGTCACGGCGTACGGCGATCTCGCCGGTGGTGTTTTCCGAGGAGGAGTTTAGGAAGGCGCCGAGAAGGGGTGTGAAGAAGGGGATGGGGAGTGGCAGAAGTAGAAA
ATTTGCGGCGAAATTGAGCGCAATTCCGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATCAAATCTCAAATCACAATTCACAACTCGCTCTCGTCTGACTCACTGAGTTCACCCAAAACAAAACGACAGAACACAAAAATGGGCATTTGCATTTCCTCCGATTCCAC
CAATGTTGCTACGGCCAAACTGATTTTAACGGACGGAACGCTGCTCGAATACTCGTACCCAGTCAAAGTCTCTTATGTGTTACAGAAAGATCCGGCGAGTTTTATCTGTA
ACTCCGACGAGATGGACTTCGACGACGTCGTATACGCCATTGACGACGACGACGAGCTCCAACTCGGGCAGTTGTACTTTGCCCTGCCGTTGGAGAGGCTGAACCAGCCG
CTGCAAGCGGAGGAGATGGCCGCATTGGCTGTGAAGGCCAGCTCGGCGCTCATGAAGGCCGGCGGCGGCGGGAGTGAGAAATGTGGGTCACGGCGTACGGCGATCTCGCC
GGTGGTGTTTTCCGAGGAGGAGTTTAGGAAGGCGCCGAGAAGGGGTGTGAAGAAGGGGATGGGGAGTGGCAGAAGTAGAAAATTTGCGGCGAAATTGAGCGCAATTCCGG
AATGATGTGGATGTGGGTTTCTTTTTTTCTCTATAGGGCTAAAATTGTAAATATGTTCTTTAGGGTTTTGAAGATGGAGAGGTTGCTTTCTGTGGATGTTTTGTTTCTTC
CGCATCCCTTGGCAATGGCATGAGATTCTTTCTTAGTCCACTTTTACATTTTTACCCCTCATTACTAAAAAGTTCTCAAAATAAAATCCTATATTTTG
Protein sequenceShow/hide protein sequence
MGICISSDSTNVATAKLILTDGTLLEYSYPVKVSYVLQKDPASFICNSDEMDFDDVVYAIDDDDELQLGQLYFALPLERLNQPLQAEEMAALAVKASSALMKAGGGGSEK
CGSRRTAISPVVFSEEEFRKAPRRGVKKGMGSGRSRKFAAKLSAIPE