; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G000220 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G000220
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTrihelix transcription factor
Genome locationchr04:326814..329563
RNA-Seq ExpressionLsi04G000220
SyntenyLsi04G000220
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4
IPR044823 - Trihelix transcription factor ASIL1/2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601124.1 Trihelix transcription factor ASIL2, partial [Cucurbita argyrosperma subsp. sororia]2.6e-6675Show/hide
Query:  MEMKPSPSSPPTSQ-TSPSLLINHHLQLH----------LPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSS
        ME+KP+PSSP  S+ TSPSLL NHH   H           P  KK P ST   GDRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKGHDWEDVARHVSS
Subjt:  MEMKPSPSSPPTSQ-TSPSLLINHHLQLH----------LPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSS

Query:  RANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSN
        RA+FTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGN TLTPPPPPP P     PP             PPPP+ PPFLPPQNSHGSN
Subjt:  RANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSN

Query:  GVDRINPK
        GVDRINPK
Subjt:  GVDRINPK

XP_022957512.1 trihelix transcription factor ASIL2 [Cucurbita moschata]2.5e-6473.91Show/hide
Query:  MEMKPSPSSPPTSQ-TSPSLLINHH---------LQLHLPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSR
        ME+KP+PSSP  S+ TSPSLL NHH              P  KK P ST   GDRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKGHDWEDVARHVSSR
Subjt:  MEMKPSPSSPPTSQ-TSPSLLINHH---------LQLHLPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSR

Query:  ANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNG
        A+FTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGN TLTPPPPPP P     PP             PPPP+ PPFLP QNSHGSNG
Subjt:  ANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNG

Query:  VDRINPK
         DRINPK
Subjt:  VDRINPK

XP_022988920.1 trihelix transcription factor ASIL1-like isoform X1 [Cucurbita maxima]6.9e-5980.95Show/hide
Query:  PDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYH
        P  KK P ST   GDRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKGHDWEDVARHVSSRA+FTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYH
Subjt:  PDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYH

Query:  RLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPK
        RLHLLLRGN TLTPPPPPP P     PP             PPPP+ PPFLP QNSHGSNGVDRINPK
Subjt:  RLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPK

XP_023526851.1 trihelix transcription factor ASIL1 [Cucurbita pepo subsp. pepo]1.9e-6474.76Show/hide
Query:  MEMKPSPSSPPTSQ-TSPSLLINHHLQL--------HLPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRA
        ME+KP+PSSP  S+ TSPSLL NHH  L          P  KK P ST   GDRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKGHDWEDVARHVSSRA
Subjt:  MEMKPSPSSPPTSQ-TSPSLLINHHLQL--------HLPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRA

Query:  NFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGV
        +FTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGN TLTPPPPPP P     PP             PPPP+ PPFLP QNS GSNGV
Subjt:  NFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGV

Query:  DRINPK
        DRINPK
Subjt:  DRINPK

XP_038893233.1 trihelix transcription factor ASIL1 [Benincasa hispida]4.8e-6878.5Show/hide
Query:  MEMKPSPSSPPTSQTSPSLLINHHLQLHLPDDKKN--PVST----GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSP
        ME+KPSPSSP T Q SPSL+ NH+L     DD K    VST    GDRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKGHDWEDVARHVSSR+NFTKSP
Subjt:  MEMKPSPSSPPTSQTSPSLLINHHLQLHLPDDKKN--PVST----GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSP

Query:  KTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPK
        KTQTQCKNKIESMKKRYRSESAS AS+WPLY+RLHLLLRGNTTLTPPPPP    PPPPP     PPPP     PPPPS PPFLP QNSHGSNG+DRINPK
Subjt:  KTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPK

TrEMBL top hitse value%identityAlignment
A0A0A0KRQ8 Uncharacterized protein3.4e-5161.24Show/hide
Query:  MEMKPSPSSPPT--------SQTSPSLLINHHLQLHLPDD----KKNPVSTGD----RLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHV
        ME+KP+PSSPPT         QTSPSLL NHH Q  LPDD    K NPVS G     RLKRDEWSEGAVS L+ AYESKW LRNRAKLKGHDWEDVARHV
Subjt:  MEMKPSPSSPPT--------SQTSPSLLINHHLQLHLPDD----KKNPVSTGD----RLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHV

Query:  SSRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHG
        SSR+N TKS KT TQCKNKIESMKK+ R E A   SSWPLYHR+  L+ GNT LTP PPP                                LPPQNSHG
Subjt:  SSRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHG

Query:  SNGVDRINP
        SNGVD INP
Subjt:  SNGVDRINP

A0A1S3BGZ0 LOW QUALITY PROTEIN: trihelix transcription factor ASIL25.7e-5966.82Show/hide
Query:  EMKPSPSSPPT-------SQTSPSLLINHHLQLHLPDD----KKNPVST-----GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVS
        E+K +PSSPPT       +QTSPSLL NHH    LPDD    K+N VST     G RLKRDEWS+GAVSTLL AYESKW+LRNRAKLKGHDWEDVARHVS
Subjt:  EMKPSPSSPPT-------SQTSPSLLINHHLQLHLPDD----KKNPVST-----GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVS

Query:  SRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLI-LVD--PPPPSSPPFLPPQNS
        SR+NFTKS KT TQCKNKIESMKK  R+  ASAASSWP  HR+   L GN+ LTPPP               SPPPPL+ LVD  PPPPS PPFL PQNS
Subjt:  SRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLI-LVD--PPPPSSPPFLPPQNS

Query:  HGSNGVDRINP
        HGSNGVD INP
Subjt:  HGSNGVDRINP

A0A2N9HTC7 Uncharacterized protein8.8e-5264.45Show/hide
Query:  SQTSPSLLINHHLQLHLPDDKKNP-------------VSTGDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQ
        ++ SPSLL NH       D  + P             V + DRLKRDEWSEGAVS+LL+AYE+KWVLRNRAKLKGHDWEDVARHVS RAN TKSPKTQTQ
Subjt:  SQTSPSLLINHHLQLHLPDDKKNP-------------VSTGDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQ

Query:  CKNKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDP-------PPPSSPPFLPP------QNSHGS
        CKNKIESMKKRYRSESA+A ASSWPLY RL LLLRG+  +  PPPPPPPPPPPPPP    PPPPL+L++P       PPP +PP  PP      QNSHGS
Subjt:  CKNKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDP-------PPPSSPPFLPP------QNSHGS

Query:  NGVDRINPKFG
        NGVDR+  + G
Subjt:  NGVDRINPKFG

A0A6J1GZB5 trihelix transcription factor ASIL21.2e-6473.91Show/hide
Query:  MEMKPSPSSPPTSQ-TSPSLLINHH---------LQLHLPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSR
        ME+KP+PSSP  S+ TSPSLL NHH              P  KK P ST   GDRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKGHDWEDVARHVSSR
Subjt:  MEMKPSPSSPPTSQ-TSPSLLINHH---------LQLHLPDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSR

Query:  ANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNG
        A+FTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGN TLTPPPPPP P     PP             PPPP+ PPFLP QNSHGSNG
Subjt:  ANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNG

Query:  VDRINPK
         DRINPK
Subjt:  VDRINPK

A0A6J1JKX8 trihelix transcription factor ASIL1-like isoform X13.4e-5980.95Show/hide
Query:  PDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYH
        P  KK P ST   GDRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKGHDWEDVARHVSSRA+FTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYH
Subjt:  PDDKKNPVST---GDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYH

Query:  RLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPK
        RLHLLLRGN TLTPPPPPP P     PP             PPPP+ PPFLP QNSHGSNGVDRINPK
Subjt:  RLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPK

SwissProt top hitse value%identityAlignment
Q9LJG8 Trihelix transcription factor ASIL21.9e-1134.04Show/hide
Query:  SPP--TSQTSPSLLINHHLQLHLPDDKKNPVS--TGDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKI
        +PP  +SQ  P  L    L +       N     TG   + D WSE A + L+ A+  +++  +R  LK   W++VA  VSSR ++ K PKT  QCKN+I
Subjt:  SPP--TSQTSPSLLINHHLQLHLPDDKKNPVS--TGDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKI

Query:  ESMKKRYRSESASAA-----SSWPLYHRLHLLLRGNTTLTP
        +++KK+Y+ E    A     S W  + +L  L+ G+T   P
Subjt:  ESMKKRYRSESASAA-----SSWPLYHRLHLLLRGNTTLTP

Arabidopsis top hitse value%identityAlignment
AT3G10030.1 aspartate/glutamate/uridylate kinase family protein1.7e-1540.78Show/hide
Query:  RLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYR------SESASAASSWPLYHRLHLLLRGN
        R  R+EWS+ A++ LL AY  K+   NR  L+G DWE+VA  VS R    K  K+  QCKNKI+++KKRY+      S   +AAS WP + ++  ++  +
Subjt:  RLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYR------SESASAASSWPLYHRLHLLLRGN

Query:  TTL
          L
Subjt:  TTL

AT3G10030.2 aspartate/glutamate/uridylate kinase family protein1.7e-1540.78Show/hide
Query:  RLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYR------SESASAASSWPLYHRLHLLLRGN
        R  R+EWS+ A++ LL AY  K+   NR  L+G DWE+VA  VS R    K  K+  QCKNKI+++KKRY+      S   +AAS WP + ++  ++  +
Subjt:  RLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYR------SESASAASSWPLYHRLHLLLRGN

Query:  TTL
          L
Subjt:  TTL

AT3G54390.1 sequence-specific DNA binding transcription factors3.1e-4162.92Show/hide
Query:  INHHLQLHLPDDKKNPVSTGDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESASA-A
        +NH   L  P      V   DRLKRDEWSEGAVSTLL+AYESKWVLRNRAKLKG DWEDVA+HVSSRA  TKSPKTQTQCKNKIESMKKRYRSESA+A  
Subjt:  INHHLQLHLPDDKKNPVSTGDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESASA-A

Query:  SSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPKFGF
        SSWPLY RL  LLRG          P P P    P + S   PL+L++PP P+     PPQ S+GSNGV +I  + GF
Subjt:  SSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPKFGF

AT5G05550.1 sequence-specific DNA binding transcription factors1.9e-1440Show/hide
Query:  KRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESAS-AASSWPLYHRLHLLL
        + D WSE A +TL++A+ +++V  N   L+ +DW+DVA  V+SR       KT  QCKN+++++KK+Y++E A  + S+W  Y+RL +L+
Subjt:  KRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESAS-AASSWPLYHRLHLLL

AT5G05550.2 sequence-specific DNA binding transcription factors1.9e-1440Show/hide
Query:  KRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESAS-AASSWPLYHRLHLLL
        + D WSE A +TL++A+ +++V  N   L+ +DW+DVA  V+SR       KT  QCKN+++++KK+Y++E A  + S+W  Y+RL +L+
Subjt:  KRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESAS-AASSWPLYHRLHLLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGAAACCCAGTCCCTCATCCCCACCAACCTCACAAACCTCCCCTTCTCTTCTAATTAACCACCACCTTCAGCTTCACCTTCCCGACGACAAAAAAAACCCTGT
TTCCACCGGTGACCGCCTAAAACGAGATGAATGGAGCGAAGGAGCGGTTTCCACCCTCCTACAAGCCTACGAATCAAAATGGGTTCTACGAAACAGAGCCAAATTGAAAG
GCCATGATTGGGAAGATGTGGCCCGCCATGTCTCTTCAAGAGCTAATTTTACCAAATCTCCCAAAACTCAAACTCAGTGTAAGAATAAAATTGAGTCCATGAAAAAAAGG
TACCGTTCTGAATCTGCCTCCGCCGCCTCTTCCTGGCCTTTGTACCACCGTCTTCATCTCTTGCTCAGGGGAAACACTACGCTCACTCCACCCCCACCCCCACCCCCACC
ACCACCGCCGCCGCCGCCCCCATCCTCTCACTCTCCACCACCGCCACTTATTCTCGTCGATCCTCCACCTCCCTCTTCACCGCCGTTTCTTCCGCCCCAAAATTCCCATG
GATCCAATGGTGTTGATAGGATTAATCCTAAGTTTGGATTTTGGGAGATGGAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATGAAACCCAGTCCCTCATCCCCACCAACCTCACAAACCTCCCCTTCTCTTCTAATTAACCACCACCTTCAGCTTCACCTTCCCGACGACAAAAAAAACCCTGT
TTCCACCGGTGACCGCCTAAAACGAGATGAATGGAGCGAAGGAGCGGTTTCCACCCTCCTACAAGCCTACGAATCAAAATGGGTTCTACGAAACAGAGCCAAATTGAAAG
GCCATGATTGGGAAGATGTGGCCCGCCATGTCTCTTCAAGAGCTAATTTTACCAAATCTCCCAAAACTCAAACTCAGTGTAAGAATAAAATTGAGTCCATGAAAAAAAGG
TACCGTTCTGAATCTGCCTCCGCCGCCTCTTCCTGGCCTTTGTACCACCGTCTTCATCTCTTGCTCAGGGGAAACACTACGCTCACTCCACCCCCACCCCCACCCCCACC
ACCACCGCCGCCGCCGCCCCCATCCTCTCACTCTCCACCACCGCCACTTATTCTCGTCGATCCTCCACCTCCCTCTTCACCGCCGTTTCTTCCGCCCCAAAATTCCCATG
GATCCAATGGTGTTGATAGGATTAATCCTAAGTTTGGATTTTGGGAGATGGAGAGATAA
Protein sequenceShow/hide protein sequence
MEMKPSPSSPPTSQTSPSLLINHHLQLHLPDDKKNPVSTGDRLKRDEWSEGAVSTLLQAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKR
YRSESASAASSWPLYHRLHLLLRGNTTLTPPPPPPPPPPPPPPPSSHSPPPPLILVDPPPPSSPPFLPPQNSHGSNGVDRINPKFGFWEMER