; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008522 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008522
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription factor bHLH149-like
Genome locationchr9:24359791..24363957
RNA-Seq ExpressionLag0008522
SyntenyLag0008522
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044660 - Transcription factor IBH1-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586210.1 Transcription factor basic helix-loop-helix 149, partial [Cucurbita argyrosperma subsp. sororia]2.6e-1559.18Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL A  KGRT   + ILASRF QRL RR+RT KL  +   S  A  K+EK+ KL +VQRK KIL +L+PGCRKV   N LEETTDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

KAG7021037.1 Transcription factor bHLH, partial [Cucurbita argyrosperma subsp. argyrosperma]8.8e-1661.22Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL A  KGRT   + ILASRF QRL RR+RT KL  +   S  A  K+EK+ KL AVQRK KIL +LIPGCRKV   N LEETTDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

XP_022937811.1 transcription factor bHLH149-like [Cucurbita moschata]1.2e-1560.2Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL A  KGRT   + ILASRF QRL RR+RT KL  +   S  A  K+EK+ KL AVQRK KIL +L+PGCRKV   N LEETTDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

XP_022965958.1 transcription factor bHLH149-like [Cucurbita maxima]8.8e-1659.18Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL A  KGRT   + ILASRF QRL RR+RT KL  +    + A  K+EK+ KL AVQRK KIL +L+PGCRKV   N LEETTDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

XP_038890546.1 transcription factor bHLH149 [Benincasa hispida]1.8e-1662.63Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLT-CKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL AT KGRT   + ILASRF QRL RR+RT KL  CKP  S  A AK EK  KL AVQRK KIL +L+PGCRK+   N LEE TDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLT-CKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

TrEMBL top hitse value%identityAlignment
A0A0A0LJ14 Uncharacterized protein7.5e-1354.64Show/hide
Query:  TAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        TA  VL AT KGRT   + ILA+RF Q L RR+R  K   K L + +   K EK  KL AVQRK KIL +L+PGCRK+   N LEE TDYI  LEM+
Subjt:  TAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

A0A6J1DBJ7 uncharacterized protein LOC1110189734.4e-1338.21Show/hide
Query:  MAGIAPLQSCLDVDLAKGRAVRQGVHMALDLGFCNLIVEVDSLRVYRVLQREMDDISELGNLLSSIPKF--DHLGREIRFHFIPHEGNKAAHGLARLASE
        +  I  L    DVD  +G AV +G+ +A++ GF    +E DSLR++ +L  +  D SE+G L S I  F   H  R + F F    GN  AH LA+LA  
Subjt:  MAGIAPLQSCLDVDLAKGRAVRQGVHMALDLGFCNLIVEVDSLRVYRVLQREMDDISELGNLLSSIPKF--DHLGREIRFHFIPHEGNKAAHGLARLASE

Query:  VNKEWVWIEDWPSEIADVVSADC
             +W+E+WP EI+ V++ DC
Subjt:  VNKEWVWIEDWPSEIADVVSADC

A0A6J1FBF1 transcription factor bHLH149-like5.6e-1660.2Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL A  KGRT   + ILASRF QRL RR+RT KL  +   S  A  K+EK+ KL AVQRK KIL +L+PGCRKV   N LEETTDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

A0A6J1HQH2 transcription factor bHLH149-like4.3e-1659.18Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL A  KGRT   + ILASRF QRL RR+RT KL  +    + A  K+EK+ KL AVQRK KIL +L+PGCRKV   N LEETTDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

A0A6J1KCR3 transcription factor bHLH149-like1.7e-1254.08Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        ETA  VL A  KGRT   + ILA+RF QRL RR+RT K   +    VE      K+ KL AVQRK KIL +L+PGCRK+   N LEE TDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

SwissProt top hitse value%identityAlignment
O80482 Transcription factor bHLH1491.3e-0943Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLT--CKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        +TA  VL A+ +G T   + ILASR   +L + ++  K T  CK    +    +     KL AV+RK KIL +L+PGCRKV + N L+E TDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLT--CKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

Q9C8Z9 Transcription factor bHLH1486.2e-0429.23Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGK------------LSAVQRKAKILDQLIPGCRKVYLSNFLEET
        E A   L  + +GRT   + ILA+R   +  +++R       P  +   ++ + +S K            +  V RK ++L +L+PGC K  +   LEE 
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGK------------LSAVQRKAKILDQLIPGCRKVYLSNFLEET

Query:  TDYIPTLEMK-HPQSPLGSLQAKSPQLPPP
        TDYI  LEM+    + L  L +     PPP
Subjt:  TDYIPTLEMK-HPQSPLGSLQAKSPQLPPP

Q9LSN7 Transcription factor bHLH1471.0e-0636.72Show/hide
Query:  ETAYGVLTATTKGRTQ-----LILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAE-------KSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDY
        E A   L    +G+T      L  A +   R  +RQR +  T   L +    +K +       K+  L AVQRK K+L +L+PGCRK  L   LEETTDY
Subjt:  ETAYGVLTATTKGRTQ-----LILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAE-------KSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDY

Query:  IPTLEMKHPQSPLGSLQAKSPQLPPPPP
        I  +EM+  ++    L A S   PPP P
Subjt:  IPTLEMKHPQSPLGSLQAKSPQLPPPPP

Q9M9L6 Transcription factor bHLH1501.2e-0738.78Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        +TA  VL  T +G T   + IL SRF   L RR+R  K       ++  +  + +  KLSAV  + ++L  L+PGCR+  L   L+ET DYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

Arabidopsis top hitse value%identityAlignment
AT1G09250.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein9.1e-1143Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLT--CKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        +TA  VL A+ +G T   + ILASR   +L + ++  K T  CK    +    +     KL AV+RK KIL +L+PGCRKV + N L+E TDYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLT--CKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

AT3G05800.1 AtBS1(activation-tagged BRI1 suppressor 1)-interacting factor 18.5e-0938.78Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK
        +TA  VL  T +G T   + IL SRF   L RR+R  K       ++  +  + +  KLSAV  + ++L  L+PGCR+  L   L+ET DYI  LEM+
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYIPTLEMK

AT3G06590.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein4.4e-0529.23Show/hide
Query:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGK------------LSAVQRKAKILDQLIPGCRKVYLSNFLEET
        E A   L  + +GRT   + ILA+R   +  +++R       P  +   ++ + +S K            +  V RK ++L +L+PGC K  +   LEE 
Subjt:  ETAYGVLTATTKGRT---QLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGK------------LSAVQRKAKILDQLIPGCRKVYLSNFLEET

Query:  TDYIPTLEMK-HPQSPLGSLQAKSPQLPPP
        TDYI  LEM+    + L  L +     PPP
Subjt:  TDYIPTLEMK-HPQSPLGSLQAKSPQLPPP

AT3G17100.1 sequence-specific DNA binding transcription factors7.2e-0836.72Show/hide
Query:  ETAYGVLTATTKGRTQ-----LILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAE-------KSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDY
        E A   L    +G+T      L  A +   R  +RQR +  T   L +    +K +       K+  L AVQRK K+L +L+PGCRK  L   LEETTDY
Subjt:  ETAYGVLTATTKGRTQ-----LILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAE-------KSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDY

Query:  IPTLEMKHPQSPLGSLQAKSPQLPPPPP
        I  +EM+  ++    L A S   PPP P
Subjt:  IPTLEMKHPQSPLGSLQAKSPQLPPPPP

AT3G17100.2 sequence-specific DNA binding transcription factors7.2e-0836.72Show/hide
Query:  ETAYGVLTATTKGRTQ-----LILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAE-------KSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDY
        E A   L    +G+T      L  A +   R  +RQR +  T   L +    +K +       K+  L AVQRK K+L +L+PGCRK  L   LEETTDY
Subjt:  ETAYGVLTATTKGRTQ-----LILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAE-------KSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDY

Query:  IPTLEMKHPQSPLGSLQAKSPQLPPPPP
        I  +EM+  ++    L A S   PPP P
Subjt:  IPTLEMKHPQSPLGSLQAKSPQLPPPPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGTATTGCTCCTCTCCAATCCTGTCTTGATGTTGATCTAGCTAAAGGACGGGCAGTGAGGCAGGGAGTGCATATGGCTCTAGACCTGGGGTTTTGCAACCTGAT
TGTTGAAGTGGACTCCCTGCGGGTCTATCGCGTTCTTCAGAGAGAAATGGATGATATTTCAGAGTTGGGGAACTTGTTGTCCAGCATTCCAAAATTTGACCATTTGGGGA
GGGAAATTAGATTCCATTTTATCCCGCATGAAGGTAACAAGGCTGCTCACGGACTCGCTCGACTAGCTTCTGAGGTGAACAAGGAGTGGGTATGGATTGAAGATTGGCCA
AGTGAGATTGCTGATGTAGTTTCTGCTGATTGTGTGCTCTTTACTAAATTTTTCAGCTTTCTTGAAACCGCATATGGCGTCCTCACCGCCACGACCAAGGGGAGGACACA
ATTGATTCTGGCGAGTCGATTCCACCAGAGGCTTGTGCGACGTCAGAGGACGAATAAATTGACCTGTAAACCGTTGCACTCGGTGGAGGCGAATGCGAAGGCTGAGAAAA
GTGGGAAGTTGTCGGCGGTGCAGAGGAAGGCAAAAATTCTCGACCAATTGATTCCGGGTTGCCGGAAAGTCTATCTCTCAAACTTTCTGGAAGAAACGACCGATTATATA
CCGACTTTAGAGATGAAGCATCCACAAAGCCCTCTGGGATCTCTTCAAGCCAAATCCCCTCAACTTCCTCCTCCACCGCCAACCTTGTTGTCTGCTGTTTCAAGAGATTG
CTTCGTTGACAAAAAGAGAACCTCGCCGCTTGATGTCATGGAGAATATGTTAGTACGAGATTTGAAATCTTTCCCGACTAATTGGAGTTTGAAAGCAAGTGTTCTTCGTA
AAGCTTCCTCCGACAGCCTCAACGCACGAATCAAAACCCACAAATCAAGACTCACAAGCATCCAAGCAACTCACGAATCGAAACCCACAACTCACGAATCGAGACCCAGA
TCTGGACGAGCAACTCACAAATCGAAACCCACAACTCACGAATCGAGCCCCAGATCAGGACGAGCATCCGAGCAAGGTCTCCGACGAGTAACGTCTTCGATTGCAAGAAA
CTCAAAGAAGAAAGAAACCCGCGTCGGACTGCTGCTGCTCGTGATTGCTAAGTCGCCGGAGTTCCACTTTCGACAGATCCGGCTGGAACGCTGCGTCATCGACGGCTCGG
AGTTGTCCCTCTCCAACGTGTTTCTCGACACATCCGGCTGGAAGTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGTATTGCTCCTCTCCAATCCTGTCTTGATGTTGATCTAGCTAAAGGACGGGCAGTGAGGCAGGGAGTGCATATGGCTCTAGACCTGGGGTTTTGCAACCTGAT
TGTTGAAGTGGACTCCCTGCGGGTCTATCGCGTTCTTCAGAGAGAAATGGATGATATTTCAGAGTTGGGGAACTTGTTGTCCAGCATTCCAAAATTTGACCATTTGGGGA
GGGAAATTAGATTCCATTTTATCCCGCATGAAGGTAACAAGGCTGCTCACGGACTCGCTCGACTAGCTTCTGAGGTGAACAAGGAGTGGGTATGGATTGAAGATTGGCCA
AGTGAGATTGCTGATGTAGTTTCTGCTGATTGTGTGCTCTTTACTAAATTTTTCAGCTTTCTTGAAACCGCATATGGCGTCCTCACCGCCACGACCAAGGGGAGGACACA
ATTGATTCTGGCGAGTCGATTCCACCAGAGGCTTGTGCGACGTCAGAGGACGAATAAATTGACCTGTAAACCGTTGCACTCGGTGGAGGCGAATGCGAAGGCTGAGAAAA
GTGGGAAGTTGTCGGCGGTGCAGAGGAAGGCAAAAATTCTCGACCAATTGATTCCGGGTTGCCGGAAAGTCTATCTCTCAAACTTTCTGGAAGAAACGACCGATTATATA
CCGACTTTAGAGATGAAGCATCCACAAAGCCCTCTGGGATCTCTTCAAGCCAAATCCCCTCAACTTCCTCCTCCACCGCCAACCTTGTTGTCTGCTGTTTCAAGAGATTG
CTTCGTTGACAAAAAGAGAACCTCGCCGCTTGATGTCATGGAGAATATGTTAGTACGAGATTTGAAATCTTTCCCGACTAATTGGAGTTTGAAAGCAAGTGTTCTTCGTA
AAGCTTCCTCCGACAGCCTCAACGCACGAATCAAAACCCACAAATCAAGACTCACAAGCATCCAAGCAACTCACGAATCGAAACCCACAACTCACGAATCGAGACCCAGA
TCTGGACGAGCAACTCACAAATCGAAACCCACAACTCACGAATCGAGCCCCAGATCAGGACGAGCATCCGAGCAAGGTCTCCGACGAGTAACGTCTTCGATTGCAAGAAA
CTCAAAGAAGAAAGAAACCCGCGTCGGACTGCTGCTGCTCGTGATTGCTAAGTCGCCGGAGTTCCACTTTCGACAGATCCGGCTGGAACGCTGCGTCATCGACGGCTCGG
AGTTGTCCCTCTCCAACGTGTTTCTCGACACATCCGGCTGGAAGTTTTAG
Protein sequenceShow/hide protein sequence
MAGIAPLQSCLDVDLAKGRAVRQGVHMALDLGFCNLIVEVDSLRVYRVLQREMDDISELGNLLSSIPKFDHLGREIRFHFIPHEGNKAAHGLARLASEVNKEWVWIEDWP
SEIADVVSADCVLFTKFFSFLETAYGVLTATTKGRTQLILASRFHQRLVRRQRTNKLTCKPLHSVEANAKAEKSGKLSAVQRKAKILDQLIPGCRKVYLSNFLEETTDYI
PTLEMKHPQSPLGSLQAKSPQLPPPPPTLLSAVSRDCFVDKKRTSPLDVMENMLVRDLKSFPTNWSLKASVLRKASSDSLNARIKTHKSRLTSIQATHESKPTTHESRPR
SGRATHKSKPTTHESSPRSGRASEQGLRRVTSSIARNSKKKETRVGLLLLVIAKSPEFHFRQIRLERCVIDGSELSLSNVFLDTSGWKF