; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018794 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018794
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTranscription factor Pur-alpha 1
Genome locationchr5:34569994..34571594
RNA-Seq ExpressionLag0018794
SyntenyLag0018794
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054521.1 transcription elongation factor B polypeptide 3 isoform X2 [Cucumis melo var. makuwa]3.1e-0944.83Show/hide
Query:  HLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSS
        H   SE GRK+ IEER G R+ +L++++GT AW+RDCL+A  +  NP  FWRRR LE  ++      N  GR  L +  S  GR  S
Subjt:  HLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSS

RVW29265.1 hypothetical protein CK203_113585 [Vitis vinifera]3.5e-0526.32Show/hide
Query:  MDRWKKVNIDWKVF--HLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGR
        M R  KV+++ K+F   L    DG+   + E        L  +     W+ + L    K ++  GF+R+ + ++++   +V  N  GRF  LS  ++  +
Subjt:  MDRWKKVNIDWKVF--HLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGR

Query:  KSSICIPEGQNRRGWGSLEMAMTELLPPPKAPKRLENGLLKEKGSEGDQLEEGVRKP-----NSFAKIVR-NGPKKGNEM-IEMLAEKCG
         + + IPEG+  RGW  L+ A++ +L  P +        + EKG +  + E  + K       SFA +VR  G ++G  + +    E CG
Subjt:  KSSICIPEGQNRRGWGSLEMAMTELLPPPKAPKRLENGLLKEKGSEGDQLEEGVRKP-----NSFAKIVR-NGPKKGNEM-IEMLAEKCG

TYK14183.1 hypothetical protein E5676_scaffold8046G00070 [Cucumis melo var. makuwa]1.2e-2139.31Show/hide
Query:  KVNIDWKVFHLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIP
        K+ ID K F +      RK+ IEER G    +L++D GT AW+RDCL+    + N   FW+RR LE A+IFFQVL N+KGRF +LSLES + RK+ I IP
Subjt:  KVNIDWKVFHLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIP

Query:  EGQNRRGWGSLEMAMTELLPPPKAPKRLENGLLKEKGSEGDQLEEGVRKPNSFAKIVRNGPKKGNEMIEMLAE
        EG    GW SL   ++ LLP         +G + +K     Q  +         K+VR    +  + +E   E
Subjt:  EGQNRRGWGSLEMAMTELLPPPKAPKRLENGLLKEKGSEGDQLEEGVRKPNSFAKIVRNGPKKGNEMIEMLAE

XP_042413382.1 transcription factor Pur-alpha 1-like isoform X1 [Zingiber officinale]3.5e-0526.45Show/hide
Query:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI
        K +  + K+F+    E+  GR + I E+  +  S + V V   AW  +       ++  + F +  KL++   FF +  NK+GRF  +S  S    +S+I
Subjt:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI

Query:  CIPEGQNRRGWGSLEMAMTEL
         +P G +  GW +    + E+
Subjt:  CIPEGQNRRGWGSLEMAMTEL

XP_042418063.1 transcription factor Pur-alpha 1-like isoform X1 [Zingiber officinale]3.5e-0526.45Show/hide
Query:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI
        K +  + K+F+    E+  GR + I E+  +  S + V V   AW  +       ++  + F +  KL++   FF +  NK+GRF  +S  S    +S+I
Subjt:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI

Query:  CIPEGQNRRGWGSLEMAMTEL
         +P G +  GW +    + E+
Subjt:  CIPEGQNRRGWGSLEMAMTEL

TrEMBL top hitse value%identityAlignment
A0A0A0KB99 Uncharacterized protein9.3e-1245.98Show/hide
Query:  LDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIPEGQNRRGWGSLEMAMTELL
        +++D GT A +RDCL     + NP  FW RR L+ A+IFFQVL N +GRF +LSLES +G+K+ I I +G   +G  S    ++ LL
Subjt:  LDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIPEGQNRRGWGSLEMAMTELL

A0A0A0KGZ5 Uncharacterized protein4.8e-0844.87Show/hide
Query:  GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRK
        GRK+ IEE  G R  +L++++G  AW+ DCL+A  +  NP GFW RR LE   IFFQ            SLE  RG K
Subjt:  GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRK

A0A0A0LY03 Uncharacterized protein1.3e-1640.71Show/hide
Query:  RKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIPEGQNRRGWGS-----LE
        RK+ IEER G +  +LD+D+GT  W+ D L A   +    G W+R+KL+  +IFFQVL N   RF ++SL+S +GRK+ + I EG    GW S     L 
Subjt:  RKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIPEGQNRRGWGS-----LE

Query:  MAMTE----LLPPPKAPKRLENGLLKEKGS-----EGDQL
        M   E    LL P  A ++  N +   KGS     EG QL
Subjt:  MAMTE----LLPPPKAPKRLENGLLKEKGS-----EGDQL

A0A5A7ULV4 Transcription elongation factor B polypeptide 3 isoform X21.5e-0944.83Show/hide
Query:  HLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSS
        H   SE GRK+ IEER G R+ +L++++GT AW+RDCL+A  +  NP  FWRRR LE  ++      N  GR  L +  S  GR  S
Subjt:  HLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSS

A0A5D3CQQ0 Uncharacterized protein5.8e-2239.31Show/hide
Query:  KVNIDWKVFHLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIP
        K+ ID K F +      RK+ IEER G    +L++D GT AW+RDCL+    + N   FW+RR LE A+IFFQVL N+KGRF +LSLES + RK+ I IP
Subjt:  KVNIDWKVFHLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIP

Query:  EGQNRRGWGSLEMAMTELLPPPKAPKRLENGLLKEKGSEGDQLEEGVRKPNSFAKIVRNGPKKGNEMIEMLAE
        EG    GW SL   ++ LLP         +G + +K     Q  +         K+VR    +  + +E   E
Subjt:  EGQNRRGWGSLEMAMTELLPPPKAPKRLENGLLKEKGSEGDQLEEGVRKPNSFAKIVRNGPKKGNEMIEMLAE

SwissProt top hitse value%identityAlignment
Q9SKZ1 Transcription factor Pur-alpha 19.6e-0624.59Show/hide
Query:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI
        K + ++ K+F+    E+  GR + I E+  +  S + V     +W  D       SE    F +  +L++ + +F +  N++GRF  +S  S    +S+I
Subjt:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI

Query:  CIPEGQN-RRGWGSLEMAMTEL
         +P G +   GW +    + E+
Subjt:  CIPEGQN-RRGWGSLEMAMTEL

Arabidopsis top hitse value%identityAlignment
AT2G32080.1 purin-rich alpha 16.8e-0724.59Show/hide
Query:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI
        K + ++ K+F+    E+  GR + I E+  +  S + V     +W  D       SE    F +  +L++ + +F +  N++GRF  +S  S    +S+I
Subjt:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI

Query:  CIPEGQN-RRGWGSLEMAMTEL
         +P G +   GW +    + E+
Subjt:  CIPEGQN-RRGWGSLEMAMTEL

AT2G32080.2 purin-rich alpha 16.8e-0724.59Show/hide
Query:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI
        K + ++ K+F+    E+  GR + I E+  +  S + V     +W  D       SE    F +  +L++ + +F +  N++GRF  +S  S    +S+I
Subjt:  KKVNIDWKVFHLCKSED--GRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSI

Query:  CIPEGQN-RRGWGSLEMAMTEL
         +P G +   GW +    + E+
Subjt:  CIPEGQN-RRGWGSLEMAMTEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGGTGGAAGAAGGTGAACATCGACTGGAAGGTGTTTCACCTTTGCAAGAGTGAGGATGGGAGGAAGGTACTTATTGAGGAGAGGCGTGGATCGAGAACGAGTCG
CCTGGACGTGGACGTGGGTACTTTCGCTTGGATTAGGGATTGTCTTATGGCAGTGACGAAGTCAGAGAACCCAAATGGATTCTGGAGAAGGAGGAAACTAGAGGCTGCGT
TAATATTCTTTCAGGTGCTATCCAACAAGAAGGGAAGGTTCGGGCTTCTTTCTCTAGAATCGAATAGGGGAAGGAAATCTTCTATCTGCATCCCCGAGGGTCAGAACCGC
AGAGGTTGGGGGTCGCTTGAGATGGCCATGACAGAATTGCTCCCCCCTCCAAAGGCACCCAAACGGCTCGAAAATGGGTTACTTAAAGAGAAAGGAAGTGAAGGAGACCA
GCTTGAGGAAGGGGTTCGAAAACCCAACTCTTTTGCGAAAATAGTGAGAAATGGACCTAAGAAAGGAAACGAGATGATCGAGATGCTAGCTGAAAAGTGCGGTGGGCTTA
CAGAGGAAGGTGAAATTAGAGGGAGGGAGCTACTTTCCGGTGAGGTTTGTGGCCGGAAACGAAAATATCGAAAAGAACTTCGGCGAGATTTCAGCAGCCCCATGAGCACC
GGCGAGGGTTGTCCTTCAGCTGCCAAAGTCGGCAGTCTTTCACGGATTGATGGGTCGGCTGGGCCTGAGAAAGTCACCCTCAAATTGACAGAAAAGCCTTTTCTCCCTAT
AACATACTCGTGTAGGGAACCTTTTAAGGATTACTTACCCGCGGGGATTAATAATATCGAGCCTGAGATGAATGGGCTGGATGGGCGTGGGGCTTTATCTTCACATAGGT
CCAGTAAAGATAAAGGGAAGGTTGTGTATAATTATGATAGTGAGAATCTCGGGCTGGAAAATGAGCATCCAATTACAAACATCACTCTGATGCAAGCAGAATTCGTGGAG
GTAAGGGGGAAGAAGAATAGGACTTGGAATGGACAACGAACTTCAATCCAATTGAGGAAGAATTTGGCTCAGACCCATCTATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGGTGGAAGAAGGTGAACATCGACTGGAAGGTGTTTCACCTTTGCAAGAGTGAGGATGGGAGGAAGGTACTTATTGAGGAGAGGCGTGGATCGAGAACGAGTCG
CCTGGACGTGGACGTGGGTACTTTCGCTTGGATTAGGGATTGTCTTATGGCAGTGACGAAGTCAGAGAACCCAAATGGATTCTGGAGAAGGAGGAAACTAGAGGCTGCGT
TAATATTCTTTCAGGTGCTATCCAACAAGAAGGGAAGGTTCGGGCTTCTTTCTCTAGAATCGAATAGGGGAAGGAAATCTTCTATCTGCATCCCCGAGGGTCAGAACCGC
AGAGGTTGGGGGTCGCTTGAGATGGCCATGACAGAATTGCTCCCCCCTCCAAAGGCACCCAAACGGCTCGAAAATGGGTTACTTAAAGAGAAAGGAAGTGAAGGAGACCA
GCTTGAGGAAGGGGTTCGAAAACCCAACTCTTTTGCGAAAATAGTGAGAAATGGACCTAAGAAAGGAAACGAGATGATCGAGATGCTAGCTGAAAAGTGCGGTGGGCTTA
CAGAGGAAGGTGAAATTAGAGGGAGGGAGCTACTTTCCGGTGAGGTTTGTGGCCGGAAACGAAAATATCGAAAAGAACTTCGGCGAGATTTCAGCAGCCCCATGAGCACC
GGCGAGGGTTGTCCTTCAGCTGCCAAAGTCGGCAGTCTTTCACGGATTGATGGGTCGGCTGGGCCTGAGAAAGTCACCCTCAAATTGACAGAAAAGCCTTTTCTCCCTAT
AACATACTCGTGTAGGGAACCTTTTAAGGATTACTTACCCGCGGGGATTAATAATATCGAGCCTGAGATGAATGGGCTGGATGGGCGTGGGGCTTTATCTTCACATAGGT
CCAGTAAAGATAAAGGGAAGGTTGTGTATAATTATGATAGTGAGAATCTCGGGCTGGAAAATGAGCATCCAATTACAAACATCACTCTGATGCAAGCAGAATTCGTGGAG
GTAAGGGGGAAGAAGAATAGGACTTGGAATGGACAACGAACTTCAATCCAATTGAGGAAGAATTTGGCTCAGACCCATCTATTGTGA
Protein sequenceShow/hide protein sequence
MDRWKKVNIDWKVFHLCKSEDGRKVLIEERRGSRTSRLDVDVGTFAWIRDCLMAVTKSENPNGFWRRRKLEAALIFFQVLSNKKGRFGLLSLESNRGRKSSICIPEGQNR
RGWGSLEMAMTELLPPPKAPKRLENGLLKEKGSEGDQLEEGVRKPNSFAKIVRNGPKKGNEMIEMLAEKCGGLTEEGEIRGRELLSGEVCGRKRKYRKELRRDFSSPMST
GEGCPSAAKVGSLSRIDGSAGPEKVTLKLTEKPFLPITYSCREPFKDYLPAGINNIEPEMNGLDGRGALSSHRSSKDKGKVVYNYDSENLGLENEHPITNITLMQAEFVE
VRGKKNRTWNGQRTSIQLRKNLAQTHLL