; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013606 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013606
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA binding;sequence-specific DNA binding transcription factors
Genome locationchr1:51566478..51566975
RNA-Seq ExpressionLag0013606
SyntenyLag0013606
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025593.1 hypothetical protein E6C27_scaffold253G00830 [Cucumis melo var. makuwa]5.5e-3062.99Show/hide
Query:  YQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGASSTSADAA-LTIGT-AFTELSGQL
        Y QS   EIAN  KRQH     +E S   SPG  FD D LSFGVFDFPW KD LI SKS+DWKF+DVFFTS ++GASST   A  LTIGT AFTE     
Subjt:  YQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGASSTSADAA-LTIGT-AFTELSGQL

Query:  VWCNLPDPWEKEYEAHA----APPLDGGGP-AEGMDCIWSSLLNQPLQQGSSAL
            LPDPWEK+YEA A     P LDGG    EGMDCIW S+LNQPLQQGSSAL
Subjt:  VWCNLPDPWEKEYEAHA----APPLDGGGP-AEGMDCIWSSLLNQPLQQGSSAL

KAG6594718.1 hypothetical protein SDJN03_11271, partial [Cucurbita argyrosperma subsp. sororia]3.6e-4563.89Show/hide
Query:  MDRPSSKRPRE----SSSSAAALSADVTYQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFD
        MDR ++KRPRE    SSSS+++ SA V  +QS   EI NGEKRQH++G           GRLFDLDLSFGVFDFPW KDSLI SK EDWK +DVFFTSFD
Subjt:  MDRPSSKRPRE----SSSSAAALSADVTYQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFD

Query:  GASSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL-------DGGGPA-EGMDCIWSSLLNQPLQQGSSA
        G  ST+AD AL      TE S QLVWCNLPDPWE+EYEAH AP         DG GPA EGMDC+W+SLLNQPLQQGSS+
Subjt:  GASSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL-------DGGGPA-EGMDCIWSSLLNQPLQQGSSA

KAG6603846.1 hypothetical protein SDJN03_04455, partial [Cucurbita argyrosperma subsp. sororia]1.2e-3761.18Show/hide
Query:  MDRPSSKRPRESSSSAAALSADVTYQQSEIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFDGASSTSA
        MDR S+KRPRESS+  ++L         EI NG K+Q L GG+ + +    P   FD DLSFGVFDFPW K+SLI S+SEDWK DDVFFTSF    ST A
Subjt:  MDRPSSKRPRESSSSAAALSADVTYQQSEIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFDGASSTSA

Query:  DAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL--DGGG--PAEGMDCIWSSLLNQPLQQGSSAL
        DA  TIGT  +  SGQLVW NLPDPWE EYEA   P    DGGG    EG+DCIWSSLLNQPLQQGS AL
Subjt:  DAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL--DGGG--PAEGMDCIWSSLLNQPLQQGSSAL

KGN49015.1 hypothetical protein Csa_004415 [Cucumis sativus]8.8e-3659.66Show/hide
Query:  MDRPSSKRPRESSSSAAALSADVTYQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGA
        M+R ++KRPRESSS ++    D  Y QS   EIAN  KRQH    A+E    +SPG  FD D LSFGVFDFPW KD LI SKS+DWKF+DVFFTS ++GA
Subjt:  MDRPSSKRPRESSSSAAALSADVTYQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGA

Query:  SSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPP----LDGGGP-AEGMDCIWSSLLNQPLQQGSSAL
        S+      LTIGTAFTE         LPDPWEK+YEA A PP    LDGG    E MDCIW S+LNQPLQQGSSAL
Subjt:  SSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPP----LDGGGP-AEGMDCIWSSLLNQPLQQGSSAL

XP_022977867.1 uncharacterized protein LOC111478029 [Cucurbita maxima]1.0e-3660Show/hide
Query:  MDRPSSKRPRESSSSAAALSADVTYQQSEIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFDGASSTSA
        M+R S+KRPRESS+  ++L         EI NG K+Q L GG+ + +    P   +D DLSFGVFDFPW K+SLI S+SEDWK DDVFFTSF    ST A
Subjt:  MDRPSSKRPRESSSSAAALSADVTYQQSEIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFDGASSTSA

Query:  DAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL--DGGG--PAEGMDCIWSSLLNQPLQQGSSAL
        DA  +IGT  +  SGQLVW NLPDPWE EYEA   P    DGGG    EGMDCIWSSLLNQPLQQGS AL
Subjt:  DAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL--DGGG--PAEGMDCIWSSLLNQPLQQGSSAL

TrEMBL top hitse value%identityAlignment
A0A0A0KH39 Uncharacterized protein4.3e-3659.66Show/hide
Query:  MDRPSSKRPRESSSSAAALSADVTYQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGA
        M+R ++KRPRESSS ++    D  Y QS   EIAN  KRQH    A+E    +SPG  FD D LSFGVFDFPW KD LI SKS+DWKF+DVFFTS ++GA
Subjt:  MDRPSSKRPRESSSSAAALSADVTYQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGA

Query:  SSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPP----LDGGGP-AEGMDCIWSSLLNQPLQQGSSAL
        S+      LTIGTAFTE         LPDPWEK+YEA A PP    LDGG    E MDCIW S+LNQPLQQGSSAL
Subjt:  SSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPP----LDGGGP-AEGMDCIWSSLLNQPLQQGSSAL

A0A5A7SLQ6 Uncharacterized protein2.7e-3062.99Show/hide
Query:  YQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGASSTSADAA-LTIGT-AFTELSGQL
        Y QS   EIAN  KRQH     +E S   SPG  FD D LSFGVFDFPW KD LI SKS+DWKF+DVFFTS ++GASST   A  LTIGT AFTE     
Subjt:  YQQS---EIANGEKRQHLDGGAREPSNNNSPGRLFDLD-LSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTS-FDGASSTSADAA-LTIGT-AFTELSGQL

Query:  VWCNLPDPWEKEYEAHA----APPLDGGGP-AEGMDCIWSSLLNQPLQQGSSAL
            LPDPWEK+YEA A     P LDGG    EGMDCIW S+LNQPLQQGSSAL
Subjt:  VWCNLPDPWEKEYEAHA----APPLDGGGP-AEGMDCIWSSLLNQPLQQGSSAL

A0A6J1BUL5 uncharacterized protein LOC1110055081.4e-2658.39Show/hide
Query:  MDRPSSKRPRESSSSAAALSAD-VTYQQS-----EIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLISKSEDWKFDDVFFTSFDGA
        MDR ++KR RES S++AA S D V+Y++S     EI N  K+QH DGG        + GR  DLD    VFDFPW KDSLISKSEDWKF+DVFFTS D  
Subjt:  MDRPSSKRPRESSSSAAALSAD-VTYQQS-----EIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLISKSEDWKFDDVFFTSFDGA

Query:  SSTSADAALTIGTAFTELSGQLVWC---NLPDPWEKE
         +++AD  LTIGTAFTELSGQL+WC   +LPDP+E E
Subjt:  SSTSADAALTIGTAFTELSGQLVWC---NLPDPWEKE

A0A6J1IJM6 uncharacterized protein LOC1114780295.0e-3760Show/hide
Query:  MDRPSSKRPRESSSSAAALSADVTYQQSEIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFDGASSTSA
        M+R S+KRPRESS+  ++L         EI NG K+Q L GG+ + +    P   +D DLSFGVFDFPW K+SLI S+SEDWK DDVFFTSF    ST A
Subjt:  MDRPSSKRPRESSSSAAALSADVTYQQSEIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLI-SKSEDWKFDDVFFTSFDGASSTSA

Query:  DAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL--DGGG--PAEGMDCIWSSLLNQPLQQGSSAL
        DA  +IGT  +  SGQLVW NLPDPWE EYEA   P    DGGG    EGMDCIWSSLLNQPLQQGS AL
Subjt:  DAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHAAPPL--DGGG--PAEGMDCIWSSLLNQPLQQGSSAL

A0A6P9ENN9 uncharacterized protein LOC108996550 isoform X23.5e-1441.1Show/hide
Query:  YQQSEIANGEKRQHLDGG-AREPSNNNSPGRLFD--LDLSFGVFDFPWSKDSLISKSEDWKFDDVFFTSFDGASSTSADAALTIGTAFTELSGQLVWCNL
        +Q     N  K+Q L G   RE ++  S  +  D  + L  GVFDFPW KD +ISKSEDW+F+D F +S     +++  AA+       E SGQ + C  
Subjt:  YQQSEIANGEKRQHLDGG-AREPSNNNSPGRLFD--LDLSFGVFDFPWSKDSLISKSEDWKFDDVFFTSFDGASSTSADAALTIGTAFTELSGQLVWCNL

Query:  PD--PW--EKEYEAHA--APPLDGGGPAEGMDCIWSSLLNQPLQQG
        P+  P+  E +++     A P   G   EG+DCIWSSLL+QPLQQG
Subjt:  PD--PW--EKEYEAHA--APPLDGGGPAEGMDCIWSSLLNQPLQQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64800.1 DNA binding;sequence-specific DNA binding transcription factors1.1e-0427.87Show/hide
Query:  EKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLISKSEDWKFDDVFFTSFDGASSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHA
        +KRQH+   A    N     R    + S GVF+FPW K+S+IS S DW   +  F   D  +             F E+S           +  E+EA  
Subjt:  EKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLISKSEDWKFDDVFFTSFDGASSTSADAALTIGTAFTELSGQLVWCNLPDPWEKEYEAHA

Query:  APPLDGGGPAEGMDCIWSSLLN
                     +CIW+S+L+
Subjt:  APPLDGGGPAEGMDCIWSSLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGCCCCAGCTCCAAGCGGCCAAGAGAATCCTCATCATCAGCAGCGGCGTTATCGGCCGATGTAACATACCAGCAATCGGAGATCGCTAACGGAGAGAAGCGCCA
GCACCTCGACGGCGGCGCCCGAGAACCTTCTAATAATAATTCTCCTGGAAGATTGTTTGATTTGGATCTGTCGTTCGGCGTCTTTGATTTTCCCTGGTCCAAGGACAGCC
TGATTTCCAAATCTGAGGACTGGAAATTTGACGACGTTTTCTTCACGTCCTTCGATGGAGCGTCGTCCACTTCCGCCGATGCTGCGCTTACGATTGGAACGGCGTTTACC
GAGTTGTCGGGGCAATTGGTGTGGTGTAATCTGCCGGATCCTTGGGAGAAGGAGTACGAGGCACATGCGGCGCCGCCGTTGGATGGCGGAGGGCCGGCGGAGGGGATGGA
TTGTATTTGGAGCTCTCTGCTAAATCAACCGCTTCAACAAGGGAGTAGCGCGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCGCCCCAGCTCCAAGCGGCCAAGAGAATCCTCATCATCAGCAGCGGCGTTATCGGCCGATGTAACATACCAGCAATCGGAGATCGCTAACGGAGAGAAGCGCCA
GCACCTCGACGGCGGCGCCCGAGAACCTTCTAATAATAATTCTCCTGGAAGATTGTTTGATTTGGATCTGTCGTTCGGCGTCTTTGATTTTCCCTGGTCCAAGGACAGCC
TGATTTCCAAATCTGAGGACTGGAAATTTGACGACGTTTTCTTCACGTCCTTCGATGGAGCGTCGTCCACTTCCGCCGATGCTGCGCTTACGATTGGAACGGCGTTTACC
GAGTTGTCGGGGCAATTGGTGTGGTGTAATCTGCCGGATCCTTGGGAGAAGGAGTACGAGGCACATGCGGCGCCGCCGTTGGATGGCGGAGGGCCGGCGGAGGGGATGGA
TTGTATTTGGAGCTCTCTGCTAAATCAACCGCTTCAACAAGGGAGTAGCGCGTTGTAG
Protein sequenceShow/hide protein sequence
MDRPSSKRPRESSSSAAALSADVTYQQSEIANGEKRQHLDGGAREPSNNNSPGRLFDLDLSFGVFDFPWSKDSLISKSEDWKFDDVFFTSFDGASSTSADAALTIGTAFT
ELSGQLVWCNLPDPWEKEYEAHAAPPLDGGGPAEGMDCIWSSLLNQPLQQGSSAL