; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030123 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030123
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTranscription factor UPBEAT1-like
Genome locationchr8:44775599..44777052
RNA-Seq ExpressionLag0030123
SyntenyLag0030123
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044660 - Transcription factor IBH1-like
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039441.1 transcription factor UPBEAT1-like [Cucumis melo var. makuwa]6.2e-3270.87Show/hide
Query:  MSSKTNTLPNPLRNL-RRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLL
        MSSKTNTLPNPLRN  RRRTKRSL  SRRRFHRRH  A  RP      GDST      S+VS KL+ALQ+LIP QSAA HG+ RQ ++LFK+TADYIV+L
Subjt:  MSSKTNTLPNPLRNL-RRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLL

Query:  KTQVVILQKLVDFFGSGCSDTENAVVS
        KTQVVILQKLVDFFGS CSD+ +AVVS
Subjt:  KTQVVILQKLVDFFGSGCSDTENAVVS

KAG6578552.1 hypothetical protein SDJN03_23000, partial [Cucurbita argyrosperma subsp. sororia]7.9e-2772.88Show/hide
Query:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCS---SDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIV
        MS KTNT    L NLR  TKRSLPDSRRRFHRRH  +A R P+P  NG +TSS   S    DVSAKL+ALQ+LIPAQSA    E RQTEQLFKETADYIV
Subjt:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCS---SDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIV

Query:  LLKTQVVILQKLVDFFGS
        LLKTQVVILQKLVDFFGS
Subjt:  LLKTQVVILQKLVDFFGS

KAG6607102.1 hypothetical protein SDJN03_00444, partial [Cucurbita argyrosperma subsp. sororia]7.9e-1160.87Show/hide
Query:  CSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGCSDTENAV
        CSSDVSA+L+ALQ+LIP Q+    G  R++E+LF+ETADYIVLLK QV +LQ+L++F+GS   + ENAV
Subjt:  CSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGCSDTENAV

KAG7016112.1 hypothetical protein SDJN02_21216, partial [Cucurbita argyrosperma subsp. argyrosperma]4.6e-2772.88Show/hide
Query:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCS---SDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIV
        MS KTNT    L NLR  TKRSLPDSRRRFHRRH  +A R P+P  NG++TSS   S    DVSAKL+ALQ+LIPAQSA    E RQTEQLFKETADYIV
Subjt:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCS---SDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIV

Query:  LLKTQVVILQKLVDFFGS
        LLKTQVVILQKLVDFFGS
Subjt:  LLKTQVVILQKLVDFFGS

XP_022993697.1 uncharacterized protein LOC111489609 [Cucurbita maxima]2.2e-2164.23Show/hide
Query:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSS--------DVSAKLDALQTLIPAQSAAAHGEARQTEQLFKET
        M+ KT T    L NLR RTKRSLPDS           A R P+P   G++TSS  CSS        DVSAKL+ALQ+LIPAQSA    EARQTEQLFKET
Subjt:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSS--------DVSAKLDALQTLIPAQSAAAHGEARQTEQLFKET

Query:  ADYIVLLKTQVVILQKLVDFFGS
        ADYIVLLKTQVVILQKLVDFFGS
Subjt:  ADYIVLLKTQVVILQKLVDFFGS

TrEMBL top hitse value%identityAlignment
A0A2N9EMQ7 Uncharacterized protein4.2e-1044.72Show/hide
Query:  SKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQ
        S T   P    +  RR     P SRRR   R C A        NNG      C  S+VS KL+AL+ LIPA     + E  + +QLFKETADYIVLL+TQ
Subjt:  SKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQ

Query:  VVILQKLVDFFGSGCSDTENAVV
        VVILQ+LV+ +GS  + ++N V+
Subjt:  VVILQKLVDFFGSGCSDTENAVV

A0A2P5AW87 Transcription factor UPBEAT-like protein8.5e-1146.77Show/hide
Query:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLK
        MSS T      +R  RRR +++   SR    RR C          NNG  T    CSS VS KL+AL+ LIP       GE  +T+QLF++TADYIVLL+
Subjt:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLK

Query:  TQVVILQKLVDFFGSGCSDTENAV
        TQVV+LQKL++F+GS  +D+E+AV
Subjt:  TQVVILQKLVDFFGSGCSDTENAV

A0A5D3BNP5 Transcription factor UPBEAT1-like3.0e-3270.87Show/hide
Query:  MSSKTNTLPNPLRNL-RRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLL
        MSSKTNTLPNPLRN  RRRTKRSL  SRRRFHRRH  A  RP      GDST      S+VS KL+ALQ+LIP QSAA HG+ RQ ++LFK+TADYIV+L
Subjt:  MSSKTNTLPNPLRNL-RRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLL

Query:  KTQVVILQKLVDFFGSGCSDTENAVVS
        KTQVVILQKLVDFFGS CSD+ +AVVS
Subjt:  KTQVVILQKLVDFFGSGCSDTENAVVS

A0A6I9UIV7 uncharacterized protein LOC1051773601.9e-1046.79Show/hide
Query:  RTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGC
        + + S P + RR  RR   +  RP      G + SS      VS KL+AL++LIP+Q A       + EQLFKETADYIVLLKTQV++LQKLVDF+G+  
Subjt:  RTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGC

Query:  SDTENAVVS
            +AV+S
Subjt:  SDTENAVVS

A0A6J1JTK8 uncharacterized protein LOC1114896091.1e-2164.23Show/hide
Query:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSS--------DVSAKLDALQTLIPAQSAAAHGEARQTEQLFKET
        M+ KT T    L NLR RTKRSLPDS           A R P+P   G++TSS  CSS        DVSAKL+ALQ+LIPAQSA    EARQTEQLFKET
Subjt:  MSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGDSTSSPCCSS--------DVSAKLDALQTLIPAQSAAAHGEARQTEQLFKET

Query:  ADYIVLLKTQVVILQKLVDFFGS
        ADYIVLLKTQVVILQKLVDFFGS
Subjt:  ADYIVLLKTQVVILQKLVDFFGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29370.1 unknown protein9.6e-0742.31Show/hide
Query:  CC---SSD----VSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGCSDTENAVV
        CC   SSD    V  KL AL++L+P       GE   TE+LF+ETA+YIV L+TQVV+L+KL++ + +     ++ V+
Subjt:  CC---SSD----VSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGCSDTENAVV

AT5G39240.1 unknown protein1.9e-0742.67Show/hide
Query:  SAKLDALQTLIPAQSAAAH-----------GEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGCSDTENAV
        S KL AL++L+P  S                   +TEQLF+ETADYIV L+ QVV+LQKL++ +GS     +N V
Subjt:  SAKLDALQTLIPAQSAAAH-----------GEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGCSDTENAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGCAGCTCAGTCAGTGGGTCCATTTTTGGGATCGATCGAAATGGCAGAAGCTCTAGCCATCCGTGAAGGAGTAGAATTAGCAAAATCAACGGGCATCCATCCAAT
TTGGGTGGAAACCGATTCGCTCCTAGTGTGGCAACTGTTTCAAGGAATCGAAAGCTATGTGAATGAAACTCAGAATATTATTGAGGACATCAAATTCTATAGAAATTCAG
GGGACTTGAAAGGGTTGCTTCTTACCTCCAGAGAGTCGAATCAATCTGCCAACGCGCTTGCCGTTCATGCCCGAAGCAATCGAGTCGATGGTGTCTGGATTGAAGAAGCC
CCCTCCTGGGCGATGCCGTTCGTCAGGTTCGACGGGGAGAAGGAAGAGCGGAAGACTGATGGTAAAATATTGAAAATCTCTTTCGTATATTTTCCTAGAGGATGTAGCAC
TGTAGCAGACGAATTGCAAAATTTGCTAGACAATCAGGCGACCAAATTCGCTCCGAGTTTTTGGAGACAAACAATGAGCTCCAAAACCAACACACTCCCGAATCCTCTCA
GAAACCTCAGAAGAAGAACCAAACGATCTCTACCCGATTCCCGCCGACGCTTCCACCGCCGACACTGCCCCGCCGCCGCCCGACCTCCCAGTCCCCTCAACAATGGCGAT
TCCACGTCGTCCCCCTGCTGCTCCTCCGACGTCTCCGCCAAGTTGGACGCCCTGCAGACCCTCATTCCGGCCCAATCCGCCGCCGCCCATGGCGAGGCCCGCCAAACGGA
GCAGTTGTTCAAGGAAACCGCCGATTACATTGTTCTATTGAAAACCCAGGTCGTCATTCTGCAGAAGCTGGTCGATTTTTTTGGATCCGGCTGCAGCGACACGGAAAACG
CCGTCGTTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAGCAGCTCAGTCAGTGGGTCCATTTTTGGGATCGATCGAAATGGCAGAAGCTCTAGCCATCCGTGAAGGAGTAGAATTAGCAAAATCAACGGGCATCCATCCAAT
TTGGGTGGAAACCGATTCGCTCCTAGTGTGGCAACTGTTTCAAGGAATCGAAAGCTATGTGAATGAAACTCAGAATATTATTGAGGACATCAAATTCTATAGAAATTCAG
GGGACTTGAAAGGGTTGCTTCTTACCTCCAGAGAGTCGAATCAATCTGCCAACGCGCTTGCCGTTCATGCCCGAAGCAATCGAGTCGATGGTGTCTGGATTGAAGAAGCC
CCCTCCTGGGCGATGCCGTTCGTCAGGTTCGACGGGGAGAAGGAAGAGCGGAAGACTGATGGTAAAATATTGAAAATCTCTTTCGTATATTTTCCTAGAGGATGTAGCAC
TGTAGCAGACGAATTGCAAAATTTGCTAGACAATCAGGCGACCAAATTCGCTCCGAGTTTTTGGAGACAAACAATGAGCTCCAAAACCAACACACTCCCGAATCCTCTCA
GAAACCTCAGAAGAAGAACCAAACGATCTCTACCCGATTCCCGCCGACGCTTCCACCGCCGACACTGCCCCGCCGCCGCCCGACCTCCCAGTCCCCTCAACAATGGCGAT
TCCACGTCGTCCCCCTGCTGCTCCTCCGACGTCTCCGCCAAGTTGGACGCCCTGCAGACCCTCATTCCGGCCCAATCCGCCGCCGCCCATGGCGAGGCCCGCCAAACGGA
GCAGTTGTTCAAGGAAACCGCCGATTACATTGTTCTATTGAAAACCCAGGTCGTCATTCTGCAGAAGCTGGTCGATTTTTTTGGATCCGGCTGCAGCGACACGGAAAACG
CCGTCGTTTCATAG
Protein sequenceShow/hide protein sequence
MVAAQSVGPFLGSIEMAEALAIREGVELAKSTGIHPIWVETDSLLVWQLFQGIESYVNETQNIIEDIKFYRNSGDLKGLLLTSRESNQSANALAVHARSNRVDGVWIEEA
PSWAMPFVRFDGEKEERKTDGKILKISFVYFPRGCSTVADELQNLLDNQATKFAPSFWRQTMSSKTNTLPNPLRNLRRRTKRSLPDSRRRFHRRHCPAAARPPSPLNNGD
STSSPCCSSDVSAKLDALQTLIPAQSAAAHGEARQTEQLFKETADYIVLLKTQVVILQKLVDFFGSGCSDTENAVVS