; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026327 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026327
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription elongation factor B polypeptide 3
Genome locationchr10:34889362..34889943
RNA-Seq ExpressionLag0026327
SyntenyLag0026327
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0070449 - elongin complex (cellular component)
GO:0035529 - NADH pyrophosphatase activity (molecular function)
GO:0047631 - ADP-ribose diphosphatase activity (molecular function)
GO:0051287 - NAD binding (molecular function)
InterPro domainsIPR010684 - RNA polymerase II transcription factor SIII, subunit A


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582214.1 hypothetical protein SDJN03_22216, partial [Cucurbita argyrosperma subsp. sororia]1.3e-4357.74Show/hide
Query:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES
        +N+AIDS+RF+GDVGQ DL+FLEHILPHCTVDQLMH+E CSKGRDL+PVTNKLWK FYE+KFG+D  + V+E    KE+FRWKQ+YE KMKEL++K  E 
Subjt:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES

Query:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG
         +R IQN + EK RKQSRQ++I +I                 + KK K + ++C+VSS +N KRS+ G
Subjt:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG

KAG6582215.1 40S ribosomal protein S21-1, partial [Cucurbita argyrosperma subsp. sororia]8.7e-4359.17Show/hide
Query:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES
        +N+AIDS+R +GDVGQTDL+FLEHILPHCTVDQLMH+E CSKGRDL+PVTNKLWK FYERKFG D  + V+E    KE+FRWKQ+YE KMK L++K  E 
Subjt:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES

Query:  GERLIQNYQKEKARKQSRQIQILDI-PSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG
         +R IQN + EKARKQSRQ++I +I PS               + KK +   +VC+VSS +N KRS+ G
Subjt:  GERLIQNYQKEKARKQSRQIQILDI-PSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG

KGN49034.1 hypothetical protein Csa_004217 [Cucumis sativus]4.2e-4557.98Show/hide
Query:  MSQVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEKETFRWKQLYEA
        MS+V+K IP L++D  +N+AIDSV+F+GDVG TDL  LE IL HCT DQL+H+E CSKGRDL+P+TNKLWKNFYERKFGKDD D VV+ ETF+W  LY A
Subjt:  MSQVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEKETFRWKQLYEA

Query:  KMKELDKKERESGERLIQNYQKEKARKQSRQIQIL-DIPSLRNKKRRNFAKTGGNLS--------KKAKREAQVCEVSSLSNKKRSFE
        KMKEL+ + ++  +R+IQ+YQKEKARKQSRQI       SL + K     +T G  S        KKAKRE  V +VS+ SNK+ + E
Subjt:  KMKELDKKERESGERLIQNYQKEKARKQSRQIQIL-DIPSLRNKKRRNFAKTGGNLS--------KKAKREAQVCEVSSLSNKKRSFE

XP_022979571.1 uncharacterized protein LOC111479252 [Cucurbita maxima]1.1e-4560.12Show/hide
Query:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES
        +N+AIDS+RF+GDVGQTDL+FLEHILPHCTVDQLMH+E CSKGRDL+P+TNKLWK FYERKFGKDD + VV+    KE+FRWKQ+YE K+KEL+ K  E 
Subjt:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES

Query:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG
         +R I+N Q EKARKQSRQ++I +I                 + KK K   +VC+VSS +N KRS+ G
Subjt:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG

XP_038885873.1 uncharacterized protein LOC120076176 [Benincasa hispida]1.3e-4358.56Show/hide
Query:  QVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVV-----EKETFRWKQL
        +V+KI  S  +   INEAIDS+RF+GDVG TDL  LE ILPHCTVDQLMH+E  SKGRDL+PVT+KLWKNFYE+KFGK+D+D V+     +KE+F+WKQL
Subjt:  QVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVV-----EKETFRWKQL

Query:  YEAKMKELDKKERESGERLIQNYQKEKARKQSRQIQIL-DIPSLRNKKRRNFA-------KTGGNLSKKAKREAQVCEVSS
        YEAKM+ L+KK  E   R  QN QKE ARKQSR+I    D+ S  NKKRR+          T   + KK  REAQ+C+VSS
Subjt:  YEAKMKELDKKERESGERLIQNYQKEKARKQSRQIQIL-DIPSLRNKKRRNFA-------KTGGNLSKKAKREAQVCEVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KJR8 Nudix hydrolase domain-containing protein2.0e-4557.98Show/hide
Query:  MSQVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEKETFRWKQLYEA
        MS+V+K IP L++D  +N+AIDSV+F+GDVG TDL  LE IL HCT DQL+H+E CSKGRDL+P+TNKLWKNFYERKFGKDD D VV+ ETF+W  LY A
Subjt:  MSQVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEKETFRWKQLYEA

Query:  KMKELDKKERESGERLIQNYQKEKARKQSRQIQIL-DIPSLRNKKRRNFAKTGGNLS--------KKAKREAQVCEVSSLSNKKRSFE
        KMKEL+ + ++  +R+IQ+YQKEKARKQSRQI       SL + K     +T G  S        KKAKRE  V +VS+ SNK+ + E
Subjt:  KMKELDKKERESGERLIQNYQKEKARKQSRQIQIL-DIPSLRNKKRRNFAKTGGNLS--------KKAKREAQVCEVSSLSNKKRSFE

A0A1S4E1U0 transcription elongation factor B polypeptide 3 isoform X26.9e-3855.95Show/hide
Query:  IPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE-----KETFRWKQLYEAKM
        IPSL  D  +N+AID++RF+GDVG+TD+  LE ILPHCTVDQLMHVEK S+GRDL+PVT+KLWK FYER+FGK+ T  V+E     +  FRW QLYEAKM
Subjt:  IPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE-----KETFRWKQLYEAKM

Query:  KELDKKERESGERLIQNYQKEKARKQSRQIQILD-IPSLRNKKR-------RNFAKTGGNLSKKAKRE
        ++++K E ++ +R+ Q+Y KE ARKQSRQIQI   +P   NK+         N A T   + KKAK E
Subjt:  KELDKKERESGERLIQNYQKEKARKQSRQIQILD-IPSLRNKKR-------RNFAKTGGNLSKKAKRE

A0A5D3BGR1 Transcription elongation factor B polypeptide 3 isoform X26.9e-3855.95Show/hide
Query:  IPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE-----KETFRWKQLYEAKM
        IPSL  D  +N+AID++RF+GDVG+TD+  LE ILPHCTVDQLMHVEK S+GRDL+PVT+KLWK FYER+FGK+ T  V+E     +  FRW QLYEAKM
Subjt:  IPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE-----KETFRWKQLYEAKM

Query:  KELDKKERESGERLIQNYQKEKARKQSRQIQILD-IPSLRNKKR-------RNFAKTGGNLSKKAKRE
        ++++K E ++ +R+ Q+Y KE ARKQSRQIQI   +P   NK+         N A T   + KKAK E
Subjt:  KELDKKERESGERLIQNYQKEKARKQSRQIQILD-IPSLRNKKR-------RNFAKTGGNLSKKAKRE

A0A6J1IR58 uncharacterized protein LOC1114792525.3e-4660.12Show/hide
Query:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES
        +N+AIDS+RF+GDVGQTDL+FLEHILPHCTVDQLMH+E CSKGRDL+P+TNKLWK FYERKFGKDD + VV+    KE+FRWKQ+YE K+KEL+ K  E 
Subjt:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES

Query:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG
         +R I+N Q EKARKQSRQ++I +I                 + KK K   +VC+VSS +N KRS+ G
Subjt:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG

A0A6J1ITM2 uncharacterized protein LOC1114792433.6e-4257.74Show/hide
Query:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES
        +N+AIDS+RF+GDVGQTDL+FLEHILPHCTV+QLM +E  SKGRDL+PVTNKLWK FYERKFGKD  + V+E    KE+F+WKQ+YE K+KEL++K  E 
Subjt:  INEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVE----KETFRWKQLYEAKMKELDKKERES

Query:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG
         +R IQN Q EKARKQSRQ++I +I                 + KK+K   + C+VSS  N KRS+ G
Subjt:  GERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42780.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684); Has 187 Blast hits to 186 proteins in 77 species: Archae - 0; Bacteria - 0; Metazoa - 104; Fungi - 29; Plants - 38; Viruses - 0; Other Eukaryotes - 16 (source: NCBI BLink).1.1e-2438.1Show/hide
Query:  QVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEK------ETFRWKQ
        ++TK  PSL  D  + +AID+V+++G VG  D   LE IL HCT++QL H+E  +   DLSP+T+K WK FY++ +G++D   ++E         F+W+ 
Subjt:  QVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEK------ETFRWKQ

Query:  LYEAKMKELDKKERESGERLIQNYQKEKARKQSRQIQILD--IPSLR-----NKKRRNFAKTGGNLSKKAKRE-AQVCEVSSLSNKKRS
        LYE K+  + +KE+E G RL + Y+ E  RKQSRQ ++     PS R     +    N      N+ KKAK +  +  EV +L+  KR+
Subjt:  LYEAKMKELDKKERESGERLIQNYQKEKARKQSRQIQILD--IPSLR-----NKKRRNFAKTGGNLSKKAKRE-AQVCEVSSLSNKKRS

AT2G42780.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684).1.1e-2438.1Show/hide
Query:  QVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEK------ETFRWKQ
        ++TK  PSL  D  + +AID+V+++G VG  D   LE IL HCT++QL H+E  +   DLSP+T+K WK FY++ +G++D   ++E         F+W+ 
Subjt:  QVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEK------ETFRWKQ

Query:  LYEAKMKELDKKERESGERLIQNYQKEKARKQSRQIQILD--IPSLR-----NKKRRNFAKTGGNLSKKAKRE-AQVCEVSSLSNKKRS
        LYE K+  + +KE+E G RL + Y+ E  RKQSRQ ++     PS R     +    N      N+ KKAK +  +  EV +L+  KR+
Subjt:  LYEAKMKELDKKERESGERLIQNYQKEKARKQSRQIQILD--IPSLR-----NKKRRNFAKTGGNLSKKAKRE-AQVCEVSSLSNKKRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGTAACTAAAATAATTCCTTCCTTACATAACGATGCATGGATTAATGAAGCCATCGATAGTGTGAGGTTTATGGGGGATGTTGGCCAAACCGATTTAGATTT
TCTAGAGCACATTTTGCCACATTGCACTGTTGACCAATTGATGCATGTCGAGAAATGTTCAAAGGGCAGAGATCTCAGTCCAGTAACCAACAAATTGTGGAAAAATTTCT
ACGAAAGAAAATTCGGCAAAGATGATACTGATTTTGTAGTTGAAAAAGAGACATTTCGATGGAAGCAATTGTATGAGGCGAAGATGAAAGAGTTGGACAAAAAGGAGAGG
GAGAGTGGAGAGCGATTGATCCAAAACTACCAGAAAGAAAAGGCTCGAAAACAAAGTCGTCAAATACAGATTCTTGATATTCCCTCTTTGAGGAATAAGAAACGAAGAAA
CTTTGCAAAAACAGGTGGCAACCTCTCCAAGAAGGCAAAAAGAGAAGCACAAGTTTGTGAGGTTTCCTCTTTGAGTAATAAGAAACGAAGCTTTGAAGGAAGTGTTGGAT
TTGGATGTAATACGAAAAGAGCAAGTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAGTAACTAAAATAATTCCTTCCTTACATAACGATGCATGGATTAATGAAGCCATCGATAGTGTGAGGTTTATGGGGGATGTTGGCCAAACCGATTTAGATTT
TCTAGAGCACATTTTGCCACATTGCACTGTTGACCAATTGATGCATGTCGAGAAATGTTCAAAGGGCAGAGATCTCAGTCCAGTAACCAACAAATTGTGGAAAAATTTCT
ACGAAAGAAAATTCGGCAAAGATGATACTGATTTTGTAGTTGAAAAAGAGACATTTCGATGGAAGCAATTGTATGAGGCGAAGATGAAAGAGTTGGACAAAAAGGAGAGG
GAGAGTGGAGAGCGATTGATCCAAAACTACCAGAAAGAAAAGGCTCGAAAACAAAGTCGTCAAATACAGATTCTTGATATTCCCTCTTTGAGGAATAAGAAACGAAGAAA
CTTTGCAAAAACAGGTGGCAACCTCTCCAAGAAGGCAAAAAGAGAAGCACAAGTTTGTGAGGTTTCCTCTTTGAGTAATAAGAAACGAAGCTTTGAAGGAAGTGTTGGAT
TTGGATGTAATACGAAAAGAGCAAGTTGTTGA
Protein sequenceShow/hide protein sequence
MSQVTKIIPSLHNDAWINEAIDSVRFMGDVGQTDLDFLEHILPHCTVDQLMHVEKCSKGRDLSPVTNKLWKNFYERKFGKDDTDFVVEKETFRWKQLYEAKMKELDKKER
ESGERLIQNYQKEKARKQSRQIQILDIPSLRNKKRRNFAKTGGNLSKKAKREAQVCEVSSLSNKKRSFEGSVGFGCNTKRASC