; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01540 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01540
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptiontranscription elongation factor B polypeptide 3
Genome locationClcChr04:4607357..4608083
RNA-Seq ExpressionClc04G01540
SyntenyClc04G01540
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0070449 - elongin complex (cellular component)
GO:0035529 - NADH pyrophosphatase activity (molecular function)
GO:0047631 - ADP-ribose diphosphatase activity (molecular function)
GO:0051287 - NAD binding (molecular function)
InterPro domainsIPR010684 - RNA polymerase II transcription factor SIII, subunit A


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582092.1 hypothetical protein SDJN03_22094, partial [Cucurbita argyrosperma subsp. sororia]2.1e-4653.57Show/hide
Query:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKNDSDF---VIERMKHKKESFKWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG++D N L RILPHCT++QLM IEN SKGRDLTPVT+KLWK FYE+K   D    V+E MKH KESFKWKQ+YE K++ELE+KA +
Subjt:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKNDSDF---VIERMKHKKESFKWKQVYEAKMEELEKKAKK

Query:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK
        IE RYIQN + EKA+K+SRQ+  C            E  P+        +K+LKK K   +IC+V  + NN K+   GGT              SKI KK
Subjt:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK

Query:  AKREALGCIETKNLIAFRRNVMQK
        A++E L  IETKN IAFRRN +QK
Subjt:  AKREALGCIETKNLIAFRRNVMQK

KAG6582214.1 hypothetical protein SDJN03_22216, partial [Cucurbita argyrosperma subsp. sororia]2.9e-4855.36Show/hide
Query:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKNDSD---FVIERMKHKKESFKWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG  D N L  ILPHCT+DQLMHIEN SKGRDLTPVT+KLWK FYEKK   D    V+ERMKH KESF+WKQ+YE KM+ELE+KA +
Subjt:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKNDSD---FVIERMKHKKESFKWKQVYEAKMEELEKKAKK

Query:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK
        IE RYIQN R EK +K+SRQ+  C            E  P+        +K+LKK K + +IC+V    NNK+  S+GG T             SKI KK
Subjt:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK

Query:  AKREALGCIETKNLIAFRRNVMQK
        A++E    IETKNLIAFRRN +QK
Subjt:  AKREALGCIETKNLIAFRRNVMQK

XP_022979571.1 uncharacterized protein LOC111479252 [Cucurbita maxima]4.2e-4752.23Show/hide
Query:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG +D N L  ILPHCT+DQLMHIEN SKGRDLTP+T+KLWK FYE+   K+D + V++RMKH KESF+WKQ+YE K++ELE KA +
Subjt:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKKAKK

Query:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK
        +E RYI+N + EKA+K+SRQ+  C            E  P+        +K+LKK K   ++C+V  + NN K+   GGT              SKI KK
Subjt:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK

Query:  AKREALGCIETKNLIAFRRNVMQK
        A++E L  IETKNLIAFRRN +QK
Subjt:  AKREALGCIETKNLIAFRRNVMQK

XP_038885873.1 uncharacterized protein LOC120076176 [Benincasa hispida]1.6e-7569.42Show/hide
Query:  MCEEVSKITTSFFNNLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKW
        MCEEVSKI +SF ++ SINEAIDSL+FLGDVGD+D ++L RILPHCT+DQLMHIENSSKGRDLTPVTDKLWKNFYEK   KNDSD VI++MK+KKESFKW
Subjt:  MCEEVSKITTSFFNNLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKW

Query:  KQVYEAKMEELEKKAKKIEARYIQNRRKEKAQKESRQIIFCGG-SSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTK
        KQ+YEAKME LEKKA +IEARY QN +KE A+K+SR+IIFC   SS  NKKRR EG      CNT E+KILKK  RE Q+C+V          S GGTTK
Subjt:  KQVYEAKMEELEKKAKKIEARYIQNRRKEKAQKESRQIIFCGG-SSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTK

Query:  PRQDTKSSKTKPSKIWKKAKREALGCIETKNLIAFRRNVMQK
        P       +TK SKI KKAKREAL CIETKN+IAFRRN MQK
Subjt:  PRQDTKSSKTKPSKIWKKAKREALGCIETKNLIAFRRNVMQK

XP_038885879.1 uncharacterized protein LOC120076185 [Benincasa hispida]9.3e-7166.94Show/hide
Query:  MCEEVSKITTSFFNNLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKW
        MCEEVSKI +SF ++ SINEAID+L+FL DVGD+D ++L RILPHCT+DQL+HIENSSKGRDLT VTDKLWKNFY K   KND D  IERMK+KKESFKW
Subjt:  MCEEVSKITTSFFNNLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKW

Query:  KQVYEAKMEELEKKAKKIEARYIQNRRKEKAQKESRQIIFCGG-SSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTK
        KQ+YEAKME LEKKA +IEARY QN +KE A+K+SR+IIFC   SS  NKK R EG      CNT E+KILKK  RE Q+C+V          S GGTTK
Subjt:  KQVYEAKMEELEKKAKKIEARYIQNRRKEKAQKESRQIIFCGG-SSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTK

Query:  PRQDTKSSKTKPSKIWKKAKREALGCIETKNLIAFRRNVMQK
        P       +TK SKI KKAKREAL CIETKN+IAFRRN MQK
Subjt:  PRQDTKSSKTKPSKIWKKAKREALGCIETKNLIAFRRNVMQK

TrEMBL top hitse value%identityAlignment
A0A0A0KJR8 Nudix hydrolase domain-containing protein1.9e-3753.89Show/hide
Query:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKK
        N  +N+AIDS+KFLGDVGD+D   L  IL HCT DQL+HIEN SKGRDLTP+T+KLWKNFYE+   K+D D V+     K E+FKW  +Y AKM+ELE +
Subjt:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKK

Query:  AKKIEARYIQNRRKEKAQKESRQIIFCGG-SSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQS
        AKKIE R IQ+ +KEKA+K+SRQI+FCG   S ++ K     +   F  NT ++  LKK+KRE  + +V  T +NK+  S
Subjt:  AKKIEARYIQNRRKEKAQKESRQIIFCGG-SSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQS

A0A1S4E1U0 transcription elongation factor B polypeptide 3 isoform X21.6e-3653.09Show/hide
Query:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKK
        +L +N+AID+++FLGDVG++D +LL RILPHCT+DQLMH+E SS+GRDLTPVTDKLWK FYE+   K  +  VIERM+ K+ +F+W Q+YEAKM+++EK 
Subjt:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKK

Query:  AKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKRE
          K   R  Q+  KE A+K+SRQI  C    P + KR F G    +     + KILKK+K E
Subjt:  AKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKRE

A0A5D3BGR1 Transcription elongation factor B polypeptide 3 isoform X21.6e-3653.09Show/hide
Query:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKK
        +L +N+AID+++FLGDVG++D +LL RILPHCT+DQLMH+E SS+GRDLTPVTDKLWK FYE+   K  +  VIERM+ K+ +F+W Q+YEAKM+++EK 
Subjt:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKK

Query:  AKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKRE
          K   R  Q+  KE A+K+SRQI  C    P + KR F G    +     + KILKK+K E
Subjt:  AKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKRE

A0A6J1IR58 uncharacterized protein LOC1114792522.0e-4752.23Show/hide
Query:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG +D N L  ILPHCT+DQLMHIEN SKGRDLTP+T+KLWK FYE+   K+D + V++RMKH KESF+WKQ+YE K++ELE KA +
Subjt:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMKHKKESFKWKQVYEAKMEELEKKAKK

Query:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK
        +E RYI+N + EKA+K+SRQ+  C            E  P+        +K+LKK K   ++C+V  + NN K+   GGT              SKI KK
Subjt:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK

Query:  AKREALGCIETKNLIAFRRNVMQK
        A++E L  IETKNLIAFRRN +QK
Subjt:  AKREALGCIETKNLIAFRRNVMQK

A0A6J1ITM2 uncharacterized protein LOC1114792433.8e-4653.12Show/hide
Query:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKNDSDF---VIERMKHKKESFKWKQVYEAKMEELEKKAKK
        +N+AIDSL+FLGDVG +D N L  ILPHCT++QLM IENSSKGRDLTPVT+KLWK FYE+K   D    V+E MKH KESFKWKQ+YE K++ELE+KA +
Subjt:  INEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKNDSDF---VIERMKHKKESFKWKQVYEAKMEELEKKAKK

Query:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK
        IE RYIQN + EKA+K+SRQ+  C            E  P+        +K+LKKSK   + C+V  + +N K+   GGT              SKI KK
Subjt:  IEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKK

Query:  AKREALGCIETKNLIAFRRNVMQK
        A++E L  IETKN+IAFRRN +QK
Subjt:  AKREALGCIETKNLIAFRRNVMQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42780.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684); Has 187 Blast hits to 186 proteins in 77 species: Archae - 0; Bacteria - 0; Metazoa - 104; Fungi - 29; Plants - 38; Viruses - 0; Other Eukaryotes - 16 (source: NCBI BLink).5.5e-2132.05Show/hide
Query:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMK-HKKESFKWKQVYEAKMEELEK
        +L + +AID++K++G VG  DF LL +IL HCT++QL HIE+++   DL+P+TDK WK FY+K   + D   +IE ++ +K   FKW+ +YE K+  +++
Subjt:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMK-HKKESFKWKQVYEAKMEELEK

Query:  KAKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETK--ILKKSK----REEQICEVPFTINNKKKQSFGGTTKPRQDTKSS
        K K++  R  +  + E  +K+SRQ   C  + P   KR F G  +  G N    K  I+KK+K    + +++  +     N  ++SF  +T  +    ++
Subjt:  KAKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETK--ILKKSK----REEQICEVPFTINNKKKQSFGGTTKPRQDTKSS

Query:  KTKPSKIWKKAKREALGCIETKNLIAFRRNVMQK
            S+      R         NL   ++N +QK
Subjt:  KTKPSKIWKKAKREALGCIETKNLIAFRRNVMQK

AT2G42780.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684).9.4e-2133.5Show/hide
Query:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMK-HKKESFKWKQVYEAKMEELEK
        +L + +AID++K++G VG  DF LL +IL HCT++QL HIE+++   DL+P+TDK WK FY+K   + D   +IE ++ +K   FKW+ +YE K+  +++
Subjt:  NLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEK---KNDSDFVIERMK-HKKESFKWKQVYEAKMEELEK

Query:  KAKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETK--ILKKSK----REEQICEVPFTINNKKKQSFGGTTKPRQDTKSS
        K K++  R  +  + E  +K+SRQ   C  + P   KR F G  +  G N    K  I+KK+K    + +++  +     N  ++SF  +   R    ++
Subjt:  KAKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETK--ILKKSK----REEQICEVPFTINNKKKQSFGGTTKPRQDTKSS

Query:  KTKPSK
            S+
Subjt:  KTKPSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGAAGAAGTAAGTAAAATAACTACCTCCTTTTTTAATAATCTTTCCATTAACGAAGCTATAGATAGTTTGAAGTTTCTTGGAGATGTTGGGGATTCTGATTTCAA
TCTTCTAAACCGTATTTTGCCACATTGTACTATTGACCAATTGATGCATATAGAGAACTCTTCCAAAGGAAGAGATCTCACACCTGTAACTGACAAGTTGTGGAAAAACT
TCTATGAAAAAAAAAACGATTCTGATTTTGTGATTGAGAGGATGAAACATAAGAAAGAATCATTTAAATGGAAGCAAGTGTATGAAGCAAAGATGGAAGAGTTAGAAAAG
AAGGCCAAGAAAATTGAGGCTCGATATATACAAAACCGTCGAAAGGAAAAAGCTCAAAAAGAAAGCCGTCAAATAATATTTTGTGGGGGTTCTTCTCCAATCAATAAGAA
ACGAAGATTTGAAGGAAAACCGAATGAGTTTGGATGCAATACCAACGAGACCAAGATTTTGAAGAAGTCCAAGAGAGAAGAACAAATTTGTGAAGTTCCATTTACGATCA
ATAATAAGAAGAAACAAAGCTTTGGAGGGACAACCAAACCTAGACAAGATACTAAGTCAAGCAAGACTAAGCCAAGCAAGATATGGAAGAAGGCAAAGAGAGAAGCGTTG
GGGTGTATAGAGACGAAGAACCTAATAGCTTTTCGAAGAAATGTGATGCAAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTGAAGAAGTAAGTAAAATAACTACCTCCTTTTTTAATAATCTTTCCATTAACGAAGCTATAGATAGTTTGAAGTTTCTTGGAGATGTTGGGGATTCTGATTTCAA
TCTTCTAAACCGTATTTTGCCACATTGTACTATTGACCAATTGATGCATATAGAGAACTCTTCCAAAGGAAGAGATCTCACACCTGTAACTGACAAGTTGTGGAAAAACT
TCTATGAAAAAAAAAACGATTCTGATTTTGTGATTGAGAGGATGAAACATAAGAAAGAATCATTTAAATGGAAGCAAGTGTATGAAGCAAAGATGGAAGAGTTAGAAAAG
AAGGCCAAGAAAATTGAGGCTCGATATATACAAAACCGTCGAAAGGAAAAAGCTCAAAAAGAAAGCCGTCAAATAATATTTTGTGGGGGTTCTTCTCCAATCAATAAGAA
ACGAAGATTTGAAGGAAAACCGAATGAGTTTGGATGCAATACCAACGAGACCAAGATTTTGAAGAAGTCCAAGAGAGAAGAACAAATTTGTGAAGTTCCATTTACGATCA
ATAATAAGAAGAAACAAAGCTTTGGAGGGACAACCAAACCTAGACAAGATACTAAGTCAAGCAAGACTAAGCCAAGCAAGATATGGAAGAAGGCAAAGAGAGAAGCGTTG
GGGTGTATAGAGACGAAGAACCTAATAGCTTTTCGAAGAAATGTGATGCAAAAGTAG
Protein sequenceShow/hide protein sequence
MCEEVSKITTSFFNNLSINEAIDSLKFLGDVGDSDFNLLNRILPHCTIDQLMHIENSSKGRDLTPVTDKLWKNFYEKKNDSDFVIERMKHKKESFKWKQVYEAKMEELEK
KAKKIEARYIQNRRKEKAQKESRQIIFCGGSSPINKKRRFEGKPNEFGCNTNETKILKKSKREEQICEVPFTINNKKKQSFGGTTKPRQDTKSSKTKPSKIWKKAKREAL
GCIETKNLIAFRRNVMQK