; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009611 (gene) of Snake gourd v1 genome

Gene IDTan0009611
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription elongation factor B polypeptide 3 isoform X2
Genome locationLG06:17928090..17928629
RNA-Seq ExpressionTan0009611
SyntenyTan0009611
Gene Ontology termsGO:0006368 - transcription elongation from RNA polymerase II promoter (biological process)
GO:0070449 - elongin complex (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR010684 - RNA polymerase II transcription factor SIII, subunit A


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604740.1 hypothetical protein SDJN03_02057, partial [Cucurbita argyrosperma subsp. sororia]5.6e-2050Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQLM VEKSS+GRDLT +T+ LWK+ YER+FGR  TN VSE     K  F W QLY+AKMKDIE    E+ +R  Q+Y KE ARK+SRQI+ C  V  P 
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS
        + K+ F    G     TK+K  ++ +I  + S
Subjt:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS

KGN49034.1 hypothetical protein Csa_004217 [Cucumis sativus]2.7e-2244.5Show/hide
Query:  MSQVSS----LNGDVLVNKPTDSLKCLEN-----------------ADQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSENKGPFTWKQLYD
        MS+VS     LN D  VNK  DS+K L +                 ADQL+ +E  SKGRDLT IT  LWK  YERKFG++  + V +N+  F W  LY 
Subjt:  MSQVSS----LNGDVLVNKPTDSLKCLEN-----------------ADQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSENKGPFTWKQLYD

Query:  AKMKDIENKIRESENRFIQNYQKEKARKESRQIKFC---EVVLSPRNKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFE
        AKMK++EN+ ++ E+R IQ+YQKEKARK+SRQI FC   + +LS  NK  +     G K NTTKS    K KR++ + +V++ S K+ + E
Subjt:  AKMKDIENKIRESENRFIQNYQKEKARKESRQIKFC---EVVLSPRNKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFE

XP_022970778.1 transcription elongation factor B polypeptide 3 isoform X2 [Cucurbita maxima]5.6e-2050Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQLM VEKSS+GRDLT +T+ LWK+ YER+FGR  TN VSE     K  F W QLY+AKMKDIE    E+ +R  Q+Y KE ARK+SRQI+ C  V  P 
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS
        + K+ F    G     TK+K  ++ +I  + S
Subjt:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS

XP_038885873.1 uncharacterized protein LOC120076176 [Benincasa hispida]4.6e-2247.83Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQLM +E SSKGRDLT +T+ LWK  YE+KFG+N ++ V +     K  F WKQLY+AKM+ +E K  E E R+ QN QKE ARK+SR+I FCE V S  
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFEATTNNKKNKQLKKTER
        NKK+R EG    + NTT+S    K  R+ Q+ +V+S    K         K++K LKK +R
Subjt:  NKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFEATTNNKKNKQLKKTER

XP_038885879.1 uncharacterized protein LOC120076185 [Benincasa hispida]4.3e-2046.58Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQL+ +E SSKGRDLT +T+ LWK  Y +KFG+N  +   E     K  F WKQLY+AKM+ +E K  E E R+ QN QKE ARK+SR+I FCE V S  
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFEATTNNKKNKQLKKTER
        NKK R EG    + NTT+S    K  R+ Q+ +V+S    K         K++K LKK +R
Subjt:  NKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFEATTNNKKNKQLKKTER

TrEMBL top hitse value%identityAlignment
A0A0A0KJR8 Nudix hydrolase domain-containing protein1.3e-2244.5Show/hide
Query:  MSQVSS----LNGDVLVNKPTDSLKCLEN-----------------ADQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSENKGPFTWKQLYD
        MS+VS     LN D  VNK  DS+K L +                 ADQL+ +E  SKGRDLT IT  LWK  YERKFG++  + V +N+  F W  LY 
Subjt:  MSQVSS----LNGDVLVNKPTDSLKCLEN-----------------ADQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSENKGPFTWKQLYD

Query:  AKMKDIENKIRESENRFIQNYQKEKARKESRQIKFC---EVVLSPRNKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFE
        AKMK++EN+ ++ E+R IQ+YQKEKARK+SRQI FC   + +LS  NK  +     G K NTTKS    K KR++ + +V++ S K+ + E
Subjt:  AKMKDIENKIRESENRFIQNYQKEKARKESRQIKFC---EVVLSPRNKKQRFEGRTGCKYNTTKS----KTKRQVQIREVASLSGKKRSFE

A0A6J1G8K1 transcription elongation factor B polypeptide 3 isoform X22.7e-2050Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQLM VEKSS+GRDLT +T+ LWK+ YER+FGR  TN VSE     K  F W QLY+AKMKDIE    E+ +R  Q+Y KE ARK+SRQI+ C  V  P 
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS
        + K+ F    G     TK+K  ++ +I  + S
Subjt:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS

A0A6J1G8W9 transcription elongation factor B polypeptide 3 isoform X14.6e-2049.24Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQLM VEKSS+GRDLT +T+ LWK+ YER+FGR  TN VSE     K  F W QLY+AKMKDIE    E+ +R  Q+Y KE ARK+SRQI+ C  V    
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS
        NK+       G     TK+K  ++ +I  + S
Subjt:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS

A0A6J1I1I3 transcription elongation factor B polypeptide 3 isoform X14.6e-2049.24Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQLM VEKSS+GRDLT +T+ LWK+ YER+FGR  TN VSE     K  F W QLY+AKMKDIE    E+ +R  Q+Y KE ARK+SRQI+ C  V    
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS
        NK+       G     TK+K  ++ +I  + S
Subjt:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS

A0A6J1I4W9 transcription elongation factor B polypeptide 3 isoform X22.7e-2050Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR
        DQLM VEKSS+GRDLT +T+ LWK+ YER+FGR  TN VSE     K  F W QLY+AKMKDIE    E+ +R  Q+Y KE ARK+SRQI+ C  V  P 
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSPR

Query:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS
        + K+ F    G     TK+K  ++ +I  + S
Subjt:  NKKQRFEGRTGCKYNTTKSKTKRQVQIREVAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42780.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684); Has 187 Blast hits to 186 proteins in 77 species: Archae - 0; Bacteria - 0; Metazoa - 104; Fungi - 29; Plants - 38; Viruses - 0; Other Eukaryotes - 16 (source: NCBI BLink).9.9e-0730.63Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NK-GPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSP
        +QL  +E ++   DL+ IT+  WK  Y++ +G      + E    NK   F W+ LY+ K+  ++ K +E   R  + Y+ E  RK+SRQ K C    +P
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NK-GPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSP

Query:  RNKKQRFEGRTGCKYNT--TKSKTKRQVQI-----REVASLSGKKR-----SFEATTNNK
         +K+  F G +   YN    KS   ++ +I     +EV +L+  KR     SF  +T  K
Subjt:  RNKKQRFEGRTGCKYNT--TKSKTKRQVQI-----REVASLSGKKR-----SFEATTNNK

AT2G42780.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: regulation of transcription; LOCATED IN: integral to membrane, nucleus; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: RNA polymerase II transcription factor SIII, subunit A (InterPro:IPR010684).7.6e-0730.61Show/hide
Query:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NK-GPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSP
        +QL  +E ++   DL+ IT+  WK  Y++ +G      + E    NK   F W+ LY+ K+  ++ K +E   R  + Y+ E  RK+SRQ K C    +P
Subjt:  DQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSE----NK-GPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESRQIKFCEVVLSP

Query:  RNKKQRFEGRTGCKYNT--TKSKTKRQVQI-----REVASLSGKKRS
         +K+  F G +   YN    KS   ++ +I     +EV +L+  KR+
Subjt:  RNKKQRFEGRTGCKYNT--TKSKTKRQVQI-----REVASLSGKKRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGTAAGTTCGTTGAATGGTGATGTTTTGGTTAACAAACCTACAGATAGTTTGAAGTGTCTTGAAAACGCTGACCAATTGATGCAAGTGGAGAAAAGTTCTAA
AGGAAGAGATCTCACCTCCATAACCGAAACATTGTGGAAAGAACAATACGAAAGAAAGTTTGGTAGAAATTATACTAATTATGTAAGTGAGAATAAAGGGCCATTTACAT
GGAAGCAGTTATATGATGCCAAAATGAAGGATATAGAAAACAAGATAAGGGAAAGTGAGAATCGATTTATACAAAATTACCAAAAGGAAAAAGCTCGAAAAGAAAGTCGC
CAAATAAAGTTTTGTGAGGTAGTTTTGTCTCCGAGGAATAAGAAGCAGAGATTTGAAGGAAGAACCGGATGCAAATATAACACCACCAAGAGCAAGACAAAAAGACAAGT
TCAAATTCGTGAAGTTGCCTCTTTAAGCGGTAAGAAACGAAGTTTTGAAGCAACAACTAATAATAAGAAGAACAAACAGTTAAAGAAGACAGAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAGTAAGTTCGTTGAATGGTGATGTTTTGGTTAACAAACCTACAGATAGTTTGAAGTGTCTTGAAAACGCTGACCAATTGATGCAAGTGGAGAAAAGTTCTAA
AGGAAGAGATCTCACCTCCATAACCGAAACATTGTGGAAAGAACAATACGAAAGAAAGTTTGGTAGAAATTATACTAATTATGTAAGTGAGAATAAAGGGCCATTTACAT
GGAAGCAGTTATATGATGCCAAAATGAAGGATATAGAAAACAAGATAAGGGAAAGTGAGAATCGATTTATACAAAATTACCAAAAGGAAAAAGCTCGAAAAGAAAGTCGC
CAAATAAAGTTTTGTGAGGTAGTTTTGTCTCCGAGGAATAAGAAGCAGAGATTTGAAGGAAGAACCGGATGCAAATATAACACCACCAAGAGCAAGACAAAAAGACAAGT
TCAAATTCGTGAAGTTGCCTCTTTAAGCGGTAAGAAACGAAGTTTTGAAGCAACAACTAATAATAAGAAGAACAAACAGTTAAAGAAGACAGAAAGATGA
Protein sequenceShow/hide protein sequence
MSQVSSLNGDVLVNKPTDSLKCLENADQLMQVEKSSKGRDLTSITETLWKEQYERKFGRNYTNYVSENKGPFTWKQLYDAKMKDIENKIRESENRFIQNYQKEKARKESR
QIKFCEVVLSPRNKKQRFEGRTGCKYNTTKSKTKRQVQIREVASLSGKKRSFEATTNNKKNKQLKKTER