; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g09120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g09120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRNA polymerase II transcription factor B subunit 4
Genome locationchr4:6713303..6720688
RNA-Seq ExpressionMoc04g09120
SyntenyMoc04g09120
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0070816 - phosphorylation of RNA polymerase II C-terminal domain (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR004600 - TFIIH subunit Tfb4/GTF2H3
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578972.1 LysM domain-containing GPI-anchored protein 1, partial [Cucurbita argyrosperma subsp. sororia]4.1e-14381.16Show/hide
Query:  DKNDAELRSEGEVRMKKDENWAFFSVANSETATMASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASC
        ++ D +LR    +R +        SVANSETA MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASC
Subjt:  DKNDAELRSEGEVRMKKDENWAFFSVANSETATMASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASC

Query:  KYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEFVIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAI
        KYLYNSSS+SNR LEDGRMPALCTRLL NLEEF+I DEQS+KEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMN+I
Subjt:  KYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEFVIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAI

Query:  FSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHK
        FSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSR FLQLPKSVGVDFRASCFCHKKTIDMG+VCSVCLSIFCKHHK
Subjt:  FSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHK

Query:  KCSTCGSVFGETPVDVDSVSKQKRKTPET
        KCSTCGSVFGETPV VDSVSK KRKTPET
Subjt:  KCSTCGSVFGETPVDVDSVSKQKRKTPET

XP_004152842.1 general transcription and DNA repair factor IIH subunit TFB4 [Cucumis sativus]2.1e-13986.1Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASCKYLYNSSS+SN GLEDGRMPALCTRLLKNLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        VI DEQS+KEDP+GGTMSSLLSGSLSMALCYIQKVFRSGSLHP PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPE
        YLKPQQMDGLFQYLSTVF TDLHSR FLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPV++DSVSK KRKTPE
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPE

XP_022141943.1 RNA polymerase II transcription factor B subunit 4 isoform X1 [Momordica charantia]4.7e-14790.54Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MASVPSKLYADD                            VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET
        YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET

XP_022993825.1 RNA polymerase II transcription factor B subunit 4 [Cucurbita maxima]1.8e-13885.47Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASCKYLYNSSS+SNR LEDGRMPALCTRLL NLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        +I DEQS+KEDPR GTMSSLLSGSLSMALCY+QKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET
        YLKPQQMDGLFQYLSTVFATDLHSR FLQLPKSVGVDFRASCFCHKKTIDMG+VCSVCLSIFCKHHKKCSTCGSVFGETPV VDSVSK KRKTPET
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET

XP_023550583.1 RNA polymerase II transcription factor B subunit 4 [Cucurbita pepo subsp. pepo]1.2e-13986.15Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASCKYLYNSSS+SNR LEDGRMPALCTRLL NLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        +I DEQS+KEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET
        YLKPQQMDGLFQYLSTVFATDLHSR FLQLPKSVGVDFRASCFCHKKTIDMG+VCSVCLSIFCKHHKKCSTCGSVFGETPV VDSVSK KRKTPET
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET

TrEMBL top hitse value%identityAlignment
A0A0A0LMM2 Uncharacterized protein1.0e-13986.1Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASCKYLYNSSS+SN GLEDGRMPALCTRLLKNLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        VI DEQS+KEDP+GGTMSSLLSGSLSMALCYIQKVFRSGSLHP PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPE
        YLKPQQMDGLFQYLSTVF TDLHSR FLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPV++DSVSK KRKTPE
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPE

A0A1S3B4H7 LOW QUALITY PROTEIN: RNA polymerase II transcription factor B subunit 42.1e-13784.75Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASCKYLYNSSS+SN GLEDGRMPALCTRLLKNLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        VI DEQS+KEDP+GGTMSSLLSGSLSMALCYIQKVFRSGSLHP PRILCLQGSPDGPEQYVAIMNAIFSAQRSMV IDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPE
        YLKPQQMDGLFQYL+TVF TDLHSR FLQLPKSVGVDFRASCFCH KTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPV++DS+SK KRKTPE
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPE

A0A6J1CM09 RNA polymerase II transcription factor B subunit 4 isoform X12.3e-14790.54Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MASVPSKLYADD                            VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET
        YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET

A0A6J1FJQ6 RNA polymerase II transcription factor B subunit 44.3e-13885.52Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASCKYLYNSSS+SNR LEDGRMPALCTRLL NLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        +I DEQS++EDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDS-VSKQKRKTPET
        YLKPQQMDGLFQYLSTVFATDLHSR FLQLPKSVGVDFRASCFCHKKTIDMG+VCSVCLSIFCKHHKKCSTCGSVFGETPV VDS VSK KRKTPET
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDS-VSKQKRKTPET

A0A6J1JZK7 RNA polymerase II transcription factor B subunit 48.6e-13985.47Show/hide
Query:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF
        MAS PSKLYADD                            VLAFLNSIL LNQLNEVVVIGTGYASCKYLYNSSS+SNR LEDGRMPALCTRLL NLEEF
Subjt:  MASVPSKLYADD----------------------------VLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEF

Query:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
        +I DEQS+KEDPR GTMSSLLSGSLSMALCY+QKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMN+IFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV
Subjt:  VIADEQSVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGV

Query:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET
        YLKPQQMDGLFQYLSTVFATDLHSR FLQLPKSVGVDFRASCFCHKKTIDMG+VCSVCLSIFCKHHKKCSTCGSVFGETPV VDSVSK KRKTPET
Subjt:  YLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET

SwissProt top hitse value%identityAlignment
Q05B56 General transcription factor IIH subunit 35.7e-3938.91Show/hide
Query:  DDVLAFLNSILDLNQLNEVVVIGTGYASCKYLY----------------NSSSHSNRGLEDGRMPALCTRLLKNLEEFVIADE---QSVKEDPRGGTMSS
        D V+   NS L +N+ N++ VI +     ++LY                 SS  +  G +DG+       LL    E VIA+E      K D  G    +
Subjt:  DDVLAFLNSILDLNQLNEVVVIGTGYASCKYLY----------------NSSSHSNRGLEDGRMPALCTRLLKNLEEFVIADE---QSVKEDPRGGTMSS

Query:  LLSGSLSMALCYIQKVFR--SGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTV
        LL+GSL+ ALCYI ++ +    +     RIL ++ + D   QY+  MN IF+AQ+  + ID+C + S +S  LQQA  ITGG+YLK  QM  L QYL  V
Subjt:  LLSGSLSMALCYIQKVFR--SGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTV

Query:  FATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRK
        F  D   R+ L LP  V VD+RA+CFCH+  I++GYVCSVCLSIFC     C+TC + F    + +  V K K+K
Subjt:  FATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRK

Q13889 General transcription factor IIH subunit 31.2e-3937.36Show/hide
Query:  DDVLAFLNSILDLNQLNEVVVIGTGYASCKYLY---------------NSSSHSNRGLEDGRMPALCTRLLKNLEEFVIAD--EQSVKEDPRGGTMSSLL
        D V+   NS L +N+ N++ VI +     ++LY               N    +  G +DG+       LL +  E ++ +  +   K D +G    +LL
Subjt:  DDVLAFLNSILDLNQLNEVVVIGTGYASCKYLY---------------NSSSHSNRGLEDGRMPALCTRLLKNLEEFVIAD--EQSVKEDPRGGTMSSLL

Query:  SGSLSMALCYIQKVFR--SGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFA
        +GSL+ ALCYI ++ +    +     RIL ++ + D   QY+  MN IF+AQ+  + ID+C + S +S  LQQA  ITGG+YLK  QM  L QYL  VF 
Subjt:  SGSLSMALCYIQKVFR--SGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFA

Query:  TDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRK
         D   R+ L LP  V VD+RA+CFCH+  I++GYVCSVCLSIFC     C+TC + F    + +  V K K+K
Subjt:  TDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRK

Q86IB5 General transcription factor IIH subunit 34.4e-4735.91Show/hide
Query:  NDKNDAELRSEGEVRMKKDENWAFFSVANS---------ETATMASVPSKLYADDVLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSS-----HSNRG
        ND ND+  R  G   +  + N    + +N+          T+    +    + +  + F+N+ L LNQ N++ +I +      +++  S+        + 
Subjt:  NDKNDAELRSEGEVRMKKDENWAFFSVANS---------ETATMASVPSKLYADDVLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSS-----HSNRG

Query:  LEDGRM---PALCTRLLKNLEEFVIADEQ----SVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRS
        LE  ++     L     K ++  ++A  Q     +K D +   +SS  S S+S+ALCYI ++ R  +    PRIL    SPD   QY+++MN IFS+Q+ 
Subjt:  LEDGRM---PALCTRLLKNLEEFVIADEQ----SVKEDPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRS

Query:  MVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG
         +P+DSC +   +S FLQQAS++T G+YLKPQ+ + L QYL T F  D  SR  L  P    VD+RASCFCHK+ +D+GYVCSVCLSIFC H   CSTCG
Subjt:  MVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG

Query:  SVFGETPVDVDSVSKQKRKTPET
        + F    + +D + KQ    P T
Subjt:  SVFGETPVDVDSVSKQKRKTPET

Q8LF41 General transcription and DNA repair factor IIH subunit TFB44.6e-10570.11Show/hide
Query:  SVPSKLYADDVLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGR---MPALCTRLLKNLEEFVIADEQSVKEDPRGGTM-SSLLSGSLS
        S+    +   VLAFLN++L LNQLN+VVVI TGY+SC Y+Y+SS  SN G  +     MPA+   LLK LEEFV  DE+  KE+     + S LLSGSLS
Subjt:  SVPSKLYADDVLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGR---MPALCTRLLKNLEEFVIADEQSVKEDPRGGTM-SSLLSGSLS

Query:  MALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRA
        MALCYIQ+VFRSG LHP PRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCYIG  NSAFLQQASYITGGV+  P+Q+DGLFQYL+T+FATDLHSR 
Subjt:  MALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRA

Query:  FLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPV-DVDSVSKQKRKTPET
        F+QLPK +GVDFRASCFCHKKTIDMGY+CSVCLSIFC+HHKKCSTCGSVFG++ + D  S S +KRK P T
Subjt:  FLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPV-DVDSVSKQKRKTPET

Q8VD76 General transcription factor IIH subunit 33.3e-3938.18Show/hide
Query:  DDVLAFLNSILDLNQLNEVVVIGTGYASCKYLY---------------NSSSHSN-RGLEDGRMPALCTRLLKNLEEFVIADE---QSVKEDPRGGTMSS
        D V+   NS L +N+ N++ VI +     + LY               N+    N  G +DG+   L       +   VIA+E      K D +G    +
Subjt:  DDVLAFLNSILDLNQLNEVVVIGTGYASCKYLY---------------NSSSHSN-RGLEDGRMPALCTRLLKNLEEFVIADE---QSVKEDPRGGTMSS

Query:  LLSGSLSMALCYIQKVFRS--GSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTV
        LL+GSL+ ALCYI +V ++   +     RIL ++ + D   QY+  MN IF+AQ+  + ID+C + S +S  LQQA  ITGG+YLK  QM  L QYL  V
Subjt:  LLSGSLSMALCYIQKVFRS--GSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTV

Query:  FATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRK
        F  D   R+ L LP  + VD+RA+CFCH+  I++GYVCSVCLSIFC     C+TC + F  +   V    K+K+K
Subjt:  FATDLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRK

Arabidopsis top hitse value%identityAlignment
AT1G18340.1 basal transcription factor complex subunit-related3.3e-10670.11Show/hide
Query:  SVPSKLYADDVLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGR---MPALCTRLLKNLEEFVIADEQSVKEDPRGGTM-SSLLSGSLS
        S+    +   VLAFLN++L LNQLN+VVVI TGY+SC Y+Y+SS  SN G  +     MPA+   LLK LEEFV  DE+  KE+     + S LLSGSLS
Subjt:  SVPSKLYADDVLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGR---MPALCTRLLKNLEEFVIADEQSVKEDPRGGTM-SSLLSGSLS

Query:  MALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRA
        MALCYIQ+VFRSG LHP PRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCYIG  NSAFLQQASYITGGV+  P+Q+DGLFQYL+T+FATDLHSR 
Subjt:  MALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFATDLHSRA

Query:  FLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPV-DVDSVSKQKRKTPET
        F+QLPK +GVDFRASCFCHKKTIDMGY+CSVCLSIFC+HHKKCSTCGSVFG++ + D  S S +KRK P T
Subjt:  FLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPV-DVDSVSKQKRKTPET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCTCGTGACTTCGCTGACGACGGCTGCATCAAAGCCGACGAAGGAGATTGTTGCGTCGAAACAGAATATAGAAAAAGAAAAGAAAAACCCTATCCCCTCAGTAGA
CCCCTTCTCGGCGGCGACCATCCGCGGCGGCGCAGCTCAGAACGGCGACCGACGGACCCCCGACGACTCAAAAACCCGTACGGCGGCGCAACGACCCTTCGCGAACGGCA
AGGATCTGCAGCGGAAGCGAACTAATTCAAGAGCTAGGAGTGTTAGGCGTGTTGTAAGTAGTTCTGCTTGGTTGTCCCTATTGATGTGTTGGAATGACAAAAATGATGCT
GAATTGAGAAGTGAAGGTGAAGTAAGAATGAAAAAAGATGAGAATTGGGCGTTTTTCTCCGTTGCTAATTCTGAAACTGCTACCATGGCATCTGTTCCTTCAAAGCTCTA
TGCAGATGATGTACTTGCTTTCCTGAACTCCATTTTAGATCTGAATCAACTTAATGAGGTTGTGGTAATTGGTACTGGTTATGCTTCATGCAAGTATCTATACAACTCGT
CTTCACATTCAAATCGTGGTCTTGAAGATGGTAGAATGCCTGCACTTTGCACTCGTTTATTGAAGAATTTGGAGGAATTCGTGATTGCGGATGAGCAGTCTGTCAAGGAA
GACCCCAGAGGAGGGACCATGTCTTCACTTCTTTCTGGATCACTCTCCATGGCTTTGTGCTATATACAGAAAGTTTTCCGTTCTGGATCTCTCCATCCCCATCCTCGAAT
CCTTTGCTTGCAGGGATCCCCAGATGGGCCTGAACAATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAGCGTTCAATGGTTCCCATAGATTCGTGTTACATAGGTT
CACACAATTCTGCATTTCTTCAGCAGGCTTCTTACATCACGGGTGGAGTTTATCTGAAGCCTCAGCAAATGGATGGACTGTTTCAATATCTCTCTACTGTTTTTGCTACT
GATTTGCATTCTCGGGCCTTCTTACAACTTCCAAAGTCCGTTGGTGTGGATTTTCGTGCATCGTGTTTTTGCCACAAAAAAACAATTGACATGGGCTATGTCTGTTCGGT
TTGCTTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACTTGTGGGTCAGTATTTGGTGAGACCCCAGTAGATGTGGATTCAGTGTCCAAACAGAAGAGAAAAACTC
CAGAAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCTCGTGACTTCGCTGACGACGGCTGCATCAAAGCCGACGAAGGAGATTGTTGCGTCGAAACAGAATATAGAAAAAGAAAAGAAAAACCCTATCCCCTCAGTAGA
CCCCTTCTCGGCGGCGACCATCCGCGGCGGCGCAGCTCAGAACGGCGACCGACGGACCCCCGACGACTCAAAAACCCGTACGGCGGCGCAACGACCCTTCGCGAACGGCA
AGGATCTGCAGCGGAAGCGAACTAATTCAAGAGCTAGGAGTGTTAGGCGTGTTGTAAGTAGTTCTGCTTGGTTGTCCCTATTGATGTGTTGGAATGACAAAAATGATGCT
GAATTGAGAAGTGAAGGTGAAGTAAGAATGAAAAAAGATGAGAATTGGGCGTTTTTCTCCGTTGCTAATTCTGAAACTGCTACCATGGCATCTGTTCCTTCAAAGCTCTA
TGCAGATGATGTACTTGCTTTCCTGAACTCCATTTTAGATCTGAATCAACTTAATGAGGTTGTGGTAATTGGTACTGGTTATGCTTCATGCAAGTATCTATACAACTCGT
CTTCACATTCAAATCGTGGTCTTGAAGATGGTAGAATGCCTGCACTTTGCACTCGTTTATTGAAGAATTTGGAGGAATTCGTGATTGCGGATGAGCAGTCTGTCAAGGAA
GACCCCAGAGGAGGGACCATGTCTTCACTTCTTTCTGGATCACTCTCCATGGCTTTGTGCTATATACAGAAAGTTTTCCGTTCTGGATCTCTCCATCCCCATCCTCGAAT
CCTTTGCTTGCAGGGATCCCCAGATGGGCCTGAACAATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAGCGTTCAATGGTTCCCATAGATTCGTGTTACATAGGTT
CACACAATTCTGCATTTCTTCAGCAGGCTTCTTACATCACGGGTGGAGTTTATCTGAAGCCTCAGCAAATGGATGGACTGTTTCAATATCTCTCTACTGTTTTTGCTACT
GATTTGCATTCTCGGGCCTTCTTACAACTTCCAAAGTCCGTTGGTGTGGATTTTCGTGCATCGTGTTTTTGCCACAAAAAAACAATTGACATGGGCTATGTCTGTTCGGT
TTGCTTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACTTGTGGGTCAGTATTTGGTGAGACCCCAGTAGATGTGGATTCAGTGTCCAAACAGAAGAGAAAAACTC
CAGAAACATGA
Protein sequenceShow/hide protein sequence
MVLVTSLTTAASKPTKEIVASKQNIEKEKKNPIPSVDPFSAATIRGGAAQNGDRRTPDDSKTRTAAQRPFANGKDLQRKRTNSRARSVRRVVSSSAWLSLLMCWNDKNDA
ELRSEGEVRMKKDENWAFFSVANSETATMASVPSKLYADDVLAFLNSILDLNQLNEVVVIGTGYASCKYLYNSSSHSNRGLEDGRMPALCTRLLKNLEEFVIADEQSVKE
DPRGGTMSSLLSGSLSMALCYIQKVFRSGSLHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFAT
DLHSRAFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVDVDSVSKQKRKTPET