; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018065 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018065
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiongeneral transcription and DNA repair factor IIH subunit TFB1-1-like
Genome locationChr03:30206458..30216954
RNA-Seq ExpressionHG10018065
SyntenyHG10018065
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006351 - transcription, DNA-templated (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
InterPro domainsIPR005607 - BSD domain
IPR013876 - TFIIH p62 subunit, N-terminal
IPR027079 - TFIIH subunit Tfb1/GTF2H1
IPR035925 - BSD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008457278.1 PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucumis melo]5.0e-17793.04Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTSVKDPGTPGVLEMTE KFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        K GEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RDSSKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAA+TRKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

XP_038894178.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X1 [Benincasa hispida]1.2e-17592.17Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

XP_038894194.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X2 [Benincasa hispida]1.2e-17592.17Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

XP_038894195.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X3 [Benincasa hispida]1.2e-17592.17Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

XP_038894196.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X4 [Benincasa hispida]1.2e-17592.17Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

TrEMBL top hitse value%identityAlignment
A0A1S3C6E8 probable RNA polymerase II transcription factor B subunit 1-1 isoform X12.4e-17793.04Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTSVKDPGTPGVLEMTE KFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        K GEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RDSSKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAA+TRKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

A0A6J1DNY2 probable RNA polymerase II transcription factor B subunit 1-1 isoform X23.4e-17189.86Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVF+PSDPTS SKLDVEFR+IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGE AQA SER VA FPHEQLSK EMELRM+CLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD+SKKSKQL+GFK+SMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFL+HVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITES NE Y+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

A0A6J1DRT9 probable RNA polymerase II transcription factor B subunit 1-1 isoform X13.4e-17189.86Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVF+PSDPTS SKLDVEFR+IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGE AQA SER VA FPHEQLSK EMELRM+CLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD+SKKSKQL+GFK+SMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFL+HVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITES NE Y+RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

A0A6J1EXP7 probable RNA polymerase II transcription factor B subunit 1-13.4e-17190.72Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGEA QAPSE+ VA FPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD S KSKQL+GFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIR VDPTLDLEADLGDDYT
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITES NE  +RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

A0A6J1IB04 probable RNA polymerase II transcription factor B subunit 1-11.7e-17090.43Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----
        KSGEA QAPSE+ VA FPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLL+RD S KSKQL+GFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR-----

Query:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT
                  IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIR VDPTLDLEADLGDDY 
Subjt:  ----------IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT

Query:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
        HLPDHGIFRDGGKEITES NE  +RTLSQDLNRQGAVVLEGRTID
Subjt:  HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID

SwissProt top hitse value%identityAlignment
P32780 General transcription factor IIH subunit 12.5e-1427.76Show/hide
Query:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG
        IK  K + EG  K   L L    G +  F F N            S   K  +A +   ++ +  F  ++ +  E+E + R LQED  L +L+K  V+S 
Subjt:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG

Query:  VLTESEFWAARKKLLKRDSSKKS--KQLIGFKSSMVLDTKPMSDG--------------RIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKN
        V++  EFWA R  +   DSS  S  KQ +G  ++ + D +P +DG               IF   PAV   +  +VP+ M+EK+FWT++F++ Y H  + 
Subjt:  VLTESEFWAARKKLLKRDSSKKS--KQLIGFKSSMVLDTKPMSDG--------------RIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKN

Query:  SIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLNRQGA
        +  +    AE A+ +E  L          +T   +   +P LDL A      D G   + +P       I  +    I +  N H    L+  L +Q A
Subjt:  SIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLNRQGA

Q3ECP0 General transcription and DNA repair factor IIH subunit TFB1-13.9e-10860.4Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK++VKDPGTPG L + E   +F P+DP S SKL V  + IK  K TKEGSNKPPWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK    
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------
         +    + V +   EQLS  E+ELR + L+E+SELQ+LHKQFV S VLTE EFWA RKKLL +DS +KSKQ +G KS MV   KP +DGR          
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------

Query:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH
             IFA KPAV QAF+N+VP+KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA ETR KIR VDPTLD+EAD GDDYTHL DH
Subjt:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH

Query:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT
        GI RDG  ++ E QN+ +KR+L QDLNR  AVVLEGR+ID   + T
Subjt:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT

Q55FP1 General transcription factor IIH subunit 17.6e-1927.02Show/hide
Query:  LSKSEMELRMRCLQEDSELQKLHKQFVISG-VLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSD--------------GRIFALKPAVHQA
        LS+ +++ R+  LQ + EL++L++Q V    V++ES+FW +RK +LK DS++  KQ  G  S+++ D +P S+               +IF   P+V +A
Subjt:  LSKSEMELRMRCLQEDSELQKLHKQFVISG-VLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSD--------------GRIFALKPAVHQA

Query:  FLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNE
        +  +VP K+SE++FW KY +++Y +  ++S  A A   +D+  + +  D++      ++K+  ++P +DL +  G D      +G+  D  ++  + +  
Subjt:  FLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNE

Query:  HYKRTLSQDLNRQGAVVLEGRTIDTTRKITTPEDMIQVFLHLSTKNTT
             L +  NR  A+VL  + + T   I   +D   +      +N+T
Subjt:  HYKRTLSQDLNRQGAVVLEGRTIDTTRKITTPEDMIQVFLHLSTKNTT

Q9DBA9 General transcription factor IIH subunit 17.3e-1427.52Show/hide
Query:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG
        IK  K + EG  K   L L    G +  F F N            S   K  +A +   ++ +  F  ++ +  E+E + R LQED  L +L+K  V+S 
Subjt:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG

Query:  VLTESEFWAARKKLLKRDSSKKS-KQLIGFKSSMVLDTKPMSDG--------------RIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNS
        V++  EFWA R  +   DSS  S KQ +G  ++ + D +P +DG               IF   PAV   +   VP+ M+EK+FWT++F++ Y H  + +
Subjt:  VLTESEFWAARKKLLKRDSSKKS-KQLIGFKSSMVLDTKPMSDG--------------RIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNS

Query:  IAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLNRQGA
          +    AE A+ +E  L          +T   +   +P LDL +      D G   + +P       I  +    I +  N H    L+  L +Q A
Subjt:  IAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLNRQGA

Q9M322 General transcription and DNA repair factor IIH subunit TFB1-31.1e-10560.12Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK+ VKDPGT G LE++E   +F P+DP S  KL V+   IK  K TKEGSNKPPWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E 
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------
              + V   P EQLS +E ELR + L+E+SELQKLHKQFV S VLTE EFW+ RKKLL +DS +KSKQ +G KS MV   KP +DGR          
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------

Query:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH
             IFA KPAV QAF+N+VP KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA E R+K+R VDPTLD++AD GDDYTHL DH
Subjt:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH

Query:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT
        GI RDG  +I E QN+  KR+L QDLNR  AVVLEGR I+   + T
Subjt:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT

Arabidopsis top hitse value%identityAlignment
AT1G55750.1 BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins)2.8e-10960.4Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK++VKDPGTPG L + E   +F P+DP S SKL V  + IK  K TKEGSNKPPWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK    
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------
         +    + V +   EQLS  E+ELR + L+E+SELQ+LHKQFV S VLTE EFWA RKKLL +DS +KSKQ +G KS MV   KP +DGR          
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------

Query:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH
             IFA KPAV QAF+N+VP+KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA ETR KIR VDPTLD+EAD GDDYTHL DH
Subjt:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH

Query:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT
        GI RDG  ++ E QN+ +KR+L QDLNR  AVVLEGR+ID   + T
Subjt:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT

AT3G61420.1 BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins)7.6e-10760.12Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK+ VKDPGT G LE++E   +F P+DP S  KL V+   IK  K TKEGSNKPPWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E 
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------
              + V   P EQLS +E ELR + L+E+SELQKLHKQFV S VLTE EFW+ RKKLL +DS +KSKQ +G KS MV   KP +DGR          
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGR----------

Query:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH
             IFA KPAV QAF+N+VP KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA E R+K+R VDPTLD++AD GDDYTHL DH
Subjt:  -----IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDH

Query:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT
        GI RDG  +I E QN+  KR+L QDLNR  AVVLEGR I+   + T
Subjt:  GIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDTTRKIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACCAAGTATGTCCATAGGAGTGCTAAGTACAAGACCTCAGTTAAGGATCCTGGCACGCCTGGCGTTTTGGAAATGACAGAGCGCAAGTTCGTATTTAGACCCAG
TGATCCCACTTCAACTTCTAAGCTTGACGTGGAGTTTAGATTTATTAAAGGCCACAAAAACACTAAGGAAGGATCAAATAAACCACCGTGGCTTAATCTCACCAGGGACC
AGGGTGGAAGTTACATTTTTGAGTTTAAAAATTTCTCTGATCTTCATGTTTGCCGCGAGTTTGTAGGAAGTGCTTTAGCAAAGTCAGGAGAGGCTGCACAAGCTCCCTCT
GAGAGGCCTGTGGCGGCATTTCCTCATGAACAACTCAGTAAATCAGAAATGGAACTTCGAATGAGATGTTTGCAAGAGGATAGTGAACTGCAGAAACTCCATAAACAATT
TGTGATTAGTGGTGTGTTGACAGAATCTGAGTTTTGGGCAGCAAGGAAGAAATTACTGAAACGAGACAGCTCCAAAAAGTCAAAACAACTAATTGGTTTTAAGAGTTCAA
TGGTTTTGGATACCAAACCAATGTCTGATGGTCGGATTTTTGCTCTAAAGCCAGCTGTTCACCAGGCCTTCCTTAATCATGTTCCCAATAAGATGTCGGAGAAAGACTTT
TGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACAAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGACGAAGAACTTGCCCTTTTTCTGAAGGACGACGA
GATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCGGATCTAGGAGATGATTACACACACCTTCCAGATCATGGAATCTTTC
GTGATGGTGGCAAGGAGATAACTGAATCACAAAATGAGCACTATAAAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGAAGAACTATAGAT
ACTACTCGTAAAATCACAACACCAGAGGATATGATCCAAGTCTTCCTCCACCTTTCGACTAAGAATACAACAGAAAGGTCCAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACCAAGTATGTCCATAGGAGTGCTAAGTACAAGACCTCAGTTAAGGATCCTGGCACGCCTGGCGTTTTGGAAATGACAGAGCGCAAGTTCGTATTTAGACCCAG
TGATCCCACTTCAACTTCTAAGCTTGACGTGGAGTTTAGATTTATTAAAGGCCACAAAAACACTAAGGAAGGATCAAATAAACCACCGTGGCTTAATCTCACCAGGGACC
AGGGTGGAAGTTACATTTTTGAGTTTAAAAATTTCTCTGATCTTCATGTTTGCCGCGAGTTTGTAGGAAGTGCTTTAGCAAAGTCAGGAGAGGCTGCACAAGCTCCCTCT
GAGAGGCCTGTGGCGGCATTTCCTCATGAACAACTCAGTAAATCAGAAATGGAACTTCGAATGAGATGTTTGCAAGAGGATAGTGAACTGCAGAAACTCCATAAACAATT
TGTGATTAGTGGTGTGTTGACAGAATCTGAGTTTTGGGCAGCAAGGAAGAAATTACTGAAACGAGACAGCTCCAAAAAGTCAAAACAACTAATTGGTTTTAAGAGTTCAA
TGGTTTTGGATACCAAACCAATGTCTGATGGTCGGATTTTTGCTCTAAAGCCAGCTGTTCACCAGGCCTTCCTTAATCATGTTCCCAATAAGATGTCGGAGAAAGACTTT
TGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACAAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGACGAAGAACTTGCCCTTTTTCTGAAGGACGACGA
GATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCGGATCTAGGAGATGATTACACACACCTTCCAGATCATGGAATCTTTC
GTGATGGTGGCAAGGAGATAACTGAATCACAAAATGAGCACTATAAAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGAAGAACTATAGAT
ACTACTCGTAAAATCACAACACCAGAGGATATGATCCAAGTCTTCCTCCACCTTTCGACTAAGAATACAACAGAAAGGTCCAACAAGTGA
Protein sequenceShow/hide protein sequence
MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPS
ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLKRDSSKKSKQLIGFKSSMVLDTKPMSDGRIFALKPAVHQAFLNHVPNKMSEKDF
WTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID
TTRKITTPEDMIQVFLHLSTKNTTERSNK