; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G020130 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G020130
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptiongeneral transcription and DNA repair factor IIH subunit TFB1-1-like
Genome locationchr01:27493632..27506711
RNA-Seq ExpressionLsi01G020130
SyntenyLsi01G020130
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006351 - transcription, DNA-templated (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
InterPro domainsIPR005607 - BSD domain
IPR013876 - TFIIH p62 subunit, N-terminal
IPR027079 - TFIIH subunit Tfb1/GTF2H1
IPR035925 - BSD domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008457278.1 PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucumis melo]5.7e-17891.78Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTSVKDPGTPGVLEMTE KFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        K GEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAA+TRKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

XP_038894178.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X1 [Benincasa hispida]1.4e-17690.93Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

XP_038894194.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X2 [Benincasa hispida]1.4e-17690.93Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

XP_038894195.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X3 [Benincasa hispida]1.4e-17690.93Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

XP_038894196.1 general transcription and DNA repair factor IIH subunit TFB1-1-like isoform X4 [Benincasa hispida]1.4e-17690.93Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTS+KDPGTPGVLEMTE KF+FRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGS IFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGEAAQAP ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD+SKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAE RKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

TrEMBL top hitse value%identityAlignment
A0A1S3C6E8 probable RNA polymerase II transcription factor B subunit 1-1 isoform X12.8e-17891.78Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYVH+SAKYKTSVKDPGTPGVLEMTE KFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        K GEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAA+TRKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

A0A6J1DNY2 probable RNA polymerase II transcription factor B subunit 1-1 isoform X23.9e-17288.67Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVF+PSDPTS SKLDVEFR+IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGE AQA SER VA FPHEQLSK EMELRM+CLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD+SKKSKQL+GFK+SMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFL+HVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITES NE Y+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

A0A6J1DRT9 probable RNA polymerase II transcription factor B subunit 1-1 isoform X13.9e-17288.67Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVF+PSDPTS SKLDVEFR+IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGE AQA SER VA FPHEQLSK EMELRM+CLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD+SKKSKQL+GFK+SMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFL+HVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITES NE Y+RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

A0A6J1EXP7 probable RNA polymerase II transcription factor B subunit 1-13.9e-17289.52Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGEA QAPSE+ VA FPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD S KSKQL+GFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIR VDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDYTHLPDHGIFRDGGKEITES NE  +RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

A0A6J1IB04 probable RNA polymerase II transcription factor B subunit 1-11.9e-17189.24Show/hide
Query:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
        MGTKYV +SAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTS SKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA
Subjt:  MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALA

Query:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR
        KSGEA QAPSE+ VA FPHEQLSKSEMELRMRCLQEDSELQKLHKQFVI GVLTESEFWAARKKLLERD S KSKQL+GFKSSMVLDTKPMSDGR     
Subjt:  KSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQR

Query:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL
             N+V F++       IFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIR VDPTLDL
Subjt:  FIPGFNEVVFDIR-----GIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDL

Query:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        EADLGDDY HLPDHGIFRDGGKEITES NE  +RTLSQDLNRQGAVVLEGRTI
Subjt:  EADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

SwissProt top hitse value%identityAlignment
P32780 General transcription factor IIH subunit 16.5e-1527.96Show/hide
Query:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG
        IK  K + EG  K   L L    G +  F F N            S   K  +A +   ++ +  F  ++ +  E+E + R LQED  L +L+K  V+S 
Subjt:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG

Query:  VLTESEFWAARKKLLERDSSKKS--KQLIGFKSSMVLDTKPMSDGRLISQRFIPGFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYL
        V++  EFWA R  +   DSS  S  KQ +G  ++ + D +P +DG    +     +N     I  IF   PAV   +  +VP+ M+EK+FWT++F++ Y 
Subjt:  VLTESEFWAARKKLLERDSSKKS--KQLIGFKSSMVLDTKPMSDGRLISQRFIPGFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYL

Query:  HSTKNSIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLN
        H  + +  +    AE A+ +E  L          +T   +   +P LDL A      D G   + +P       I  +    I +  N H    L+  L 
Subjt:  HSTKNSIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLN

Query:  RQGA
        +Q A
Subjt:  RQGA

Q3ECP0 General transcription and DNA repair factor IIH subunit TFB1-17.7e-10960.87Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK++VKDPGTPG L + E   +F P+DP S SKL V  + IK  K TKEGSNKPPWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK    
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRL--ISQRFIP
         +    + V +   EQLS  E+ELR + L+E+SELQ+LHKQFV S VLTE EFWA RKKLL +DS +KSKQ +G KS MV   KP +DGR   ++    P
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRL--ISQRFIP

Query:  GFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY
           E++F    IFA KPAV QAF+N+VP+KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA ETR KIR VDPTLD+EAD GDDY
Subjt:  GFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY

Query:  THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        THL DHGI RDG  ++ E QN+ +KR+L QDLNR  AVVLEGR+I
Subjt:  THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

Q55FP1 General transcription factor IIH subunit 13.3e-1928.38Show/hide
Query:  LSKSEMELRMRCLQEDSELQKLHKQFVISG-VLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSD-GRLISQRFIPGFNEVVFDIRGIFALK
        LS+ +++ R+  LQ + EL++L++Q V    V++ES+FW +RK +L+ DS++  KQ  G  S+++ D +P S+    +  RF P        I  IF   
Subjt:  LSKSEMELRMRCLQEDSELQKLHKQFVISG-VLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSD-GRLISQRFIPGFNEVVFDIRGIFALK

Query:  PAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEI
        P+V +A+  +VP K+SE++FW KY +++Y +  ++S  A A   +D+  + +  D++      ++K+  ++P +DL +  G D      +G+  D  ++ 
Subjt:  PAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEI

Query:  TESQNEHYKRTLSQDLNRQGAVVLEGRTI
         + +       L +  NR  A+VL  + +
Subjt:  TESQNEHYKRTLSQDLNRQGAVVLEGRTI

Q9DBA9 General transcription factor IIH subunit 11.9e-1427.72Show/hide
Query:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG
        IK  K + EG  K   L L    G +  F F N            S   K  +A +   ++ +  F  ++ +  E+E + R LQED  L +L+K  V+S 
Subjt:  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISG

Query:  VLTESEFWAARKKLLERDSSKKS-KQLIGFKSSMVLDTKPMSDGRLISQRFIPGFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLH
        V++  EFWA R  +   DSS  S KQ +G  ++ + D +P +DG    +     +N     I  IF   PAV   +   VP+ M+EK+FWT++F++ Y H
Subjt:  VLTESEFWAARKKLLERDSSKKS-KQLIGFKSSMVLDTKPMSDGRLISQRFIPGFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLH

Query:  STKNSIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLNR
          + +  +    AE A+ +E  L          +T   +   +P LDL +      D G   + +P       I  +    I +  N H    L+  L +
Subjt:  STKNSIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLNR

Query:  QGA
        Q A
Subjt:  QGA

Q9M322 General transcription and DNA repair factor IIH subunit TFB1-34.2e-10761.22Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK+ VKDPGT G LE++E   +F P+DP S  KL V+   IK  K TKEGSNKPPWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E 
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQRFIPGF
              + V   P EQLS +E ELR + L+E+SELQKLHKQFV S VLTE EFW+ RKKLL +DS +KSKQ +G KS MV   KP +DGR     F    
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQRFIPGF

Query:  NEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTH
        +E++F    IFA KPAV QAF+N+VP KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA E R+K+R VDPTLD++AD GDDYTH
Subjt:  NEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTH

Query:  LPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        L DHGI RDG  +I E QN+  KR+L QDLNR  AVVLEGR I
Subjt:  LPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

Arabidopsis top hitse value%identityAlignment
AT1G55750.1 BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins)5.5e-11060.87Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK++VKDPGTPG L + E   +F P+DP S SKL V  + IK  K TKEGSNKPPWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK    
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRL--ISQRFIP
         +    + V +   EQLS  E+ELR + L+E+SELQ+LHKQFV S VLTE EFWA RKKLL +DS +KSKQ +G KS MV   KP +DGR   ++    P
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRL--ISQRFIP

Query:  GFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY
           E++F    IFA KPAV QAF+N+VP+KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA ETR KIR VDPTLD+EAD GDDY
Subjt:  GFNEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY

Query:  THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        THL DHGI RDG  ++ E QN+ +KR+L QDLNR  AVVLEGR+I
Subjt:  THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI

AT3G61420.1 BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins)3.0e-10861.22Show/hide
Query:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA
        + +  KYK+ VKDPGT G LE++E   +F P+DP S  KL V+   IK  K TKEGSNKPPWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E 
Subjt:  VHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA

Query:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQRFIPGF
              + V   P EQLS +E ELR + L+E+SELQKLHKQFV S VLTE EFW+ RKKLL +DS +KSKQ +G KS MV   KP +DGR     F    
Subjt:  AQAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQRFIPGF

Query:  NEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTH
        +E++F    IFA KPAV QAF+N+VP KM+EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+FLK DEILA E R+K+R VDPTLD++AD GDDYTH
Subjt:  NEVVFDIRGIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTH

Query:  LPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI
        L DHGI RDG  +I E QN+  KR+L QDLNR  AVVLEGR I
Subjt:  LPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACCAAGTATGTCCATAGGAGTGCTAAGTACAAGACCTCAGTTAAGGATCCTGGCACGCCTGGCGTTTTGGAAATGACAGAGCGCAAGTTCGTATTTAGACCCAG
TGATCCCACTTCAACTTCTAAGCTTGACGTGGAGTTTAGATTTATTAAAGGCCACAAAAACACTAAGGAAGGATCAAATAAACCACCGTGGCTTAATCTCACCAGGGACC
AGGGTGGAAGTTACATTTTTGAGTTTAAAAATTTCTCTGATCTTCATGTTTGCCGCGAGTTTGTAGGAAGTGCTTTAGCAAAGTCAGGAGAGGCTGCACAAGCTCCCTCT
GAGAGGCCTGTGGCGGCATTTCCTCATGAACAACTCAGTAAATCAGAAATGGAACTTCGAATGAGATGTTTGCAAGAGGATAGTGAACTGCAGAAACTCCATAAACAATT
TGTGATTAGTGGTGTGTTGACAGAATCTGAGTTTTGGGCAGCAAGGAAGAAATTACTGGAACGAGACAGCTCCAAAAAGTCAAAACAACTAATTGGTTTTAAGAGTTCAA
TGGTTTTGGATACCAAACCAATGTCTGATGGTCGGCTGATTAGTCAAAGGTTTATTCCTGGTTTCAACGAGGTGGTGTTCGATATCAGGGGTATTTTTGCTCTAAAGCCA
GCTGTTCACCAGGCCTTCCTTAATCATGTTCCCAATAAGATGTCGGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACAAAAAATTCTATTGC
AGCTGCAGCAGAGGCTGCTGAAGACGAAGAACTTGCCCTTTTTCTGAAGGACGACGAGATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGG
ATTTGGAAGCGGATCTAGGAGATGATTACACACACCTTCCAGATCATGGAATCTTTCGTGATGGTGGCAAGGAGATAACTGAATCACAAAATGAGCACTATAAAAGGACT
TTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGAAGAACTATAGGTTAG
mRNA sequenceShow/hide mRNA sequence
TTAATAATTTAACCAAAAAAATAAAAAATCATACTCTTTCATCTCTTTCTCCCTCACGCTCGATTCGCTCTTTCACGGATCCTCCCGCCGTCGCCGATTGCCATCACTAT
CTCGCACGCCATTGCTCCGCCCTCTTCGACGCAGACTGCCGTCGTCCTCTAAGCGTCGATAAGTACAGTCGAACGTCTCCCTCTTTCTCTCTCTAAGTTCGATCTCGCCC
AGCGCCGCCGTTCATCCTCAATCTCAGATCTTAATCGCCGTCGCAGTCCAGTCGTGGCTGCCGTGCCCACGTGCAAAAACGTTTTTTCGACGAGCAGCTTAGATTCAAGA
GTTTGAATGTGTTTAGCACTCTGTCCAGCAAGATTAAGGACCTTGGCAGCGCATTCGTGTCAGTTTCGAACCTATTCAAACACGAACCAGCAAGTAATTAAGCTTTTCCG
GCATTAATCAGTGGATTGATCGAGCAAGAGCTAAGGAGGACGTTGGGCAAAATGAGTTAATAAGCCTCCTAAGTATCCTTTGGGTAAGGTTCGTCAGAGTAATTTTTAGT
TGTATAGTTAATTTTCGGATTGAATTGAAGTATTACCAGATCAGTTTCTAACCAATGGAGCCTATATGCGTTTAGGTTGAACCGATTAGATTGTGACTTATGAGATGACG
TTGGGTTGATCTGAGGAGATTTGATCGAGCAGACTGGGATATTTGTACTTTGGATTGCTAATCAGACTGATCGTCTCTAAGGGAAGATGGGAACCAAGTATGTCCATAGG
AGTGCTAAGTACAAGACCTCAGTTAAGGATCCTGGCACGCCTGGCGTTTTGGAAATGACAGAGCGCAAGTTCGTATTTAGACCCAGTGATCCCACTTCAACTTCTAAGCT
TGACGTGGAGTTTAGATTTATTAAAGGCCACAAAAACACTAAGGAAGGATCAAATAAACCACCGTGGCTTAATCTCACCAGGGACCAGGGTGGAAGTTACATTTTTGAGT
TTAAAAATTTCTCTGATCTTCATGTTTGCCGCGAGTTTGTAGGAAGTGCTTTAGCAAAGTCAGGAGAGGCTGCACAAGCTCCCTCTGAGAGGCCTGTGGCGGCATTTCCT
CATGAACAACTCAGTAAATCAGAAATGGAACTTCGAATGAGATGTTTGCAAGAGGATAGTGAACTGCAGAAACTCCATAAACAATTTGTGATTAGTGGTGTGTTGACAGA
ATCTGAGTTTTGGGCAGCAAGGAAGAAATTACTGGAACGAGACAGCTCCAAAAAGTCAAAACAACTAATTGGTTTTAAGAGTTCAATGGTTTTGGATACCAAACCAATGT
CTGATGGTCGGCTGATTAGTCAAAGGTTTATTCCTGGTTTCAACGAGGTGGTGTTCGATATCAGGGGTATTTTTGCTCTAAAGCCAGCTGTTCACCAGGCCTTCCTTAAT
CATGTTCCCAATAAGATGTCGGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACAAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGA
CGAAGAACTTGCCCTTTTTCTGAAGGACGACGAGATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCGGATCTAGGAGATG
ATTACACACACCTTCCAGATCATGGAATCTTTCGTGATGGTGGCAAGGAGATAACTGAATCACAAAATGAGCACTATAAAAGGACTTTGTCACAAGACCTTAATCGTCAA
GGTGCAGTTGTTCTTGAAGGAAGAACTATAGGTTAG
Protein sequenceShow/hide protein sequence
MGTKYVHRSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSTSKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAPS
ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVISGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRLISQRFIPGFNEVVFDIRGIFALKP
AVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRT
LSQDLNRQGAVVLEGRTIG