; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G07930 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G07930
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationChr5:6746410..6754238
RNA-Seq ExpressionCSPI05G07930
SyntenyCSPI05G07930
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055121.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa]7.1e-24798.58Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_004143721.1 general transcription factor IIH subunit 2 [Cucumis sativus]1.2e-24999.53Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG ANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFRRPKLATSDE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_008467294.1 PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo]6.0e-24698.35Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP T NSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]1.3e-23794.1Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+ LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]1.6e-24396.93Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AATEM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH +LNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPR+QLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSDE
Subjt:  IHESLHNCPGCESFRRPKLATSDE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit5.6e-25099.53Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG ANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFRRPKLATSDE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A1S3CUH8 General transcription factor IIH subunit2.9e-24698.35Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP T NSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A5A7UNG8 General transcription factor IIH subunit3.4e-24798.58Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A6J1GC53 General transcription factor IIH subunit6.5e-23894.1Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+ LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

A0A6J1K980 General transcription factor IIH subunit3.2e-23793.87Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVA+ LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 22.2e-8940.99Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q2TBV5 General transcription factor IIH subunit 22.2e-8940.49Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR S+IGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein2.8e-8940.99Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q9JIB4 General transcription factor IIH subunit 21.1e-8840.64Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR
        D++      WE  Y  +R+WE L+EDE+G L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  KT+K
Subjt:  EFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTC
        IR SVIGL+AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++      +       GGY C
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTC

Query:  PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESL
        P+C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S + +   R      C+GCQ  L +            C  C+  FC+DCD+++H+SL
Subjt:  PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESL

Query:  HNCPGC
        H CPGC
Subjt:  HNCPGC

Q9ZVN9 General transcription factor IIH subunit 24.8e-19076.46Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM
        A++AKHVEAF+REFFDQNPLSQIGLV+IK+GVA+ LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVL+LYSAL +CDPGDIM
Subjt:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+    GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRRPK
        NCPGCES  RPK
Subjt:  NCPGCESFRRPK

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H23.4e-19176.46Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM
        A++AKHVEAF+REFFDQNPLSQIGLV+IK+GVA+ LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVL+LYSAL +CDPGDIM
Subjt:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+    GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRRPK
        NCPGCES  RPK
Subjt:  NCPGCESFRRPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGGGAAAATAGGCGATTGAATGGGGAAGCCGATGAAGAAGATGATGATGATGATGCCAATGGACTTGCTGCATGGGAAAGGACTTATGCGGATGATAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTCGTCCGATCGACAATAAAGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGAACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTATATTGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTGGTGGCA
AAACATGTAGAGGCTTTTGTAAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTAATTGCTTAACAGATCTTGGTGG
AAGTCCTGAATCTCATGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCACAGAGAAGTTTTAGTCTTATACTCTGCTCTTAATTCTTGCGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCA
GTTATTGGTCTTACTGCAGAAATTTTTATTTGCAGACATCTCTGTCAAGAAACTGGTGGCTCGTACTCTGTCGCATTGGATGAGTCTCACTTTAAAGAGTTGCTATTGGA
GCATGCACCCCCACCCCCAGCAATTGCAGACTCTGCCATGCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCCATTGCAATATGTTCATGCCACA
AGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTTCCCACAGAGTGTCGAATTTGTGGACTGACGCTTATCTCCTCTCCCCAT
TTGGCTAGGTCGTATCATCATCTTTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCA
AGAAAGCCTCATGAATCCTAGCACAGGTAACAGTCCAAGCATACGTGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCT
TGCACAATTGTCCTGGCTGTGAGAGTTTCAGGCGTCCCAAATTAGCGACTTCTGACGAATGA
mRNA sequenceShow/hide mRNA sequence
ACGCTACTCTTTCTTCAGGGTTTTCTTCCCACAACATTCTCTTCCTCTCTTCAATTTCACCGTTCTTCTCTATCTAAAAAGTAAACGAATCAAACAAAATTCAAACTGTT
TTCCTCTTCATTGTTCATACTTACACTCGGAGATCGGGAGCAGTTAAACCCACAAATCGGAAGAACATGGGTCAAGCTTTTCGCAAACTCTTCGACTCATTTTTCGGCAA
GTCCGAGATGAGGAAGATGATGGCTTAGGGGTAGAACATTTCTTGGAACCTGGGCTTCACAAATGCTTGTTTGGAAGAAGTAATTGAAATCCAATGAAATCAAACCCAAC
TGGACTACATTGCTGATTCCTCATCCCTCCTATTGACGAATAGGACGCCATTACCTAAATATTTTTACCGACCTACTGTATTCCCCAAGTTGGGATTTTTGTGAGTAGAA
TTTGTTTTAGGTATTGGGGACTCATCAATTTGATCTTGTTTCAAAACCCAACAAGTGTATATCTTCTGAAATAGTCCTTAAATATGAACAATGGGGAAAATAGGCGATTG
AATGGGGAAGCCGATGAAGAAGATGATGATGATGATGCCAATGGACTTGCTGCATGGGAAAGGACTTATGCGGATGATAGGTCGTGGGAAGCCCTGCAAGAGGATGAGTC
TGGACTCCTTCGTCCGATCGACAATAAAGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGAACCCTTTCTTCCTTAGCAACCACTGCTCGGATTCAGAAGGGTCTTA
TTCGCTATCTCTATATTGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTGGTGGCAAAACATGTAGAGGCTTTTGTAAGGGAA
TTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTAATTGCTTAACAGATCTTGGTGGAAGTCCTGAATCTCATGTTAAAGCGTT
AATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCATCATATGGGCACAGAGAAGTTTTAGTCT
TATACTCTGCTCTTAATTCTTGCGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCAGTTATTGGTCTTACTGCAGAAATTTTT
ATTTGCAGACATCTCTGTCAAGAAACTGGTGGCTCGTACTCTGTCGCATTGGATGAGTCTCACTTTAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATTGC
AGACTCTGCCATGCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCCATTGCAATATGTTCATGCCACAAGGAAGCTAAAGTTGGAGGGGGCTATA
CTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTTCCCACAGAGTGTCGAATTTGTGGACTGACGCTTATCTCCTCTCCCCATTTGGCTAGGTCGTATCATCATCTTTTT
CCAATTATACCATTTGATGAAGTCTCTGATAAAGTATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATCCTAGCACAGG
TAACAGTCCAAGCATACGTGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTT
TCAGGCGTCCCAAATTAGCGACTTCTGACGAATGAATGTCTACTTTTAGATGCAACATGGCCCTAACCGAATCCAAACAGGACGCTTCTCTGTCTCTGTTCATTTGCGTA
AAAAGCTGCAATGAGAGCTCTGTTGAATGAATGCTGAATCCAAGGCTCACACTGCTACCATGCAGTCGTCTTTCATGTTTCATCCAGATCCAAAGCTTGTGACTTTTTGA
CACTTCATGTCATTGTTGAAATTGGCTAGAAGATTAATTTTTCAAGATGAAAGGCTGAAAGCCAATTTCCTTCTAGGCTGCAGGATGGACGACTTTACTTGAATGTACCC
AAAATTGACCCCCAAATAGACATTTTTCCATTTTCATGAGATTTTGTAAGAACTTGATATTCATTATTATCTGATTCAGGTGTATTTTGATACACATTCTTTTAGTTAAG
TTATTAAAGGAGCTTGTGCCAGAAACTTCAGTGTTTACTATTAACAATAAACACTCACGTGTAACAATGTTG
Protein sequenceShow/hide protein sequence
MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVA
KHVEAFVREFFDQNPLSQIGLVTIKDGVANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCS
VIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPH
LARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLATSDE