; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0013205 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0013205
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationchr09:3859810..3862384
RNA-Seq ExpressionPI0013205
SyntenyPI0013205
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055121.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa]4.8e-22992.91Show/hide
Query:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD
        MNNGENR+LNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYL IVIDFSKAATEMD
Subjt:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD

Query:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--
        FRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVL+LYSALN  
Subjt:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--

Query:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
                           S IGLTAEIFICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_004143721.1 general transcription factor IIH subunit 2 [Cucumis sativus]4.1e-22892.43Show/hide
Query:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD
        MNNGENR+LNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYL IVIDFSKAATEMD
Subjt:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD

Query:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--
        FRPSRMAVVAKH++AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVL+LYSALN  
Subjt:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--

Query:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
                           S IGLTAEIFICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFRRPKLATSDE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_008467294.1 PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo]4.1e-22892.67Show/hide
Query:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD
        MNNGENR+LNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYL IVIDFSKAATEMD
Subjt:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD

Query:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--
        FRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVL+LYSALN  
Subjt:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--

Query:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
                           S IGLTAEIFICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]5.3e-22088.92Show/hide
Query:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM
        MNNGENR+LNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYL +VIDFS+AA EM
Subjt:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM

Query:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-
        DFRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVLILYSALN 
Subjt:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-

Query:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
                            S IGLTAE+FICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]6.5e-22691.75Show/hide
Query:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM
        MNNGENR+LNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYL IVIDFS+AATEM
Subjt:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM

Query:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-
        DFRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH +LNQIPSYGHREVLILYSALN 
Subjt:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-

Query:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
                            S IGLTAEIFICRHLCQETGGSYS+ALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPR+QLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSDE
Subjt:  IHESLHNCPGCESFRRPKLATSDE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit2.0e-22892.43Show/hide
Query:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD
        MNNGENR+LNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYL IVIDFSKAATEMD
Subjt:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD

Query:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--
        FRPSRMAVVAKH++AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVL+LYSALN  
Subjt:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--

Query:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
                           S IGLTAEIFICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFRRPKLATSDE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A1S3CUH8 General transcription factor IIH subunit2.0e-22892.67Show/hide
Query:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD
        MNNGENR+LNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYL IVIDFSKAATEMD
Subjt:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD

Query:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--
        FRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVL+LYSALN  
Subjt:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--

Query:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
                           S IGLTAEIFICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A5A7UNG8 General transcription factor IIH subunit2.3e-22992.91Show/hide
Query:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD
        MNNGENR+LNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYL IVIDFSKAATEMD
Subjt:  MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMD

Query:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--
        FRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVL+LYSALN  
Subjt:  FRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN--

Query:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
                           S IGLTAEIFICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  -------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A6J1GC53 General transcription factor IIH subunit2.6e-22088.92Show/hide
Query:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM
        MNNGENR+LNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYL +VIDFS+AA EM
Subjt:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM

Query:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-
        DFRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVLILYSALN 
Subjt:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-

Query:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
                            S IGLTAE+FICRHLCQETGGSYSVALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

A0A6J1K980 General transcription factor IIH subunit1.3e-21988.68Show/hide
Query:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM
        MNNGENR+LNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYL +VIDFS+AA EM
Subjt:  MNNGENRQLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEM

Query:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-
        DFRPSRMAVVAKH+EAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVLILYSALN 
Subjt:  DFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN-

Query:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
                            S IGLTAE+FICRHLCQETGGSY VALDE+HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  --------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

SwissProt top hitse value%identityAlignment
O74995 General transcription and DNA repair factor IIH subunit ssl18.6e-8038.66Show/hide
Query:  NNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDF
        +  E+ Q NG         D N    WE  Y   RSW+ +QED  G L  +    I   + +R LR       T  +Q+G+IR++ +V+D S +  E DF
Subjt:  NNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDF

Query:  RPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALNS--
           R  +  K+   FV EFF+QNP+SQ+ ++ + DG+AH +TDL G+P+SH++ L    +CSG+ SLQN LE+  + L+ I S+G REVLI++ ++ S  
Subjt:  RPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALNS--

Query:  -------------------FIGLTAEIFICRHLCQETGGS----YSVALDEAHFKELLLEHA-PPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEA
                            +GL AE+ IC+ +C +T  S    Y V + E HF+ELLLE   PP    A +   +L+ MGFP +  E   ++C+CH   
Subjt:  -------------------FIGLTAEIFICRHLCQETGGS----YSVALDEAHFKELLLEHA-PPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEA

Query:  KVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQ----ESLMNPGTGNSPGIRVSCPKCKQHF
           GG+ CPRCKA+VC LP EC  C L LI S HLARSYHHLFP+  + E+         H     CF CQ    +  ++P   ++  +R +CP CK HF
Subjt:  KVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQ----ESLMNPGTGNSPGIRVSCPKCKQHF

Query:  CLDCDIYIHESLHNCPGCE
        CLDCD++ HE LH C GC+
Subjt:  CLDCDIYIHESLHNCPGCE

Q13888 General transcription factor IIH subunit 22.8e-7838.52Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRMAVVAKHIEAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+L +V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRMAVVAKHIEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN------------------
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L                   
Subjt:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN------------------

Query:  ---SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
           S IGL+AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  ---SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q2TBV5 General transcription factor IIH subunit 22.1e-7838.27Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRMAVVAKHIEAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+L +V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRMAVVAKHIEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN------------------
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L                   
Subjt:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALN------------------

Query:  ---SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
           S IGL+AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  ---SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q86KZ2 General transcription factor IIH subunit 24.3e-7937.59Show/hide
Query:  NNGENRQLNGEA-DEED--------DDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSS---LATTARIQKGLIRYLCIV
        NN +N++ N    D+ED        +D+D      WE  +  +++W  + EDE G LRP + +        RRL+       L+   R+++G+ R+LC++
Subjt:  NNGENRQLNGEA-DEED--------DDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSS---LATTARIQKGLIRYLCIV

Query:  IDFSKAATEMDFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHRE
        +D SK  +  D +PSR  V+ +++E F++EFFDQNP+SQ+ ++  K+  A  +++L G+   H++A+   +   G+ S+QN LE+  S L  +P YG RE
Subjt:  IDFSKAATEMDFRPSRMAVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHRE

Query:  VLILYSALN---------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAI
        VL ++S+L                      SFI + AE++IC+ + ++T G+  V L+E HF E L+    PPP I  +    L++MGFPQ+   +  + 
Subjt:  VLILYSALN---------------------SFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAI

Query:  CSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCK
        C CH++ K   GY CPRC  + CELPT+C+IC L+L+SSPHLARSYHHLF I  F+EV+ K  +         C GC    ++    +   +  SCP+C+
Subjt:  CSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCK

Query:  QHFCLDCDIYIHESLHNCPGCES
        + FCLDCD++IHESLHNCPGCE+
Subjt:  QHFCLDCDIYIHESLHNCPGCES

Q9ZVN9 General transcription factor IIH subunit 24.6e-17472.33Show/hide
Query:  RQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYL IVIDFS+AA EMDFRPSRM
Subjt:  RQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRM

Query:  AVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSAL---------
        A++AKH+EAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVLILYSAL         
Subjt:  AVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSAL---------

Query:  ------------NSFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
                     S IGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ------------NSFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRRPK
        NCPGCES  RPK
Subjt:  NCPGCESFRRPK

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H23.3e-17572.33Show/hide
Query:  RQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYL IVIDFS+AA EMDFRPSRM
Subjt:  RQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRM

Query:  AVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSAL---------
        A++AKH+EAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVLILYSAL         
Subjt:  AVVAKHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSAL---------

Query:  ------------NSFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
                     S IGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ------------NSFIGLTAEIFICRHLCQETGGSYSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRRPK
        NCPGCES  RPK
Subjt:  NCPGCESFRRPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGGGAAAATAGGCAATTGAATGGGGAAGCCGATGAAGAAGATGACGACGACGATGCCAATGGACTTGCTGCGTGGGAAAGGACTTATGCGGATGACAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGTACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTGTATCGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTTGTGGCA
AAACACATAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGG
AAGTCCTGAATCTCATGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCATAGAGAAGTTTTAATTTTATACTCTGCTCTTAATTCTTTTATTGGTCTTACTGCAGAAATTTTTATTTGCAGACATCTCTGCCAGGAAACTGGTGGCTCA
TACTCTGTCGCATTGGATGAGGCTCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATTGCAGACTCTGCCATGCCTAATTTAATCAAGATGGGTTT
TCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTCACAAAGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTTC
CCACAGAGTGTCGAATTTGTGGACTGACACTTATCTCCTCACCTCATTTGGCTAGGTCATATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTA
TTTCATGACCCACGACACCAACTGCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGTCCAGGCATCCGTGTATCTTGCCCAAAGTG
CAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGATGTGAGAGTTTCAGGCGTCCCAAGTTAGCGACTTCTGATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGGGGAAAATAGGCAATTGAATGGGGAAGCCGATGAAGAAGATGACGACGACGATGCCAATGGACTTGCTGCGTGGGAAAGGACTTATGCGGATGACAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGTACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTGTATCGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTTGTGGCA
AAACACATAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGG
AAGTCCTGAATCTCATGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCATAGAGAAGTTTTAATTTTATACTCTGCTCTTAATTCTTTTATTGGTCTTACTGCAGAAATTTTTATTTGCAGACATCTCTGCCAGGAAACTGGTGGCTCA
TACTCTGTCGCATTGGATGAGGCTCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATTGCAGACTCTGCCATGCCTAATTTAATCAAGATGGGTTT
TCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTCACAAAGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTTC
CCACAGAGTGTCGAATTTGTGGACTGACACTTATCTCCTCACCTCATTTGGCTAGGTCATATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTA
TTTCATGACCCACGACACCAACTGCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGTCCAGGCATCCGTGTATCTTGCCCAAAGTG
CAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGATGTGAGAGTTTCAGGCGTCCCAAGTTAGCGACTTCTGATGAATGA
Protein sequenceShow/hide protein sequence
MNNGENRQLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLCIVIDFSKAATEMDFRPSRMAVVA
KHIEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLILYSALNSFIGLTAEIFICRHLCQETGGS
YSVALDEAHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV
FHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLATSDE