; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016620 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016620
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationchr09:3972851..3975407
RNA-Seq ExpressionPI0016620
SyntenyPI0016620
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055121.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa]1.0e-22992.67Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKG+IRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLE+VHSYLNQIPSYGHREVL+LYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEI       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQESLMN GTGNSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRCPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRCPKLATSDE

XP_004143721.1 general transcription factor IIH subunit 2 [Cucumis sativus]4.9e-22992.43Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKG+IRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC
        FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLE+VHSYLNQIPSYGHREVL+LYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEI       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQESLMN  TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRCPKLATSDE
        HESLHNCPGCESFR PKLATSDE
Subjt:  HESLHNCPGCESFRCPKLATSDE

XP_008467294.1 PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo]8.4e-22992.43Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKG+IRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLE+VHSYLNQIPSYGHREVL+LYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEI       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQESLMN GT NSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRCPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRCPKLATSDE

XP_022998882.1 general transcription factor IIH subunit 2 [Cucurbita maxima]1.4e-22088.92Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKG+IRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL++V  YLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILY----------ADISAKKLSHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+            + + A   SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILY----------ADISAKKLSHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  N GTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRCPKLATSDE
        IHESLHNCPGCESFR PK ATSD+
Subjt:  IHESLHNCPGCESFRCPKLATSDE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]3.9e-22691.51Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM
        MNNGENRRLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSSLATTARIQKG+IRYLYIVIDFS+AATEM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLE+VH +LNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEI       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPR+QLPKVCFGCQESLMN GTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRCPKLATSDE
        IHESLHNCPGCESFR PK ATSDE
Subjt:  IHESLHNCPGCESFRCPKLATSDE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit2.4e-22992.43Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKG+IRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC
        FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLE+VHSYLNQIPSYGHREVL+LYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEI       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQESLMN  TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRCPKLATSDE
        HESLHNCPGCESFR PKLATSDE
Subjt:  HESLHNCPGCESFRCPKLATSDE

A0A1S3CUH8 General transcription factor IIH subunit4.1e-22992.43Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKG+IRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLE+VHSYLNQIPSYGHREVL+LYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEI       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQESLMN GT NSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRCPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRCPKLATSDE

A0A5A7UNG8 General transcription factor IIH subunit4.8e-23092.67Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKG+IRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLE+VHSYLNQIPSYGHREVL+LYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEI       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQESLMN GTGNSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRCPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRCPKLATSDE

A0A6J1GC53 General transcription factor IIH subunit1.2e-22088.68Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKG+IRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL++V  YLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+       ++           SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  N GTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRCPKLATSDE
        IHESLHNCPGCESFR PK ATSD+
Subjt:  IHESLHNCPGCESFRCPKLATSDE

A0A6J1K980 General transcription factor IIH subunit7.0e-22188.92Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKG+IRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL++V  YLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILY----------ADISAKKLSHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+            + + A   SHFKELLLEHAPPPPAIADSA+PNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEILY----------ADISAKKLSHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  N GTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRCPKLATSDE
        IHESLHNCPGCESFR PK ATSD+
Subjt:  IHESLHNCPGCESFRCPKLATSDE

SwissProt top hitse value%identityAlignment
O74995 General transcription and DNA repair factor IIH subunit ssl16.8e-8038.95Show/hide
Query:  NNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDF
        +  E+ + NG         D N    WE  Y   RSW+ +QED  G L  +    I   + +R LR       T  +Q+GIIR++ +V+D S +  E DF
Subjt:  NNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDF

Query:  RPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCD
           R  +  K+   FV EFF+QNP+SQ+ ++ + DG+AH +TDL G+P+SH++ L    +CSG+ SLQN LE+  + L+ I S+G REVLI++ ++ S D
Subjt:  RPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKLS--------------HFKELLLEHA-PPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEA
        PGDI +T+       IR  ++GL AE+        K +              HF+ELLLE   PP    A +   +L+ MGFP +  E   ++C+CH   
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKLS--------------HFKELLLEHA-PPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEA

Query:  KVGGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKV--CFGCQ----ESLMNAGTGNSPGIRVSCPKCKQ
           GG+ CPRCKA+VC LP EC  C L LI S HLARSYHHLFP+  + E+       P    PK   CF CQ    +  ++    ++  +R +CP CK 
Subjt:  KVGGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKV--CFGCQ----ESLMNAGTGNSPGIRVSCPKCKQ

Query:  HFCLDCDIYIHESLHNCPGCE
        HFCLDCD++ HE LH C GC+
Subjt:  HFCLDCDIYIHESLHNCPGCE

Q13888 General transcription factor IIH subunit 22.3e-8038.52Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K ++ FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR

Query:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+    + A++           SH+KELL  H  PPPA + S   +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein5.2e-8038.52Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K ++ FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR

Query:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+    + A++           SH+KELL  H  PPPA + S   +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEILYADISAKKL----------SHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q86KZ2 General transcription factor IIH subunit 21.8e-8038.59Show/hide
Query:  NNGENRRLNGEA-DEED--------DDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSS---LATTARIQKGIIRYLYIV
        NN +N+R N    D+ED        +D+D      WE  +  +++W  + EDE G LRP + +        RRL+       L+   R+++G+ R+L ++
Subjt:  NNGENRRLNGEA-DEED--------DDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSS---LATTARIQKGIIRYLYIV

Query:  IDFSKAATEMDFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHRE
        +D SK  +  D +PSR  V+ ++V+ F++EFFDQNP+SQ+ ++  K+  A  +++L G+   H++A+   +   G+ S+QN LEV  S L  +P YG RE
Subjt:  IDFSKAATEMDFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHRE

Query:  VLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKLS----------HFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAI
        VL ++S+L +CDP  + +T+Q  K   IR S I + AE+      A++ +          HF E L+    PPP I  +    L++MGFPQ+   +  + 
Subjt:  VLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLTAEILYADISAKKLS----------HFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAI

Query:  CSCHKEAKVGGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPK--VCFGCQESLMNAGTGNSPGIRVSCPK
        C CH++ K   GY CPRC  + CELPT+CQIC L+L+SSPHLARSYHHLF I  F+EV+ K       +L K   C GC    +++   +   +  SCP+
Subjt:  CSCHKEAKVGGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKIFHDPRHQLPK--VCFGCQESLMNAGTGNSPGIRVSCPK

Query:  CKQHFCLDCDIYIHESLHNCPGCES
        C++ FCLDCD++IHESLHNCPGCE+
Subjt:  CKQHFCLDCDIYIHESLHNCPGCES

Q9ZVN9 General transcription factor IIH subunit 28.6e-17672.33Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKG+IRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIM
        A++AKHV+AF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LE+VH +LNQ+PSYGHREVLILYSAL +CDPGDIM
Subjt:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEILYADISAKKLS----------HFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+       ++            H K+LLLEHAPPPPAIA+ AI NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEILYADISAKKLS----------HFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRCPK
        NCPGCES   PK
Subjt:  NCPGCESFRCPK

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H26.1e-17772.33Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKG+IRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIM
        A++AKHV+AF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LE+VH +LNQ+PSYGHREVLILYSAL +CDPGDIM
Subjt:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEILYADISAKKLS----------HFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+       ++            H K+LLLEHAPPPPAIA+ AI NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEILYADISAKKLS----------HFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRCPK
        NCPGCES   PK
Subjt:  NCPGCESFRCPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGGGAAAATAGGCGATTGAATGGGGAAGCCGATGAAGAAGATGATGACGACGATGCCAATGGACTTGCTGCGTGGGAAAGGACTTATGCGGATGATAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGTACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTATTATTCGCTATCTCTATATCGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTTGTGGCA
AAACATGTAGATGCTTTTGTAAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGG
AAGTCCCGAATCTCATGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAAGTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCATAGAGAAGTTTTAATTTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCA
GTTATTGGTCTTACTGCAGAAATTTTATATGCAGACATCTCTGCCAAGAAACTGTCTCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATTGCAGA
CTCTGCCATTCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTCACAAGGAAGCTAAAGTTGGAGGGGGCTATACTT
GCCCTCGATGCAAAGCACGGGTTTGTGAGCTTCCCACAGAGTGTCAAATTTGTGGACTGACACTTATCTCCTCACCCCATTTGGCTAGGTCGTATCATCATCTCTTTCCA
ATTATACCATTTGATGAAGTCTCTGATAAAATATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATGCTGGCACAGGTAA
CAGTCCAGGCATCCGTGTTTCTTGCCCAAAGTGCAAACAACATTTCTGCCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTCA
GGTGTCCCAAATTAGCGACTTCTGACGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGGGGAAAATAGGCGATTGAATGGGGAAGCCGATGAAGAAGATGATGACGACGATGCCAATGGACTTGCTGCGTGGGAAAGGACTTATGCGGATGATAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGTACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTATTATTCGCTATCTCTATATCGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTTGTGGCA
AAACATGTAGATGCTTTTGTAAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGG
AAGTCCCGAATCTCATGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAAGTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCATAGAGAAGTTTTAATTTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCA
GTTATTGGTCTTACTGCAGAAATTTTATATGCAGACATCTCTGCCAAGAAACTGTCTCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCAATTGCAGA
CTCTGCCATTCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTCACAAGGAAGCTAAAGTTGGAGGGGGCTATACTT
GCCCTCGATGCAAAGCACGGGTTTGTGAGCTTCCCACAGAGTGTCAAATTTGTGGACTGACACTTATCTCCTCACCCCATTTGGCTAGGTCGTATCATCATCTCTTTCCA
ATTATACCATTTGATGAAGTCTCTGATAAAATATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATGCTGGCACAGGTAA
CAGTCCAGGCATCCGTGTTTCTTGCCCAAAGTGCAAACAACATTTCTGCCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTCA
GGTGTCCCAAATTAGCGACTTCTGACGAATGA
Protein sequenceShow/hide protein sequence
MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGIIRYLYIVIDFSKAATEMDFRPSRMAVVA
KHVDAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLEVVHSYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCS
VIGLTAEILYADISAKKLSHFKELLLEHAPPPPAIADSAIPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFP
IIPFDEVSDKIFHDPRHQLPKVCFGCQESLMNAGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRCPKLATSDE