; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G005670 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G005670
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationCmo_Chr01:2849974..2852690
RNA-Seq ExpressionCmoCh01G005670
SyntenyCmoCh01G005670
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036868.1 General transcription factor IIH subunit 2 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-25199.76Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSATSDD
Subjt:  IHESLHNCPGCESFRRPKSATSDD

XP_022157930.1 general transcription factor IIH subunit 2 [Momordica charantia]2.4e-23994.1Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGE  RLNGEA+EEDDDDD NGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +DPRH+LPKVCFGCQE+  N GTGNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSA S++
Subjt:  IHESLHNCPGCESFRRPKSATSDD

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]4.3e-252100Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSATSDD
Subjt:  IHESLHNCPGCESFRRPKSATSDD

XP_022998882.1 general transcription factor IIH subunit 2 [Cucurbita maxima]2.1e-25199.76Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSATSDD
Subjt:  IHESLHNCPGCESFRRPKSATSDD

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]6.8e-24295.28Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAA EM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV G+LNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPR+QLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSATSD+
Subjt:  IHESLHNCPGCESFRRPKSATSDD

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit5.5e-23793.63Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+ LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKSATSDD

A0A5A7UNG8 General transcription factor IIH subunit6.5e-23894.1Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLL PIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFR PK AT D+
Subjt:  IHESLHNCPGCESFRRPKSATSDD

A0A6J1DUE2 General transcription factor IIH subunit1.2e-23994.1Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGE  RLNGEA+EEDDDDD NGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +DPRH+LPKVCFGCQE+  N GTGNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSA S++
Subjt:  IHESLHNCPGCESFRRPKSATSDD

A0A6J1GC53 General transcription factor IIH subunit2.1e-252100Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSATSDD
Subjt:  IHESLHNCPGCESFRRPKSATSDD

A0A6J1K980 General transcription factor IIH subunit1.0e-25199.76Show/hide
Query:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
        MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM
Subjt:  MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSDD
        IHESLHNCPGCESFRRPKSATSDD
Subjt:  IHESLHNCPGCESFRRPKSATSDD

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 29.7e-9042.42Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LYVV+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ           + S ++       + G   GGY CP+C+A+ CEL
Subjt:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ    +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q2TBV5 General transcription factor IIH subunit 27.4e-9041.92Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LYVV+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR S+IGL+
Subjt:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA ++S   +LI+MGFPQ           + S ++       + G   GGY CP+C+A+ CEL
Subjt:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ    +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein1.3e-8942.42Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LYVV+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ           + S ++       + G   GGY CP+C+A+ CEL
Subjt:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ    +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9JIB4 General transcription factor IIH subunit 24.8e-8942.07Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDE+G L+      +F A+ +R            +++ G++R+LYVV+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  KT+KIR SVIGL+
Subjt:  QIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA------GESSIAICSCH------KEAKVGGGYTCPRCKARVCE
        AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ         ++  +    H      +     GGY CP+C+A+ CE
Subjt:  AELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA------GESSIAICSCH------KEAKVGGGYTCPRCKARVCE

Query:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S + +   R      C+GCQ    +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9ZVN9 General transcription factor IIH subunit 24.5e-18875Show/hide
Query:  NGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDF
        + + +R N E EEEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AI+HAQYRRRLR LS+ A   RIQKGLIRYLY+VIDFSRAAAEMDF
Subjt:  NGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH+LTDLGGSPE+H+KALMGKLE  G++SLQN L+LV  +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRA E S+AICSCHKE K+G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ++    G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKSAT
        HESLHNCPGCES  RPKS +
Subjt:  HESLHNCPGCESFRRPKSAT

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H23.2e-18975Show/hide
Query:  NGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDF
        + + +R N E EEEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AI+HAQYRRRLR LS+ A   RIQKGLIRYLY+VIDFSRAAAEMDF
Subjt:  NGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH+LTDLGGSPE+H+KALMGKLE  G++SLQN L+LV  +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRA E S+AICSCHKE K+G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ++    G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKSAT
        HESLHNCPGCES  RPKS +
Subjt:  HESLHNCPGCESFRRPKSAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGCGAAAATCGGCGATTAAATGGTGAGGCCGAAGAAGAAGACGATGATGACGATGCCAATGGCGGAATGGCGGCCTGGGAAAGGACTTATGCCGATGATAG
GTCGTGGGAAGCCCTACAAGAGGACGAGTCTGGGCTTCTTCGGCCGATTGACAATAAGGCCATTTTCCATGCCCAATATCGCAGACGTCTTCGTTCACTTTCTTCGTTAG
CAACCACTGCTCGAATTCAAAAGGGTCTTATTCGCTATCTCTATGTCGTCATTGATTTCTCCAGGGCAGCAGCGGAAATGGATTTTCGACCGAGTCGAATGGCTGTGGTT
GCAAAACATGTGGAAGCTTTTGTCAGGGAATTCTTTGACCAAAATCCTCTCAGTCAGATTGGTTTGGTGACTATAAAGGATGGAGTTGCTCATAGCTTAACAGATCTTGG
TGGAAGTCCTGAGTCCCATGTTAAAGCATTAATGGGGAAACTGGAATGCTCAGGTGAAGCATCCTTGCAGAATGGTCTGGATCTTGTTTGTGGCTATCTAAATCAAATAC
CATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTCAATTCCTGTGATCCTGGAGATATAATGGAGACAGTTCAGAAATGCAAAACATCTAAAATAAGGTGT
TCAGTAATTGGTCTTACTGCTGAACTTTTTATATGCAGACATCTCTGTCAGGAAACTGGTGGCTCGTACTCGGTCGCATTGGATGAGTCCCATTTCAAAGAGTTGCTATT
AGAGCACGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTGATCAAGATGGGGTTTCCACAAAGAGCAGGGGAGAGTTCTATTGCGATTTGTTCGTGTC
ACAAGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCTCGAGTTTGCGAGCTGCCAACTGAGTGTCGAATTTGTGGATTGACACTTATTTCTTCACCC
CATTTGGCTAGGTCGTATCATCATCTCTTTCCGATTATACCATTTGATGAAGTTTCTGATAAACTATTTCATGATCCACGACATCAACTACCGAAAGTTTGCTTTGGCTG
CCAAGAAAACTTCACAAACCCTGGTACAGGTAACAGCCCAGGCATCCGAGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGA
GCTTGCACAATTGCCCTGGCTGTGAGAGTTTCAGACGTCCCAAATCAGCGACTTCCGATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATACAACCACATGGGTTTGAAAGATGATGGCCCAAAAAAAATTAGGCATCCCTAAAAAATCCCAATGACATTAGCATTACAAGTACAGTCTACGCCGAAACATTTCCTAA
GCGCTCAGTTTCTGCCTCCGCCACTGAACACCATTGCAACTGCCACTGTTCTCTCATCTCTTCTCTTCACACTCGAATCCCACCCATCAAATTCATACCCACCTCAGTTC
GTCATTATCGATTCCTCAGCGCACCTTCAATCCCCGAATTGGCCCTCATTTTTCCAATTCTTTAGACCTACCTACTGTTACTCTTTTAACTCCCCTAATGCTGCTTGTTC
TCTAAGTATAGATTGTTTTCGATATTGGGGGTTCATCAATTTGATCTCATTTCAAAACCCTCAAGAAATTCCTTAGGAAAAACCATGAACAATGGCGAAAATCGGCGATT
AAATGGTGAGGCCGAAGAAGAAGACGATGATGACGATGCCAATGGCGGAATGGCGGCCTGGGAAAGGACTTATGCCGATGATAGGTCGTGGGAAGCCCTACAAGAGGACG
AGTCTGGGCTTCTTCGGCCGATTGACAATAAGGCCATTTTCCATGCCCAATATCGCAGACGTCTTCGTTCACTTTCTTCGTTAGCAACCACTGCTCGAATTCAAAAGGGT
CTTATTCGCTATCTCTATGTCGTCATTGATTTCTCCAGGGCAGCAGCGGAAATGGATTTTCGACCGAGTCGAATGGCTGTGGTTGCAAAACATGTGGAAGCTTTTGTCAG
GGAATTCTTTGACCAAAATCCTCTCAGTCAGATTGGTTTGGTGACTATAAAGGATGGAGTTGCTCATAGCTTAACAGATCTTGGTGGAAGTCCTGAGTCCCATGTTAAAG
CATTAATGGGGAAACTGGAATGCTCAGGTGAAGCATCCTTGCAGAATGGTCTGGATCTTGTTTGTGGCTATCTAAATCAAATACCATCATATGGGCATAGAGAAGTTTTA
ATCTTATACTCTGCTCTCAATTCCTGTGATCCTGGAGATATAATGGAGACAGTTCAGAAATGCAAAACATCTAAAATAAGGTGTTCAGTAATTGGTCTTACTGCTGAACT
TTTTATATGCAGACATCTCTGTCAGGAAACTGGTGGCTCGTACTCGGTCGCATTGGATGAGTCCCATTTCAAAGAGTTGCTATTAGAGCACGCACCCCCACCCCCAGCAA
TAGCAGACTCTGCAATGCCTAATTTGATCAAGATGGGGTTTCCACAAAGAGCAGGGGAGAGTTCTATTGCGATTTGTTCGTGTCACAAGGAAGCTAAAGTTGGAGGGGGC
TATACTTGCCCTCGATGCAAAGCTCGAGTTTGCGAGCTGCCAACTGAGTGTCGAATTTGTGGATTGACACTTATTTCTTCACCCCATTTGGCTAGGTCGTATCATCATCT
CTTTCCGATTATACCATTTGATGAAGTTTCTGATAAACTATTTCATGATCCACGACATCAACTACCGAAAGTTTGCTTTGGCTGCCAAGAAAACTTCACAAACCCTGGTA
CAGGTAACAGCCCAGGCATCCGAGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGCCCTGGCTGTGAG
AGTTTCAGACGTCCCAAATCAGCGACTTCCGATGATTGA
Protein sequenceShow/hide protein sequence
MNNGENRRLNGEAEEEDDDDDANGGMAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYVVIDFSRAAAEMDFRPSRMAVV
AKHVEAFVREFFDQNPLSQIGLVTIKDGVAHSLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRC
SVIGLTAELFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSP
HLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCFGCQENFTNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSATSDD