; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0300 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0300
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationMC03:8942004..8947084
RNA-Seq ExpressionMC03g0300
SyntenyMC03g0300
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036868.1 General transcription factor IIH subunit 2 [Cucurbita argyrosperma subsp. argyrosperma]9.72e-29893.4Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEA+EEDDDDD NGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +DPRH+LPKVCFGCQE+  N  +GNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSA S++
Subjt:  IHESLHNCPGCESFRRPKSAASNE

XP_022157930.1 general transcription factor IIH subunit 2 [Momordica charantia]1.94e-31399.53Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSG--NSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSG  NSQGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSG--NSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSAASNE
Subjt:  IHESLHNCPGCESFRRPKSAASNE

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]1.68e-29893.63Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEA+EEDDDDD NGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +DPRH+LPKVCFGCQE+  N  +GNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSA S++
Subjt:  IHESLHNCPGCESFRRPKSAASNE

XP_022998882.1 general transcription factor IIH subunit 2 [Cucurbita maxima]1.38e-29793.4Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEA+EEDDDDD NGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAE+FICRHLCQETGGSY +ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +DPRH+LPKVCFGCQE+  N  +GNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSA S++
Subjt:  IHESLHNCPGCESFRRPKSAASNE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]4.31e-30195.52Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEADEEDDDDD NGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSS+ATTARIQKGLIRYLYIVIDFSRAA EM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV G+LNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+DPR++LPKVCFGCQESLMN  +GNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSA S+E
Subjt:  IHESLHNCPGCESFRRPKSAASNE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit1.19e-29393.16Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEADEEDDDDD NG LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSS+ATTARIQKGLIRYLYIVIDFS+AA EM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV +DPRH+LPKVCFGCQESLMN  +GNS  IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPK A S+E
Subjt:  IHESLHNCPGCESFRRPKSAASNE

A0A5A7UNG8 General transcription factor IIH subunit5.91e-29493.4Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEADEEDDDDD NG LAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSS+ATTARIQKGLIRYLYIVIDFS+AA EM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV +DPRH+LPKVCFGCQESLMN  +GNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFR PK A  +E
Subjt:  IHESLHNCPGCESFRRPKSAASNE

A0A6J1DUE2 General transcription factor IIH subunit9.38e-31499.53Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSG--NSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSG  NSQGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSG--NSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSAASNE
Subjt:  IHESLHNCPGCESFRRPKSAASNE

A0A6J1GC53 General transcription factor IIH subunit8.14e-29993.63Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEA+EEDDDDD NGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +DPRH+LPKVCFGCQE+  N  +GNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSA S++
Subjt:  IHESLHNCPGCESFRRPKSAASNE

A0A6J1K980 General transcription factor IIH subunit6.69e-29893.4Show/hide
Query:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEA+EEDDDDD NGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAE+FICRHLCQETGGSY +ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +DPRH+LPKVCFGCQE+  N  +GNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMN--SGNSQGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAASNE
        IHESLHNCPGCESFRRPKSA S++
Subjt:  IHESLHNCPGCESFRRPKSAASNE

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 25.1e-9143.4Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + I+  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y + LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  N  R      C+GCQ  L      Q + V C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q2TBV5 General transcription factor IIH subunit 28.7e-9142.64Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + I+  K +KIR S+IGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y + LDESH+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L      Q + V C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein6.7e-9143.4Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + I+  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y + LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  N  R      C+GCQ  L      Q + V C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9JIB4 General transcription factor IIH subunit 25.7e-9042.53Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDE+G L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + I+  KT+KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTCPRCKARVCE
        AE+ +C  L +ETGG+Y + LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++      +       GGY CP+C+A+ CE
Subjt:  AEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTCPRCKARVCE

Query:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S +      ++  + C+GCQ  L      Q + V C  C+  FC+DCD+++H+SLH CPGC
Subjt:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9ZVN9 General transcription factor IIH subunit 24.8e-19076.01Show/hide
Query:  NGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDF
        + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAAAEMDF
Subjt:  NGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN L+LV  +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGDIMETIQKCK SK+RCSVIGL+AE+FIC+HLCQETGG YS+A+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHE
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   LND R +L K CFGCQ+SL+ +GN     V+C KCK +FCLDCDIYIHE
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHE

Query:  SLHNCPGCESFRRPKSAASNE
        SLHNCPGCES  RPKS +  E
Subjt:  SLHNCPGCESFRRPKSAASNE

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H23.4e-19176.01Show/hide
Query:  NGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDF
        + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAAAEMDF
Subjt:  NGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN L+LV  +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGDIMETIQKCK SK+RCSVIGL+AE+FIC+HLCQETGG YS+A+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDIMETIQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHE
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   LND R +L K CFGCQ+SL+ +GN     V+C KCK +FCLDCDIYIHE
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHE

Query:  SLHNCPGCESFRRPKSAASNE
        SLHNCPGCES  RPKS +  E
Subjt:  SLHNCPGCESFRRPKSAASNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGCGAAGGTAGTCGATTGAATGGGGAAGCCGATGAAGAAGACGATGACGACGACGGCAATGGCGGACTCGCTGCGTGGGAGAGGACTTATGCGGATGATAG
GTCGTGGGAAGCCCTGCAAGAGGACGAGTCTGGCCTCCTCCGCCCCATCGATAATAAGGCCATTTACCATGCCCAGTATCGAAGGCGTCTCCGCTCCCTCTCTTCCATTG
CTACCACTGCTCGAATTCAGAAGGGTCTTATCCGCTATCTCTATATCGTCATTGACTTCTCCAGGGCAGCAGCAGAAATGGATTTTCGACCAAGTCGAATGGCAGTGGTG
GCGAAACATGTAGAGGCATTTGTGAGGGAATTCTTTGACCAGAATCCACTCAGTCAGATTGGTTTGGTGACAATAAAAGATGGAGTTGCTCATTGCCTCACGGATCTTGG
TGGCAGTCCTGAGTCCCATGTTAAAGCTTTAATGGGCAAATTGGAATGCTCTGGTGAAGCATCCCTGCAGAATGGTCTGGATCTTGTTTGCGGCTATCTGAATCAAATTC
CATCATATGGTCACAGAGAAGTCTTAATCTTATATTCTGCTCTTAATTCCTGTGATCCTGGGGATATAATGGAGACTATTCAGAAATGCAAAACATCTAAAATAAGGTGT
TCAGTAATTGGTCTTACTGCGGAAATTTTTATATGCAGACATCTCTGCCAGGAAACTGGTGGTTCATACTCTATCGCACTGGATGAGTCCCACTTCAAAGAGTTGCTATT
GGAGCATGCTCCGCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTGATCAAGATGGGCTTTCCACAAAGAGCTGCAGAGAGTTCCATTGCAATTTGTTCCTGTC
ACAAGGAAGCTAAAGTTGGAGGGGGCTACACTTGTCCTCGATGCAAAGCACGAGTTTGTGAGCTGCCGACTGAGTGTCGAATTTGTGGATTGACACTGATCTCCTCGCCC
CATTTGGCTCGGTCATATCATCATCTCTTTCCAATTATACCATTTGATGAGGTCTCTGACAAAGTACTTAACGATCCACGACATCGACTACCAAAAGTTTGCTTTGGCTG
CCAAGAAAGCCTGATGAACTCTGGTAACAGCCAAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGC
ACAATTGCCCTGGCTGTGAAAGTTTCAGGCGTCCCAAATCAGCGGCTTCCAATGAA
mRNA sequenceShow/hide mRNA sequence
TTCACACTCGCCTTGTCTTGAAAATACAGAATTCTAAGTCTCCGGTTATGAACATCCTCGTCTACCCTATTCATGGTGTGTCTAAGACCGGTTATTAGGTCGAACTCCCG
CTTCCCAAAAGAGACCATATTCCCGAATAAGTTAAAGCTAATGAGGTCGTCTCTAGGTTCCTACACCTCTCTAAGCAACAGGTGATGGAGCAATGGACCGTTAAATACAA
CGTTCATCCCTAAAATCGGACCAAAACATGTTTGACTAAACATGTCTAACTGAGAAGGAGTTAACCTAGCCTTAAGACGATAAGAGGTTTTCCCTACGTGAGCGAGATTT
GACAGCGCGGCCGGGAACCAGTCATCTTGGTTGATCTTAAGTGTCATATCCATGTCTCCTGTAACAAAACAAAACAATATTGCTTAGAAACGGACCGGGTTGAAGCCTGG
GATGGAATCCAGAAAGTCGAAGTACAAGATACCTTCTTTCTGGATTCTATCTCGGGCCTCGTCCCCGTCCATTTTGAACCGGGATGAGTCCCAAGATGGATTCCAGACCA
TACCATTTCGGTCTTCATCCCGGGTCTATTTCGAATCGGGATGAGGCCCGAGATGGAATCCAGAAGGAATTCTGGAATGGACTTTCTGGGTTTCATCCCGGTCCTAAATC
CATCGGGCTTAAAGCCGAGATGGAACCCTGAAAGTCAAAAATTAAAACAAAACAATGGAAGCAACAATAAATCAAACTTCATAATAACCTTGGCTACTTCAGATAGAGCC
GAAGGTTGAAATCATTCACACCCACGCAAAGAACGAAGAAAAAAAAAGGTAGAAAATCATTCACAATAAGATCGCGAGTTCGAAAGAAAAAATACCTCAATTAGGGAAGG
GGGAAAGCTCTGATAGAGACGAAGTGAGGGAGACGAAGAAGAAACGACGGAGATGAACTTTCTGGTTGTGAGGGAGAGATCGGGATGAAGACTGGGATGGACAACGAGAT
AGTGAATGAGGGAGGGGAAGGACCGAGACAGGGAGGGAAAGGAGGGGAGGGCTGGGTTTGATTTCTTTTTTTAAATTGAGGGTAACTTGGTAATTTTACAAAGCAAAATA
AGGGGAGGGGTTAGAAATCTGAGGTGGAGGGGTCATAAATAGGTAAAAATCAAAATGGAGTTAAAAGTCAAAAGTGATTATTGGAAGGGGTCAAAAATCCCAATTTCCCT
AAAATCCGATCATTTTGGTAGAAAAAATTTGATACTAAAACACACTATTTGAAATCACTCAAGTCAAAAGTGTTCTTGCATTATTACATTACAACAATCAACGCCGAATC
AAGACCAATTTTTTTCCTAAGCCCTCCATCTGCAGTCGGCAGCCGGCAGCGGCGGCGAACCAACTACCACCGATCTCACTTCTCTTCTCTTTACTCTCAAATCCCAAATC
CCAATCCAAGCCAAACCAAATAAACCCAACTGAAAACTGTTACTCCCTCAACTGGGGTCGATCAATTTGATCTACTTTCAAAACCCTCGAGCATAGATTCTGGGAAAGCC
CCTCATCCGCCGGAAAAAAACATGAACAATGGCGAAGGTAGTCGATTGAATGGGGAAGCCGATGAAGAAGACGATGACGACGACGGCAATGGCGGACTCGCTGCGTGGGA
GAGGACTTATGCGGATGATAGGTCGTGGGAAGCCCTGCAAGAGGACGAGTCTGGCCTCCTCCGCCCCATCGATAATAAGGCCATTTACCATGCCCAGTATCGAAGGCGTC
TCCGCTCCCTCTCTTCCATTGCTACCACTGCTCGAATTCAGAAGGGTCTTATCCGCTATCTCTATATCGTCATTGACTTCTCCAGGGCAGCAGCAGAAATGGATTTTCGA
CCAAGTCGAATGGCAGTGGTGGCGAAACATGTAGAGGCATTTGTGAGGGAATTCTTTGACCAGAATCCACTCAGTCAGATTGGTTTGGTGACAATAAAAGATGGAGTTGC
TCATTGCCTCACGGATCTTGGTGGCAGTCCTGAGTCCCATGTTAAAGCTTTAATGGGCAAATTGGAATGCTCTGGTGAAGCATCCCTGCAGAATGGTCTGGATCTTGTTT
GCGGCTATCTGAATCAAATTCCATCATATGGTCACAGAGAAGTCTTAATCTTATATTCTGCTCTTAATTCCTGTGATCCTGGGGATATAATGGAGACTATTCAGAAATGC
AAAACATCTAAAATAAGGTGTTCAGTAATTGGTCTTACTGCGGAAATTTTTATATGCAGACATCTCTGCCAGGAAACTGGTGGTTCATACTCTATCGCACTGGATGAGTC
CCACTTCAAAGAGTTGCTATTGGAGCATGCTCCGCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTGATCAAGATGGGCTTTCCACAAAGAGCTGCAGAGAGTT
CCATTGCAATTTGTTCCTGTCACAAGGAAGCTAAAGTTGGAGGGGGCTACACTTGTCCTCGATGCAAAGCACGAGTTTGTGAGCTGCCGACTGAGTGTCGAATTTGTGGA
TTGACACTGATCTCCTCGCCCCATTTGGCTCGGTCATATCATCATCTCTTTCCAATTATACCATTTGATGAGGTCTCTGACAAAGTACTTAACGATCCACGACATCGACT
ACCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTGATGAACTCTGGTAACAGCCAAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATA
TTTATATTCACGAGAGCTTGCACAATTGCCCTGGCTGTGAAAGTTTCAGGCGTCCCAAATCAGCGGCTTCCAATGAA
Protein sequenceShow/hide protein sequence
MNNGEGSRLNGEADEEDDDDDGNGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSIATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVV
AKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETIQKCKTSKIRC
SVIGLTAEIFICRHLCQETGGSYSIALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSP
HLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCFGCQESLMNSGNSQGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSAASNE