; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009392 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009392
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationChr06:5423798..5426282
RNA-Seq ExpressionHG10009392
SyntenyHG10009392
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055121.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa]1.5e-24195.99Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AATEM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGLELV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIF+CRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSDK
        IHESLHNCPGCESFR PK  T D+
Subjt:  IHESLHNCPGCESFRRPKSVTSDK

KAG7036868.1 General transcription factor IIH subunit 2 [Cucurbita argyrosperma subsp. argyrosperma]2.0e-24195.27Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAA EM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGL+LV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+F+CRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSD
        IHESLHNCPGCESFRRPKS TSD
Subjt:  IHESLHNCPGCESFRRPKSVTSD

XP_022157930.1 general transcription factor IIH subunit 2 [Momordica charantia]1.2e-24195.52Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGE  RLNGEADEEDDDDD NGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSS+ATTARIQKGLIRYLYIVIDFSRAA EM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIF+CRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+DPRH+LPKVCFGCQESLMN GTGNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSDK
        IHESLHNCPGCESFRRPKS  S++
Subjt:  IHESLHNCPGCESFRRPKSVTSDK

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]5.2e-24295.51Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAA EM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+F+CRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSD
        IHESLHNCPGCESFRRPKS TSD
Subjt:  IHESLHNCPGCESFRRPKSVTSD

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]4.1e-24797.64Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGLELV G+LNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIF+CRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPR+QLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSDK
        IHESLHNCPGCESFRRPKS TSD+
Subjt:  IHESLHNCPGCESFRRPKSVTSDK

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit6.2e-24195.52Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AATEM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSG+ASLQNGLELV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIF+CRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSDK
        IHESLHNCPGCESFRRPK  TSD+
Subjt:  IHESLHNCPGCESFRRPKSVTSDK

A0A5A7UNG8 General transcription factor IIH subunit7.4e-24295.99Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AATEM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGLELV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIF+CRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSDK
        IHESLHNCPGCESFR PK  T D+
Subjt:  IHESLHNCPGCESFRRPKSVTSDK

A0A6J1DUE2 General transcription factor IIH subunit5.6e-24295.52Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGE  RLNGEADEEDDDDD NGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSS+ATTARIQKGLIRYLYIVIDFSRAA EM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIF+CRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+DPRH+LPKVCFGCQESLMN GTGNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSDK
        IHESLHNCPGCESFRRPKS  S++
Subjt:  IHESLHNCPGCESFRRPKSVTSDK

A0A6J1GC53 General transcription factor IIH subunit2.5e-24295.51Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAA EM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+F+CRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSD
        IHESLHNCPGCESFRRPKS TSD
Subjt:  IHESLHNCPGCESFRRPKSVTSD

A0A6J1K980 General transcription factor IIH subunit1.3e-24195.27Show/hide
Query:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM
        MNNGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAA EM
Subjt:  MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+F+CRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGY+CPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSVTSD
        IHESLHNCPGCESFRRPKS TSD
Subjt:  IHESLHNCPGCESFRRPKSVTSD

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 25.7e-9042.68Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYSCPRCKARVCEL
        AE+ VC  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYSCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  +  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q2TBV5 General transcription factor IIH subunit 22.6e-9042.17Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR S+IGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYSCPRCKARVCEL
        AE+ VC  L +ETGG+Y V LDESH+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYSCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein7.4e-9042.68Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYSCPRCKARVCEL
        AE+ VC  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYSCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  +  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9JIB4 General transcription factor IIH subunit 22.2e-8942.07Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDE+G L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  KT+KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYSCPRCKARVCE
        AE+ VC  L +ETGG+Y V LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++      +       GGY CP+C+A+ CE
Subjt:  AEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYSCPRCKARVCE

Query:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S +      ++  + C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9ZVN9 General transcription factor IIH subunit 22.2e-19076.19Show/hide
Query:  NGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDF
        + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAA EMDF
Subjt:  NGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN LELV  +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+F+C+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L+D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYI
Subjt:  YSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKSVT
        HESLHNCPGCES  RPKSV+
Subjt:  HESLHNCPGCESFRRPKSVT

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H21.5e-19176.19Show/hide
Query:  NGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDF
        + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAA EMDF
Subjt:  NGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN LELV  +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+F+C+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L+D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYI
Subjt:  YSCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKSVT
        HESLHNCPGCES  RPKSV+
Subjt:  HESLHNCPGCESFRRPKSVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGCGAAAATTTGAGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGCCAATGGCGGACTTGCTGCGTGGGAAAGGACTTATGCAGATGATAG
GTCGTGGGAAGCCTTGCAAGAGGATGAGTCTGGACTCCTTCGCCCGATCGACAATAAAGCAATTTACCACGCCCAGTATCGAAGGCGCCTTCGTTCCCTTTCTTCCTTGG
CAACCACTGCTCGAATTCAGAAGGGTCTTATTCGCTATCTCTATATCGTCATTGACTTCTCTAGGGCAGCTACGGAAATGGATTTTCGACCAAGTCGAATGGCTGTTGTG
GCAAAACATGTAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGTCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGTTTAACAGATCTTGG
TGGAAGTCCTGAATCCCATGTTAAAGCGTTAATGGGTAAACTCGAATGCTCAGGTGAAGCATCCTTGCAGAATGGTTTGGAACTTGTTCAGGGCTATCTAAATCAAATTC
CATCATATGGGCATCGAGAAGTTTTAATATTATACTCTGCTCTTAATTCTTGTGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGT
TCAGTCATTGGTCTTACAGCAGAAATTTTTGTATGCAGACATCTCTGTCAAGAAACTGGTGGCTCATACTCTGTTGCACTGGATGAGTCCCACTTCAAAGAGTTGCTATT
GGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTAATCAAGATGGGCTTCCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTC
ACAAGGAAGCTAAAGTTGGAGGGGGCTACAGTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTGCCCACAGAGTGTCGAATTTGTGGATTGACACTTATCTCCTCACCC
CATTTGGCTAGGTCGTATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTACTTCATGATCCACGACATCAACTTCCAAAAGTTTGCTTTGGCTG
CCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGCCCAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTTTGTCTTGATTGTGATATTTATATTCACGAGA
GCTTGCACAATTGTCCTGGCTGTGAGAGTTTCAGGCGTCCCAAATCGGTGACTTCTGACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGGCGAAAATTTGAGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGCCAATGGCGGACTTGCTGCGTGGGAAAGGACTTATGCAGATGATAG
GTCGTGGGAAGCCTTGCAAGAGGATGAGTCTGGACTCCTTCGCCCGATCGACAATAAAGCAATTTACCACGCCCAGTATCGAAGGCGCCTTCGTTCCCTTTCTTCCTTGG
CAACCACTGCTCGAATTCAGAAGGGTCTTATTCGCTATCTCTATATCGTCATTGACTTCTCTAGGGCAGCTACGGAAATGGATTTTCGACCAAGTCGAATGGCTGTTGTG
GCAAAACATGTAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGTCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGTTTAACAGATCTTGG
TGGAAGTCCTGAATCCCATGTTAAAGCGTTAATGGGTAAACTCGAATGCTCAGGTGAAGCATCCTTGCAGAATGGTTTGGAACTTGTTCAGGGCTATCTAAATCAAATTC
CATCATATGGGCATCGAGAAGTTTTAATATTATACTCTGCTCTTAATTCTTGTGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGT
TCAGTCATTGGTCTTACAGCAGAAATTTTTGTATGCAGACATCTCTGTCAAGAAACTGGTGGCTCATACTCTGTTGCACTGGATGAGTCCCACTTCAAAGAGTTGCTATT
GGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTAATCAAGATGGGCTTCCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTC
ACAAGGAAGCTAAAGTTGGAGGGGGCTACAGTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTGCCCACAGAGTGTCGAATTTGTGGATTGACACTTATCTCCTCACCC
CATTTGGCTAGGTCGTATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTACTTCATGATCCACGACATCAACTTCCAAAAGTTTGCTTTGGCTG
CCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGCCCAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTTTGTCTTGATTGTGATATTTATATTCACGAGA
GCTTGCACAATTGTCCTGGCTGTGAGAGTTTCAGGCGTCCCAAATCGGTGACTTCTGACAAGTGA
Protein sequenceShow/hide protein sequence
MNNGENLRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAATEMDFRPSRMAVV
AKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLELVQGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRC
SVIGLTAEIFVCRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYSCPRCKARVCELPTECRICGLTLISSP
HLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSVTSDK