; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G05320 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G05320
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGeneral transcription factor IIH subunit
Genome locationClcChr01:4934357..4938317
RNA-Seq ExpressionClc01G05320
SyntenyClc01G05320
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055121.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa]4.0e-24295.99Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFR PKL T DE
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

XP_004143721.1 general transcription factor IIH subunit 2 [Cucumis sativus]3.4e-24195.52Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVT+KDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ PKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFRRPKL TSDE
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

XP_008467294.1 PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo]3.4e-24195.75Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ PKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFR PKL T DE
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]3.2e-23993.87Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAA EM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DK+ HDPRHQ PKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFRRPK  TSD+
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]2.7e-24697.17Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYI+IDFSRAATEM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHG+LNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKVLHDPR+Q PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFRRPK  TSDE
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit1.6e-24195.52Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVT+KDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ PKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFRRPKL TSDE
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

A0A1S3CUH8 General transcription factor IIH subunit1.6e-24195.75Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ PKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFR PKL T DE
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

A0A5A7UNG8 General transcription factor IIH subunit1.9e-24295.99Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEADEEDDDDDAN GLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AATEM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKV HDPRHQ PKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFR PKL T DE
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

A0A6J1DUE2 General transcription factor IIH subunit2.0e-23994.34Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGE  RLNGEADEEDDDDD NGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSS+ATTARIQKGLIRYLYI+IDFSRAA EM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DKVL+DPRH+ PKVCFGCQESLMN GTGNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFRRPK   S+E
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

A0A6J1GC53 General transcription factor IIH subunit1.5e-23993.87Show/hide
Query:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM
        M NGEN RLNGEA+EEDDDDDANGG+AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAA EM
Subjt:  MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVT+KDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAK G
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV+DK+ HDPRHQ PKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLLTSDE
        IHESLHNCPGCESFRRPK  TSD+
Subjt:  IHESLHNCPGCESFRRPKLLTSDE

SwissProt top hitse value%identityAlignment
A0JN27 General transcription factor IIH subunit 24.8e-8941.56Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+++D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  KT+KIR SVIGL+
Subjt:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSC-HKEAKAG---GGYTCPRCKARVCE
        AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA + S   +LI+MGFPQ          A+ S ++    +   + G   GGY CP+C+A+ CE
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSC-HKEAKAG---GGYTCPRCKARVCE

Query:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      ++  + C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q13888 General transcription factor IIH subunit 21.7e-8941.67Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+++D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      +   + C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q2TBV5 General transcription factor IIH subunit 24.3e-9041.41Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+++D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR S+IGL+
Subjt:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein2.2e-8941.67Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+++D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKAG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      +   + C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9ZVN9 General transcription factor IIH subunit 23.1e-18976.26Show/hide
Query:  NGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDF
        + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYI+IDFSRAA EMDF
Subjt:  NGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV++K+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTD-KVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV     L+D R +  K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTD-KVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPK
        HESLHNCPGCES  RPK
Subjt:  HESLHNCPGCESFRRPK

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H22.2e-19076.26Show/hide
Query:  NGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDF
        + +  R N E +EEDD+D    G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYI+IDFSRAA EMDF
Subjt:  NGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV++K+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTD-KVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV     L+D R +  K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVTD-KVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPK
        HESLHNCPGCES  RPK
Subjt:  HESLHNCPGCESFRRPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAATGGCGAAAATATGAGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGCCAATGGTGGACTTGCTGCGTGGGAAAGGACTTACGCAGATGATAG
GTCGTGGGAAGCCTTGCAAGAAGACGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCTCAGTATCGAAGGCGCCTTCGCTCCCTTTCTTCCTTAG
CAACCACTGCTCGAATTCAGAAGGGTCTTATTCGTTATCTCTATATCATCATTGACTTCTCTAGGGCAGCTACAGAAATGGATTTTCGACCAAGTCGAATGGCTGTTGTG
GCAAAACATGTAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGTCAGATTGGTTTGGTAACTATGAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGG
TGGAAGTCCTGAATCCCATGTTAAAGCATTAATGGGCAAACTGGAATGCTCAGGTGATGCATCCTTGCAGAACGGTCTGGAACTTGTCCACGGCTATCTGAATCAAATTC
CATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGATATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGT
TCAGTAATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAAGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCCCACTTCAAAGAGTTGCTATT
GGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTAATCAAGATGGGCTTCCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATGTTCATGTC
ACAAGGAAGCTAAAGCTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTCCCCACAGAGTGTCGAATTTGTGGATTGACACTGATCTCCTCGCCC
CATTTGGCTAGGTCATATCACCATCTCTTTCCAATTATACCATTTGATGAAGTCACTGATAAAGTACTTCATGATCCACGACATCAATCTCCAAAAGTTTGCTTTGGCTG
CCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGCCCAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTTTGTCTTGATTGTGATATTTATATTCACGAGA
GCTTGCACAATTGTCCTGGCTGTGAGAGTTTTAGGCGTCCAAAATTGTTGACTTCTGACGAATGA
mRNA sequenceShow/hide mRNA sequence
AAAAGTTTGAGGGTATTATTTGTCCTCTAGAAACTTGGGAGCAATTTTTTTAAACAATATTTTCTATAATTCAGTCATTTCGAAAATAACTCGTCTTCTTGCTCTTACTC
TTTAGGGCTGAAATCATGAAATCATGGAAAGCTTAAAGTTGAAAAGAGGGTGAAAGCGCAGCAGCCTTAACAAGCCGTGGGTCACTGGGTGTTTTCCATGTCTTGGTTTT
GAGGCTGAATCGCGCCGCCGCGGGTCTTGCTGTAGAGGACTCTAACACGCCGTCGTTCGTATCAGGTCAGTGGCGGTCGGGGCGCGCTGGTCTCGGTTTTCAAGGGCAGT
TTCCGGCGGTTTATCTCCGCGCCGGTTTGAGTCTCGGGTTAGAGACTGTGAAGCACGCAGTGGGTTTCCCATTTTAGCAATCAAACGGTAACCTTGAGGTTGATATAAAG
ATTGTGGGTATATTTCAAGGTTGTTGATTTCTGTTGTGCTGGTGTTTTGGGCGTTTGAATCCGTTTTACTTCTTCATTGTCGATTCCTCATCACACCTACCTACTGTAAC
TCTTAGGTCCCCAAGTTGTGATTATTCTCTGTGTAGAGCTTGTTTTAGGTATTGGGGTTTCATCAATTTGATCTTGTTTCAAAACCTTCGAGCGTAGCTTCTGAAAAAGT
CCTTAAATATGAAAAATGGCGAAAATATGAGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGCCAATGGTGGACTTGCTGCGTGGGAAAGGACTTACGCA
GATGATAGGTCGTGGGAAGCCTTGCAAGAAGACGAGTCTGGACTCCTTCGCCCGATCGACAATAAGGCAATTTACCATGCTCAGTATCGAAGGCGCCTTCGCTCCCTTTC
TTCCTTAGCAACCACTGCTCGAATTCAGAAGGGTCTTATTCGTTATCTCTATATCATCATTGACTTCTCTAGGGCAGCTACAGAAATGGATTTTCGACCAAGTCGAATGG
CTGTTGTGGCAAAACATGTAGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCACTCAGTCAGATTGGTTTGGTAACTATGAAAGATGGAGTTGCTCATTGCTTAACA
GATCTTGGTGGAAGTCCTGAATCCCATGTTAAAGCATTAATGGGCAAACTGGAATGCTCAGGTGATGCATCCTTGCAGAACGGTCTGGAACTTGTCCACGGCTATCTGAA
TCAAATTCCATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGATATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAA
TAAGGTGTTCAGTAATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAAGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCCCACTTCAAAGAG
TTGCTATTGGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAATTTAATCAAGATGGGCTTCCCACAAAGAGCAGCAGAGAGTTCTATTGCAATATG
TTCATGTCACAAGGAAGCTAAAGCTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTCCCCACAGAGTGTCGAATTTGTGGATTGACACTGATCT
CCTCGCCCCATTTGGCTAGGTCATATCACCATCTCTTTCCAATTATACCATTTGATGAAGTCACTGATAAAGTACTTCATGATCCACGACATCAATCTCCAAAAGTTTGC
TTTGGCTGCCAAGAAAGCCTCATGAATCCTGGCACAGGTAATAGCCCAGGCATTCGTGTTTCTTGCCCAAAGTGCAAACAACACTTTTGTCTTGATTGTGATATTTATAT
TCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTTAGGCGTCCAAAATTGTTGACTTCTGACGAATGAACGTCTACTTTCGGATGCAACCATGTGGCCCCAACCG
AATCCAAATTCAATCTTCAATCTGACCGTGACTCTGCTCTGTCTGTAATGAACTTTGTACCACTATTTATAAAGTTTCAAGCATGTCTACCAATGATGGTGAATTCAAGG
CTCTCATTGTTACCGCGGAGTCGTCTTTCGTGTTTCTCGACAGCTCCTAAACCAACCTCAAAGAGATCCAAAGCTTGTGGCTTTCTGACACTTGATGGCATCGTTGAAAT
TGGCTAGAAGATTAATTTTTCAAGATGAAAGGCTGAAACTTACTTTCCTTCTGGGCTGCAGATGGCTGATATTACTTGGAGTGTTACCCAAAATTGACCCCCCAATGGAC
AATTTCCATTTTCATGAGATTTTGTAGGAACTTGATATTAATTCTTGTCTGACATTTCAGATGTATTTGACATTCATATTCTTTTAGTTAAGTTATTAAAAGAGCTTGTG
CCAGAAACTTCAGTGTGCCTACTATTAACAATAAACAATTATATGTACCAATGTTGTAAATGTTAGCCAATATGGCTATAGTGATCTTTCAAATGTAATCATTTTGGCTC
GTAAGGACGGAATTTTTTTGGGGGAAAATGGAAGAAAGAGGATTACTTTACCAATGCGTTGACCTGAAAACAAACTACCTTACCTAATGAAACTTTAGAAAGATTAGGCC
AATTAAAAATTAAAATTTCTTTCTGGTATATA
Protein sequenceShow/hide protein sequence
MKNGENMRLNGEADEEDDDDDANGGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIIIDFSRAATEMDFRPSRMAVV
AKHVEAFVREFFDQNPLSQIGLVTMKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRC
SVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKAGGGYTCPRCKARVCELPTECRICGLTLISSP
HLARSYHHLFPIIPFDEVTDKVLHDPRHQSPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLLTSDE