; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032642 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032642
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationscaffold11:4857433..4861706
RNA-Seq ExpressionSpg032642
SyntenySpg032642
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036868.1 General transcription factor IIH subunit 2 [Cucurbita argyrosperma subsp. argyrosperma]5.2e-24295.98Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEA+EEDDDDDANGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSE
        IHESLHNCPGCESFRRPKSATS+
Subjt:  IHESLHNCPGCESFRRPKSATSE

XP_022157930.1 general transcription factor IIH subunit 2 [Momordica charantia]2.0e-24196.45Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEADEEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSS+ATTARIQKGLIRYLYIVIDFSRAAAEM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+MET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+DPRH+LPKVCFGCQESL+N GTGNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATS
        IHESLHNCPGCESFRRPKSA S
Subjt:  IHESLHNCPGCESFRRPKSATS

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]1.4e-24296.22Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEA+EEDDDDDANGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSE
        IHESLHNCPGCESFRRPKSATS+
Subjt:  IHESLHNCPGCESFRRPKSATSE

XP_022998882.1 general transcription factor IIH subunit 2 [Cucurbita maxima]6.8e-24295.98Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEA+EEDDDDDANGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSE
        IHESLHNCPGCESFRRPKSATS+
Subjt:  IHESLHNCPGCESFRRPKSATSE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]1.0e-24597.4Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEADEEDDDDDANGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAA EM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LVHG+LNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPR+QLPKVCFGCQESL+NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSE
        IHESLHNCPGCESFRRPKSATS+
Subjt:  IHESLHNCPGCESFRRPKSATSE

TrEMBL top hitse value%identityAlignment
A0A1S3CUH8 General transcription factor IIH subunit9.0e-24095.96Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEADEEDDDDDAN G AAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AA EM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPRHQLPKVCFGCQESL+NPGT NSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAT
        IHESLHNCPGCESFR PK AT
Subjt:  IHESLHNCPGCESFRRPKSAT

A0A5A7UNG8 General transcription factor IIH subunit1.1e-24096.2Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEADEEDDDDDAN G AAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AA EM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LVH YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPRHQLPKVCFGCQESL+NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAT
        IHESLHNCPGCESFR PK AT
Subjt:  IHESLHNCPGCESFRRPKSAT

A0A6J1DUE2 General transcription factor IIH subunit9.6e-24296.45Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEADEEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSS+ATTARIQKGLIRYLYIVIDFSRAAAEM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+MET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+DPRH+LPKVCFGCQESL+N GTGNS GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATS
        IHESLHNCPGCESFRRPKSA S
Subjt:  IHESLHNCPGCESFRRPKSATS

A0A6J1GC53 General transcription factor IIH subunit6.7e-24396.22Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEA+EEDDDDDANGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSE
        IHESLHNCPGCESFRRPKSATS+
Subjt:  IHESLHNCPGCESFRRPKSATSE

A0A6J1K980 General transcription factor IIH subunit3.3e-24295.98Show/hide
Query:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE+RRLNGEA+EEDDDDDANGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYLNQIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNS

Query:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGD+METVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSATSE
        IHESLHNCPGCESFRRPKSATS+
Subjt:  IHESLHNCPGCESFRRPKSATSE

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 22.2e-8942.17Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP ++ + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  +  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q2TBV5 General transcription factor IIH subunit 27.4e-9041.67Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP ++ + ++  K +KIR S+IGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein2.8e-8942.17Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP ++ + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  +  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9JIB4 General transcription factor IIH subunit 26.3e-8941.56Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDE+G L+      ++ A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP ++ + ++  KT+KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTCPRCKARVCE
        AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++      +       GGY CP+C+A+ CE
Subjt:  AEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTCPRCKARVCE

Query:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S +      ++  + C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9ZVN9 General transcription factor IIH subunit 23.7e-19076.19Show/hide
Query:  NGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF
        + + +R N E +EEDD+D    G   WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAAAEMDF
Subjt:  NGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN L+LVH +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGD+MET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L+D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKSAT
        HESLHNCPGCES  RPKS +
Subjt:  HESLHNCPGCESFRRPKSAT

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H22.6e-19176.19Show/hide
Query:  NGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF
        + + +R N E +EEDD+D    G   WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAAAEMDF
Subjt:  NGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN L+LVH +LNQ+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCD

Query:  PGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGD+MET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDVMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L+D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKSAT
        HESLHNCPGCES  RPKS +
Subjt:  HESLHNCPGCESFRRPKSAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGCGAAGATAGGCGATTGAATGGGGAAGCCGATGAAGAAGATGATGATGACGATGCCAATGGCGGACGTGCTGCTTGGGAAAGGACTTATGCCGATGATAG
GTCGTGGGAAGCTCTGCAAGAGGACGAGTCTGGGCTTCTTCGGCCGATCGACAATAAGGCCATTTACCATGCTCAGTATCGAAGGCGTCTTCGCTCCCTTTCTTCGTTAG
CAACCACTGCTCGAATTCAGAAGGGTCTTATTCGCTATCTCTATATCGTCATTGACTTCTCTAGGGCAGCTGCAGAAATGGATTTTCGACCAAGTCGAATGGCTGTGGTA
GCAAAACATGTAGAGGCTTTTGTCCGGGAATTCTTTGACCAGAATCCTCTCAGTCAGATTGGTTTGGTGACTATAAAGGATGGAGTTGCTCATTGCTTAACCGATCTTGG
TGGAAGTCCTGAGTCACATGTTAAAGCTTTAATGGGTAAACTGGAATGCTCAGGTGAAGCATCTTTGCAGAATGGTTTGGATCTTGTTCACGGCTATCTAAATCAAATTC
CATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCCTGCGATCCTGGGGATGTAATGGAGACAGTTCAGAAGTGCAAAACATCTAAAATAAGGTGT
TCAGTAATTGGTCTTACTGCTGAAATTTTTATATGCAGACATCTCTGCCAGGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCCCACTTCAAAGAGTTACTATT
GGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAACTTAATCAAGATGGGCTTTCCACAAAGAGCAGCAGAGAGTTCTATTGCAATTTGTTCATGTC
ACAAGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGAGTTTGTGAGCTGCCAACGGAGTGTCGGATTTGTGGATTGACACTTATCTCCTCACCC
CATTTGGCTAGGTCGTATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTACTTCATGATCCACGACATCAACTACCAAAAGTTTGCTTTGGCTG
CCAAGAAAGCCTCGTGAACCCTGGCACAGGTAACAGCCCAGGCATCCGTGTTTCTTGCCCCAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCATGAGA
GCTTGCACAATTGTCCTGGCTGTGAGAGTTTCAGACGTCCGAAATCAGCGACTTCCGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGGCGAAGATAGGCGATTGAATGGGGAAGCCGATGAAGAAGATGATGATGACGATGCCAATGGCGGACGTGCTGCTTGGGAAAGGACTTATGCCGATGATAG
GTCGTGGGAAGCTCTGCAAGAGGACGAGTCTGGGCTTCTTCGGCCGATCGACAATAAGGCCATTTACCATGCTCAGTATCGAAGGCGTCTTCGCTCCCTTTCTTCGTTAG
CAACCACTGCTCGAATTCAGAAGGGTCTTATTCGCTATCTCTATATCGTCATTGACTTCTCTAGGGCAGCTGCAGAAATGGATTTTCGACCAAGTCGAATGGCTGTGGTA
GCAAAACATGTAGAGGCTTTTGTCCGGGAATTCTTTGACCAGAATCCTCTCAGTCAGATTGGTTTGGTGACTATAAAGGATGGAGTTGCTCATTGCTTAACCGATCTTGG
TGGAAGTCCTGAGTCACATGTTAAAGCTTTAATGGGTAAACTGGAATGCTCAGGTGAAGCATCTTTGCAGAATGGTTTGGATCTTGTTCACGGCTATCTAAATCAAATTC
CATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCCTGCGATCCTGGGGATGTAATGGAGACAGTTCAGAAGTGCAAAACATCTAAAATAAGGTGT
TCAGTAATTGGTCTTACTGCTGAAATTTTTATATGCAGACATCTCTGCCAGGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCCCACTTCAAAGAGTTACTATT
GGAGCATGCACCCCCACCCCCAGCAATAGCAGACTCTGCAATGCCTAACTTAATCAAGATGGGCTTTCCACAAAGAGCAGCAGAGAGTTCTATTGCAATTTGTTCATGTC
ACAAGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGAGTTTGTGAGCTGCCAACGGAGTGTCGGATTTGTGGATTGACACTTATCTCCTCACCC
CATTTGGCTAGGTCGTATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTACTTCATGATCCACGACATCAACTACCAAAAGTTTGCTTTGGCTG
CCAAGAAAGCCTCGTGAACCCTGGCACAGGTAACAGCCCAGGCATCCGTGTTTCTTGCCCCAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCATGAGA
GCTTGCACAATTGTCCTGGCTGTGAGAGTTTCAGACGTCCGAAATCAGCGACTTCCGAATAA
Protein sequenceShow/hide protein sequence
MNNGEDRRLNGEADEEDDDDDANGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVV
AKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVHGYLNQIPSYGHREVLILYSALNSCDPGDVMETVQKCKTSKIRC
SVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSP
HLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLVNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSATSE