; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018571 (gene) of Chayote v1 genome

Gene IDSed0018571
OrganismSechium edule (Chayote v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationLG01:7343821..7347580
RNA-Seq ExpressionSed0018571
SyntenySed0018571
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036868.1 General transcription factor IIH subunit 2 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-23492.92Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEAEE+DDDDD + GG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAAAE
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+  LTDLGGSPESHVK+LMGKLECSGEASLQNGLDLVCGYL+QIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AE+FIC+H+CQETGGSYS+ALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +D RHQLPK+CFGCQE+   PGTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSAT+D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

XP_022157930.1 general transcription factor IIH subunit 2 [Momordica charantia]8.9e-23492.92Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGE  RLNGEA+E+DDDDDG+ GG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLYI+IDFSRAAAE
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+ CLTDLGGSPESHVK+LMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMET+QKCKTSKIRCSVIGL+AEIFIC+H+CQETGGSYSIALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL D RH+LPK+CFGCQESLM  GTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSA ++
Subjt:  YIHESLHNCPGCESFRRPKSATAD

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]2.8e-23593.16Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEAEE+DDDDD + GG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAAAE
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+  LTDLGGSPESHVK+LMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AE+FIC+H+CQETGGSYS+ALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +D RHQLPK+CFGCQE+   PGTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSAT+D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

XP_022998882.1 general transcription factor IIH subunit 2 [Cucurbita maxima]1.4e-23492.92Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEAEE+DDDDD + GG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAAAE
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+  LTDLGGSPESHVK+LMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AE+FIC+H+CQETGGSY +ALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +D RHQLPK+CFGCQE+   PGTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSAT+D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]8.9e-23493.16Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEA+E+DDDDD + GG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLYI+IDFSRAA E
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+ CLTDLGGSPESHVK+LMGKLECSG+ASLQNGL+LV G+LNQIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AEIFIC+H+CQETGGSYSIALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+D R+QLPK+CFGCQESLM PGTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSAT+D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit1.4e-22990.8Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEA+E+DDDDD +  G AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AA E
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG + CLTDLGGSPESHVK+LMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AEIFIC+H+CQETGGSYS+ALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDKV +D RHQLPK+CFGCQESLM P TGNS  IRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPK AT+D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

A0A5A7UNG8 General transcription factor IIH subunit5.0e-23091.27Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEA+E+DDDDD +  G AAWERTYADDRSWEALQEDESGLL PIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLYI+IDFS+AA E
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+ CLTDLGGSPESHVK+LMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AEIFIC+H+CQETGGSYS+ALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDKV +D RHQLPK+CFGCQESLM PGTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFR PK AT D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

A0A6J1DUE2 General transcription factor IIH subunit4.3e-23492.92Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGE  RLNGEA+E+DDDDDG+ GG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLYI+IDFSRAAAE
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+ CLTDLGGSPESHVK+LMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMET+QKCKTSKIRCSVIGL+AEIFIC+H+CQETGGSYSIALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL D RH+LPK+CFGCQESLM  GTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSA ++
Subjt:  YIHESLHNCPGCESFRRPKSATAD

A0A6J1GC53 General transcription factor IIH subunit1.3e-23593.16Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEAEE+DDDDD + GG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAAAE
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+  LTDLGGSPESHVK+LMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AE+FIC+H+CQETGGSYS+ALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +D RHQLPK+CFGCQE+   PGTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSAT+D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

A0A6J1K980 General transcription factor IIH subunit6.7e-23592.92Show/hide
Query:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE
        MNNGENRRLNGEAEE+DDDDD + GG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY++IDFSRAAAE
Subjt:  MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAE

Query:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
        MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGV+  LTDLGGSPESHVK+LMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN
Subjt:  MDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALN

Query:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
        SCDPGDIMETVQKCKTSKIRCSVIGL+AE+FIC+H+CQETGGSY +ALDE HFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV
Subjt:  SCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKV

Query:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI
        GGGYTCPRCKARVCELPTEC+ICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ +D RHQLPK+CFGCQE+   PGTGNS GIRVSCPKCKQHFCLDCDI
Subjt:  GGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDI

Query:  YIHESLHNCPGCESFRRPKSATAD
        YIHESLHNCPGCESFRRPKSAT+D
Subjt:  YIHESLHNCPGCESFRRPKSATAD

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 22.4e-8841.41Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LY+++D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLS
        QIG++  K   ++ LT+L G+P  H+ SL   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGLS
Subjt:  QIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLS

Query:  AEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  + +ETGG+Y + LDE H+KELL  H  PPPA + S+  +LI+MGFPQ           + S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      +   + C+GCQ  L              C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q2TBV5 General transcription factor IIH subunit 26.3e-8941.16Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LY+++D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLS
        QIG++  K   ++ LT+L G+P  H+ SL   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR S+IGLS
Subjt:  QIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLS

Query:  AEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  + +ETGG+Y + LDE H+KELL  H  PPPA ++S   +LI+MGFPQ           + S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------GESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L              C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q86KZ2 General transcription factor IIH subunit 21.4e-8839.48Show/hide
Query:  NNGENRRLNGEAEEDDD-------DDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSS---LATTARIQKGLIRYLYIL
        NN +N+R N    +D+D        +D DG  +  WE  +  +++W  + EDE G LRP + +        RRL++      L+   R+++G+ R+L ++
Subjt:  NNGENRRLNGEAEEDDD-------DDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSS---LATTARIQKGLIRYLYIL

Query:  IDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHRE
        +D S+  +  D +PSR  V+ ++VE F++EFFDQNP+SQ+ ++  K+  ++ +++L G+   H++++   +   GE S+QN L++    L  +P YG RE
Subjt:  IDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHRE

Query:  VLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAI
        VL ++S+L +CDP  + +T+Q  K   IR S I ++AE++ICK + ++T G+  + L+E HF E L+    PPP I  +    L++MGFPQ+   +  + 
Subjt:  VLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAI

Query:  CSCHKEAKVGGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCK
        C CH++ K   GY CPRC  + CELPT+CQIC L+L+SSPHLARSYHHLF I  F+EV+ K L  +       C GC  S  K    +   +  SCP+C+
Subjt:  CSCHKEAKVGGGYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCK

Query:  QHFCLDCDIYIHESLHNCPGCES
        + FCLDCD++IHESLHNCPGCE+
Subjt:  QHFCLDCDIYIHESLHNCPGCES

Q9JIB4 General transcription factor IIH subunit 22.4e-8841.31Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDE+G L+      +F A+ +R            +++ G++R+LY+++D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLS
        QIG++  K   ++ LT+L G+P  H+ SL   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  KT+KIR SVIGLS
Subjt:  QIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLE--CSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLS

Query:  AEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA------GESSIAICSCH------KEAKVGGGYTCPRCKARVCE
        AE+ +C  + +ETGG+Y + LDE H+KELL  H  PPPA + S+  +LI+MGFPQ         ++  +    H      +     GGY CP+C+A+ CE
Subjt:  AEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA------GESSIAICSCH------KEAKVGGGYTCPRCKARVCE

Query:  LPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S +      ++  + C+GCQ  L              C  C+  FC+DCD+++H+SLH CPGC
Subjt:  LPTECQICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9ZVN9 General transcription factor IIH subunit 21.6e-18574.35Show/hide
Query:  NGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMD
        + + +R N E EE+DD+   D  G   WER Y DDRSWE LQEDESGLLRPIDN AI+HAQYRRRLR LS+ A   RIQKGLIRYLYI+IDFSRAAAEMD
Subjt:  NGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSC
        FRPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GV+  LTDLGGSPE+H+K+LMGKLE  G++SLQN L+LV  +LNQ+PSYGHREVLILYSAL +C
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGG
        DPGDIMET+QKCK SK+RCSVIGLSAE+FICKH+CQETGG YS+A+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRA E S+AICSCHKE K+G 
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIY
        GY CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L D+R +L K CFGCQ+SL+  G GN     V+C KCK +FCLDCDIY
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAT
        IHESLHNCPGCES  RPKS +
Subjt:  IHESLHNCPGCESFRRPKSAT

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H21.1e-18674.35Show/hide
Query:  NGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMD
        + + +R N E EE+DD+   D  G   WER Y DDRSWE LQEDESGLLRPIDN AI+HAQYRRRLR LS+ A   RIQKGLIRYLYI+IDFSRAAAEMD
Subjt:  NGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSC
        FRPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GV+  LTDLGGSPE+H+K+LMGKLE  G++SLQN L+LV  +LNQ+PSYGHREVLILYSAL +C
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGG
        DPGDIMET+QKCK SK+RCSVIGLSAE+FICKH+CQETGG YS+A+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRA E S+AICSCHKE K+G 
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIY
        GY CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L D+R +L K CFGCQ+SL+  G GN     V+C KCK +FCLDCDIY
Subjt:  GYTCPRCKARVCELPTECQICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAT
        IHESLHNCPGCES  RPKS +
Subjt:  IHESLHNCPGCESFRRPKSAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACGGCGAGAATCGGCGATTGAACGGAGAAGCCGAAGAAGACGACGACGACGACGACGGCGACGGCGGGGGGCGCGCTGCGTGGGAAAGGACCTACGCGGACGA
CAGGTCTTGGGAAGCTCTGCAAGAGGACGAGTCCGGCCTTCTCCGCCCCATCGATAACAAAGCCATTTTTCATGCTCAGTACCGGCGCCGCCTCCGTTCCCTGTCTTCCT
TGGCCACCACTGCTCGCATTCAAAAGGGTCTCATTCGATATCTCTACATCCTCATTGATTTCTCCAGGGCAGCTGCAGAAATGGATTTTCGACCAAGTCGAATGGCTGTG
GTGGCAAAACATGTGGAGGCTTTTGTTAGGGAGTTCTTTGACCAAAATCCTCTCAGCCAGATTGGTTTGGTGACTATAAAGGATGGAGTTTCTCAGTGCTTAACGGATCT
CGGAGGAAGTCCCGAGTCCCATGTGAAATCTTTAATGGGTAAACTAGAATGCTCAGGTGAAGCTTCTTTGCAGAATGGTCTGGATCTTGTTTGCGGTTATCTCAATCAAA
TACCATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCCCTTAATTCCTGTGATCCTGGGGATATAATGGAGACAGTTCAGAAATGCAAAACATCTAAAATAAGG
TGTTCAGTAATTGGTCTTTCCGCAGAAATTTTTATATGCAAACATGTGTGCCAGGAAACTGGTGGCTCATACTCTATCGCATTGGATGAGCCCCACTTCAAAGAGCTGCT
ATTGGAGCATGCACCCCCACCCCCAGCAATAGCTGACTCTGCAATGCCTAATTTGATTAAGATGGGCTTTCCGCAAAGAGCAGGAGAGAGTTCTATTGCAATTTGTTCAT
GTCACAAGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGGTGCAAAGCACGAGTTTGCGAGCTGCCAACCGAGTGTCAGATTTGTGGATTGACACTTATTTCCTCG
CCCCATTTGGCTAGGTCGTACCATCATCTCTTTCCAATCATACCATTTGATGAAGTCTCTGATAAAGTACTTTATGATTCACGACATCAACTACCAAAACTCTGCTTTGG
CTGCCAAGAAAGCCTCATGAAGCCTGGCACAGGTAACAGCTCAGGCATCCGTGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATCTATATTCACG
AGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTCCGGCGTCCAAAATCGGCAACTGCTGATTAA
mRNA sequenceShow/hide mRNA sequence
GTCATTTTTGTTTGGCAAAGCGCTCTCAGTTTACTTTGCCACTGAATCCCCAACTCGATCTCTCTTCACTCTCAAATCCAAACCCTAAACTCTGTGTTTTCCATTGGCGA
TTGCTTAATTGATTCTTCCATCCTCAAATCAGCCCTCCTTTTCCAATTCATTTTACCTCTAAACCCTAAACTGTAACTCCCAACACTACACATTGTGTTTTAGCAATCAA
TTCGATCAGATTCTTCCAAAACCCCTAGAAACGATGAACAACGGCGAGAATCGGCGATTGAACGGAGAAGCCGAAGAAGACGACGACGACGACGACGGCGACGGCGGGGG
GCGCGCTGCGTGGGAAAGGACCTACGCGGACGACAGGTCTTGGGAAGCTCTGCAAGAGGACGAGTCCGGCCTTCTCCGCCCCATCGATAACAAAGCCATTTTTCATGCTC
AGTACCGGCGCCGCCTCCGTTCCCTGTCTTCCTTGGCCACCACTGCTCGCATTCAAAAGGGTCTCATTCGATATCTCTACATCCTCATTGATTTCTCCAGGGCAGCTGCA
GAAATGGATTTTCGACCAAGTCGAATGGCTGTGGTGGCAAAACATGTGGAGGCTTTTGTTAGGGAGTTCTTTGACCAAAATCCTCTCAGCCAGATTGGTTTGGTGACTAT
AAAGGATGGAGTTTCTCAGTGCTTAACGGATCTCGGAGGAAGTCCCGAGTCCCATGTGAAATCTTTAATGGGTAAACTAGAATGCTCAGGTGAAGCTTCTTTGCAGAATG
GTCTGGATCTTGTTTGCGGTTATCTCAATCAAATACCATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCCCTTAATTCCTGTGATCCTGGGGATATAATGGAG
ACAGTTCAGAAATGCAAAACATCTAAAATAAGGTGTTCAGTAATTGGTCTTTCCGCAGAAATTTTTATATGCAAACATGTGTGCCAGGAAACTGGTGGCTCATACTCTAT
CGCATTGGATGAGCCCCACTTCAAAGAGCTGCTATTGGAGCATGCACCCCCACCCCCAGCAATAGCTGACTCTGCAATGCCTAATTTGATTAAGATGGGCTTTCCGCAAA
GAGCAGGAGAGAGTTCTATTGCAATTTGTTCATGTCACAAGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGGTGCAAAGCACGAGTTTGCGAGCTGCCAACCGAG
TGTCAGATTTGTGGATTGACACTTATTTCCTCGCCCCATTTGGCTAGGTCGTACCATCATCTCTTTCCAATCATACCATTTGATGAAGTCTCTGATAAAGTACTTTATGA
TTCACGACATCAACTACCAAAACTCTGCTTTGGCTGCCAAGAAAGCCTCATGAAGCCTGGCACAGGTAACAGCTCAGGCATCCGTGTTTCTTGCCCAAAGTGCAAACAAC
ACTTCTGTCTTGATTGTGATATCTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGAGAGTTTCCGGCGTCCAAAATCGGCAACTGCTGATTAATGATCAGAACAC
GTCTGCTTTCGGATGCGATGCAACTGCGTTAAACAAATTCAGACTCCAATACTCAACATGATCGTAACAATGCTGTCATTTAGTTAAAAAGCTGGTATGAGCTTTGTACC
CAAGTTTCAAGTATGCCTTGCAATGCAATGATGATGAATCAGAGGCTCATGCTGTTACTGCTGAAGTTGGTCTTTCATGTTTTGATAGCTACTAAACTGATTTCACAAAG
ATGCCAAGCTCCTCATGGATGAGCCTTTTTTACACTTGATATCATTTTTTCTACACCTCTCTAAGGTAGTATCCAATTGTAAGTTGGTGTACCCAAAGTTCAACCCCCAA
TAGACAATTTTCACTTTCATGAGGATTTGTTGGAACTTTGTATTAATTGTCCAAATCGGGGGCAAGTTGACATTCGTTTAGCTAATTTTATTTATACAGCTTGTCAAATT
TTCGTGCTTCCAAAATGTGCGAACGTATTTTTTACTCGGCCCTCATTTAGGGTAGAAAATTTGAACATGTACTTTTTTATGTTATCG
Protein sequenceShow/hide protein sequence
MNNGENRRLNGEAEEDDDDDDGDGGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYILIDFSRAAAEMDFRPSRMAV
VAKHVEAFVREFFDQNPLSQIGLVTIKDGVSQCLTDLGGSPESHVKSLMGKLECSGEASLQNGLDLVCGYLNQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIR
CSVIGLSAEIFICKHVCQETGGSYSIALDEPHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECQICGLTLISS
PHLARSYHHLFPIIPFDEVSDKVLYDSRHQLPKLCFGCQESLMKPGTGNSSGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSATAD