; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008870 (gene) of Snake gourd v1 genome

Gene IDTan0008870
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationLG01:38784495..38788134
RNA-Seq ExpressionTan0008870
SyntenyTan0008870
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036868.1 General transcription factor IIH subunit 2 [Cucurbita argyrosperma subsp. argyrosperma]1.5e-24195.52Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEA+EEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGG YSVALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+ TNPGTGN+PGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA SD+
Subjt:  IHESLHNCPGCESFRRPKSAISDE

XP_022157930.1 general transcription factor IIH subunit 2 [Momordica charantia]8.4e-24095.28Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEADEEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLYIVIDFSRAAAEM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGG YS+ALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+DPRH+LPKVCFGCQESL N GTGN+ GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA S+E
Subjt:  IHESLHNCPGCESFRRPKSAISDE

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]1.2e-24195.52Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEA+EEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGG YSVALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+ TNPGTGN+PGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA SD+
Subjt:  IHESLHNCPGCESFRRPKSAISDE

XP_022998882.1 general transcription factor IIH subunit 2 [Cucurbita maxima]5.8e-24195.28Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEA+EEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGG Y VALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+ TNPGTGN+PGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA SD+
Subjt:  IHESLHNCPGCESFRRPKSAISDE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]1.4e-24295.99Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEADEEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAA EM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV G+L+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGG YS+ALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPR+QLPKVCFGCQESL NPGTGN+PGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA SDE
Subjt:  IHESLHNCPGCESFRRPKSAISDE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit2.1e-23693.87Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEADEEDDDDD N G AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AA EM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YL+QIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGG YSVALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPRHQLPKVCFGCQESL NP TGN+P IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPK A SDE
Subjt:  IHESLHNCPGCESFRRPKSAISDE

A0A5A7UNG8 General transcription factor IIH subunit2.5e-23794.34Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEADEEDDDDD N G AAWERTYADDRSWEALQEDESGLL PIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AA EM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YL+QIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGG YSVALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPRHQLPKVCFGCQESL NPGTGN+PGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFR PK A  DE
Subjt:  IHESLHNCPGCESFRRPKSAISDE

A0A6J1DUE2 General transcription factor IIH subunit4.0e-24095.28Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGE  RLNGEADEEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLRSLSS+ATTARIQKGLIRYLYIVIDFSRAAAEM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMET+QKCKTSKIRCSVIGLTAEIFICRHLCQETGG YS+ALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVL+DPRH+LPKVCFGCQESL N GTGN+ GIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA S+E
Subjt:  IHESLHNCPGCESFRRPKSAISDE

A0A6J1GC53 General transcription factor IIH subunit5.6e-24295.52Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEA+EEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGG YSVALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+ TNPGTGN+PGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA SD+
Subjt:  IHESLHNCPGCESFRRPKSAISDE

A0A6J1K980 General transcription factor IIH subunit2.8e-24195.28Show/hide
Query:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM
        MNNGEN RLNGEA+EEDDDDD NGG AAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLY+VIDFSRAAAEM
Subjt:  MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSGEASLQNGLDLV GYL+QIPSYGHREVLILYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGG Y VALDESHFKELLLEHAPPPPAIA+SAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+ HDPRHQLPKVCFGCQE+ TNPGTGN+PGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKSAISDE
        IHESLHNCPGCESFRRPKSA SD+
Subjt:  IHESLHNCPGCESFRRPKSAISDE

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 27.4e-9042.68Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  +  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q2TBV5 General transcription factor IIH subunit 23.3e-9042.17Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR S+IGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG Y V LDESH+KELL  H  PPPA + S   +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein9.7e-9042.68Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDESG L+      +F A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  K +KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL
        AE+ +C  L +ETGG Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP+C+A+ CEL
Subjt:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCPRCKARVCEL

Query:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        P EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +  +  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  PTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9JIB4 General transcription factor IIH subunit 22.2e-8942.07Show/hide
Query:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS
        WE  Y  +R+WE L+EDE+G L+      +F A+ +R            +++ G++R+LY+V+D SR   + D +P+R+    K +E FV E+FDQNP+S
Subjt:  WERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVVAKHVEAFVREFFDQNPLS

Query:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT
        QIG++  K   A  LT+L G+P  H+ +L   ++  C GE SL N L +    L  +P +  REVLI++S+L +CDP +I + ++  KT+KIR SVIGL+
Subjt:  QIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRCSVIGLT

Query:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTCPRCKARVCE
        AE+ +C  L +ETGG Y V LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++      +       GGY CP+C+A+ CE
Subjt:  AEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTCPRCKARVCE

Query:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC
        LP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S +      ++  + C+GCQ  L +            C  C+  FC+DCD+++H+SLH CPGC
Subjt:  LPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGC

Q9ZVN9 General transcription factor IIH subunit 24.1e-18976.56Show/hide
Query:  NGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF
        + +  R N E +EEDD+D    G   WER Y DDRSWE LQEDESGLLRPIDN AI+HAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAAAEMDF
Subjt:  NGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN L+LV  +L+Q+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGGLYSVA+DE H K+LLLEHAPPPPAIAE A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L+D R +L K CFGCQ+SL   G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKS
        HESLHNCPGCES  RPKS
Subjt:  HESLHNCPGCESFRRPKS

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H22.9e-19076.56Show/hide
Query:  NGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF
        + +  R N E +EEDD+D    G   WER Y DDRSWE LQEDESGLLRPIDN AI+HAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFSRAAAEMDF
Subjt:  NGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCD
        RPSRMA++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  G++SLQN L+LV  +L+Q+PSYGHREVLILYSAL +CD
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG
        PGDIMET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGGLYSVA+DE H K+LLLEHAPPPPAIAE A+ NLIKMGFPQRAAE S+AICSCHKE K+G G
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGG

Query:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYI
        Y CPRCKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +   L+D R +L K CFGCQ+SL   G GN P   V+C KCK +FCLDCDIYI
Subjt:  YTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKS
        HESLHNCPGCES  RPKS
Subjt:  HESLHNCPGCESFRRPKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGCGAAAATCCGCGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGTCAATGGCGGACGTGCTGCCTGGGAAAGAACTTATGCGGACGATAG
GTCGTGGGAAGCTCTGCAAGAGGATGAGTCTGGGCTTCTTCGCCCGATTGACAATAAGGCTATTTTCCATGCCCAGTATCGAAGGCGTCTTCGTTCCCTTTCTTCCTTAG
CAACCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTATATCGTCATTGACTTCTCCAGGGCAGCTGCAGAAATGGATTTTCGACCAAGTCGAATGGCTGTGGTG
GCAAAACATGTGGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCTCTCAGTCAGATTGGTTTGGTGACTATAAAGGATGGAGTTGCTCATTGCTTAACAGATCTCGG
TGGAAGTCCCGAGTCCCATGTTAAAGCTTTAATGGGAAAACTGGAATGCTCAGGTGAAGCATCCTTGCAGAATGGTCTGGATCTTGTTCGCGGCTATCTAGATCAAATAC
CATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCCTGTGATCCTGGGGATATAATGGAGACAGTTCAGAAATGCAAAACATCTAAAATAAGGTGT
TCAGTAATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAGGAAACTGGTGGCTTATACTCTGTGGCGTTGGATGAGTCCCACTTCAAAGAGTTACTATT
GGAGCATGCACCCCCACCCCCAGCAATAGCAGAATCTGCAATGCCCAATTTGATCAAGATGGGCTTTCCACAAAGAGCAGCAGAGAGTTCTATTGCAATTTGTTCATGTC
ACAAGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGAGTCTGTGAACTGCCAACGGAGTGTCGAATTTGTGGATTGACACTTATTTCGTCGCCC
CATTTGGCTAGGTCGTATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTACTTCATGATCCACGACATCAACTACCAAAAGTTTGCTTTGGCTG
CCAAGAAAGCCTCACGAACCCTGGCACAGGTAACAACCCAGGCATCCGTGTTTCTTGTCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGACATTTATATTCACGAGA
GCTTGCACAATTGTCCTGGCTGCGAGAGTTTCAGACGTCCCAAATCAGCGATATCTGATGAATGA
mRNA sequenceShow/hide mRNA sequence
TACCAAACAAGGTTTCAAAACCATCCCAATAGAAAACATCAAAAGCCCCTTAAAGGTTAGAGATATTAAACAAAACTAATTTAGCCTAAAAAATTACCATCAACGCCGAA
ACATAACATTTTCTCTCCTAAGCGCACACTTCCTGCCTCTGTCACTGAACACCAACTGCCACGGATCTCTCTTCTCTTCTCTTCTCTTCTTCTTCACTCTCAAATCCCAA
CCCAAAAAATTCAAACCCAACTTACTTTCTTCATTGCCGATTTCTCGTTGCATCTTCAATCGTCGAATTGACACTCATTTTCCATTCCTTTTGACCTACCTACTGTTACT
CTTAACTCCCCAAATGGTGCTTTTCCTTTGAGTGTATATTGTTTTAGGTATTGGGGTTCAATCAATTTGATCTTCTTTCAAAACCCTCGAGCCTAGCTTCTGAGAAAAGT
CCTCGAAAAACCATGAACAATGGCGAAAATCCGCGATTGAATGGGGAAGCCGATGAAGAAGACGATGATGACGATGTCAATGGCGGACGTGCTGCCTGGGAAAGAACTTA
TGCGGACGATAGGTCGTGGGAAGCTCTGCAAGAGGATGAGTCTGGGCTTCTTCGCCCGATTGACAATAAGGCTATTTTCCATGCCCAGTATCGAAGGCGTCTTCGTTCCC
TTTCTTCCTTAGCAACCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTATATCGTCATTGACTTCTCCAGGGCAGCTGCAGAAATGGATTTTCGACCAAGTCGA
ATGGCTGTGGTGGCAAAACATGTGGAGGCTTTTGTCAGGGAATTCTTTGACCAAAATCCTCTCAGTCAGATTGGTTTGGTGACTATAAAGGATGGAGTTGCTCATTGCTT
AACAGATCTCGGTGGAAGTCCCGAGTCCCATGTTAAAGCTTTAATGGGAAAACTGGAATGCTCAGGTGAAGCATCCTTGCAGAATGGTCTGGATCTTGTTCGCGGCTATC
TAGATCAAATACCATCATATGGGCATAGAGAAGTTTTAATCTTATACTCTGCTCTTAATTCCTGTGATCCTGGGGATATAATGGAGACAGTTCAGAAATGCAAAACATCT
AAAATAAGGTGTTCAGTAATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAGGAAACTGGTGGCTTATACTCTGTGGCGTTGGATGAGTCCCACTTCAA
AGAGTTACTATTGGAGCATGCACCCCCACCCCCAGCAATAGCAGAATCTGCAATGCCCAATTTGATCAAGATGGGCTTTCCACAAAGAGCAGCAGAGAGTTCTATTGCAA
TTTGTTCATGTCACAAGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGAGTCTGTGAACTGCCAACGGAGTGTCGAATTTGTGGATTGACACTT
ATTTCGTCGCCCCATTTGGCTAGGTCGTATCATCATCTCTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTACTTCATGATCCACGACATCAACTACCAAAAGT
TTGCTTTGGCTGCCAAGAAAGCCTCACGAACCCTGGCACAGGTAACAACCCAGGCATCCGTGTTTCTTGTCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGACATTT
ATATTCACGAGAGCTTGCACAATTGTCCTGGCTGCGAGAGTTTCAGACGTCCCAAATCAGCGATATCTGATGAATGATCGTCTGCTTTCGGATGCGACTGCATGCCCTTG
TCCCTTAACCAAATCCAAACTCAATATTTAATATGACCATGACAATGCTGTATGTTCATAAGCTGTTATGAGCTTTGTACCAAAGTTTCAAGCATGTCTTGCTGCACTGA
TGATGATCAATCAGAAGCTCACATTGTTACTGTCGAAAGTTGTCTTTCTTGTTTCAACAGCTCCTAAACCGATTTCAGAGATACAAAGCTCGCGGATGCAAAAAGTTCAT
TACGAGAATAAGATTTTATTAGACGAAAGGCTGCTCACTTTCTTAAGGTGGCCGATTTCGAATAGTAAGTTTTACTTGGATTGTACCCAAAATTGACCCCTCAATCAATT
TCCATTTTCATGAGGATTTGTAGAAACTTGGTATTAATTGTCTAATTTTTTAGGGGTATTTGACATTTATATTCATTTAAGTTATTAAAAGAGCTTATGCCAAAACTTTC
Protein sequenceShow/hide protein sequence
MNNGENPRLNGEADEEDDDDDVNGGRAAWERTYADDRSWEALQEDESGLLRPIDNKAIFHAQYRRRLRSLSSLATTARIQKGLIRYLYIVIDFSRAAAEMDFRPSRMAVV
AKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGEASLQNGLDLVRGYLDQIPSYGHREVLILYSALNSCDPGDIMETVQKCKTSKIRC
SVIGLTAEIFICRHLCQETGGLYSVALDESHFKELLLEHAPPPPAIAESAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSP
HLARSYHHLFPIIPFDEVSDKVLHDPRHQLPKVCFGCQESLTNPGTGNNPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKSAISDE