; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21203 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21203
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionADP-ribosylation factor-like
Genome locationctg910:3442267..3443716
RNA-Seq ExpressionCucsat.G21203
SyntenyCucsat.G21203
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0006886 - intracellular protein transport (biological process)
GO:0016192 - vesicle-mediated transport (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0003924 - GTPase activity (molecular function)
GO:0005525 - GTP binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055121.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa]1.53e-31098.11Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_004143721.1 general transcription factor IIH subunit 2 [Cucumis sativus]0.0100Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFRRPKLATSDE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_008467294.1 PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo]2.54e-30997.87Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP T NSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]2.49e-29893.63Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDANG +AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+ LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]4.19e-30696.46Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEADEEDDDDDANG LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AATEM
Subjt:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH +LNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPR+QLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSDE
Subjt:  IHESLHNCPGCESFRRPKLATSDE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit0.0100Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFRRPKLATSDE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A1S3CUH8 General transcription factor IIH subunit1.23e-30997.87Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP T NSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A5A7UNG8 General transcription factor IIH subunit7.42e-31198.11Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRRPKLATSDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRRPKLATSDE

A0A6J1GC53 General transcription factor IIH subunit1.20e-29893.63Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDANG +AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+ LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

A0A6J1K980 General transcription factor IIH subunit9.89e-29893.4Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDANG +AAWERTYADDRSWEALQEDESGLLRPIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+ LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NP TGNSP IRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRRPKLATSDE
        IHESLHNCPGCESFRRPK ATSD+
Subjt:  IHESLHNCPGCESFRRPKLATSDE

SwissProt top hitse value%identityAlignment
Q13888 General transcription factor IIH subunit 23.7e-8940.74Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K ++ FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR

Query:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q2TBV5 General transcription factor IIH subunit 23.7e-8940.25Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K ++ FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR

Query:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR S+IGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein4.8e-8940.74Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR
        D++      WE  Y  +R+WE L+EDESG L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K ++ FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR

Query:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q9JIB4 General transcription factor IIH subunit 21.8e-8840.39Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR
        D++      WE  Y  +R+WE L+EDE+G L+      ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K ++ FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVDAFVR

Query:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  KT+K
Subjt:  EFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTC
        IR SVIGL+AE+ +C  L +ETGG+Y V LDE+H+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++      +       GGY C
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAK----VGGGYTC

Query:  PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESL
        P+C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+S + +   R      C+GCQ  L +            C  C+  FC+DCD+++H+SL
Subjt:  PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESL

Query:  HNCPGC
        H CPGC
Subjt:  HNCPGC

Q9ZVN9 General transcription factor IIH subunit 24.1e-18975.97Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM
        A++AKHV+AF+REFFDQNPLSQIGLV+IK+G A+ LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVL+LYSAL +CDPGDIM
Subjt:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+    GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRRPK
        NCPGCES  RPK
Subjt:  NCPGCESFRRPK

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H22.9e-19075.97Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLLRPIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM
        A++AKHV+AF+REFFDQNPLSQIGLV+IK+G A+ LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVL+LYSAL +CDPGDIM
Subjt:  AVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+    GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRRPK
        NCPGCES  RPK
Subjt:  NCPGCESFRRPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGGGAAAATAGGCGATTGAATGGGGAAGCCGATGAAGAAGATGATGATGATGATGCCAATGGACTTGCTGCATGGGAAAGGACTTATGCGGATGATAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTCGTCCGATCGACAATAAAGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGAACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTATATTGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTGGTGGCA
AAACATGTAGACGCTTTTGTAAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGTTTTGCTAATTGCTTAACAGATCTTGGTGG
AAGTCCTGAATCTCATGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCACCGAGAAGTTTTAGTCTTATACTCTGCTCTTAATTCTTGCGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCA
GTTATTGGTCTTACTGCAGAAATTTTTATTTGCAGACATCTCTGCCAAGAAACTGGTGGCTCGTACTCTGTCGCATTGGATGAGTCTCACTTTAAAGAGTTGCTATTGGA
GCATGCACCCCCACCCCCAGCGATTGCAGACTCTGCCATGCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCCATTGCAATATGTTCATGCCACA
AGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTTCCCACAGAGTGTCGAATTTGTGGACTGACGCTTATCTCCTCTCCCCAT
TTGGCTAGGTCGTATCATCATCTTTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCA
AGAAAGCCTCATGAATCCTAGCACAGGTAACAGTCCAAGCATACGTGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCT
TGCACAATTGTCCTGGCTGTGAGAGTTTCAGGCGTCCCAAATTAGCGACTTCTGACGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGGGGAAAATAGGCGATTGAATGGGGAAGCCGATGAAGAAGATGATGATGATGATGCCAATGGACTTGCTGCATGGGAAAGGACTTATGCGGATGATAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTCGTCCGATCGACAATAAAGCAATTTACCATGCCCAGTATCGAAGGCGCCTTCGAACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTATATTGTCATTGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTGGTGGCA
AAACATGTAGACGCTTTTGTAAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGTTTTGCTAATTGCTTAACAGATCTTGGTGG
AAGTCCTGAATCTCATGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCACCGAGAAGTTTTAGTCTTATACTCTGCTCTTAATTCTTGCGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCA
GTTATTGGTCTTACTGCAGAAATTTTTATTTGCAGACATCTCTGCCAAGAAACTGGTGGCTCGTACTCTGTCGCATTGGATGAGTCTCACTTTAAAGAGTTGCTATTGGA
GCATGCACCCCCACCCCCAGCGATTGCAGACTCTGCCATGCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCCATTGCAATATGTTCATGCCACA
AGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCACGGGTTTGTGAGCTTCCCACAGAGTGTCGAATTTGTGGACTGACGCTTATCTCCTCTCCCCAT
TTGGCTAGGTCGTATCATCATCTTTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCA
AGAAAGCCTCATGAATCCTAGCACAGGTAACAGTCCAAGCATACGTGTTTCTTGCCCAAAGTGCAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCT
TGCACAATTGTCCTGGCTGTGAGAGTTTCAGGCGTCCCAAATTAGCGACTTCTGACGAATGA
Protein sequenceShow/hide protein sequence
MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLRPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVA
KHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCS
VIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPH
LARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRRPKLATSDE