; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021230 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021230
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGeneral transcription factor IIH subunit
Genome locationchr09:20120819..20127520
RNA-Seq ExpressionIVF0021230
SyntenyIVF0021230
Gene Ontology termsGO:0006289 - nucleotide-excision repair (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0000439 - transcription factor TFIIH core complex (cellular component)
GO:0005675 - transcription factor TFIIH holo complex (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR004595 - TFIIH C1-like domain
IPR007198 - Ssl1-like
IPR012170 - TFIIH subunit Ssl1/p44
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR013087 - Zinc finger C2H2-type
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055121.1 general transcription factor IIH subunit 2 [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRHPKLATFDE
        HESLHNCPGCESFRHPKLATFDE
Subjt:  HESLHNCPGCESFRHPKLATFDE

XP_004143721.1 general transcription factor IIH subunit 2 [Cucumis sativus]3.22e-31298.11Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRHPKLATFDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRHPKLATFDE

XP_008467294.1 PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo]0.099.76Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRHPKLATFDE
        HESLHNCPGCESFRHPKLATFDE
Subjt:  HESLHNCPGCESFRHPKLATFDE

XP_022949453.1 general transcription factor IIH subunit 2 [Cucurbita moschata]3.16e-30194.1Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDANG +AAWERTYADDRSWEALQEDESGLL PIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRHPKLATFDE
        IHESLHNCPGCESFR PK AT D+
Subjt:  IHESLHNCPGCESFRHPKLATFDE

XP_038874496.1 general transcription factor IIH subunit 2 isoform X1 [Benincasa hispida]5.32e-30996.93Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEADEEDDDDDANG LAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLR+LSSLATTARIQKGLIRYLYIVIDFS+AATEM
Subjt:  MNNGENRRLNGEADEEDDDDDANG-LAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVH +LNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYS+ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKV HDPR+QLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRHPKLATFDE
        IHESLHNCPGCESFR PK AT DE
Subjt:  IHESLHNCPGCESFRHPKLATFDE

TrEMBL top hitse value%identityAlignment
A0A0A0KPM4 General transcription factor IIH subunit9.9e-24798.11Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLL PIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHV+AFVREFFDQNPLSQIGLVTIKDG A+CLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNP TGNSP IRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRHPKLATFDE
        HESLHNCPGCESFR PKLAT DE
Subjt:  HESLHNCPGCESFRHPKLATFDE

A0A1S3CUH8 General transcription factor IIH subunit3.5e-25299.76Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGT NSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRHPKLATFDE
        HESLHNCPGCESFRHPKLATFDE
Subjt:  HESLHNCPGCESFRHPKLATFDE

A0A5A7UNG8 General transcription factor IIH subunit4.2e-253100Show/hide
Query:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
        MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD
Subjt:  MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMD

Query:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
        FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC
Subjt:  FRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSC

Query:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
        DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG
Subjt:  DPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGG

Query:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
        GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI
Subjt:  GYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYI

Query:  HESLHNCPGCESFRHPKLATFDE
        HESLHNCPGCESFRHPKLATFDE
Subjt:  HESLHNCPGCESFRHPKLATFDE

A0A6J1GC53 General transcription factor IIH subunit2.2e-23894.1Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLL PIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRHPKLATFDE
        IHESLHNCPGCESFR PK AT D+
Subjt:  IHESLHNCPGCESFRHPKLATFDE

A0A6J1K980 General transcription factor IIH subunit1.1e-23793.87Show/hide
Query:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM
        MNNGENRRLNGEA+EEDDDDDAN G+AAWERTYADDRSWEALQEDESGLL PIDNKAI+HAQYRRRLR+LSSLATTARIQKGLIRYLY+VIDFS+AA EM
Subjt:  MNNGENRRLNGEADEEDDDDDAN-GLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEM

Query:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS
        DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAH LTDLGGSPESHVKALMGKLECSG+ASLQNGL+LV  YLNQIPSYGHREVL+LYSALNS
Subjt:  DFRPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNS

Query:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG
        CDPGDIMETVQKCKTSKIRCSVIGLTAE+FICRHLCQETGGSY VALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA ESSIAICSCHKEAKVG
Subjt:  CDPGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVG

Query:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY
        GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDK+FHDPRHQLPKVCFGCQE+  NPGTGNSPGIRVSCPKCKQHFCLDCDIY
Subjt:  GGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIY

Query:  IHESLHNCPGCESFRHPKLATFDE
        IHESLHNCPGCESFR PK AT D+
Subjt:  IHESLHNCPGCESFRHPKLATFDE

SwissProt top hitse value%identityAlignment
O74995 General transcription and DNA repair factor IIH subunit ssl18.2e-8940.1Show/hide
Query:  NNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDF
        +  E+ + NG         D N    WE  Y   RSW+ +QED  G L  +    I   + +R LR       T  +Q+G+IR++ +V+D S +  E DF
Subjt:  NNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDF

Query:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCD
           R  +  K+   FV EFF+QNP+SQ+ ++ + DG+AH +TDL G+P+SH++ L    +CSG+ SLQN LE+  + L+ I S+G REVL+++ ++ S D
Subjt:  RPSRMAVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCD

Query:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGS----YSVALDESHFKELLLEHA-PPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEA
        PGDI +T+       IR  ++GL AE+ IC+ +C +T  S    Y V + E HF+ELLLE   PP    A +   +L+ MGFP +  E   ++C+CH   
Subjt:  PGDIMETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGS----YSVALDESHFKELLLEHA-PPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEA

Query:  KVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQ----ESLMNPGTGNSPGIRVSCPKCKQHF
           GG+ CPRCKA+VC LP EC  C L LI S HLARSYHHLFP+  + E+         H     CF CQ    +  ++P   ++  +R +CP CK HF
Subjt:  KVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQ----ESLMNPGTGNSPGIRVSCPKCKQHF

Query:  CLDCDIYIHESLHNCPGCE
        CLDCD++ HE LH C GC+
Subjt:  CLDCDIYIHESLHNCPGCE

Q13888 General transcription factor IIH subunit 22.8e-8940.99Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR
        D++      WE  Y  +R+WE L+EDESG L       ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q2TBV5 General transcription factor IIH subunit 22.8e-8940.49Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR
        D++      WE  Y  +R+WE L+EDESG L       ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR S+IGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA ++S   +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  +      H   + C+ CQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q6P1K8 General transcription factor IIH subunit 2-like protein3.7e-8940.99Show/hide
Query:  DDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR
        D++      WE  Y  +R+WE L+EDESG L       ++ A+ +R            +++ G++R+LY+V+D S+   + D +P+R+    K +E FV 
Subjt:  DDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVAKHVEAFVR

Query:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK
        E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   ++  C G+ SL N L +    L  +P +  REVL+++S+L +CDP +I + ++  K +K
Subjt:  EFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLE--CSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSK

Query:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP
        IR SVIGL+AE+ +C  L +ETGG+Y V LDESH+KELL  H  PPPA + S+  +LI+MGFPQ          A+ S ++       + G   GGY CP
Subjt:  IRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRA--------AESSIAICSCHKEAKVG---GGYTCP

Query:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        +C+A+ CELP EC+ICGLTL+S+PHLARSYHHLFP+  F E+  + ++  R      C+GCQ  L +            C  C+  FC+DCD+++H+SLH
Subjt:  RCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGC
         CPGC
Subjt:  NCPGC

Q9ZVN9 General transcription factor IIH subunit 27.4e-19175.6Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLL PIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM
        A++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVL+LYSAL +CDPGDIM
Subjt:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRHPKLATFDE
        NCPGCES   PK  +  E
Subjt:  NCPGCESFRHPKLATFDE

Arabidopsis top hitse value%identityAlignment
AT1G05055.1 general transcription factor II H25.3e-19275.6Show/hide
Query:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM
        R+ + +  EE+DD+DA G+  WER Y DDRSWE LQEDESGLL PIDN AIYHAQYRRRLR LS+ A   RIQKGLIRYLYIVIDFS+AA EMDFRPSRM
Subjt:  RRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRM

Query:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM
        A++AKHVEAF+REFFDQNPLSQIGLV+IK+GVAH LTDLGGSPE+H+KALMGKLE  GD+SLQN LELVH +LNQ+PSYGHREVL+LYSAL +CDPGDIM
Subjt:  AVVAKHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIM

Query:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR
        ET+QKCK SK+RCSVIGL+AE+FIC+HLCQETGG YSVA+DE H K+LLLEHAPPPPAIA+ A+ NLIKMGFPQRAAE S+AICSCHKE K+G GY CPR
Subjt:  ETVQKCKTSKIRCSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPR

Query:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH
        CKARVC+LPTEC ICGLTL+SSPHLARSYHHLFPI PFDEV +    +D R +L K CFGCQ+SL+  G GN P   V+C KCK +FCLDCDIYIHESLH
Subjt:  CKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEV-SDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLH

Query:  NCPGCESFRHPKLATFDE
        NCPGCES   PK  +  E
Subjt:  NCPGCESFRHPKLATFDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGGGAAAATAGGCGATTGAATGGGGAAGCTGATGAAGAAGATGATGATGATGATGCCAATGGACTTGCTGCGTGGGAAAGGACTTATGCGGATGATAGGTC
GTGGGAAGCCCTGCAAGAGGATGAGTCTGGACTCCTTTGTCCGATCGACAATAAGGCAATTTACCATGCCCAGTATCGTAGGCGCCTTCGTACCCTTTCTTCCTTAGCAA
CCACTGCTCGGATTCAGAAGGGTCTTATTCGCTATCTCTATATCGTCATCGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTTGTGGCA
AAACATGTAGAGGCTTTCGTAAGGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGG
AAGTCCTGAATCTCACGTTAAAGCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCAT
CATATGGGCATAGAGAAGTTTTAGTCTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCA
GTTATTGGTCTTACTGCAGAAATTTTTATATGCAGACATCTCTGCCAAGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCTCACTTCAAAGAGTTGCTATTGGA
GCATGCACCCCCACCCCCAGCAATTGCAGACTCTGCCATGCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCCATTGCAATATGTTCATGTCACA
AGGAAGCTAAAGTTGGAGGGGGCTATACTTGCCCTCGATGCAAAGCAAGGGTTTGTGAGCTTCCCACAGAGTGTCGAATTTGTGGACTGACTCTTATCTCCTCACCCCAT
TTGGCTAGGTCGTATCATCATCTTTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCA
AGAAAGCCTCATGAATCCTGGCACAGGTAACAGTCCAGGCATCCGTGTTTCTTGCCCAAAGTGTAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCT
TGCACAATTGTCCTGGCTGTGAGAGTTTCAGGCATCCCAAATTAGCGACTTTTGACGAATGA
mRNA sequenceShow/hide mRNA sequence
AGAAAAACCACTCATACGCTACTCCTTCAAGGTTTTCTTCCCAGCACACTCTCTTCATCTCTTCAATTTCACCGTTTTTCTCTATCTAAAAAGTAAACGAATCAAACAAA
CTTCAAACTGTTTTTCTCTTCATTCTTTCATACTTACACTCAGAGATCGGGAGCAGTTAAATCCACAAATCGGAAGAACATGGGTCAAGCTTTTCGCGAACTCTTCGACT
CATTTTTTGGCAACTCCGAGATATGATGGCTTAGGGGTGGAACATTTCTTGGAACCTGGGTTTCGCAAATGCTTGTTGGGAAGAAGTAATTGAAATCCAATGAAATCAAA
CCCAACTGGAGGTCATTGCTGATTCCCCATCCCTCCTATTGACGAATAGGACGCCATTACCTAAATTTTTTTACCGACCTACTGTATTCCCCAAGTTGGGATTTTTGTGT
GTAGAATTTGTTTTAGGTACTGGGGACTCATCAATTTGATCTTGTTTCAAAACCCAACAAGAGTAATCTTCTGAAATAGTCCTTAAATATGAACAATGGGGAAAATAGGC
GATTGAATGGGGAAGCTGATGAAGAAGATGATGATGATGATGCCAATGGACTTGCTGCGTGGGAAAGGACTTATGCGGATGATAGGTCGTGGGAAGCCCTGCAAGAGGAT
GAGTCTGGACTCCTTTGTCCGATCGACAATAAGGCAATTTACCATGCCCAGTATCGTAGGCGCCTTCGTACCCTTTCTTCCTTAGCAACCACTGCTCGGATTCAGAAGGG
TCTTATTCGCTATCTCTATATCGTCATCGACTTCTCCAAGGCTGCTACAGAAATGGATTTCCGACCAAGTCGAATGGCTGTTGTGGCAAAACATGTAGAGGCTTTCGTAA
GGGAATTCTTTGACCAAAATCCACTCAGCCAGATTGGTTTGGTGACTATAAAAGATGGAGTTGCTCATTGCTTAACAGATCTTGGTGGAAGTCCTGAATCTCACGTTAAA
GCGTTAATGGGTAAACTGGAATGCTCAGGTGATGCTTCCTTGCAGAATGGTCTGGAACTTGTCCACAGCTATCTAAATCAAATTCCATCATATGGGCATAGAGAAGTTTT
AGTCTTATACTCTGCTCTTAATTCTTGTGATCCTGGGGACATCATGGAGACAGTTCAGAAATGCAAAACTTCTAAAATAAGGTGTTCAGTTATTGGTCTTACTGCAGAAA
TTTTTATATGCAGACATCTCTGCCAAGAAACTGGTGGCTCATACTCTGTCGCATTGGATGAGTCTCACTTCAAAGAGTTGCTATTGGAGCATGCACCCCCACCCCCAGCA
ATTGCAGACTCTGCCATGCCTAATTTAATCAAGATGGGTTTTCCACAAAGAGCAGCAGAGAGTTCCATTGCAATATGTTCATGTCACAAGGAAGCTAAAGTTGGAGGGGG
CTATACTTGCCCTCGATGCAAAGCAAGGGTTTGTGAGCTTCCCACAGAGTGTCGAATTTGTGGACTGACTCTTATCTCCTCACCCCATTTGGCTAGGTCGTATCATCATC
TTTTTCCAATTATACCATTTGATGAAGTCTCTGATAAAGTATTTCATGACCCACGACACCAACTTCCAAAAGTTTGCTTTGGCTGCCAAGAAAGCCTCATGAATCCTGGC
ACAGGTAACAGTCCAGGCATCCGTGTTTCTTGCCCAAAGTGTAAACAACACTTCTGTCTTGATTGTGATATTTATATTCACGAGAGCTTGCACAATTGTCCTGGCTGTGA
GAGTTTCAGGCATCCCAAATTAGCGACTTTTGACGAATGAATGTCTACTTTTAGATGCAACAATATGGCCCTAACCGAATCCAAACAGGACGCTTCTCAGTCTATTCATT
TGCGTAAAAAGCTGGAATGAGAGCTTTGTATACCACTGCTTAAAATAATGTTGAATGAATGCTGAATCCAAGCTCACATTGTTACCATGCAATCATCTTTCGTGTTTCTC
AACAGCTCCTAAACCAATTTCAGACCGATCCAAAGCTTGTGGCTTTTTGACACTTGATGCCATAGTTGAAATTGGGCGGAAGATTAATTTTTCAAGATGAAAGGCTGAAA
GCCAATTTCCTTCTAGGCTGCAGGATGGACGACTTTACTTGAGTGTACCCAAAATTGACCCCCCAAATAGACATTTTTCCATTTTCATGAGATTTTGTAAAAACTTGATA
TTAATTATTATCCGATTCAGGGTGTATTTGACATTCACATTCTTTTAGTTAAGTTATTAAAGAAGCGCGTGCCAGAAAACTTCAGTGTTTACTATTGACAATAAACATTC
ACATGTATCAATGTTGTAAATATTAGCCAATATGGTTATAGTGATCGTTATTAAAATGTAATCATATTGGCTCATGAGGACAGGGG
Protein sequenceShow/hide protein sequence
MNNGENRRLNGEADEEDDDDDANGLAAWERTYADDRSWEALQEDESGLLCPIDNKAIYHAQYRRRLRTLSSLATTARIQKGLIRYLYIVIDFSKAATEMDFRPSRMAVVA
KHVEAFVREFFDQNPLSQIGLVTIKDGVAHCLTDLGGSPESHVKALMGKLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIRCS
VIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPH
LARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPGTGNSPGIRVSCPKCKQHFCLDCDIYIHESLHNCPGCESFRHPKLATFDE