; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0026384 (gene) of Chayote v1 genome

Gene IDSed0026384
OrganismSechium edule (Chayote v1)
Descriptiontobamovirus multiplication protein 1-like
Genome locationLG06:12847826..12854355
RNA-Seq ExpressionSed0026384
SyntenySed0026384
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR009457 - THH1/TOM1/TOM3 domain
IPR040226 - THH1/TOM1/TOM3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136876.1 tobamovirus multiplication protein 1 isoform X1 [Cucumis sativus]2.8e-5472.67Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KVLEMV+MEIPGLLFFSTYTL VLFWAEIYHQARSLPI+KLKP YC +NGVMY IQICIWI +ML  SPGAVI A LFF+VVSFSA LGFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

XP_011658787.1 tobamovirus multiplication protein 1 isoform X2 [Cucumis sativus]2.8e-5472.67Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KVLEMV+MEIPGLLFFSTYTL VLFWAEIYHQARSLPI+KLKP YC +NGVMY IQICIWI +ML  SPGAVI A LFF+VVSFSA LGFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

XP_023511656.1 tobamovirus multiplication protein 1-like [Cucurbita pepo subsp. pepo]3.6e-5472.67Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KVLEMVVMEIPGLLFFSTYTL VLFWAEIYHQARSLPI KLKP YC INGVMY IQICIWI +ML++SPGAVIAA LFF+VV+FSA +GFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YY+
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

XP_038888866.1 tobamovirus multiplication protein 1-like isoform X1 [Benincasa hispida]1.6e-5473.29Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KV+EMVVMEIPGLLFFSTYTL VLFWAEIYHQARSLPI+KLKP YC INGVMY IQICIWI +ML+ SPG+VIAA LFF+VVSFSA LGFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

XP_038888867.1 tobamovirus multiplication protein 1-like isoform X2 [Benincasa hispida]1.6e-5473.29Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KV+EMVVMEIPGLLFFSTYTL VLFWAEIYHQARSLPI+KLKP YC INGVMY IQICIWI +ML+ SPG+VIAA LFF+VVSFSA LGFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

TrEMBL top hitse value%identityAlignment
A0A0A0K5Z8 DUF1084 domain-containing protein1.3e-5472.67Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KVLEMV+MEIPGLLFFSTYTL VLFWAEIYHQARSLPI+KLKP YC +NGVMY IQICIWI +ML  SPGAVI A LFF+VVSFSA LGFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

A0A6J1EME9 tobamovirus multiplication protein 1-like isoform X13.0e-5472.05Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        K LEMVVME+PGLLFFSTYTL VLFWAEIY+QARSLPI++LKP YC INGV+Y IQICIWIF+MLNKSPGAVI A LFF+VVS SA LGFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

A0A6J1HJF7 tobamovirus multiplication protein 1-like1.7e-5472.67Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KVLEMVVMEIPGLLFFSTYTL VLFWAEIYHQARSLPI KLKP YC INGVMY IQICIWI +ML++SPGAVIAA LFF+VV+FSA +GFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHPILNL YY+
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

A0A6J1HVA7 tobamovirus multiplication protein 1-like3.0e-5471.43Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KVLEMV+MEIPGLLFFSTYTL VLFWAEIYHQARSLPI KLKP YC INGVMY IQICIWI +ML++SPGAVIAA LFF+VV+FSA +GFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          DADLDVLDHP+LNL YY+
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

A0A6J1JNL1 tobamovirus multiplication protein 1-like3.0e-5472.05Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        K LEMVVME+PGLLFFSTYTL VLFWAEIY+QARSLPI+KLKP YC INGV+Y IQICIWIF+MLNKSPGAVI A LFF+VVS SA LGFLIYGGRLFVM
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L+QFP ESRGRQKKLYE                          D+DLDVLDHPILNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

SwissProt top hitse value%identityAlignment
Q402F3 Tobamovirus multiplication protein 32.1e-3346.58Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        ++++ +++++P L FF+TY L VLFWAEIY+QAR++  + L+P++ TINGV+Y IQI +W+ +        VI + +FFA VS  A LGFL+YGGRLF+M
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L++FP ES+GR+KKL E                           ADLDVLDHPILNL YY+
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

Q402F4 Tobamovirus multiplication protein 11.4e-4052.8Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        KVL + ++++PGLLFFST+TL VLFWAEIYHQARSLP +KL+ +Y +ING +Y IQ CIW++L  N +        +F AVVSF A LGFL+YGGRLF+M
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L++FP ES+GR+KKL+E                          DA LDVLDHP+LNL YY+
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

Q948R8 Protein TOM THREE HOMOLOG 18.1e-3346.95Show/hide
Query:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL
        M  ++L+ ++++IP L FF+TY L VLFWAEIY+QAR++  + L+P++ TIN V+Y IQI +W+ L        VI + +FFA VS  A LGFL+YGGRL
Subjt:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL

Query:  FVMLKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        F+ML++FP ES+GR+KKL E                           ADLDVLDHPILN  YY+
Subjt:  FVMLKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

Q9FEG2 Tobamovirus multiplication protein 11.0e-3549.69Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        K L  V++++PGLLFFS YTL VLFWAEIYHQARSLP +KL+  Y ++N  +Y  QI IW ++ ++ +    +   +F AVVSF A LGFL+YGGRLF M
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L++FP ES+GR+KKL+E                          D  LDVLDHP+LNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

Q9ZUM2 Tobamovirus multiplication protein 32.4e-3245.73Show/hide
Query:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL
        M  ++L+ ++++IP L FF+TY L VLFWAEIY+QAR++  + L+P++ TIN V+Y +QI +W+ L        VI + +FFA VS  A LGFL+YGGRL
Subjt:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL

Query:  FVMLKQFPTESRGRQKKLY--------------------------EDADLDVLDHPILNLTYYM
        F+ML++FP ES+GR+KKL                           E A+LDVLDHPILN  YY+
Subjt:  FVMLKQFPTESRGRQKKLY--------------------------EDADLDVLDHPILNLTYYM

Arabidopsis top hitse value%identityAlignment
AT1G14530.1 Protein of unknown function (DUF1084)5.8e-3446.95Show/hide
Query:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL
        M  ++L+ ++++IP L FF+TY L VLFWAEIY+QAR++  + L+P++ TIN V+Y IQI +W+ L        VI + +FFA VS  A LGFL+YGGRL
Subjt:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL

Query:  FVMLKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        F+ML++FP ES+GR+KKL E                           ADLDVLDHPILN  YY+
Subjt:  FVMLKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

AT1G14530.2 Protein of unknown function (DUF1084)5.8e-3446.95Show/hide
Query:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL
        M  ++L+ ++++IP L FF+TY L VLFWAEIY+QAR++  + L+P++ TIN V+Y IQI +W+ L        VI + +FFA VS  A LGFL+YGGRL
Subjt:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL

Query:  FVMLKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        F+ML++FP ES+GR+KKL E                           ADLDVLDHPILN  YY+
Subjt:  FVMLKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM

AT2G02180.1 tobamovirus multiplication protein 31.7e-3345.73Show/hide
Query:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL
        M  ++L+ ++++IP L FF+TY L VLFWAEIY+QAR++  + L+P++ TIN V+Y +QI +W+ L        VI + +FFA VS  A LGFL+YGGRL
Subjt:  MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRL

Query:  FVMLKQFPTESRGRQKKLY--------------------------EDADLDVLDHPILNLTYYM
        F+ML++FP ES+GR+KKL                           E A+LDVLDHPILN  YY+
Subjt:  FVMLKQFPTESRGRQKKLY--------------------------EDADLDVLDHPILNLTYYM

AT4G21790.1 tobamovirus multiplication 17.3e-3749.69Show/hide
Query:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM
        K L  V++++PGLLFFS YTL VLFWAEIYHQARSLP +KL+  Y ++N  +Y  QI IW ++ ++ +    +   +F AVVSF A LGFL+YGGRLF M
Subjt:  KVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVM

Query:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM
        L++FP ES+GR+KKL+E                          D  LDVLDHP+LNL YYM
Subjt:  LKQFPTESRGRQKKLYE--------------------------DADLDVLDHPILNLTYYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGTAAAGGTTCTTGAAATGGTGGTGATGGAGATTCCTGGACTTCTATTTTTTTCAACATACACACTAGCTGTTCTATTTTGGGCAGAGATATACCATCAGGCGCG
AAGTCTTCCTATAAATAAACTTAAGCCTGCTTACTGCACCATCAATGGAGTTATGTACGCTATACAGATCTGCATTTGGATATTTTTGATGTTAAACAAATCTCCTGGTG
CAGTTATAGCTGCCAATCTCTTCTTTGCAGTGGTTTCGTTTTCTGCTGTACTGGGTTTCCTGATATACGGTGGAAGGTTATTTGTCATGCTGAAGCAATTCCCTACTGAA
TCTAGAGGACGCCAAAAAAAGTTGTATGAGGATGCAGATCTTGACGTCCTAGATCATCCCATTCTCAATCTGACATACTACATGGCAAGTTATTTTTAA
mRNA sequenceShow/hide mRNA sequence
GAGAAATAAAAGAAAAACTAATTGCTGTTAACAATGCACAACTATAATTTTTTTTTGTGCTTGTATTTCTATTGAAATTTTTTTTTGTTTTCTAACTATGAACATCCAAA
AGGTTTCTGAAGGTGAAATTAATAAATAAACATCAACAACAAAAGCCTTGTCTTGTTGTGAACCGTTGGTCAAAAAAATCTCGGGATTTCTTCTTCCTCAAACCATAACA
CATTTTTAGATTAAGATTAGTTACATAATGTATAGTTTTGATACCCACCCCAATCCACACTCAACAGAGCATCCTATTCCCATAGGCCATATAAAAATGTTAGTCAAGGT
ATATACTTCAAATTGATAATCAAGATATTCTGAGGGACCGTTCCTTTCCCATGATATAACTGTTATTACATACTGACTGTATGACTTGAAATATTCATGCTGGTAAAGGT
TCTTGAAATGGTGGTGATGGAGATTCCTGGACTTCTATTTTTTTCAACATACACACTAGCTGTTCTATTTTGGGCAGAGATATACCATCAGGCGCGAAGTCTTCCTATAA
ATAAACTTAAGCCTGCTTACTGCACCATCAATGGAGTTATGTACGCTATACAGATCTGCATTTGGATATTTTTGATGTTAAACAAATCTCCTGGTGCAGTTATAGCTGCC
AATCTCTTCTTTGCAGTGGTTTCGTTTTCTGCTGTACTGGGTTTCCTGATATACGGTGGAAGGTTATTTGTCATGCTGAAGCAATTCCCTACTGAATCTAGAGGACGCCA
AAAAAAGTTGTATGAGGATGCAGATCTTGACGTCCTAGATCATCCCATTCTCAATCTGACATACTACATGGCAAGTTATTTTTAAAAATTTCTGATAGCAGTACATGGCA
AGTTTTAGAATCCGTATTTTCGTTCCAATATTTTTTTATGATGAATAAATTATCTTCTTAGTTAGAAATTGCCAGTAGAACAACTGTGGAGGTTCAAAATGACTCTATTT
GTTGGGATTGATAGGGCCGTTAGGTAACTTTCTCCTCACTCACCAAAAAAAGTGTCACGATACTAATGGAACATTAGATTTAACTTGCCATACTTTTACATGCAGAGAAT
TTTTTTCGAAGTAGCTTTAACTCAATAAAGTAACGGGATCTCCACCTTTCTGTTTAATTCAGAATCTGTCAAAAACTTCGCATGGTATCAATTTTTTTTTGAAACGAGGG
TAGTCAAACCTGTCCCTAGGCCAGGTATCGGAGACATTGAAGCAGTATTGTCATAGGTAAGTCTCGAACCTGGGACTTCAATCGGAACATACTTTTAAAGCCCAAGGTCT
TCAACCACTATGCCATCCCTTGAGGACATGGTATCAATTTTTCATAAATTCACACTGCAGTCATTTGGAAACGTTTTCATACGTCCATACTGCATTATGGGGCCTGCAAA
TTCTGTGAATTATAGTCTCCTGTTGAATTTGCAATCCTAGTTTCAAATTTCTTTCTCCCAGATAAGGAATTCAGACGGTTTTTCCATTGAGTCTCTTCCCTTTTCGTCAC
TCCATGATGTTGCTTGACTTCCAAATTTTAATAATTCTCAGAAGTTCAGAACTAGCATTTAGCTCAAATAAATTGATCTTCACTTGGGTGAGAGGTTATATGAAGGGTAT
ATTCTCTTGGAGCCTTTACAACCAGTTCATTTTTTTTTTTGATAGTCTTCAATAATTTTTTCAGTCCTCTAATAAGCACGGATATGGATACGGATATGGATATGAGACAC
AAACTCAAAACACTACGAAACAACTATACATCAATTTCTGAAAAACACAGGACATAGAAATCCATTTCTGTATCATGAGTTTCATTTTTCTCAAGGAACATTTGAATTTG
TTGAATAAACTTTAATGTTGAGAACAAATTAAGGAATTAGTGGATCCTTGATTTATCAATTAGTAACTTTCTCTCTAGACTTTCTCTCTCTAAATTTTCTCTCTAGACCT
CTAACCCTAAACCTTTCGCTCACTTCCTTTGGCACCGACGCCACCAGTATCGACATCGGAGGGGGGAAATCACAGCGACCCACATCGTTTCGACGGATGCTTCTCTCCAT
CTCTCACCTCGGGAGAGGTTTGATTTTTCTTGATGCCTCTACTTCTCCCGTTAATTTTTTCTTCAACTGCTTCATGGCAAAGTTATCTCCTATTTCACCTACGACTTGCT
TCTCCAATATGTTTTGGCGCATCTGTTTGGGTTCCAATGGCAGTGAAGACTTTTTTTTTAATTTTTGGTGATGACGATGACGATGACGATTCGAAGATGAGAAAGATAGG
TAAACCCTAGTTTGTGGTGCGGCTCATACCACAAATCGGTTTGGTAGAGCATGTTTCGGTCCACCTGGGACTGAACCGGCGGGCATCACTACACCAAGGTGAAATGTCTT
ATCCGTATTGGTTTTTAGACGGTCGGTTTTGGATGCCATCTTCGTATCAAATAGAGATCGTCTTTCTCATTGGTTTGGTCAACTCGCCTTTTGGTTTTGTCAATTCTATC
ACTGACTCCTGTAACAAATGGTCCTCACAACCCCTCGATATGGCTTCTTACAATCCGTTTGTGGCTCTTTTACCAGTATTAGAGGGAATTTTCTCTTGGTTGTAACGGTT
TTTCATACACAACCCCAATCCTTTATTTTTCCATGTATGGATTAGCTTAATGTAACACACTTTTTATCAATGAATGGTTTGTTTCCTAATAAAAAAAAACTTGGAGACCG
TGTTGTAATAGGGCTATAACTAGAGTGCCAATTCCATAGAGTAAAGTAGAGTGAAGGAGAGTAGAGAAATTTATAGAGAAACATGTAGAGAAATAAAGAGAGCATTTAGA
GATTTTGTGTGCCATGTGTCTCCAATCTCGAAGTGAAAAGCTTGCGCGATTGTTATTTGTATTTTATTGTTTTTGTTTATTCGCTTGTCATTATTTATAGAATTCTTTGT
CAGATTTTAAAAAATAATGTAATTTTAAAACATAGGGAACAAGAGATAACAAACCTAGAGAAATTATAGGTGGTAATAGTTTAGGCTTAATTTTCTAAAATTAAAAACGA
ATGATTTTTCGACCTTTTTGTTTTCAAATTTTCCCTCAGTTGGGCTTTATGGATGGCTCTTTACACTAAATTGTAATGAAATCTCTTTCTAAAAGTTCCTCTCAAAAGCA
CTTACCAAGCTATAGTTTTTAAGAAGTTTTCCCCTTCGACGCTTGGCCTTGAAAGGGGACCTTTGACTCCTCTCTTCGCCGCCACCTTCCCTTCTCCTCCCTTTCATCGT
GCTACTGCTTCCCTTTTCCACTTGATAGAGTCATTCAACACGAGTCCAAGCATCCTTGGGAGCTTCCTTGCATTTTTTTCTCGCGCGAGCAACTTCCTTCTTCTCTCCTC
TTTGCTTTCTTCCTTCGCCTTCATTCCACATGAGATTTGAGCTTTCTAAAGTGAGTTATCTCACATTTTCTTCGCGTGGCTTTCCTTGAAGCGATGCTCGATGTCTCTTT
AGGCCCACTCCCACTGTGGCATCGGCAATGGAGTAGATCGCCTGCTTTTTACAAGTTCATCATCGGTATTCGCACTTCGCAAGAGTTGCAACCTATTGTCATTCCACCTT
ATCTTTTCTCCTTCCACCAAATCTAGTCTTTTGAGTTGTTTCCTACTAAACTTTGCGACTAGGTTTCTACTCAAGCTCAGATCATCTTGACAAGATTTGAGTCTTTTTTC
TCACCATGTAGCAACCTCTACCAATTTACTTAACTTTACACCAATTTACCAAACTTTCTCCTCTGTTTTCTTGTGCTTCAATTACAGCTTCAGTAGTCCTCCAATAAGTC
TTATGATGAGCAGAGTCTATTGGCAGTTGTAATTTCTATTTGTTGTTGCTTAGCTTGTAGGGTTAAGTTTTATTTTAGAATTATTATTAAGAAATATGTTATTCATGTTT
GTAATCTTGTTAATTTAATAAAAACTTGAACGTAAATGTGAATAGAAAAAAATCGCTCAAGCTTTCACTTTTGCGATTTGAGAGACAAGGTACACAACGAAAATTTCTCT
ATGAACACTTCTTTATTTCTCTAAAAACTTTCTCTTGAGCTTTACATTTGCTCTTCCTTTCTCTAATTTTCACAATGAAATAAGTCACCGATTTATAGTCCTATTATAAC
TTGGTCTTCAAGTTCTTACCAAAAACGTTTTTCACTAATTGCTCAATTAAGGATCCACTGATTCCATAATTAGCTAAAAAATTGACAACCCATTCATTCCTTAATAAAAT
GTCAACCAATTACATTTTAAACAAAGTATTAAATTTACATTAAAGTTTATTCAACAAAATTGATCCAAAAACTTTAATGCTTCTTTCCTCTGTCATGCTAATCATCTTCT
TGAATCTTTTGAACAAAACCTTTGGCAATGATTTTGTAAAAATATCCGCAACTTGATCTTGACTCGCCACATCTTCAGTTCCACGCTTCCTTCCTTCATATGATCTCAGA
TAAAATGAAAGTGAACATCAATGTGTTTGCTTCTTTCATGATTCATTGGATTCTTTGCTAACTCAATTGCTGATCTATTGTCAACTTGAATCACGGTTGCACCAGGTTGC
TTTAGCTCCATCTTGCTCAACTAATTTCTCAACATATCGCATGACATATGTACCAAGAAGCTGCAACATATTCTGCTTCGCATGTCGAAGGCTTCACGATTGGTTGTTTC
TTTGACAGCCAAGTAAATGTTGTGTTTCTCATAAAGAATGCATATCCTGATGTACTTTTTTGATCGTTTATGTCACCGCACCAATTGCTGTTGGAGTAACCAATCAACTT
ATAGTCTTCTGCTTTTGAATAAAACATCTCAAGTGCAGTTTCTTGGATGTATCACAAAATTTGCCACAATGATTTCCAATGCGAATAAATCGACTCCTCTATGAACAAAC
TAACAATGCCAACACTTAATGCGAGATTCGGCCTTGTGCATGTGAGATAAGAAGGCTTCCCACCAAGTTTCGATATCTGCTCGCATCTACATGTTCCCCTCCATCGTACT
TCGAGAGTTTCATATCAGGTTCCGTTGGTATTGAAATCGGGTTGCAATTTGCCATTTTGAATTTCTTTAAAATCTCTTTCACATATTGCTCTTGCGATATAAAGATTCCC
ATCTCTTATTGTTTAACATTCAAACCGAGGAAAAACTTCAACAATCCTAAATCAGTCATCTCAAACTCTCGCATCATTGTGCTCTTAAACTCTCTCTTCTATCTAGTGTT
CTAAAAAGCATGCTTGGACACAAGTCCCACCTCTAGCACCTCGC
Protein sequenceShow/hide protein sequence
MLVKVLEMVVMEIPGLLFFSTYTLAVLFWAEIYHQARSLPINKLKPAYCTINGVMYAIQICIWIFLMLNKSPGAVIAANLFFAVVSFSAVLGFLIYGGRLFVMLKQFPTE
SRGRQKKLYEDADLDVLDHPILNLTYYMASYF