; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004086 (gene) of Chayote v1 genome

Gene IDSed0004086
OrganismSechium edule (Chayote v1)
Descriptiontobamovirus multiplication protein 1
Genome locationLG01:5436672..5445606
RNA-Seq ExpressionSed0004086
SyntenySed0004086
Gene Ontology termsGO:0005774 - vacuolar membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009457 - THH1/TOM1/TOM3 domain
IPR040226 - THH1/TOM1/TOM3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456345.1 PREDICTED: tobamovirus multiplication protein 1 isoform X1 [Cucumis melo]2.1e-16586.07Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELL+     +P+DLLVL+V  ASFNGLLAF+AFSQLIRIHMRSQQDGWTRQKA+HLMIGSSNLGYMIYFIFALVAI +LWH WSHV  FVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID
        LFLAAFLL+LSFWVDLCHQAND +DDDD+EENN RQ+LLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVV LVF LMVAVSILIWIG G+NPID
Subjt:  LFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID

Query:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS
        ST VARVYE F+A+TVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+RTNGVEA V+L LYFCMGS
Subjt:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS

Query:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        LIPSAFLLWIMRELPP KK QR EESRAIAFISHG ADV+PQGWTAV  SKNQ S  SP
Subjt:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

XP_022970675.1 uncharacterized protein LOC111469590 [Cucurbita maxima]4.9e-16788.3Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELLA     VPVDLLVLNVA ASFNGLLAF+AFSQLIRIHMRSQQDGWTRQK +HLMI SSNLGYM YFIFALVAIF+  H WSHV GFVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQANDDDDDDE-EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID
        LFLAAFLL+LSFWVDL HQANDDDD+DE EENNTRQALLENSKNKPGSS+VDG+RRCCGFPAIHLGSRQKFVIVVVMLVF LMVAVSILIWIG G+NPID
Subjt:  LFLAAFLLVLSFWVDLCHQANDDDDDDE-EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID

Query:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS
        STAVA+VYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+RTNGV+ALVLL LYFCMGS
Subjt:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS

Query:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        LIPSAFL+W MRELPP KK QR EESRAIAFISHG ADV+PQGWTAVT SKNQVS  SP
Subjt:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

XP_023534011.1 uncharacterized protein LOC111795688 [Cucurbita pepo subsp. pepo]5.1e-16486.98Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELLA     VPVDLLVLNVA ASFNGLLAF+AFSQLIRIHMRSQQD WTRQK +HLMI SSNLGYM YFIFALVAIF+  H WSHV GFVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQANDDDDDDE---EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENP
        LFLAAFLL+LSFWVDL HQANDDDDDDE   EENNTRQALLENSKNKPGSS+VDG+RRCCGFPAIHLGSRQK VIVVVMLV  LMVAVSILIWIG G+NP
Subjt:  LFLAAFLLVLSFWVDLCHQANDDDDDDE---EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENP

Query:  IDSTAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCM
        IDSTAVA+VYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+R NGV+ALVLL LYFCM
Subjt:  IDSTAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCM

Query:  GSLIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        GSLIPSAFL+W MRELPP KK QR EESRAIAFISHG ADV+PQGWTAVT SKNQVS  SP
Subjt:  GSLIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

XP_031744161.1 tobamovirus multiplication protein 1 [Cucumis sativus]2.7e-16586.07Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELL+     +P+DLLVL+V  ASFNGLLAF+AFSQLIRIHMRSQQDGWTRQKA+HLMIGSSNLGYMIYFIFALV I +LWH WSHV  FVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID
        LFLAAFLL+LSFWVDLCHQAND +DDDD+EENN RQ LLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVVMLVF LMVAVSILIWIG G+NPID
Subjt:  LFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID

Query:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS
        STAVARVYE F+A+TVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+RTNGVEA V+L LYFCMGS
Subjt:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS

Query:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        LIPSAFLLWIMRELPP KK QR EESRAIAFISHG AD +PQGWTAV  SKNQ S  SP
Subjt:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

XP_038902574.1 tobamovirus multiplication protein 1 [Benincasa hispida]2.9e-16786.94Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELLA     +P+DLLVL+VA AS NGLLAF+AF QLIRIHMRSQQDGWTRQKA+HLMIGSSNLGYMIYFIFALVAI +LWH WSHV GFVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQAND--DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPI
        LFLAAFLL+LSFWVDLCHQAND  DDD+D+EENN RQALLENSKNKPGSSNVDGHRRCCGFPA+HLGSRQK VI+VVMLVF LMVAVSILIWIG G NPI
Subjt:  LFLAAFLLVLSFWVDLCHQAND--DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPI

Query:  DSTAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMG
        DSTAVARVYE F+A+TVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+RTNGV+ALVLLILYFCMG
Subjt:  DSTAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        SLIPSAFLLWIMRELPP KK QR EESRAIAFISHG ADV+PQGWTAV  SKNQ S  SP
Subjt:  SLIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

TrEMBL top hitse value%identityAlignment
A0A1S3C2L2 tobamovirus multiplication protein 1 isoform X19.9e-16686.07Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELL+     +P+DLLVL+V  ASFNGLLAF+AFSQLIRIHMRSQQDGWTRQKA+HLMIGSSNLGYMIYFIFALVAI +LWH WSHV  FVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID
        LFLAAFLL+LSFWVDLCHQAND +DDDD+EENN RQ+LLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVV LVF LMVAVSILIWIG G+NPID
Subjt:  LFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID

Query:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS
        ST VARVYE F+A+TVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+RTNGVEA V+L LYFCMGS
Subjt:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS

Query:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        LIPSAFLLWIMRELPP KK QR EESRAIAFISHG ADV+PQGWTAV  SKNQ S  SP
Subjt:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

A0A1S3C336 tobamovirus multiplication protein 1 isoform X21.4e-15188.29Show/hide
Query:  MRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSK
        MRSQQDGWTRQKA+HLMIGSSNLGYMIYFIFALVAI +LWH WSHV  FVLMAFPKILFLAAFLL+LSFWVDLCHQAND +DDDD+EENN RQ+LLENSK
Subjt:  MRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSK

Query:  NKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEAS
        NKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVV LVF LMVAVSILIWIG G+NPIDST VARVYE F+A+TVLLSGGALGFYGFMLFYRLKKVRSEEAS
Subjt:  NKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEAS

Query:  SEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQG
        SEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+RTNGVEA V+L LYFCMGSLIPSAFLLWIMRELPP KK QR EESRAIAFISHG ADV+PQG
Subjt:  SEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQG

Query:  WTAVTLSKNQVSCLSP
        WTAV  SKNQ S  SP
Subjt:  WTAVTLSKNQVSCLSP

A0A6J1D8P5 tobamovirus multiplication protein 18.7e-16285.03Show/hide
Query:  MALELLAVPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAA
        +A     VP+DL++L+ A ASFNG+LAFLAFSQLIRIHMR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA FELW+  SHV GFVLMAFPKILFLAA
Subjt:  MALELLAVPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAA

Query:  FLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVA
        FLL+LSFWVDLCHQAND +DDDD+EEN+T+QALLENSKNKPGSSNVDGHRRCCGFPA HLGSRQK VIVVV+LVF LMVAVS+LIWIG G+NPIDSTAVA
Subjt:  FLLVLSFWVDLCHQAND-DDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVA

Query:  RVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSA
        RVYE F+AVTVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF RTNGV ALVLLILYFCMGSLIPSA
Subjt:  RVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSA

Query:  FLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        FLLWIMRELPP KK QR EESRAIAFISHG AD +PQ W  VT SKNQV+  SP
Subjt:  FLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

A0A6J1G615 uncharacterized protein LOC1114511269.3e-16486.91Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELLA     VPVDLL LNVA ASFNGLLAF+AFSQLIRIHMRSQQD WTRQK +HLMI SSNLGYM YFIFALVAIF+  H WSHV GFVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQANDDDDDDE-EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID
        LFLAAFLL+LSFWVDL HQANDDDD+DE EENNTRQALLENSKNKPGSS+VDG+RRCCGFPAIHLGSRQK VIVVVMLVF LMVAVSILIWIG G+NPID
Subjt:  LFLAAFLLVLSFWVDLCHQANDDDDDDE-EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID

Query:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS
        STAVA+VYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWR +R NGV+ALVLL LYFCMGS
Subjt:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS

Query:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        LIPSAFL+W MRELPP KK QR EESRAIAFISHG ADV+PQGWTAVT SKNQVS  SP
Subjt:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

A0A6J1I3I8 uncharacterized protein LOC1114695902.4e-16788.3Show/hide
Query:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI
        M LELLA     VPVDLLVLNVA ASFNGLLAF+AFSQLIRIHMRSQQDGWTRQK +HLMI SSNLGYM YFIFALVAIF+  H WSHV GFVLMAFPKI
Subjt:  MALELLA-----VPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKI

Query:  LFLAAFLLVLSFWVDLCHQANDDDDDDE-EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID
        LFLAAFLL+LSFWVDL HQANDDDD+DE EENNTRQALLENSKNKPGSS+VDG+RRCCGFPAIHLGSRQKFVIVVVMLVF LMVAVSILIWIG G+NPID
Subjt:  LFLAAFLLVLSFWVDLCHQANDDDDDDE-EENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPID

Query:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS
        STAVA+VYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSAL+DLLTDIPL YNWRF+RTNGV+ALVLL LYFCMGS
Subjt:  STAVARVYEYFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGS

Query:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP
        LIPSAFL+W MRELPP KK QR EESRAIAFISHG ADV+PQGWTAVT SKNQVS  SP
Subjt:  LIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGEADVSPQGWTAVTLSKNQVSCLSP

SwissProt top hitse value%identityAlignment
Q402F3 Tobamovirus multiplication protein 32.2e-0822.48Show/hide
Query:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD
        G+++ +A  QLIRI MR  + GWT QK  H       L +++  + +LV  F  ++      ++  +L+  P + F   + L++ FW ++ +QA     D
Subjt:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD

Query:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--
                                              G R  F       +  ++  + I++W+     P+    V     +F  V++     ALGF  
Subjt:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--

Query:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSK
        YG  LF  L++  V S+    ++++VG +  +C  CF       L+  + + +N  F +   ++ L   +L ++Y+ +  ++PS+ +L+I+R+LPP +
Subjt:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSK

Q402F4 Tobamovirus multiplication protein 18.5e-1325.66Show/hide
Query:  LLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFE----LWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDD
        L++ +A  QLIRI +R  + GWT QK  HLM       +++  + A+V  F     L+H    VL   ++  P +LF + F L++ FW ++ HQA     
Subjt:  LLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFE----LWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDD

Query:  DDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF-
                    L   K +    +++G        AI+                     +   IW+    N  D++ V  + + FIAV   ++  ALGF 
Subjt:  DDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF-

Query:  -YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKK
         YG  LF  L++  + S+    ++ +VG +  +C  CF  S  + +L+         F+    ++ L   VL ++Y+ +  ++PSA +L+I+R+LPP + 
Subjt:  -YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKK

Query:  NQRH
        + ++
Subjt:  NQRH

Q948R8 Protein TOM THREE HOMOLOG 11.3e-0823.15Show/hide
Query:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD
        G+++ +A  QL+RI +R  + GWT QK  H       L +M+  + ALV +F  +  +    +L  +L+  P + F   + L++ FW ++ +QA     D
Subjt:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD

Query:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--
                                              G R  F   +  +V+ + +A+ +++W      P+    +  + + F A   L +  ALGF  
Subjt:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--

Query:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSK
        YG  LF  L++  V S+    ++++VG +  +C  CF       L+  I + ++  F+    ++ L   +L  +Y+ +  ++PS+ +L+I+R+LPP +
Subjt:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSK

Q9FEG2 Tobamovirus multiplication protein 19.4e-1225Show/hide
Query:  LLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDDDEE
        L++ +A  QLIRI MR  + GWT QK  HLM    N    + F F +    +++      L +VL+  P +LF +A+ L++ FW ++ HQA     D   
Subjt:  LLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDDDEE

Query:  ENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--YGF
                                              +   I V + V+   + +   IW+       D++ V  V + FIAV   ++  ALGF  YG 
Subjt:  ENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--YGF

Query:  MLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKKNQRH
         LF+ L++  + S+    ++ +VG +  +C  CF    ++  ++         F++   ++ L   VL ++Y+ +  ++PSA +L+I+R+LPP + + ++
Subjt:  MLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKKNQRH

Q9ZUM2 Tobamovirus multiplication protein 38.2e-0822.15Show/hide
Query:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD
        G+++ +A  QL+RI +R  + GWT QK  H       L +++  + A+V +F   +      +L  +L+  P + F   + L++ FW ++ +QA     D
Subjt:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD

Query:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--
                                              G R  F       + +++  V I +W+     P+    V     +F  V++     ALGF  
Subjt:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--

Query:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKKN
        YG  LF  L++  V S+    ++++VG +  +C  CF       L+  I + +   F+    ++ L   +L  +Y+ +  ++PS+ +L+I+R+LPP +  
Subjt:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKKN

Query:  QRHEESR
         ++ + R
Subjt:  QRHEESR

Arabidopsis top hitse value%identityAlignment
AT1G14530.1 Protein of unknown function (DUF1084)9.0e-1023.15Show/hide
Query:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD
        G+++ +A  QL+RI +R  + GWT QK  H       L +M+  + ALV +F  +  +    +L  +L+  P + F   + L++ FW ++ +QA     D
Subjt:  GLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIF--ELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDD

Query:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--
                                              G R  F   +  +V+ + +A+ +++W      P+    +  + + F A   L +  ALGF  
Subjt:  DEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--

Query:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSK
        YG  LF  L++  V S+    ++++VG +  +C  CF       L+  I + ++  F+    ++ L   +L  +Y+ +  ++PS+ +L+I+R+LPP +
Subjt:  YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSK

AT3G59090.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)4.8e-10454.34Show/hide
Query:  LVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLC
        + +N+  A  +  LAF+AF QL R H R++Q GWTRQK +HLMI SSN G +IYF+ A++A    WH WS+ LGF+LMAFPKILFLA FLL+LSFWVD+C
Subjt:  LVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLC

Query:  HQAN-DDDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVL
        HQ N ++DDDD+EEN+ +Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF LM++ +ILIWI +G+NP++S+ +A VY    A  +L
Subjt:  HQAN-DDDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVL

Query:  LSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLWIMRELPPS
        ++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+LI LLT IPL Y+W   + +G++ALVLLI+Y+ +GS +P AF+LW++RELPP 
Subjt:  LSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLWIMRELPPS

Query:  KKNQRHEESRAIAFISHGEADVSPQG----WTAVTLSKNQVSCLSP
            R E++R I ++++      PQG    W + T+SKNQVS  SP
Subjt:  KKNQRHEESRAIAFISHGEADVSPQG----WTAVTLSKNQVSCLSP

AT3G59090.2 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)2.0e-10253.96Show/hide
Query:  LVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLC
        + +N+  A  +  LAF+AF QL R H R++Q GWTRQK +HLMI SSN G +IYF+ A++A    WH WS+ LGF+LMAFPKILFLA FLL+LSFWVD+C
Subjt:  LVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLC

Query:  HQAN-DDDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVL
        HQ N ++DDDD+EEN+ +Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF LM++ +ILIWI +G+NP++S+ +A VY    A  +L
Subjt:  HQAN-DDDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVL

Query:  LSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLWIMRELPPS
        ++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+LI LLT IPL Y+W   + +G++ALVLLI+Y+ +GS +P AF+LW++RELPP 
Subjt:  LSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLWIMRELPPS

Query:  KKNQRHEESRAIAFISHGEADVSPQG----WTAVTLSKNQV
            R E++R I ++++      PQG    W + T+SKNQ+
Subjt:  KKNQRHEESRAIAFISHGEADVSPQG----WTAVTLSKNQV

AT3G59090.3 LOCATED IN: endomembrane system5.0e-10153.67Show/hide
Query:  LLAVPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLV
        LL++P     L +++ SF  +LA   F QL R H R++Q GWTRQK +HLMI SSN G +IYF+ A++A    WH WS+ LGF+LMAFPKILFLA FLL+
Subjt:  LLAVPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLV

Query:  LSFWVDLCHQAN-DDDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYE
        LSFWVD+CHQ N ++DDDD+EEN+ +Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF LM++ +ILIWI +G+NP++S+ +A VY 
Subjt:  LSFWVDLCHQAN-DDDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYE

Query:  YFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLW
           A  +L++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+LI LLT IPL Y+W   + +G++ALVLLI+Y+ +GS +P AF+LW
Subjt:  YFIAVTVLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLW

Query:  IMRELPPSKKNQRHEESRAIAFISHGEADVSPQG----WTAVTLSKNQVSCLSP
        ++RELPP     R E++R I ++++      PQG    W + T+SKNQVS  SP
Subjt:  IMRELPPSKKNQRHEESRAIAFISHGEADVSPQG----WTAVTLSKNQVSCLSP

AT4G21790.1 tobamovirus multiplication 16.7e-1325Show/hide
Query:  LLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDDDEE
        L++ +A  QLIRI MR  + GWT QK  HLM    N    + F F +    +++      L +VL+  P +LF +A+ L++ FW ++ HQA     D   
Subjt:  LLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVDLCHQANDDDDDDEE

Query:  ENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--YGF
                                              +   I V + V+   + +   IW+       D++ V  V + FIAV   ++  ALGF  YG 
Subjt:  ENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGF--YGF

Query:  MLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKKNQRH
         LF+ L++  + S+    ++ +VG +  +C  CF    ++  ++         F++   ++ L   VL ++Y+ +  ++PSA +L+I+R+LPP + + ++
Subjt:  MLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEAL---VLLILYFCMGSLIPSAFLLWIMRELPPSKKNQRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTGGAATTGCTTGCCGTTCCCGTTGACCTTCTCGTTCTCAACGTAGCGACGGCTTCCTTCAATGGCCTTCTTGCTTTCCTCGCCTTCTCGCAGCTTATCAGAAT
TCACATGCGGAGTCAACAGGATGGATGGACACGTCAAAAAGCCATCCATCTGATGATAGGATCATCTAACTTGGGCTATATGATCTACTTCATATTTGCATTAGTTGCTA
TTTTTGAGCTCTGGCACACCTGGTCTCATGTGCTTGGATTTGTCCTCATGGCCTTCCCTAAAATACTGTTTCTTGCAGCTTTTCTCCTGGTCCTTTCTTTCTGGGTCGAC
CTTTGCCATCAGGCAAATGATGACGACGACGACGATGAAGAAGAAAATAACACTCGACAAGCCTTGTTGGAAAATTCAAAGAACAAACCTGGTTCATCAAATGTAGATGG
CCATCGAAGATGTTGTGGCTTTCCTGCTATTCATCTTGGAAGTAGGCAAAAATTTGTAATTGTGGTTGTCATGCTGGTATTTTCCCTCATGGTCGCAGTTTCCATTCTGA
TCTGGATTGGGACAGGGGAAAATCCTATTGATTCTACCGCTGTTGCCAGGGTATATGAATACTTTATTGCTGTTACAGTTCTTCTATCAGGAGGAGCCTTAGGCTTCTAT
GGTTTTATGTTATTTTACAGATTGAAAAAAGTACGTTCAGAGGAAGCTTCTTCAGAGATGAAGAAGGTTGGGGGTCTAGCAGTTGTCTGTGTTGTGTGTTTTACATCAAG
TGCTTTGATAGATCTTCTAACAGATATTCCTCTTTTGTACAATTGGCGCTTCGAGAGAACAAATGGAGTAGAAGCATTAGTCCTTTTGATTTTGTACTTCTGCATGGGTT
CGTTGATTCCCTCAGCCTTTTTATTGTGGATTATGAGAGAGTTGCCGCCTTCTAAGAAAAATCAAAGGCATGAAGAATCGAGGGCAATTGCTTTTATAAGCCACGGGGAA
GCTGATGTAAGTCCTCAGGGTTGGACTGCTGTAACTCTTTCAAAGAATCAGGTTTCTTGCTTATCACCCTCTAAAGTATCAATATTTTATTTCCCTTATTTTTTCTTTGC
ACAACATATGGGGAGTTGGAGATTTTGA
mRNA sequenceShow/hide mRNA sequence
AAAACCAAAACCAAAACCAAAACCAAAACCACAGCGACAAGACTTGCAGAAAAGCAAATTAGCCTTTGTTTCTGCTTCTTCTTTTCTCTTATCTTCTCGTACGCTCGGAT
TGTTCCCCCTCCATTTCCCGCCGATGCAACACGCCTCGCCGCCGCCGCCGCCGCCGTCTCCATCGCCGCCGCCATCAATCGCTTCCTCTTCGTTGCACGCGATCTGAAAC
AACTACTTGGTTTTCCCGTTCAAGATGGCGCTGGAATTGCTTGCCGTTCCCGTTGACCTTCTCGTTCTCAACGTAGCGACGGCTTCCTTCAATGGCCTTCTTGCTTTCCT
CGCCTTCTCGCAGCTTATCAGAATTCACATGCGGAGTCAACAGGATGGATGGACACGTCAAAAAGCCATCCATCTGATGATAGGATCATCTAACTTGGGCTATATGATCT
ACTTCATATTTGCATTAGTTGCTATTTTTGAGCTCTGGCACACCTGGTCTCATGTGCTTGGATTTGTCCTCATGGCCTTCCCTAAAATACTGTTTCTTGCAGCTTTTCTC
CTGGTCCTTTCTTTCTGGGTCGACCTTTGCCATCAGGCAAATGATGACGACGACGACGATGAAGAAGAAAATAACACTCGACAAGCCTTGTTGGAAAATTCAAAGAACAA
ACCTGGTTCATCAAATGTAGATGGCCATCGAAGATGTTGTGGCTTTCCTGCTATTCATCTTGGAAGTAGGCAAAAATTTGTAATTGTGGTTGTCATGCTGGTATTTTCCC
TCATGGTCGCAGTTTCCATTCTGATCTGGATTGGGACAGGGGAAAATCCTATTGATTCTACCGCTGTTGCCAGGGTATATGAATACTTTATTGCTGTTACAGTTCTTCTA
TCAGGAGGAGCCTTAGGCTTCTATGGTTTTATGTTATTTTACAGATTGAAAAAAGTACGTTCAGAGGAAGCTTCTTCAGAGATGAAGAAGGTTGGGGGTCTAGCAGTTGT
CTGTGTTGTGTGTTTTACATCAAGTGCTTTGATAGATCTTCTAACAGATATTCCTCTTTTGTACAATTGGCGCTTCGAGAGAACAAATGGAGTAGAAGCATTAGTCCTTT
TGATTTTGTACTTCTGCATGGGTTCGTTGATTCCCTCAGCCTTTTTATTGTGGATTATGAGAGAGTTGCCGCCTTCTAAGAAAAATCAAAGGCATGAAGAATCGAGGGCA
ATTGCTTTTATAAGCCACGGGGAAGCTGATGTAAGTCCTCAGGGTTGGACTGCTGTAACTCTTTCAAAGAATCAGGTTTCTTGCTTATCACCCTCTAAAGTATCAATATT
TTATTTCCCTTATTTTTTCTTTGCACAACATATGGGGAGTTGGAGATTTTGAACAGGAGTACATGTCAATTACCTACCACTGAGTTAGGTCACTTTGGCAGTATCAACAT
TATCTTTTTGAAGTTTTGTTCTTGATTTATTTTTTGCGAATGTAATGAATAAGACTTGTACATGTCCATGCCAATGTATGGATCAAAACCTGAGGTGATTTGGCACCTTA
AAATTTTCCATCTGGTGATAAGATGCATCTGAACCACTCCAATTTTTTCGATATTGCTCATAAAAGCCGAAGCTCTTTTTTCTAGGGTAATAAACAACTTGGATCACCCA
ACCCAATCTGTAAAAGTTTGGGTTGGGTTAGTTTTTTTTTTTTCGGTTTGAGTTGAATTGTTTTTGGAGAACCTAAATAACCTCTATCCAACTCCCAAACGATTGATGTT
GAGGTTTGAATTTACTTTGTCCAATCATGATTCGAAGAGTTGACACCATCAAACCGATAGACCATGAACTTGCCTCTTACCTAAGACTATTGTAATGGTGTACAACACAT
TAATACCTATTCTATATAGGTTGTTAAAGGTATATCTAAAAGCTTAAACTCCCCCACCTAAACGACTTTCTAATTTCTCTTTCTAGAAACCTTAAACTCCCTCACGTCTG
ATCTTTCCTCCCTTTGTGTGTAGCTCCCTTGTGTTGAGGAGTAGGAAAGGTTGCCTTCAGAGCTCGAGCTTCTCTTCTCCATCTCAGTTCGATGTCTCTTTCTTCTTCGG
CGGCACGATCTTTTTTCTTTTCCTCCATTGACCTCGTTGTCGCGTCCCTCTTTCTTTCCAACACTTGCACTCTTCCATTGACATCAATGTCATGTCTCTTTCCATGAAGT
TGCATTCTTCGTTGGCGCTGTCGTTTTCAGCATATATCTTTTTCATCCCCGACAGAACTGTGGCTAGTTGCATTTCCTTCTCAGACATCATGTCTTTCTGACACCATCTC
TCTCATTGTCTTTGGCTCTCTACTCTTGGCCGCCTCTTGCCAGCTGAAGGCTCTTTCTCCAACTTTTCTGGCAATGACGACCTGGCATCTTGCTGTCGAGCATAACGTAA
GCCTGAGGTGTTTTTCCTCGACCTCCTATTTCCTTTGGCTTTTCGAGCATAGCGAGACTTCACTTTTTTATCACAGACAATTTTCTTCACCGGCATGTAAACGAGATTTT
TCCTCGTACATCTACCATTTTCTGGCAGCAGCACCACATGTTGGCTGTTGCTTCCTACTTTTTCACTTGGTCATCTTCTGAAATTAGAGGACAATTTCCTTGGTAGTTCT
CTGATCTTCGACATGTATGCTTAGTGCAGGAAGAGATGCCTTTGGCAAGAAAAAGAGAATGAAACACGAATGGATTGCATTCGGAAATCAGTACACCAGTAACAATTCAG
TTGATGGGGAGCTTGTAGCTTATGGTGCTTCCTTAACACCCACCTCATGTTTCTTCCTCTTCTTTGAGGGTTCTTTCTGTTATATTGGTGTATAGACTTGTTATTCTGAT
AGTTCCTTTGTAGTTGTCCTTTAATTGGCCCCTCTTTTAGGTTTTTTTTTTCTTGTTTCAATAGGTATATGTGCAACACTTTCAACACTGAACGAAATTTTTATGTTTCC
TATTTTTTTTAAAAAATAAAATAAAAGAAAAATCTAAAGGCTCCATGCTACGTTCTTAAATTCTTCACAGGAAAGAAGCTTATACCTGAGCCTAAATAGTTGGACACCAA
CTAAAAAGTTTGTTAGGTTTCAAGCTTCATGTAGGTCCGTGGATTTGGAATTGTTAACCATTGGGAGTCGACGCACTTTACAAAGATTTTAAACTTCATTAAAAGTTTCA
GTAGAGTTGTGGCTCCGATAATAACTTTCCATAATGTCCCCAAGGGGTGGTACTGTGGTTGAAGACTTGGGCTTTGAGGGGTACCCTCAAAGTCCTAAGTTCAAGACTCA
ACTGTGACATTATTCCTTCGATGTCTCCCGGTGCTTTGCCTAAGGACGAGCGTCGTTACCCTTGTTTAAAAAAAAAAATAATGGTCCATAATGGATATGATTGAACAAGG
GCCGTGAAATGAAAGGAAACTAACTCCAGATTTTAAAGCTGGGGATTTGGAGCAACCATCATGCAATGGTTTAGTAGTCAAAAAGGGTCTTGGAGATTAAGAGGTAAAAG
GTTCAATCCATGGTGGTTGCATACCAAGGAATTAATTTCCTAAGGGTTTTCTTGACATCCAAATGTTGTACGGTTAGACGATAAGTCGTGTGATATTAATCAAGGTGTGC
AAGTTGGTCCTCAAAGGGTGGTGCAATGGTTGAAGACTTGGGTTTTGAAGGAGTGCTTCCCTCAAGGTCTCAGGTTTGAGACTTACCTGTGATATTAATTTGTAGATGTC
TCCCGTATCTCCATTCCTTCGATGTCTCCCCGTGCTTGGCCTAAGGACGGGCATGGTTATCCTTGTTTCAAAAAAAAAATAAAAAAATCAAGGTGCGCAAGTTGACCCAA
ACAGACAAGATCCTAGCGAGGCCAATCCTTGGGACAATATCATCATCTTCTTTTAGTTATTTTTTCATATTCTAAACTGGTATGCATGGTGATTGCAGGCGTCGAGAGCA
AGTCCCATATAACAACAACACTTTGAAGAAGATGAAAGAAATTATAACCAAAGATGAGATACAAAATCTAACCTACGGGATGATTGAATGGTTTTAGAGCTTGACCATGC
TGCTGCTGCAACACCTGTATATTCCAACAAAAAGTGGATGACTGACATCACATGTGAGAGGAATCAAAATGGAGGTCCTTTATGTCTTTTCTACATATTTGCCACAACGG
ATTGCTCTTTTTCTTCAATGTTGTACAATTCTTACTTCCTTACAAGCTTCCAGTTTTGCAGATTATGTTGTAAAATTACCCTTCTGTACCCACCCTTGTCCTTGTTCTGA
TGGGAACATCTTACATCTTTCCCTTCAAGGTTTTTTAAATGCCCTATATTTATTTATTTAGTTTCCAACACATCATTTATTTAGTTTTTGGTTGCAGAGTGTCATTTCTA
ATTTACTGCATTGTTGTGTCCAATTATACAGGTTGTTGATTTGAATTCAACCACTTTACAT
Protein sequenceShow/hide protein sequence
MALELLAVPVDLLVLNVATASFNGLLAFLAFSQLIRIHMRSQQDGWTRQKAIHLMIGSSNLGYMIYFIFALVAIFELWHTWSHVLGFVLMAFPKILFLAAFLLVLSFWVD
LCHQANDDDDDDEEENNTRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLMVAVSILIWIGTGENPIDSTAVARVYEYFIAVTVLLSGGALGFY
GFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALIDLLTDIPLLYNWRFERTNGVEALVLLILYFCMGSLIPSAFLLWIMRELPPSKKNQRHEESRAIAFISHGE
ADVSPQGWTAVTLSKNQVSCLSPSKVSIFYFPYFFFAQHMGSWRF