; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001656 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001656
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiontobamovirus multiplication protein 1 isoform X1
Genome locationChr09:19169015..19176321
RNA-Seq ExpressionHG10001656
SyntenyHG10001656
Gene Ontology termsGO:0005774 - vacuolar membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009457 - THH1/TOM1/TOM3 domain
IPR040226 - THH1/TOM1/TOM3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456345.1 PREDICTED: tobamovirus multiplication protein 1 isoform X1 [Cucumis melo]4.9e-18194.07Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        MVLELL++NT+C+PLDLL+LDV MASFNGLLAFVAF QLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVF FVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDE+DDD DDEENNIRQ+LLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVV LVFLLMVAVSILIWIGAGKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DST VARVYEDFLAIT LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF+RTNGV+A V+L LYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVN QGWTAVARSKNQ
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ

XP_022149697.1 tobamovirus multiplication protein 1 [Momordica charantia]7.8e-17189.01Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        ++L+ LAANTAC+PLDL+ILD AMASFNG+LAF+AF QLIRIHMR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA  +LW+C SHVFGFVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDE+DDD DDEEN+ +QALLENSKNKPGSSNVDGHRRCCGFPA HLGSRQK VIVVV+LVF+LMVAVS+LIWIGAG+NPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DSTAVARVYEDFLA+T LLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV ALVLLILYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAAD N Q W  V RSKNQV
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV

XP_022970675.1 uncharacterized protein LOC111469590 [Cucurbita maxima]7.1e-17290.7Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        MVLELLAANTA +P+DLL+L+VAMASFNGLLAFVAF QLIRIHMRSQQDGWTRQK LHLMI SSNLGYM YFIFALVAI    HCWSHVFGFVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDL HQAND+DD+DE DEENN RQALLENSKNKPGSS+VDG+RRCCGFPAIHLGSRQKFVIVVVMLVF LMVAVSILIWIG GKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DSTAVA+VYE F+A+T LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF+RTNGVKALVLL LYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV
        SLIPSAFL+W MRELPPPKKIQRQEESRAIAFISHGAADVN QGWTAV RSKNQV
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV

XP_031744161.1 tobamovirus multiplication protein 1 [Cucumis sativus]2.9e-18194.07Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        MVLELL++NT+CLPLDLL+LDV MASFNGLLAFVAF QLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALV IIQLWHCWSHVF FVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDE+DDD DDEENNIRQ LLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVVMLVFLLMVAVSILIWIGAGKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DSTAVARVYEDFLAIT LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF+RTNGV+A V+L LYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ
        SLIPSAFLLWIMRELPPPKK+QRQEESRAIAFISHGAAD N QGWTAVARSKNQ
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ

XP_038902574.1 tobamovirus multiplication protein 1 [Benincasa hispida]8.3e-18997.18Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        MVLELLAANTACLPLDLL+LDVAMAS NGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDE+DDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPA+HLGSRQK VI+VVMLVFLLMVAVSILIWIGAG+NPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DSTAVARVYEDFLAIT LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF+RTNGVKALVLLILYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVN QGWTAVARSKNQ
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ

TrEMBL top hitse value%identityAlignment
A0A1S3C2L2 tobamovirus multiplication protein 1 isoform X12.4e-18194.07Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        MVLELL++NT+C+PLDLL+LDV MASFNGLLAFVAF QLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVF FVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDE+DDD DDEENNIRQ+LLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVV LVFLLMVAVSILIWIGAGKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DST VARVYEDFLAIT LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF+RTNGV+A V+L LYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVN QGWTAVARSKNQ
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQ

A0A1S3C336 tobamovirus multiplication protein 1 isoform X21.2e-16195.5Show/hide
Query:  MRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENS
        MRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVF FVLMAFPKILFLAAFLLLLSFWVDLCHQANDE+DDD DDEENNIRQ+LLENS
Subjt:  MRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENS

Query:  KNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEA
        KNKPGSSNVDGHRRCCGFPAIHLGSRQK VIVVV LVFLLMVAVSILIWIGAGKNPIDST VARVYEDFLAIT LLSGGALGFYGFMLFYRLKKVRSEEA
Subjt:  KNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEA

Query:  SSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQ
        SSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF+RTNGV+A V+L LYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVN Q
Subjt:  SSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQ

Query:  GWTAVARSKNQ
        GWTAVARSKNQ
Subjt:  GWTAVARSKNQ

A0A6J1D8P5 tobamovirus multiplication protein 13.8e-17189.01Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        ++L+ LAANTAC+PLDL+ILD AMASFNG+LAF+AF QLIRIHMR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA  +LW+C SHVFGFVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDE+DDD DDEEN+ +QALLENSKNKPGSSNVDGHRRCCGFPA HLGSRQK VIVVV+LVF+LMVAVS+LIWIGAG+NPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DSTAVARVYEDFLA+T LLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV ALVLLILYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAAD N Q W  V RSKNQV
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV

A0A6J1G615 uncharacterized protein LOC1114511261.3e-16889.58Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        MVLELLAANTA +P+DLL L+VAMASFNGLLAFVAF QLIRIHMRSQQD WTRQK LHLMI SSNLGYM YFIFALVAI    HCWSHVFGFVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDL HQAND+DD+DE DEENN RQALLENSKNKPGSS+VDG+RRCCGFPAIHLGSRQK VIVVVMLVF LMVAVSILIWIG GKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DSTAVA+VYE F+A+T LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWR +R NGVKALVLL LYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV
        SLIPSAFL+W MRELPPPKKIQRQEESRAIAFISHGAADVN QGWTAV RSKNQV
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV

A0A6J1I3I8 uncharacterized protein LOC1114695903.4e-17290.7Show/hide
Query:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI
        MVLELLAANTA +P+DLL+L+VAMASFNGLLAFVAF QLIRIHMRSQQDGWTRQK LHLMI SSNLGYM YFIFALVAI    HCWSHVFGFVLMAFPKI
Subjt:  MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDL HQAND+DD+DE DEENN RQALLENSKNKPGSS+VDG+RRCCGFPAIHLGSRQKFVIVVVMLVF LMVAVSILIWIG GKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG
        DSTAVA+VYE F+A+T LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF+RTNGVKALVLL LYFCMG
Subjt:  DSTAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV
        SLIPSAFL+W MRELPPPKKIQRQEESRAIAFISHGAADVN QGWTAV RSKNQV
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQGWTAVARSKNQV

SwissProt top hitse value%identityAlignment
Q402F3 Tobamovirus multiplication protein 36.1e-0923.7Show/hide
Query:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDE
        +A   G+++ VA  QLIRI MR  + GWT QK  H +  +  +  +   +FA    +Q  H    +   +L+  P + F   + LL+ FW ++ +QA   
Subjt:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDE

Query:  DDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGGA
                                 + + DG R     P+    +   +VI +++ + +    V +L+        I S         F A+ FLL GG 
Subjt:  DDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGGA

Query:  LGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKAL---VLLILYFCMGSLIPSAFLLWIMRELPPPKK
        L    F++  R   V S+    ++++VG +  +C  CF       L+  + + +N  F +   +  L   +L ++Y+ +  ++PS+ +L+I+R+LPP + 
Subjt:  LGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKAL---VLLILYFCMGSLIPSAFLLWIMRELPPPKK

Query:  IQRQEESR
        I +    R
Subjt:  IQRQEESR

Q402F4 Tobamovirus multiplication protein 11.4e-1024.08Show/hide
Query:  LLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEDDDDED
        L++ VA  QLIRI +R  + GWT QK  HLM  +  +  +   +F     + L+H    V    ++  P +LF + F LL+ FW ++ HQA         
Subjt:  LLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEDDDDED

Query:  DEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGGALGF--Y
                  L   K +    +++G                         ++ +   + + +W        D++ V  + + F+A+   ++  ALGF  Y
Subjt:  DEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGGALGF--Y

Query:  GFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQ
        G  LF  L++  + S+    ++ +VG +  +C  CF  S  V +L+      +      + +   VL ++Y+ +  ++PSA +L+I+R+L PPK++  Q
Subjt:  GFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQ

Q948R8 Protein TOM THREE HOMOLOG 11.5e-0722.93Show/hide
Query:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQ--LWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN
        +A   G+++ +A  QL+RI +R  + GWT QK  H       L +M+  + ALV + +    +    +   +L+  P + F   + LL+ FW ++ +QA 
Subjt:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQ--LWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN

Query:  DEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFL------LMVAVSILIWIGAGKNPIDSTAVARVYEDFLAI
                                   + + DG R     P+    +   +VI + + + L      LMV +S + + G                 F A+
Subjt:  DEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFL------LMVAVSILIWIGAGKNPIDSTAVARVYEDFLAI

Query:  TFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKAL---VLLILYFCMGSLIPSAFLLWIM
         FLL GG L    F++  R   V S+    ++++VG +  +C  CF       L+  I + ++  F     +  L   +L  +Y+ +  ++PS+ +L+I+
Subjt:  TFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKAL---VLLILYFCMGSLIPSAFLLWIM

Query:  RELPPPKKIQRQEE
        R+LPP + I +  +
Subjt:  RELPPPKKIQRQEE

Q9FEG2 Tobamovirus multiplication protein 12.9e-1125.82Show/hide
Query:  AMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND
        A+     L++ VA  QLIRI MR  + GWT QK  HLM    N    + F F +    Q++        +VL+  P +LF +A+ LL+ FW ++ HQA  
Subjt:  AMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND

Query:  EDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGG
           D                                           +   I V + V+L  + +   IW+       D++ V  V + F+A+   ++  
Subjt:  EDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGG

Query:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPP
        ALGF  YG  LF+ L++  + S+    ++ +VG +  +C  CF    +V     + +S   +    + +   VL ++Y+ +  ++PSA +L+I+R+L PP
Subjt:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPP

Query:  KKIQRQ
        K++  Q
Subjt:  KKIQRQ

Q9ZUM2 Tobamovirus multiplication protein 31.5e-0722.15Show/hide
Query:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFAL-VAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND
        +A   G+++ VA  QL+RI +R  + GWT QK  H +    N    + F+F   V  +Q       +   +L+  P + F   + LL+ FW ++ +QA  
Subjt:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFAL-VAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND

Query:  EDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGK-NPIDSTAVARVYEDFLAITFLLSG
           D                                        G R  F   +  +V+++ +A+ +++W    +   I S         F A+ FLL G
Subjt:  EDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGK-NPIDSTAVARVYEDFLAITFLLSG

Query:  GALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPPKKI
        G L    F++  R   V S+    ++++VG +  +C  CF    ++          N      + +   +L  +Y+ +  ++PS+ +L+I+R+LPP + I
Subjt:  GALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPPKKI

Query:  QRQEESR
         +  + R
Subjt:  QRQEESR

Arabidopsis top hitse value%identityAlignment
AT1G14530.1 Protein of unknown function (DUF1084)1.1e-0822.93Show/hide
Query:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQ--LWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN
        +A   G+++ +A  QL+RI +R  + GWT QK  H       L +M+  + ALV + +    +    +   +L+  P + F   + LL+ FW ++ +QA 
Subjt:  MASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQ--LWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN

Query:  DEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFL------LMVAVSILIWIGAGKNPIDSTAVARVYEDFLAI
                                   + + DG R     P+    +   +VI + + + L      LMV +S + + G                 F A+
Subjt:  DEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFL------LMVAVSILIWIGAGKNPIDSTAVARVYEDFLAI

Query:  TFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKAL---VLLILYFCMGSLIPSAFLLWIM
         FLL GG L    F++  R   V S+    ++++VG +  +C  CF       L+  I + ++  F     +  L   +L  +Y+ +  ++PS+ +L+I+
Subjt:  TFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKAL---VLLILYFCMGSLIPSAFLLWIM

Query:  RELPPPKKIQRQEE
        R+LPP + I +  +
Subjt:  RELPPPKKIQRQEE

AT3G59090.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)3.0e-10454.6Show/hide
Query:  CLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLS
        C     + +++ +A  +  LAF+AF QL R H R++Q GWTRQK LHLMI SSN G +IYF+ A++A    WH WS+  GF+LMAFPKILFLA FLLLLS
Subjt:  CLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLS

Query:  FWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYED
        FWVD+CHQ N E+DDD DDEEN+I+Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF+LM++ +ILIWI +GKNP++S+ +A VY D
Subjt:  FWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYED

Query:  FLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWI
          A   L++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT IPL Y+W   + +G+KALVLLI+Y+ +GS +P AF+LW+
Subjt:  FLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWI

Query:  MRELPPPKKIQRQEESRAIAFISHGAADVNSQG----WTAVARSKNQV
        +RELPP   + RQE++R I ++++       QG    W +   SKNQV
Subjt:  MRELPPPKKIQRQEESRAIAFISHGAADVNSQG----WTAVARSKNQV

AT3G59090.2 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)6.7e-10454.31Show/hide
Query:  CLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLS
        C     + +++ +A  +  LAF+AF QL R H R++Q GWTRQK LHLMI SSN G +IYF+ A++A    WH WS+  GF+LMAFPKILFLA FLLLLS
Subjt:  CLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLS

Query:  FWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYED
        FWVD+CHQ N E+DDD DDEEN+I+Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF+LM++ +ILIWI +GKNP++S+ +A VY D
Subjt:  FWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYED

Query:  FLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWI
          A   L++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT IPL Y+W   + +G+KALVLLI+Y+ +GS +P AF+LW+
Subjt:  FLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWI

Query:  MRELPPPKKIQRQEESRAIAFISHGAADVNSQG----WTAVARSKNQV
        +RELPP   + RQE++R I ++++       QG    W +   SKNQ+
Subjt:  MRELPPPKKIQRQEESRAIAFISHGAADVNSQG----WTAVARSKNQV

AT3G59090.3 LOCATED IN: endomembrane system2.4e-10153.78Show/hide
Query:  LLAANTACLPLDLLILDVAMA--SFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILF
        +L A++  + + LL + + ++  SF  +LA   F+QL R H R++Q GWTRQK LHLMI SSN G +IYF+ A++A    WH WS+  GF+LMAFPKILF
Subjt:  LLAANTACLPLDLLILDVAMA--SFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDS
        LA FLLLLSFWVD+CHQ N E+DDD DDEEN+I+Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF+LM++ +ILIWI +GKNP++S
Subjt:  LAAFLLLLSFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDS

Query:  TAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSL
        + +A VY D  A   L++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT IPL Y+W   + +G+KALVLLI+Y+ +GS 
Subjt:  TAVARVYEDFLAITFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSL

Query:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQG----WTAVARSKNQV
        +P AF+LW++RELPP   + RQE++R I ++++       QG    W +   SKNQV
Subjt:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNSQG----WTAVARSKNQV

AT4G21790.1 tobamovirus multiplication 12.1e-1225.82Show/hide
Query:  AMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND
        A+     L++ VA  QLIRI MR  + GWT QK  HLM    N    + F F +    Q++        +VL+  P +LF +A+ LL+ FW ++ HQA  
Subjt:  AMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND

Query:  EDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGG
           D                                           +   I V + V+L  + +   IW+       D++ V  V + F+A+   ++  
Subjt:  EDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLSGG

Query:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPP
        ALGF  YG  LF+ L++  + S+    ++ +VG +  +C  CF    +V     + +S   +    + +   VL ++Y+ +  ++PSA +L+I+R+L PP
Subjt:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPP

Query:  KKIQRQ
        K++  Q
Subjt:  KKIQRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGGAATTACTTGCCGCCAACACCGCTTGCCTTCCTCTCGACCTTCTTATTCTCGATGTAGCTATGGCTTCCTTCAATGGCCTTCTTGCTTTCGTCGCCTTCTG
GCAGCTCATCAGAATTCACATGCGGAGTCAACAAGATGGATGGACACGTCAAAAAGCACTCCATCTGATGATAGGCTCATCTAACTTGGGCTATATGATCTATTTCATAT
TTGCACTTGTTGCTATTATTCAGCTCTGGCACTGCTGGTCTCATGTGTTTGGATTTGTCCTCATGGCCTTCCCTAAAATACTGTTTCTTGCTGCTTTTCTCCTACTTCTT
TCTTTCTGGGTCGACCTTTGCCATCAGGCAAACGATGAAGATGACGACGACGAGGATGATGAAGAAAATAACATTCGACAGGCCTTGTTGGAAAATTCAAAGAATAAACC
TGGTTCATCAAATGTAGATGGGCATCGAAGATGTTGTGGATTTCCTGCTATTCATCTTGGAAGCAGGCAAAAATTTGTGATTGTGGTTGTCATGCTGGTATTTTTACTCA
TGGTTGCAGTTTCTATTCTGATCTGGATTGGGGCAGGGAAAAATCCTATTGATTCTACAGCTGTTGCCAGAGTGTATGAGGACTTTCTTGCTATTACATTTCTCCTATCA
GGAGGAGCCCTAGGCTTCTATGGTTTCATGTTATTTTACAGATTGAAAAAAGTACGTTCTGAGGAAGCTTCTTCAGAGATGAAGAAGGTTGGTGGTCTAGCAGTTGTCTG
TGTTGTATGTTTTACATCAAGTGCTTTGGTAGATCTTCTTACGGATATTCCTCTTTCCTATAATTGGCGCTTCAGGAGAACAAATGGAGTAAAAGCACTAGTCCTTTTGA
TTTTGTACTTCTGCATGGGTTCTCTGATTCCATCAGCGTTTTTGTTGTGGATTATGAGAGAGCTTCCACCTCCTAAAAAAATTCAAAGACAAGAAGAATCAAGGGCAATT
GCTTTTATAAGCCATGGGGCAGCTGATGTAAATTCTCAGGGTTGGACTGCTGTAGCTCGTTCAAAGAATCAGGTAGTACCATTGAAGGTGCATTTAAAGGCCATCCTTGC
TCTGTTCTTAAATTCTTCACGGAAAAGAGGCGTCCAGAGCAAGTCCCATATAATGAAAACATTTAAAGAGACGCCACTGATTAAGGGGCTGCTTTGTGTTACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCTGGAATTACTTGCCGCCAACACCGCTTGCCTTCCTCTCGACCTTCTTATTCTCGATGTAGCTATGGCTTCCTTCAATGGCCTTCTTGCTTTCGTCGCCTTCTG
GCAGCTCATCAGAATTCACATGCGGAGTCAACAAGATGGATGGACACGTCAAAAAGCACTCCATCTGATGATAGGCTCATCTAACTTGGGCTATATGATCTATTTCATAT
TTGCACTTGTTGCTATTATTCAGCTCTGGCACTGCTGGTCTCATGTGTTTGGATTTGTCCTCATGGCCTTCCCTAAAATACTGTTTCTTGCTGCTTTTCTCCTACTTCTT
TCTTTCTGGGTCGACCTTTGCCATCAGGCAAACGATGAAGATGACGACGACGAGGATGATGAAGAAAATAACATTCGACAGGCCTTGTTGGAAAATTCAAAGAATAAACC
TGGTTCATCAAATGTAGATGGGCATCGAAGATGTTGTGGATTTCCTGCTATTCATCTTGGAAGCAGGCAAAAATTTGTGATTGTGGTTGTCATGCTGGTATTTTTACTCA
TGGTTGCAGTTTCTATTCTGATCTGGATTGGGGCAGGGAAAAATCCTATTGATTCTACAGCTGTTGCCAGAGTGTATGAGGACTTTCTTGCTATTACATTTCTCCTATCA
GGAGGAGCCCTAGGCTTCTATGGTTTCATGTTATTTTACAGATTGAAAAAAGTACGTTCTGAGGAAGCTTCTTCAGAGATGAAGAAGGTTGGTGGTCTAGCAGTTGTCTG
TGTTGTATGTTTTACATCAAGTGCTTTGGTAGATCTTCTTACGGATATTCCTCTTTCCTATAATTGGCGCTTCAGGAGAACAAATGGAGTAAAAGCACTAGTCCTTTTGA
TTTTGTACTTCTGCATGGGTTCTCTGATTCCATCAGCGTTTTTGTTGTGGATTATGAGAGAGCTTCCACCTCCTAAAAAAATTCAAAGACAAGAAGAATCAAGGGCAATT
GCTTTTATAAGCCATGGGGCAGCTGATGTAAATTCTCAGGGTTGGACTGCTGTAGCTCGTTCAAAGAATCAGGTAGTACCATTGAAGGTGCATTTAAAGGCCATCCTTGC
TCTGTTCTTAAATTCTTCACGGAAAAGAGGCGTCCAGAGCAAGTCCCATATAATGAAAACATTTAAAGAGACGCCACTGATTAAGGGGCTGCTTTGTGTTACATGA
Protein sequenceShow/hide protein sequence
MVLELLAANTACLPLDLLILDVAMASFNGLLAFVAFWQLIRIHMRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIIQLWHCWSHVFGFVLMAFPKILFLAAFLLLL
SFWVDLCHQANDEDDDDEDDEENNIRQALLENSKNKPGSSNVDGHRRCCGFPAIHLGSRQKFVIVVVMLVFLLMVAVSILIWIGAGKNPIDSTAVARVYEDFLAITFLLS
GGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFRRTNGVKALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAI
AFISHGAADVNSQGWTAVARSKNQVVPLKVHLKAILALFLNSSRKRGVQSKSHIMKTFKETPLIKGLLCVT