; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007435 (gene) of Snake gourd v1 genome

Gene IDTan0007435
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontobamovirus multiplication protein 1 isoform X1
Genome locationLG05:4618630..4630102
RNA-Seq ExpressionTan0007435
SyntenyTan0007435
Gene Ontology termsGO:0005774 - vacuolar membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009457 - THH1/TOM1/TOM3 domain
IPR040226 - THH1/TOM1/TOM3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456345.1 PREDICTED: tobamovirus multiplication protein 1 isoform X1 [Cucumis melo]1.7e-17790.03Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELL++NT+C+P+DLLVL+V MASFNGLLAFVAFSQLIRIH+RSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAI +LWHCWSH F FVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDEE DDDDDEENN RQ+LLENSKNKPGSSNV+GHRRCCGFPAIHLGSRQK VIVVV LVF L VAVSILIWIGAGKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG
        DST VARVYEDFLA+  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK TNGV+  V+L LYFCMG
Subjt:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAV RSKNQ S+ASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

XP_022970675.1 uncharacterized protein LOC111469590 [Cucurbita maxima]1.5e-17390.03Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELLAANTA VPVDLLVLNVAMASFNGLLAFVAFSQLIRIH+RSQQDGWTRQK LHLMI SSNLGYM YFIFALVAIF+  HCWSH FGFVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDL HQAND +DD+D+DEENN RQALLENSKNKPGSS+V+G+RRCCGFPAIHLGSRQKFVIVVVMLVF L VAVSILIWIG GKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG
        DSTAVA+VYE F+AV  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK TNGVK LVLL LYFCMG
Subjt:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        SLIPSAFL+W MRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQ S+ASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

XP_023534011.1 uncharacterized protein LOC111795688 [Cucurbita pepo subsp. pepo]7.0e-17188.95Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELLAANTA VPVDLLVLNVAMASFNGLLAFVAFSQLIRIH+RSQQD WTRQK LHLMI SSNLGYM YFIFALVAIF+  HCWSH FGFVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDD-DDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNP
        LFLAAFLLLLSFWVDL HQAND++DDD D+DEENN RQALLENSKNKPGSS+V+G+RRCCGFPAIHLGSRQK VIVVVMLV  L VAVSILIWIG GKNP
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDD-DDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNP

Query:  IDSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCM
        IDSTAVA+VYE F+AV  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK  NGVK LVLL LYFCM
Subjt:  IDSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCM

Query:  GSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        GSLIPSAFL+W MRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQ S+ASPI
Subjt:  GSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

XP_031744161.1 tobamovirus multiplication protein 1 [Cucumis sativus]2.9e-17789.75Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELL++NT+C+P+DLLVL+V MASFNGLLAFVAFSQLIRIH+RSQQDGWTRQKALHLMIGSSNLGYMIYFIFALV I +LWHCWSH F FVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDEE DDDDDEENN RQ LLENSKNKPGSSNV+GHRRCCGFPAIHLGSRQK VIVVVMLVF L VAVSILIWIGAGKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG
        DSTAVARVYEDFLA+  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK TNGV+  V+L LYFCMG
Subjt:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        SLIPSAFLLWIMRELPPPKK+QRQEESRAIAFISHGAAD NPQGWTAV RSKNQ S+ASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

XP_038902574.1 tobamovirus multiplication protein 1 [Benincasa hispida]1.8e-18292.24Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELLAANTAC+P+DLLVL+VAMAS NGLLAFVAF QLIRIH+RSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAI +LWHCWSH FGFVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDEEDDD+DDEENN RQALLENSKNKPGSSNV+GHRRCCGFPA+HLGSRQK VI+VVMLVF L VAVSILIWIGAG+NPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG
        DSTAVARVYEDFLA+  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK TNGVK LVLLILYFCMG
Subjt:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAV RSKNQAS+ASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

TrEMBL top hitse value%identityAlignment
A0A1S3C2L2 tobamovirus multiplication protein 1 isoform X18.3e-17890.03Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELL++NT+C+P+DLLVL+V MASFNGLLAFVAFSQLIRIH+RSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAI +LWHCWSH F FVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDLCHQANDEE DDDDDEENN RQ+LLENSKNKPGSSNV+GHRRCCGFPAIHLGSRQK VIVVV LVF L VAVSILIWIGAGKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG
        DST VARVYEDFLA+  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK TNGV+  V+L LYFCMG
Subjt:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAV RSKNQ S+ASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

A0A1S3C336 tobamovirus multiplication protein 1 isoform X22.5e-15891.19Show/hide
Query:  IRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENS
        +RSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAI +LWHCWSH F FVL+AFPKILFLAAFLLLLSFWVDLCHQANDEE DDDDDEENN RQ+LLENS
Subjt:  IRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENS

Query:  KNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEA
        KNKPGSSNV+GHRRCCGFPAIHLGSRQK VIVVV LVF L VAVSILIWIGAGKNPIDST VARVYEDFLA+  LLSGGALGFYGFMLFYRLKKVRSEEA
Subjt:  KNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEA

Query:  SSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQ
        SSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK TNGV+  V+L LYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQ
Subjt:  SSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQ

Query:  GWTAVTRSKNQASKASPI
        GWTAV RSKNQ S+ASPI
Subjt:  GWTAVTRSKNQASKASPI

A0A6J1D8P5 tobamovirus multiplication protein 15.8e-17187.74Show/hide
Query:  LELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILF
        L+ LAANTACVP+DL++L+ AMASFNG+LAF+AFSQLIRIH+R QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA FELW+C SH FGFVL+AFPKILF
Subjt:  LELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDS
        LAAFLLLLSFWVDLCHQANDEE DDDDDEEN+ +QALLENSKNKPGSSNV+GHRRCCGFPA HLGSRQK VIVVV+LVF L VAVS+LIWIGAG+NPIDS
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDS

Query:  TAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSL
        TAVARVYEDFLAV  LLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRF  TNGV  LVLLILYFCMGSL
Subjt:  TAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSL

Query:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAAD NPQ W  VTRSKNQ ++ASPI
Subjt:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

A0A6J1G615 uncharacterized protein LOC1114511262.9e-17088.64Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELLAANTA VPVDLL LNVAMASFNGLLAFVAFSQLIRIH+RSQQD WTRQK LHLMI SSNLGYM YFIFALVAIF+  HCWSH FGFVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDL HQAND +DD+D+DEENN RQALLENSKNKPGSS+V+G+RRCCGFPAIHLGSRQK VIVVVMLVF L VAVSILIWIG GKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG
        DSTAVA+VYE F+AV  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWR K  NGVK LVLL LYFCMG
Subjt:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        SLIPSAFL+W MRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQ S+ASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

A0A6J1I3I8 uncharacterized protein LOC1114695907.3e-17490.03Show/hide
Query:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI
        M LELLAANTA VPVDLLVLNVAMASFNGLLAFVAFSQLIRIH+RSQQDGWTRQK LHLMI SSNLGYM YFIFALVAIF+  HCWSH FGFVL+AFPKI
Subjt:  MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKI

Query:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI
        LFLAAFLLLLSFWVDL HQAND +DD+D+DEENN RQALLENSKNKPGSS+V+G+RRCCGFPAIHLGSRQKFVIVVVMLVF L VAVSILIWIG GKNPI
Subjt:  LFLAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPI

Query:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG
        DSTAVA+VYE F+AV  LLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLT+IPLSYNWRFK TNGVK LVLL LYFCMG
Subjt:  DSTAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI
        SLIPSAFL+W MRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQ S+ASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQGWTAVTRSKNQASKASPI

SwissProt top hitse value%identityAlignment
Q402F3 Tobamovirus multiplication protein 35.1e-0722.98Show/hide
Query:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIF--ELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAN
        +A   G+++ VA  QLIRI +R  + GWT QK  H       L +++  + +LV  F  ++          +L+  P + F   + LL+ FW ++ +QA 
Subjt:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIF--ELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAN

Query:  DEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSG
            D                   +P    + G                  V+ V+ ++  L     I+ W       I S         F A+ FLL G
Subjt:  DEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSG

Query:  GALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKST--NGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPK
        G L    F++  R   V S+    ++++VG +  +C  CF       L+  + + +N   K+   + +   +L ++Y+ +  ++PS+ +L+I+R+LPP +
Subjt:  GALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKST--NGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPK

Query:  KIQRQEESR
         I +    R
Subjt:  KIQRQEESR

Q402F4 Tobamovirus multiplication protein 19.0e-1226.07Show/hide
Query:  LLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFE----LWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDEED
        L++ VA  QLIRI +R  + GWT QK  HLM       +++  + A+V  F     L+H         ++  P +LF + F LL+ FW ++ HQA     
Subjt:  LLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFE----LWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDEED

Query:  DDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGGALG
                      L   K +    ++ G        AI+                     +   IW+    N  D++ V  + + F+AV   ++  ALG
Subjt:  DDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGGALG

Query:  F--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKI
        F  YG  LF  L++  + S+    ++ +VG +  +C  CF  S  V +L+           S + +   VL ++Y+ +  ++PSA +L+I+R+L PPK++
Subjt:  F--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKI

Query:  QRQ
          Q
Subjt:  QRQ

Q948R8 Protein TOM THREE HOMOLOG 17.9e-0822.76Show/hide
Query:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIF--ELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAN
        +A   G+++ +A  QL+RI +R  + GWT QK  H       L +M+  + ALV +F  +  +        +L+  P + F   + LL+ FW ++ +QA 
Subjt:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIF--ELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAN

Query:  DEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSG
            D                                        G R  F   +  +V+ + +A+ +++W      P+    +  + + F A   L + 
Subjt:  DEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSG

Query:  GALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKEL---VLLILYFCMGSLIPSAFLLWIMRE
         ALGF  YG  LF  L++  V S+    ++++VG +  +C  CF       L+  I + ++  F     +  L   +L  +Y+ +  ++PS+ +L+I+R+
Subjt:  GALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKEL---VLLILYFCMGSLIPSAFLLWIMRE

Query:  LPPPKKIQRQEE
        LPP + I +  +
Subjt:  LPPPKKIQRQEE

Q9FEG2 Tobamovirus multiplication protein 12.0e-1125.49Show/hide
Query:  AMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAND
        A+     L++ VA  QLIRI +R  + GWT QK  HLM    N    + F F +    +++     A  +VL+  P +LF +A+ LL+ FW ++ HQA  
Subjt:  AMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAND

Query:  EEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGG
           D                                           +   I V + V+   + +   IW+       D++ V  V + F+AV   ++  
Subjt:  EEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGG

Query:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPP
        ALGF  YG  LF+ L++  + S+    ++ +VG +  +C  CF    +V     + +S   +  + + +   VL ++Y+ +  ++PSA +L+I+R+L PP
Subjt:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPP

Query:  KKIQRQ
        K++  Q
Subjt:  KKIQRQ

Q9ZUM2 Tobamovirus multiplication protein 33.5e-0822.22Show/hide
Query:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDE
        +A   G+++ VA  QL+RI +R  + GWT QK  H +    N    + F+F     F       H    +L+  P + F   + LL+ FW ++ +QA   
Subjt:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDE

Query:  EDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGK-NPIDSTAVARVYEDFLAVAFLLSGG
          D                                        G R  F   +  +V+ + +A+ +++W    +   I S         F A+ FLL GG
Subjt:  EDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGK-NPIDSTAVARVYEDFLAVAFLLSGG

Query:  ALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQ
         L    F++  R   V S+    ++++VG +  +C  CF    ++          N      + +   +L  +Y+ +  ++PS+ +L+I+R+LPP + I 
Subjt:  ALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQ

Query:  RQEESR
        +  + R
Subjt:  RQEESR

Arabidopsis top hitse value%identityAlignment
AT2G02180.1 tobamovirus multiplication protein 32.5e-0922.22Show/hide
Query:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDE
        +A   G+++ VA  QL+RI +R  + GWT QK  H +    N    + F+F     F       H    +L+  P + F   + LL+ FW ++ +QA   
Subjt:  MASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQANDE

Query:  EDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGK-NPIDSTAVARVYEDFLAVAFLLSGG
          D                                        G R  F   +  +V+ + +A+ +++W    +   I S         F A+ FLL GG
Subjt:  EDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGK-NPIDSTAVARVYEDFLAVAFLLSGG

Query:  ALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQ
         L    F++  R   V S+    ++++VG +  +C  CF    ++          N      + +   +L  +Y+ +  ++PS+ +L+I+R+LPP + I 
Subjt:  ALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQ

Query:  RQEESR
        +  + R
Subjt:  RQEESR

AT3G59090.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)3.4e-10755.37Show/hide
Query:  CVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLS
        C     + +N+ +A  +  LAF+AF QL R H R++Q GWTRQK LHLMI SSN G +IYF+ A++A    WH WS+A GF+L+AFPKILFLA FLLLLS
Subjt:  CVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLS

Query:  FWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYED
        FWVD+CHQ N EE DDDDDEEN+ +Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF L ++ +ILIWI +GKNP++S+ +A VY D
Subjt:  FWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYED

Query:  FLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWI
          A   L++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT+IPL Y+W     +G+K LVLLI+Y+ +GS +P AF+LW+
Subjt:  FLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWI

Query:  MRELPPPKKIQRQEESRAIAFISHGAADVNPQG----WTAVTRSKNQASKASPI
        +RELPP   + RQE++R I ++++      PQG    W + T SKNQ SKASPI
Subjt:  MRELPPPKKIQRQEESRAIAFISHGAADVNPQG----WTAVTRSKNQASKASPI

AT3G59090.2 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)3.5e-10454.76Show/hide
Query:  CVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLS
        C     + +N+ +A  +  LAF+AF QL R H R++Q GWTRQK LHLMI SSN G +IYF+ A++A    WH WS+A GF+L+AFPKILFLA FLLLLS
Subjt:  CVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLS

Query:  FWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYED
        FWVD+CHQ N EE DDDDDEEN+ +Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF L ++ +ILIWI +GKNP++S+ +A VY D
Subjt:  FWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYED

Query:  FLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWI
          A   L++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT+IPL Y+W     +G+K LVLLI+Y+ +GS +P AF+LW+
Subjt:  FLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWI

Query:  MRELPPPKKIQRQEESRAIAFISHGAADVNPQG----WTAVTRSKNQ
        +RELPP   + RQE++R I ++++      PQG    W + T SKNQ
Subjt:  MRELPPPKKIQRQEESRAIAFISHGAADVNPQG----WTAVTRSKNQ

AT3G59090.3 LOCATED IN: endomembrane system1.8e-10354.55Show/hide
Query:  LLAANTACVPVDLLVLNVAMA--SFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILF
        +L A++  + V LL + + ++  SF  +LA   F QL R H R++Q GWTRQK LHLMI SSN G +IYF+ A++A    WH WS+A GF+L+AFPKILF
Subjt:  LLAANTACVPVDLLVLNVAMA--SFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDS
        LA FLLLLSFWVD+CHQ N EE DDDDDEEN+ +Q LLE SK+KPGSSN    R+CC F  IH+G+RQKFV+  ++LVF L ++ +ILIWI +GKNP++S
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDS

Query:  TAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSL
        + +A VY D  A   L++GG + FYG  L + L+KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT+IPL Y+W     +G+K LVLLI+Y+ +GS 
Subjt:  TAVARVYEDFLAVAFLLSGGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSL

Query:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQG----WTAVTRSKNQASKASPI
        +P AF+LW++RELPP   + RQE++R I ++++      PQG    W + T SKNQ SKASPI
Subjt:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADVNPQG----WTAVTRSKNQASKASPI

AT4G21790.1 tobamovirus multiplication 11.4e-1225.49Show/hide
Query:  AMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAND
        A+     L++ VA  QLIRI +R  + GWT QK  HLM    N    + F F +    +++     A  +VL+  P +LF +A+ LL+ FW ++ HQA  
Subjt:  AMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLLSFWVDLCHQAND

Query:  EEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGG
           D                                           +   I V + V+   + +   IW+       D++ V  V + F+AV   ++  
Subjt:  EEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLSGG

Query:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPP
        ALGF  YG  LF+ L++  + S+    ++ +VG +  +C  CF    +V     + +S   +  + + +   VL ++Y+ +  ++PSA +L+I+R+L PP
Subjt:  ALGF--YGFMLFYRLKK--VRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPP

Query:  KKIQRQ
        K++  Q
Subjt:  KKIQRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCTGGAATTACTTGCCGCCAACACCGCTTGCGTTCCCGTCGACCTTCTCGTTCTCAATGTAGCTATGGCTTCCTTCAATGGCCTTCTTGCTTTCGTCGCCTTCTC
GCAGCTCATCAGAATTCACATCCGGAGTCAACAGGATGGATGGACACGTCAAAAAGCACTCCATCTGATGATCGGCTCATCTAACTTAGGCTATATGATCTATTTCATAT
TTGCACTTGTTGCTATTTTTGAGCTCTGGCACTGCTGGTCTCATGCGTTTGGATTTGTACTCATCGCCTTCCCTAAAATACTGTTTCTTGCAGCTTTTCTCCTACTTCTT
TCTTTCTGGGTCGACCTTTGCCATCAGGCAAACGATGAAGAGGATGACGACGACGACGATGAAGAAAATAACCCTCGTCAGGCCTTGTTGGAAAATTCAAAGAACAAACC
TGGTTCATCAAATGTAGAAGGCCATCGGAGATGTTGTGGATTTCCTGCTATTCATCTTGGAAGTAGGCAAAAATTTGTAATTGTGGTTGTCATGCTGGTATTTTCCCTCA
CGGTCGCAGTTTCCATTCTGATCTGGATTGGGGCAGGGAAAAATCCTATTGATTCTACAGCTGTTGCCAGGGTGTATGAAGACTTTCTTGCTGTTGCATTTCTCCTATCA
GGAGGAGCCCTAGGCTTCTATGGTTTCATGTTATTTTACAGATTGAAAAAAGTACGTTCTGAGGAAGCTTCTTCAGAGATGAAGAAGGTTGGTGGTCTAGCAGTTGTCTG
TGTTGTGTGTTTTACATCAAGTGCTTTGGTAGATCTTCTTACAAATATTCCTCTTTCCTATAATTGGCGCTTCAAGAGCACAAATGGAGTAAAAGAACTAGTCCTTTTGA
TTTTGTACTTCTGTATGGGTTCTTTGATTCCATCAGCCTTTTTATTGTGGATTATGAGAGAGTTGCCACCTCCTAAAAAAATTCAAAGACAAGAAGAATCGAGGGCAATT
GCTTTTATAAGCCATGGGGCAGCTGATGTAAATCCTCAGGGTTGGACTGCTGTAACTCGTTCAAAGAATCAGGCGTCCAAAGCAAGCCCCATATAA
mRNA sequenceShow/hide mRNA sequence
CCAAAATCAGAGACGTACACCGTCGCCGACAAGACTTGCAGAAAAGCAAATCAGCGACACACCCTTTCTTTCTGCTTCTTCTCTTTCACTTCTCTTCTTATCTACCCGTA
CGCTCAGATTGTTTCCTCTGTTTCCCGCCGATGCAACACGCCTCGCCGCCATCATCATCATCCTCCTCCTCAACTTTCCACACGATCTGAAACAAGTGCTTCATTTCACC
GTTCAAGATGGCGCTGGAATTACTTGCCGCCAACACCGCTTGCGTTCCCGTCGACCTTCTCGTTCTCAATGTAGCTATGGCTTCCTTCAATGGCCTTCTTGCTTTCGTCG
CCTTCTCGCAGCTCATCAGAATTCACATCCGGAGTCAACAGGATGGATGGACACGTCAAAAAGCACTCCATCTGATGATCGGCTCATCTAACTTAGGCTATATGATCTAT
TTCATATTTGCACTTGTTGCTATTTTTGAGCTCTGGCACTGCTGGTCTCATGCGTTTGGATTTGTACTCATCGCCTTCCCTAAAATACTGTTTCTTGCAGCTTTTCTCCT
ACTTCTTTCTTTCTGGGTCGACCTTTGCCATCAGGCAAACGATGAAGAGGATGACGACGACGACGATGAAGAAAATAACCCTCGTCAGGCCTTGTTGGAAAATTCAAAGA
ACAAACCTGGTTCATCAAATGTAGAAGGCCATCGGAGATGTTGTGGATTTCCTGCTATTCATCTTGGAAGTAGGCAAAAATTTGTAATTGTGGTTGTCATGCTGGTATTT
TCCCTCACGGTCGCAGTTTCCATTCTGATCTGGATTGGGGCAGGGAAAAATCCTATTGATTCTACAGCTGTTGCCAGGGTGTATGAAGACTTTCTTGCTGTTGCATTTCT
CCTATCAGGAGGAGCCCTAGGCTTCTATGGTTTCATGTTATTTTACAGATTGAAAAAAGTACGTTCTGAGGAAGCTTCTTCAGAGATGAAGAAGGTTGGTGGTCTAGCAG
TTGTCTGTGTTGTGTGTTTTACATCAAGTGCTTTGGTAGATCTTCTTACAAATATTCCTCTTTCCTATAATTGGCGCTTCAAGAGCACAAATGGAGTAAAAGAACTAGTC
CTTTTGATTTTGTACTTCTGTATGGGTTCTTTGATTCCATCAGCCTTTTTATTGTGGATTATGAGAGAGTTGCCACCTCCTAAAAAAATTCAAAGACAAGAAGAATCGAG
GGCAATTGCTTTTATAAGCCATGGGGCAGCTGATGTAAATCCTCAGGGTTGGACTGCTGTAACTCGTTCAAAGAATCAGGCGTCCAAAGCAAGCCCCATATAATGAAATA
TTTAAGGACACGCCAGTGATTAAGGCTGCTTGCTTGTGTGTTACATCACATGAAACTGAAAGAAATTATGACCAGAGATACAAAATCTAACATACGGGATGAAAGAATGT
TGAGAGTTTGAGCATGCTGTGACACCTGTATATTTCAACCAAAGTTGATGACTGACTTCACATGTGAGAATAATCAAAACTGCAGCCGCATATATGATGGAGGTTTTTAT
ATGTTTTCCTTTTTGCCTTTTCTTTTCCCCGCTCTTCTTCTGCACATAGTAGCCAAGAGATTGCTCGTTTTCTTTCAGGGGAAAAATGTTGTACAATTTTTGCTTCCCTA
GAAGCTTCAAGTTTTGCAGATTATGTTGTAAATTTACCTTTCTGTACCGCCCTAGTTCTGATGGCAACATATATTACAATGGTTGAGTTTGATAAGCCAAAGGGTATAAT
ACATTAACTACCAATTTCTATTATGCCAATGCAAAATCTCTTTAAGAAACAG
Protein sequenceShow/hide protein sequence
MALELLAANTACVPVDLLVLNVAMASFNGLLAFVAFSQLIRIHIRSQQDGWTRQKALHLMIGSSNLGYMIYFIFALVAIFELWHCWSHAFGFVLIAFPKILFLAAFLLLL
SFWVDLCHQANDEEDDDDDDEENNPRQALLENSKNKPGSSNVEGHRRCCGFPAIHLGSRQKFVIVVVMLVFSLTVAVSILIWIGAGKNPIDSTAVARVYEDFLAVAFLLS
GGALGFYGFMLFYRLKKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTNIPLSYNWRFKSTNGVKELVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAI
AFISHGAADVNPQGWTAVTRSKNQASKASPI