; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014616 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014616
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontobamovirus multiplication protein 1-like
Genome locationchr12:2671502..2682116
RNA-Seq ExpressionLag0014616
SyntenyLag0014616
Gene Ontology termsGO:0005774 - vacuolar membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR009457 - THH1/TOM1/TOM3 domain
IPR040226 - THH1/TOM1/TOM3
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588768.1 Tobamovirus multiplication protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.8e-13269.92Show/hide
Query:  SLSRAALATKNKKPFSSCDLLFPILEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRL
        SL   AL  K+ + F      F I    M R L I SAVGGFE WN IDES+ WQKGIY  LSASYGLIS+IAL                          
Subjt:  SLSRAALATKNKKPFSSCDLLFPILEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRL

Query:  FNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVL
                                     VQLIRIQ+RVPEFEWTTQKGFHLMNF+VNGLRAILFGLYKSVF IRPK LEMVVME+PGLLFFSTYTLLVL
Subjt:  FNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVL

Query:  FWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTI
        FWAEIY+QARSLPID+LKPTYCIINGV+YVIQICIWIFVMLNKSPGAVI AKLFFSVVS SAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTI
Subjt:  FWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTI

Query:  CFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIERIGSMEGKVVPG
        CF+CFFIRCFVLA SAFDKDADLDVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR+SDR      +    G+++ G
Subjt:  CFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIERIGSMEGKVVPG

XP_022927927.1 tobamovirus multiplication protein 1-like isoform X1 [Cucurbita moschata]1.2e-13175.81Show/hide
Query:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV
        M R L I SAVGGFE WN IDES+ WQKGIY  LSASYGLIS+IAL                                                      
Subjt:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV

Query:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM
         VQLIRIQ+RVPEFEWTTQKGFHLMNF+VNGLRAILFGLYKSVF IRPK LEMVVME+PGLLFFSTYTLLVLFWAEIY+QARSLPID+LKPTYCIINGV+
Subjt:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM

Query:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH
        YVIQICIWIFVMLNKSPGAVI AKLFFSVVS SAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICF+CFFIRCFVLA SAFDKDADLDVLDH
Subjt:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH

Query:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        PILNLIYYMLVEVVPSALVLFILRKLPPRR+SDR   IE
Subjt:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

XP_022988878.1 tobamovirus multiplication protein 1-like [Cucurbita maxima]2.4e-13276.11Show/hide
Query:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV
        M R L I SAVGGFE WN IDES+ WQKGIY ALSASYGLIS+IAL                                                      
Subjt:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV

Query:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM
         VQLIRIQ+RVPEFEWTTQKGFHLMNF+VNGLRAILFGLYKSVF IRPK LEMVVME+PGLLFFSTYTLLVLFWAEIY+QARSLPIDKLKPTYCIINGV+
Subjt:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM

Query:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH
        YVIQICIWIFVMLNKSPGAVI AKLFFSVVS SAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICF+CFFIRCFVLA SAFDKD+DLDVLDH
Subjt:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH

Query:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        PILNLIYYMLVEVVPSALVLFILRKLPPRR+SDR   IE
Subjt:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

XP_023530045.1 tobamovirus multiplication protein 1-like [Cucurbita pepo subsp. pepo]9.2e-13276.11Show/hide
Query:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV
        M R L I SAVGGFE WN IDES+ WQKGIY ALSASYGLIS+IAL                                                      
Subjt:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV

Query:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM
         VQLIRIQ+RVPEFEWTTQKGFHLMNF+VNGLRAILFGLYKSVF IRPK LEMVVME+PGLLFFSTYTLLVLFWAEIY+QARSLPIDKLKPTYCIINGV+
Subjt:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM

Query:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH
        Y+IQICIWIFVMLNKSPGAVI AKLFFSVVS SAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICF CFFIRCFVLA SAFDKDADLDVLDH
Subjt:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH

Query:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        PILNLIYYMLVEVVPSALVLFILRKLPPRR+SDR   IE
Subjt:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

XP_038888866.1 tobamovirus multiplication protein 1-like isoform X1 [Benincasa hispida]7.0e-13277.35Show/hide
Query:  MARAL-PIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAF
        MARAL  I S VGG + WN IDES+ WQ+GIY  LSASYGLISLIAL                                                     
Subjt:  MARAL-PIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAF

Query:  VKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGV
          VQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVF IRPKV+EMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGV
Subjt:  VKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGV

Query:  MYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLD
        MYVIQICIWI VML+ SPG+VIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLD
Subjt:  MYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLD

Query:  HPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        HPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDR   IE
Subjt:  HPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

TrEMBL top hitse value%identityAlignment
A0A0A0K5Z8 DUF1084 domain-containing protein2.2e-13176.18Show/hide
Query:  MARAL-PIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAF
        MARAL  I S +GGF+LWN ID+S  WQKGIY ALSASY LISLIAL                                                     
Subjt:  MARAL-PIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAF

Query:  VKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGV
          VQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVF I+PKVLEMV+MEIPGLLFFSTYTLLVLFWAEIYHQARSLPI KLKPTYCI+NGV
Subjt:  VKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGV

Query:  MYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLD
        MY+IQICIWI VML  SPGAVI AKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLD
Subjt:  MYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLD

Query:  HPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        HPILNLIYYMLVEVVPSALVLFILRKLPPRR+SDR   IE
Subjt:  HPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

A0A6J1EME9 tobamovirus multiplication protein 1-like isoform X15.8e-13275.81Show/hide
Query:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV
        M R L I SAVGGFE WN IDES+ WQKGIY  LSASYGLIS+IAL                                                      
Subjt:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV

Query:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM
         VQLIRIQ+RVPEFEWTTQKGFHLMNF+VNGLRAILFGLYKSVF IRPK LEMVVME+PGLLFFSTYTLLVLFWAEIY+QARSLPID+LKPTYCIINGV+
Subjt:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM

Query:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH
        YVIQICIWIFVMLNKSPGAVI AKLFFSVVS SAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICF+CFFIRCFVLA SAFDKDADLDVLDH
Subjt:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH

Query:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        PILNLIYYMLVEVVPSALVLFILRKLPPRR+SDR   IE
Subjt:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

A0A6J1HJF7 tobamovirus multiplication protein 1-like1.3e-12874.63Show/hide
Query:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV
        M RAL    AV G E WNGIDES+QWQKGIYSALS+SYGL+S+IAL                                                      
Subjt:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV

Query:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM
         VQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAI+FGLYKSVF IRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPI KLKPTYCIINGVM
Subjt:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM

Query:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH
        Y+IQICIWI VML++SPGAVIAAKLFFSVV+FSAA+GFLIYGGRLFVMLRQFPIESRGRQKKLYEVG VT+ICFSCF IRC VLALSAFDKDADLDVLDH
Subjt:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH

Query:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        PILNLIYY+LVE+VPSALVLFILRKLPPRR+SDR   IE
Subjt:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

A0A6J1HVA7 tobamovirus multiplication protein 1-like1.6e-12974.63Show/hide
Query:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV
        MARAL   SAV G E WNGIDES+QWQKGIYSALS+SYGL+S+IAL                                                      
Subjt:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV

Query:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM
         VQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAI+FGLYKSVF IRPKVLEMV+MEIPGLLFFSTYTLLVLFWAEIYHQARSLPI KLKPTYCIINGVM
Subjt:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM

Query:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH
        Y+IQICIWI VML++SPGAVIAAKLFFSVV+FSAA+GFLIYGGRLFVMLRQFPIESRGRQKKLYEVG VT+ICFSCF IRC VLALSAFDKDADLDVLDH
Subjt:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH

Query:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        P+LNLIYY+LVE+VPSALVLFILRKLPPRR+SDR   IE
Subjt:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

A0A6J1JNL1 tobamovirus multiplication protein 1-like1.2e-13276.11Show/hide
Query:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV
        M R L I SAVGGFE WN IDES+ WQKGIY ALSASYGLIS+IAL                                                      
Subjt:  MARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFV

Query:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM
         VQLIRIQ+RVPEFEWTTQKGFHLMNF+VNGLRAILFGLYKSVF IRPK LEMVVME+PGLLFFSTYTLLVLFWAEIY+QARSLPIDKLKPTYCIINGV+
Subjt:  KVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVM

Query:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH
        YVIQICIWIFVMLNKSPGAVI AKLFFSVVS SAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICF+CFFIRCFVLA SAFDKD+DLDVLDH
Subjt:  YVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDH

Query:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE
        PILNLIYYMLVEVVPSALVLFILRKLPPRR+SDR   IE
Subjt:  PILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIE

SwissProt top hitse value%identityAlignment
Q402F3 Tobamovirus multiplication protein 33.6e-7859.17Show/hide
Query:  LVSSLVAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLK
        ++  +V+A   VQLIRIQ+RVPE+ WTTQK FH +NF+VNG+R+++F   + V  + P++++ +++++P L FF+TY LLVLFWAEIY+QAR++  D L+
Subjt:  LVSSLVAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLK

Query:  PTYCIINGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFD
        P++  INGV+YVIQI +W+ +        VI +K+FF+ VS  AALGFL+YGGRLF+ML++FP+ES+GR+KKL EVG VTTICFSCF IRC ++  +AF+
Subjt:  PTYCIINGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFD

Query:  KDADLDVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR
        K ADLDVLDHPILNLIYY+LVE++PS+LVLFILRKLPP+R
Subjt:  KDADLDVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR

Q402F4 Tobamovirus multiplication protein 11.1e-9255.7Show/hide
Query:  WNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEW
        W+ I+ES QWQ GI+ +L ASY L+S +AL                                                       +QLIRI++RVPE+ W
Subjt:  WNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEW

Query:  TTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKS
        TTQK FHLMNF+VNG+RAI+FG +K VF   PKVL + ++++PGLLFFST+TLLVLFWAEIYHQARSLP DKL+ +Y  ING +Y IQ CIW+++  N +
Subjt:  TTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKS

Query:  PGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPS
               K+F +VVSF AALGFL+YGGRLF+MLR+FPIES+GR+KKL+EVG VT ICF+CF I CFV+ LSAFD DA LDVLDHP+LNLIYY+LVE++PS
Subjt:  PGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPS

Query:  ALVLFILRKLPPRRVS
        ALVL+ILRKLPP+RVS
Subjt:  ALVLFILRKLPPRRVS

Q948R8 Protein TOM THREE HOMOLOG 11.1e-7645.81Show/hide
Query:  LEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLV
        LE   + A+   +   G   W  ++ES  WQ  I+  L+  YG++S+IA+                                                  
Subjt:  LEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLV

Query:  AAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCII
             +QL+RIQ+RVPE+ WTTQK FH +NF+VNG+RA++F   +    ++P++L+ ++++IP L FF+TY LLVLFWAEIY+QAR++  D L+P++  I
Subjt:  AAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCII

Query:  NGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLD
        N V+YVIQI +W+ +        VI +K+FF+ VS  AALGFL+YGGRLF+ML++FP+ES+GR+KKL EVG VTTICF+CF IRC ++   AFD  ADLD
Subjt:  NGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLD

Query:  VLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR
        VLDHPILN IYY+LVE++PS+LVLFILRKLPP+R
Subjt:  VLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR

Q9FEG2 Tobamovirus multiplication protein 11.2e-8955.06Show/hide
Query:  WNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEW
        W+ ++ES QWQ GI+ AL  +Y L+S +AL                                                       VQLIRIQ+RVPE+ W
Subjt:  WNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEW

Query:  TTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKS
        TTQK FHLMNF+VNG+RA+LFG +  VF + PK L  V++++PGLLFFS YTLLVLFWAEIYHQARSLP DKL+ TY  +N  +Y+ QI IW ++ ++ +
Subjt:  TTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKS

Query:  PGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPS
            +  K+F +VVSF AALGFL+YGGRLF MLR+FPIES+GR+KKL+EVG VT ICF+CF IRC V+A+SAFDKD  LDVLDHP+LNLIYYM+VEV+PS
Subjt:  PGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPS

Query:  ALVLFILRKLPPRRVS
        ALVLFILRKLPP+RVS
Subjt:  ALVLFILRKLPPRRVS

Q9ZUM2 Tobamovirus multiplication protein 39.4e-7944.78Show/hide
Query:  ILEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSL
        ++ +  + A+ +++       W+ ++ES  WQ  I+  L+  YG++SL+A+                                                 
Subjt:  ILEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSL

Query:  VAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCI
              +QL+RIQ+RVPE+ WTTQK FH +NF+VNG+RA++F   ++V F++P++L+ ++++IP L FF+TY LLVLFWAEIY+QAR++  D L+P++  
Subjt:  VAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCI

Query:  INGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADL
        IN V+YV+QI +W+ +        VI +K+FF+ VS  AALGFL+YGGRLF+ML++FP+ES+GR+KKL EVG VTTICF+CF IRC ++  +AFD+ A+L
Subjt:  INGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADL

Query:  DVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR
        DVLDHPILN IYY+LVE++PS+LVLFILRKLPP+R
Subjt:  DVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR

Arabidopsis top hitse value%identityAlignment
AT1G14530.1 Protein of unknown function (DUF1084)8.1e-7845.81Show/hide
Query:  LEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLV
        LE   + A+   +   G   W  ++ES  WQ  I+  L+  YG++S+IA+                                                  
Subjt:  LEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLV

Query:  AAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCII
             +QL+RIQ+RVPE+ WTTQK FH +NF+VNG+RA++F   +    ++P++L+ ++++IP L FF+TY LLVLFWAEIY+QAR++  D L+P++  I
Subjt:  AAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCII

Query:  NGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLD
        N V+YVIQI +W+ +        VI +K+FF+ VS  AALGFL+YGGRLF+ML++FP+ES+GR+KKL EVG VTTICF+CF IRC ++   AFD  ADLD
Subjt:  NGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLD

Query:  VLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR
        VLDHPILN IYY+LVE++PS+LVLFILRKLPP+R
Subjt:  VLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR

AT1G14530.2 Protein of unknown function (DUF1084)8.1e-7845.81Show/hide
Query:  LEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLV
        LE   + A+   +   G   W  ++ES  WQ  I+  L+  YG++S+IA+                                                  
Subjt:  LEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLV

Query:  AAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCII
             +QL+RIQ+RVPE+ WTTQK FH +NF+VNG+RA++F   +    ++P++L+ ++++IP L FF+TY LLVLFWAEIY+QAR++  D L+P++  I
Subjt:  AAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCII

Query:  NGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLD
        N V+YVIQI +W+ +        VI +K+FF+ VS  AALGFL+YGGRLF+ML++FP+ES+GR+KKL EVG VTTICF+CF IRC ++   AFD  ADLD
Subjt:  NGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLD

Query:  VLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR
        VLDHPILN IYY+LVE++PS+LVLFILRKLPP+R
Subjt:  VLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR

AT2G02180.1 tobamovirus multiplication protein 36.7e-8044.78Show/hide
Query:  ILEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSL
        ++ +  + A+ +++       W+ ++ES  WQ  I+  L+  YG++SL+A+                                                 
Subjt:  ILEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSL

Query:  VAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCI
              +QL+RIQ+RVPE+ WTTQK FH +NF+VNG+RA++F   ++V F++P++L+ ++++IP L FF+TY LLVLFWAEIY+QAR++  D L+P++  
Subjt:  VAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCI

Query:  INGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADL
        IN V+YV+QI +W+ +        VI +K+FF+ VS  AALGFL+YGGRLF+ML++FP+ES+GR+KKL EVG VTTICF+CF IRC ++  +AFD+ A+L
Subjt:  INGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADL

Query:  DVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR
        DVLDHPILN IYY+LVE++PS+LVLFILRKLPP+R
Subjt:  DVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRR

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.0e-1625Show/hide
Query:  GIESQQTSQMASIFGCKPGTWPITYLGLPLNDNPKRANFWTPVIEKVQKRLNNCGSSNISKGGRHTLIQATLTNLPIYYLSLFQAPKKVTKAMEMLYRRF
        G++    + +   F    G  P+ YLGLPL       + + P++EK++ R+    + ++S  GR  LI + + +L  +++S F+ P    K ++ +   F
Subjt:  GIESQQTSQMASIFGCKPGTWPITYLGLPLNDNPKRANFWTPVIEKVQKRLNNCGSSNISKGGRHTLIQATLTNLPIYYLSLFQAPKKVTKAMEMLYRRF

Query:  LWSDRSEEKGCHLLRWSHIQLPMEEGGLGIYDIHKKNISLLAKWSWRFYRESNALWRKIIAAKFGLARNPHKMGEHSLSSSKGPWRNIFKNRHLLYYFTD
        LWS          + WS +  P +EGGLGI  + + N                + W                 G  +L S    W+ I K+R L   F  
Subjt:  LWSDRSEEKGCHLLRWSHIQLPMEEGGLGIYDIHKKNISLLAKWSWRFYRESNALWRKIIAAKFGLARNPHKMGEHSLSSSKGPWRNIFKNRHLLYYFTD

Query:  SRVGKGDKTLFWEDVW
          +  G  T FW D W
Subjt:  SRVGKGDKTLFWEDVW

AT4G21790.1 tobamovirus multiplication 18.4e-9155.06Show/hide
Query:  WNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEW
        W+ ++ES QWQ GI+ AL  +Y L+S +AL                                                       VQLIRIQ+RVPE+ W
Subjt:  WNGIDESKQWQKGIYSALSASYGLISLIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEW

Query:  TTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKS
        TTQK FHLMNF+VNG+RA+LFG +  VF + PK L  V++++PGLLFFS YTLLVLFWAEIYHQARSLP DKL+ TY  +N  +Y+ QI IW ++ ++ +
Subjt:  TTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLEMVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKS

Query:  PGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPS
            +  K+F +VVSF AALGFL+YGGRLF MLR+FPIES+GR+KKL+EVG VT ICF+CF IRC V+A+SAFDKD  LDVLDHP+LNLIYYM+VEV+PS
Subjt:  PGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKKLYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPS

Query:  ALVLFILRKLPPRRVS
        ALVLFILRKLPP+RVS
Subjt:  ALVLFILRKLPPRRVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCACGTGTAAAGTCAAGAACCGTCCACGTGTCAGAGATATCTGTCTCATGGACCAAATTAGAGTTACGCTCTCTCGTAGCCGCGCGCGGCATTAATGGCGAAGA
AGGTTTCTTCTCCCTCTCTCGCGCGGCATTAGCAACGAAGAATAAGAAGCCTTTCTCTTCATGCGATCTTCTCTTTCCCATTTTGGAAGCTCAAATGGCGAGGGCTTTAC
CGATCATCTCTGCGGTTGGTGGATTCGAATTGTGGAACGGGATTGATGAATCCAAGCAATGGCAGAAGGGGATTTACTCCGCCTTGTCCGCCTCTTATGGCCTCATCTCG
CTCATTGCCTTAGGTTTTATTCTTTGGTGCTCTGCTTTTTCTGCCACCTTGTTACCTTATGGAGGACTTGATCCGATTGAAAGGAGACTTTTCAATAGTGTTGAGAGACA
TGGTTCTCTTGGATTGGAGCTCTTTTTTGTATGGCTAGTTAGTTCTTTGGTAGCTGCTTTTGTTAAGGTTCAACTTATTCGGATTCAAATAAGAGTACCAGAATTCGAGT
GGACAACACAAAAGGGGTTCCATTTGATGAATTTCATTGTAAATGGATTGAGGGCTATTCTCTTTGGGCTTTATAAGAGCGTCTTCTTCATTAGACCGAAGGTTCTTGAA
ATGGTGGTTATGGAGATTCCTGGACTTCTATTTTTTTCAACGTATACACTACTTGTTCTATTTTGGGCGGAGATATACCATCAGGCACGAAGTCTTCCTATTGATAAACT
TAAGCCTACTTACTGTATCATCAATGGAGTTATGTACGTTATACAGATCTGCATTTGGATATTTGTGATGTTAAACAAGTCTCCTGGTGCAGTTATAGCAGCTAAGCTCT
TCTTTTCAGTGGTTTCATTTTCTGCTGCACTGGGTTTCCTTATATATGGTGGGAGGTTATTTGTCATGCTGAGGCAGTTCCCTATTGAATCTAGAGGACGCCAAAAAAAG
TTGTACGAGGTTGGTTGTGTGACAACAATTTGTTTTAGCTGTTTCTTTATACGATGTTTTGTGCTTGCTCTCTCCGCATTTGACAAGGATGCAGATCTTGACGTCTTAGA
TCATCCTATTCTCAACCTGATATACTACATGTTGGTAGAGGTTGTTCCTTCGGCATTGGTTCTGTTTATACTGAGAAAACTTCCTCCAAGGCGTGTATCTGATCGCTGTT
TCAACATTGAACGCATTGGAAGTATGGAAGGTAAGGTGGTGCCTGGTGTAGCTACATTAGGGAATTACCACTTCACATGTTGGAGGAGGTCTCATCATCAAAACTATAGA
TTTCTCATTATCTTTTCTAAAGCCTTTGACACAGTAGACTGGGACTTCCTCGACAACATATTAAAAGCCAAGGGATTCGGTACTTTATGGAGAAGGTGGATCAAAGGATG
TGTCTCATCAGCAAACTTTTCGGTCATTATTAATGGCAAACCAAGAGGCAAGATCATCGCTACTCGAGGCCTTAGACAAGGGGATCCTCTCTCCCCATTCTTATTTATCT
TGGTGGCCGATTGCCTTAGCAGATTGCTTCTTGTGGCTGAAAAGGGGGGCTCTATTGAGGGTTTTCGGGTAGGGGTCGGTCCACAAGCCATCCAAACCACCCACCTTCTG
TTTGCTGATGATACAATTCTATTTTCATCCCACAATCAGCAATCCGTCAACAACCTTCTGAATGTTGTTCAACATTTTGAAAGGGTATCGGGCTTAAAGATTAACCACAG
CAAATCAGAGCTCCTTGGCCTTGGCATCGAATCGCAGCAGACATCTCAGATGGCATCCATCTTTGGTTGTAAACCGGGAACATGGCCGATTACATACTTGGGTCTCCCTT
TGAATGACAATCCGAAAAGGGCCAATTTTTGGACTCCAGTGATTGAAAAGGTCCAAAAGAGACTTAACAATTGTGGATCATCTAATATTTCAAAGGGAGGTCGACACACC
CTTATCCAAGCTACCCTCACCAACCTCCCTATCTACTATCTCTCACTTTTTCAAGCCCCTAAGAAGGTAACCAAAGCTATGGAGATGTTATACCGCAGATTCCTTTGGAG
TGACCGTTCTGAGGAAAAAGGATGCCACCTTCTAAGGTGGTCTCACATCCAATTGCCTATGGAAGAGGGAGGATTGGGCATATATGATATTCACAAGAAAAATATCTCTC
TTTTAGCCAAATGGTCTTGGAGATTCTACCGAGAATCGAACGCCCTATGGAGGAAAATTATTGCTGCAAAATTCGGGCTTGCTAGGAACCCCCACAAAATGGGAGAACAT
TCCTTGAGCAGCTCTAAGGGGCCGTGGAGGAATATTTTCAAGAATCGACATCTCCTATATTACTTCACTGATAGTAGAGTTGGTAAAGGTGATAAAACCCTTTTTTGGGA
AGATGTCTGGTTGGGTGCTACTCCCTTTAAAACCACATATCCATCTTTATACAACCTATCTCTCAAGGAGGCCAGCATTGCTGATTTATGGATCACCGAAAATGAGGCCT
GGAATCTCTTTTTAAGGAGGCACCTTCAGGAGTCTGAAATCCTTGAATGGGCCAATCTATCACACCACCTATCATCCTTTTCCTTCACTAATAGGGATGATGTTTGGATT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCCACGTGTAAAGTCAAGAACCGTCCACGTGTCAGAGATATCTGTCTCATGGACCAAATTAGAGTTACGCTCTCTCGTAGCCGCGCGCGGCATTAATGGCGAAGA
AGGTTTCTTCTCCCTCTCTCGCGCGGCATTAGCAACGAAGAATAAGAAGCCTTTCTCTTCATGCGATCTTCTCTTTCCCATTTTGGAAGCTCAAATGGCGAGGGCTTTAC
CGATCATCTCTGCGGTTGGTGGATTCGAATTGTGGAACGGGATTGATGAATCCAAGCAATGGCAGAAGGGGATTTACTCCGCCTTGTCCGCCTCTTATGGCCTCATCTCG
CTCATTGCCTTAGGTTTTATTCTTTGGTGCTCTGCTTTTTCTGCCACCTTGTTACCTTATGGAGGACTTGATCCGATTGAAAGGAGACTTTTCAATAGTGTTGAGAGACA
TGGTTCTCTTGGATTGGAGCTCTTTTTTGTATGGCTAGTTAGTTCTTTGGTAGCTGCTTTTGTTAAGGTTCAACTTATTCGGATTCAAATAAGAGTACCAGAATTCGAGT
GGACAACACAAAAGGGGTTCCATTTGATGAATTTCATTGTAAATGGATTGAGGGCTATTCTCTTTGGGCTTTATAAGAGCGTCTTCTTCATTAGACCGAAGGTTCTTGAA
ATGGTGGTTATGGAGATTCCTGGACTTCTATTTTTTTCAACGTATACACTACTTGTTCTATTTTGGGCGGAGATATACCATCAGGCACGAAGTCTTCCTATTGATAAACT
TAAGCCTACTTACTGTATCATCAATGGAGTTATGTACGTTATACAGATCTGCATTTGGATATTTGTGATGTTAAACAAGTCTCCTGGTGCAGTTATAGCAGCTAAGCTCT
TCTTTTCAGTGGTTTCATTTTCTGCTGCACTGGGTTTCCTTATATATGGTGGGAGGTTATTTGTCATGCTGAGGCAGTTCCCTATTGAATCTAGAGGACGCCAAAAAAAG
TTGTACGAGGTTGGTTGTGTGACAACAATTTGTTTTAGCTGTTTCTTTATACGATGTTTTGTGCTTGCTCTCTCCGCATTTGACAAGGATGCAGATCTTGACGTCTTAGA
TCATCCTATTCTCAACCTGATATACTACATGTTGGTAGAGGTTGTTCCTTCGGCATTGGTTCTGTTTATACTGAGAAAACTTCCTCCAAGGCGTGTATCTGATCGCTGTT
TCAACATTGAACGCATTGGAAGTATGGAAGGTAAGGTGGTGCCTGGTGTAGCTACATTAGGGAATTACCACTTCACATGTTGGAGGAGGTCTCATCATCAAAACTATAGA
TTTCTCATTATCTTTTCTAAAGCCTTTGACACAGTAGACTGGGACTTCCTCGACAACATATTAAAAGCCAAGGGATTCGGTACTTTATGGAGAAGGTGGATCAAAGGATG
TGTCTCATCAGCAAACTTTTCGGTCATTATTAATGGCAAACCAAGAGGCAAGATCATCGCTACTCGAGGCCTTAGACAAGGGGATCCTCTCTCCCCATTCTTATTTATCT
TGGTGGCCGATTGCCTTAGCAGATTGCTTCTTGTGGCTGAAAAGGGGGGCTCTATTGAGGGTTTTCGGGTAGGGGTCGGTCCACAAGCCATCCAAACCACCCACCTTCTG
TTTGCTGATGATACAATTCTATTTTCATCCCACAATCAGCAATCCGTCAACAACCTTCTGAATGTTGTTCAACATTTTGAAAGGGTATCGGGCTTAAAGATTAACCACAG
CAAATCAGAGCTCCTTGGCCTTGGCATCGAATCGCAGCAGACATCTCAGATGGCATCCATCTTTGGTTGTAAACCGGGAACATGGCCGATTACATACTTGGGTCTCCCTT
TGAATGACAATCCGAAAAGGGCCAATTTTTGGACTCCAGTGATTGAAAAGGTCCAAAAGAGACTTAACAATTGTGGATCATCTAATATTTCAAAGGGAGGTCGACACACC
CTTATCCAAGCTACCCTCACCAACCTCCCTATCTACTATCTCTCACTTTTTCAAGCCCCTAAGAAGGTAACCAAAGCTATGGAGATGTTATACCGCAGATTCCTTTGGAG
TGACCGTTCTGAGGAAAAAGGATGCCACCTTCTAAGGTGGTCTCACATCCAATTGCCTATGGAAGAGGGAGGATTGGGCATATATGATATTCACAAGAAAAATATCTCTC
TTTTAGCCAAATGGTCTTGGAGATTCTACCGAGAATCGAACGCCCTATGGAGGAAAATTATTGCTGCAAAATTCGGGCTTGCTAGGAACCCCCACAAAATGGGAGAACAT
TCCTTGAGCAGCTCTAAGGGGCCGTGGAGGAATATTTTCAAGAATCGACATCTCCTATATTACTTCACTGATAGTAGAGTTGGTAAAGGTGATAAAACCCTTTTTTGGGA
AGATGTCTGGTTGGGTGCTACTCCCTTTAAAACCACATATCCATCTTTATACAACCTATCTCTCAAGGAGGCCAGCATTGCTGATTTATGGATCACCGAAAATGAGGCCT
GGAATCTCTTTTTAAGGAGGCACCTTCAGGAGTCTGAAATCCTTGAATGGGCCAATCTATCACACCACCTATCATCCTTTTCCTTCACTAATAGGGATGATGTTTGGATT
TGA
Protein sequenceShow/hide protein sequence
MIPRVKSRTVHVSEISVSWTKLELRSLVAARGINGEEGFFSLSRAALATKNKKPFSSCDLLFPILEAQMARALPIISAVGGFELWNGIDESKQWQKGIYSALSASYGLIS
LIALGFILWCSAFSATLLPYGGLDPIERRLFNSVERHGSLGLELFFVWLVSSLVAAFVKVQLIRIQIRVPEFEWTTQKGFHLMNFIVNGLRAILFGLYKSVFFIRPKVLE
MVVMEIPGLLFFSTYTLLVLFWAEIYHQARSLPIDKLKPTYCIINGVMYVIQICIWIFVMLNKSPGAVIAAKLFFSVVSFSAALGFLIYGGRLFVMLRQFPIESRGRQKK
LYEVGCVTTICFSCFFIRCFVLALSAFDKDADLDVLDHPILNLIYYMLVEVVPSALVLFILRKLPPRRVSDRCFNIERIGSMEGKVVPGVATLGNYHFTCWRRSHHQNYR
FLIIFSKAFDTVDWDFLDNILKAKGFGTLWRRWIKGCVSSANFSVIINGKPRGKIIATRGLRQGDPLSPFLFILVADCLSRLLLVAEKGGSIEGFRVGVGPQAIQTTHLL
FADDTILFSSHNQQSVNNLLNVVQHFERVSGLKINHSKSELLGLGIESQQTSQMASIFGCKPGTWPITYLGLPLNDNPKRANFWTPVIEKVQKRLNNCGSSNISKGGRHT
LIQATLTNLPIYYLSLFQAPKKVTKAMEMLYRRFLWSDRSEEKGCHLLRWSHIQLPMEEGGLGIYDIHKKNISLLAKWSWRFYRESNALWRKIIAAKFGLARNPHKMGEH
SLSSSKGPWRNIFKNRHLLYYFTDSRVGKGDKTLFWEDVWLGATPFKTTYPSLYNLSLKEASIADLWITENEAWNLFLRRHLQESEILEWANLSHHLSSFSFTNRDDVWI