; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0617 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0617
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontobamovirus multiplication protein 1
Genome locationMC02:4941968..4951382
RNA-Seq ExpressionMC02g0617
SyntenyMC02g0617
Gene Ontology termsGO:0005774 - vacuolar membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009457 - THH1/TOM1/TOM3 domain
IPR040226 - THH1/TOM1/TOM3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456345.1 PREDICTED: tobamovirus multiplication protein 1 isoform X1 [Cucumis melo]5.60e-22287.71Show/hide
Query:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF
        L+ L++NT+C+PLDL++LD  MASFNG+LAF+AFSQLIRIHMR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA  +LW+C SHVF FVLMAFPKILF
Subjt:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST
        LAAFLLLLSFWVDLCHQANDEEDDDDDEEN+ +Q+LLENSKNKPGSSNVDGHRRCCGFPA HLGSRQKIVIVVV LVF+LMVAVS+LIWIGAG+NPIDST
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST

Query:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI
         VARVYEDFLA+TVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV A V+L LYFCMGSLI
Subjt:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI

Query:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAAD NPQ W  V RSKNQ +RASPI
Subjt:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

XP_022149697.1 tobamovirus multiplication protein 1 [Momordica charantia]1.37e-252100Show/hide
Query:  MVLLQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPK
        MVLLQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPK
Subjt:  MVLLQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPK

Query:  ILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPI
        ILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPI
Subjt:  ILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPI

Query:  DSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMG
        DSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMG
Subjt:  DSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

XP_022970675.1 uncharacterized protein LOC111469590 [Cucurbita maxima]2.21e-21285.75Show/hide
Query:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF
        L+ LAANTA VP+DL++L+ AMASFNG+LAF+AFSQLIRIHMR QQDGWTRQK +HLMI SSNLGYM YFIFALVA F+  +C SHVFGFVLMAFPKILF
Subjt:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST
        LAAFLLLLSFWVDL HQAND++D+D+DEEN+T+QALLENSKNKPGSS+VDG+RRCCGFPA HLGSRQK VIVVV+LVF LMVAVS+LIWIG G+NPIDST
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST

Query:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI
        AVA+VYE F+AVTVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV ALVLL LYFCMGSLI
Subjt:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI

Query:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        PSAFL+W MRELPPPKKIQRQEESRAIAFISHGAAD NPQ W  VTRSKNQV+RASPI
Subjt:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

XP_031744161.1 tobamovirus multiplication protein 1 [Cucumis sativus]2.78e-22287.71Show/hide
Query:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF
        L+ L++NT+C+PLDL++LD  MASFNG+LAF+AFSQLIRIHMR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALV   +LW+C SHVF FVLMAFPKILF
Subjt:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST
        LAAFLLLLSFWVDLCHQANDEEDDDDDEEN+ +Q LLENSKNKPGSSNVDGHRRCCGFPA HLGSRQKIVIVVV+LVF+LMVAVS+LIWIGAG+NPIDST
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST

Query:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI
        AVARVYEDFLA+TVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV A V+L LYFCMGSLI
Subjt:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI

Query:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        PSAFLLWIMRELPPPKK+QRQEESRAIAFISHGAADANPQ W  V RSKNQ +RASPI
Subjt:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

XP_038902574.1 tobamovirus multiplication protein 1 [Benincasa hispida]3.01e-22489.42Show/hide
Query:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF
        L+ LAANTAC+PLDL++LD AMAS NG+LAF+AF QLIRIHMR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA  +LW+C SHVFGFVLMAFPKILF
Subjt:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDD-DDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDS
        LAAFLLLLSFWVDLCHQANDEEDDD DDEEN+ +QALLENSKNKPGSSNVDGHRRCCGFPA HLGSRQKIVI+VV+LVF+LMVAVS+LIWIGAG+NPIDS
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDD-DDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDS

Query:  TAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSL
        TAVARVYEDFLA+TVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV ALVLLILYFCMGSL
Subjt:  TAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSL

Query:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAAD NPQ W  V RSKNQ +RASPI
Subjt:  IPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

TrEMBL top hitse value%identityAlignment
A0A1S3C2L2 tobamovirus multiplication protein 1 isoform X12.71e-22287.71Show/hide
Query:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF
        L+ L++NT+C+PLDL++LD  MASFNG+LAF+AFSQLIRIHMR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA  +LW+C SHVF FVLMAFPKILF
Subjt:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST
        LAAFLLLLSFWVDLCHQANDEEDDDDDEEN+ +Q+LLENSKNKPGSSNVDGHRRCCGFPA HLGSRQKIVIVVV LVF+LMVAVS+LIWIGAG+NPIDST
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST

Query:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI
         VARVYEDFLA+TVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV A V+L LYFCMGSLI
Subjt:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI

Query:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAAD NPQ W  V RSKNQ +RASPI
Subjt:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

A0A1S3C336 tobamovirus multiplication protein 1 isoform X21.59e-19989.91Show/hide
Query:  MRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSK
        MR QQDGWTRQKA+HLMIGSSNLGYMIYFIFALVA  +LW+C SHVF FVLMAFPKILFLAAFLLLLSFWVDLCHQANDEEDDDDDEEN+ +Q+LLENSK
Subjt:  MRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSK

Query:  NKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEAS
        NKPGSSNVDGHRRCCGFPA HLGSRQKIVIVVV LVF+LMVAVS+LIWIGAG+NPIDST VARVYEDFLA+TVLLSGGALGFYGFMLFYRL KVRSEEAS
Subjt:  NKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEAS

Query:  SEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQR
        SEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV A V+L LYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAAD NPQ 
Subjt:  SEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQR

Query:  WADVTRSKNQVTRASPI
        W  V RSKNQ +RASPI
Subjt:  WADVTRSKNQVTRASPI

A0A6J1D8P5 tobamovirus multiplication protein 16.62e-253100Show/hide
Query:  MVLLQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPK
        MVLLQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPK
Subjt:  MVLLQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPK

Query:  ILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPI
        ILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPI
Subjt:  ILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPI

Query:  DSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMG
        DSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMG
Subjt:  DSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMG

Query:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
Subjt:  SLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

A0A6J1G615 uncharacterized protein LOC1114511263.40e-20984.92Show/hide
Query:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF
        L+ LAANTA VP+DL+ L+ AMASFNG+LAF+AFSQLIRIHMR QQD WTRQK +HLMI SSNLGYM YFIFALVA F+  +C SHVFGFVLMAFPKILF
Subjt:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST
        LAAFLLLLSFWVDL HQAND++D+D+DEEN+T+QALLENSKNKPGSS+VDG+RRCCGFPA HLGSRQK+VIVVV+LVF LMVAVS+LIWIG G+NPIDST
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST

Query:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI
        AVA+VYE F+AVTVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWR  R NGV ALVLL LYFCMGSLI
Subjt:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI

Query:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        PSAFL+W MRELPPPKKIQRQEESRAIAFISHGAAD NPQ W  VTRSKNQV+RASPI
Subjt:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

A0A6J1I3I8 uncharacterized protein LOC1114695901.07e-21285.75Show/hide
Query:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF
        L+ LAANTA VP+DL++L+ AMASFNG+LAF+AFSQLIRIHMR QQDGWTRQK +HLMI SSNLGYM YFIFALVA F+  +C SHVFGFVLMAFPKILF
Subjt:  LQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILF

Query:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST
        LAAFLLLLSFWVDL HQAND++D+D+DEEN+T+QALLENSKNKPGSS+VDG+RRCCGFPA HLGSRQK VIVVV+LVF LMVAVS+LIWIG G+NPIDST
Subjt:  LAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDST

Query:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI
        AVA+VYE F+AVTVLLSGGALGFYGFMLFYRL KVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRF RTNGV ALVLL LYFCMGSLI
Subjt:  AVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLI

Query:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI
        PSAFL+W MRELPPPKKIQRQEESRAIAFISHGAAD NPQ W  VTRSKNQV+RASPI
Subjt:  PSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANPQRWADVTRSKNQVTRASPI

SwissProt top hitse value%identityAlignment
Q402F3 Tobamovirus multiplication protein 36.1e-0822.68Show/hide
Query:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN
        +A   G+++ +A  QLIRI MR  + GWT QK  H       L +++  + +LV  F  ++      +   +L+  P + F   + LL+ FW ++ +QA 
Subjt:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN

Query:  DEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGG
            D                  +P    ++G                        +V+++ + + ++IW      P+    +  + + F A   L +  
Subjt:  DEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGG

Query:  ALGF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNAL---VLLILYFCMGSLIPSAFLLWIMREL
        ALGF  YG  LF  L    V S+    ++++VG +  +C  CF       L+  + + +N  F +   ++ L   +L ++Y+ +  ++PS+ +L+I+R+L
Subjt:  ALGF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNAL---VLLILYFCMGSLIPSAFLLWIMREL

Query:  PPPKKIQRQEESR
        PP + I +    R
Subjt:  PPPKKIQRQEESR

Q402F4 Tobamovirus multiplication protein 13.8e-1024Show/hide
Query:  VLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEEDDD
        +++ +A  QLIRI +R  + GWT QK  HLM       +++  + A+V  F  +++     V    ++  P +LF + F LL+ FW ++ HQA       
Subjt:  VLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEEDDD

Query:  DDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGALGF--
                   L   K +    +++G                         ++ +   + V +W        D++ V  + + F+AV   ++  ALGF  
Subjt:  DDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGALGF--

Query:  YGFMLF--YRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQ
        YG  LF   R   + S+    ++ +VG +  +C  CF  S  V +L+      +      + ++  VL ++Y+ +  ++PSA +L+I+R+L PPK++  Q
Subjt:  YGFMLF--YRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQ

Q948R8 Protein TOM THREE HOMOLOG 17.9e-0822.83Show/hide
Query:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN
        +A   G+++ +A  QL+RI +R  + GWT QK  H       L +M+  + ALV  F  +  N    +   +L+  P + F   + LL+ FW ++ +QA 
Subjt:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN

Query:  DEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGG
                                  + + DG R                   +  +V+++ +A+ +++W      P+    +  + + F A   L +  
Subjt:  DEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGG

Query:  ALGF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNAL---VLLILYFCMGSLIPSAFLLWIMREL
        ALGF  YG  LF  L    V S+    ++++VG +  +C  CF       L+  I + ++  F     ++ L   +L  +Y+ +  ++PS+ +L+I+R+L
Subjt:  ALGF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNAL---VLLILYFCMGSLIPSAFLLWIMREL

Query:  PPPKKIQRQEE
        PP + I +  +
Subjt:  PPPKKIQRQEE

Q9FEG2 Tobamovirus multiplication protein 17.6e-1124.92Show/hide
Query:  AMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND
        A+     +++ +A  QLIRI MR  + GWT QK  HLM    N    + F F +    +++        +VL+  P +LF +A+ LL+ FW ++ HQA  
Subjt:  AMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND

Query:  EEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGA
           D                                          +   I V + V++  + +   IW+       D++ V  V + F+AV   ++  A
Subjt:  EEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGA

Query:  LGF--YGFMLFY--RLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPK
        LGF  YG  LF+  R   + S+    ++ +VG +  +C  CF    +V     + +S   + +  + ++  VL ++Y+ +  ++PSA +L+I+R+L PPK
Subjt:  LGF--YGFMLFY--RLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPK

Query:  KIQRQ
        ++  Q
Subjt:  KIQRQ

Q9ZUM2 Tobamovirus multiplication protein 38.7e-0721.43Show/hide
Query:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDE
        +A   G+++ +A  QL+RI +R  + GWT QK  H +    N    + F+F     F        +   +L+  P + F   + LL+ FW ++ +QA   
Subjt:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDE

Query:  EDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGAL
                                + + DG R                   +  +V+++ +A+ +++W      P+    +  + + F A   L +  AL
Subjt:  EDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGAL

Query:  GF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKK
        GF  YG  LF  L    V S+    ++++VG +  +C  CF    ++          N      + ++  +L  +Y+ +  ++PS+ +L+I+R+LPP + 
Subjt:  GF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKK

Query:  IQRQEESR
        I +  + R
Subjt:  IQRQEESR

Arabidopsis top hitse value%identityAlignment
AT1G14530.1 Protein of unknown function (DUF1084)5.6e-0922.83Show/hide
Query:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN
        +A   G+++ +A  QL+RI +R  + GWT QK  H       L +M+  + ALV  F  +  N    +   +L+  P + F   + LL+ FW ++ +QA 
Subjt:  MASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATF--ELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAN

Query:  DEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGG
                                  + + DG R                   +  +V+++ +A+ +++W      P+    +  + + F A   L +  
Subjt:  DEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGG

Query:  ALGF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNAL---VLLILYFCMGSLIPSAFLLWIMREL
        ALGF  YG  LF  L    V S+    ++++VG +  +C  CF       L+  I + ++  F     ++ L   +L  +Y+ +  ++PS+ +L+I+R+L
Subjt:  ALGF--YGFMLFYRL--IKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNAL---VLLILYFCMGSLIPSAFLLWIMREL

Query:  PPPKKIQRQEE
        PP + I +  +
Subjt:  PPPKKIQRQEE

AT3G59090.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)1.1e-10553.68Show/hide
Query:  MVLLQSLAANT--ACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAF
        M +L +L  +T   C     + ++  +A  +  LAF+AF QL R H R +Q GWTRQK +HLMI SSN G +IYF+ A++AT   W+  S+  GF+LMAF
Subjt:  MVLLQSLAANT--ACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAF

Query:  PKILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQN
        PKILFLA FLLLLSFWVD+CHQ N EEDDDDDEENS QQ LLE SK+KPGSSN    R+CC F   H+G+RQK V+  +ILVFILM++ ++LIWI +G+N
Subjt:  PKILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQN

Query:  PIDSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFC
        P++S+ +A VY D  A  +L++GG + FYG  L + L KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT IPL Y+W   + +G+ ALVLLI+Y+ 
Subjt:  PIDSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFC

Query:  MGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANP----QRWADVTRSKNQVTRASPI
        +GS +P AF+LW++RELPP   + RQE++R I ++++      P    Q WA  T SKNQV++ASPI
Subjt:  MGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANP----QRWADVTRSKNQVTRASPI

AT3G59090.2 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)8.7e-10353.19Show/hide
Query:  MVLLQSLAANT--ACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAF
        M +L +L  +T   C     + ++  +A  +  LAF+AF QL R H R +Q GWTRQK +HLMI SSN G +IYF+ A++AT   W+  S+  GF+LMAF
Subjt:  MVLLQSLAANT--ACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAF

Query:  PKILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQN
        PKILFLA FLLLLSFWVD+CHQ N EEDDDDDEENS QQ LLE SK+KPGSSN    R+CC F   H+G+RQK V+  +ILVFILM++ ++LIWI +G+N
Subjt:  PKILFLAAFLLLLSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQN

Query:  PIDSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFC
        P++S+ +A VY D  A  +L++GG + FYG  L + L KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT IPL Y+W   + +G+ ALVLLI+Y+ 
Subjt:  PIDSTAVARVYEDFLAVTVLLSGGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFC

Query:  MGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANP----QRWADVTRSKNQV
        +GS +P AF+LW++RELPP   + RQE++R I ++++      P    Q WA  T SKNQ+
Subjt:  MGSLIPSAFLLWIMRELPPPKKIQRQEESRAIAFISHGAADANP----QRWADVTRSKNQV

AT3G59090.3 LOCATED IN: endomembrane system3.0e-10356.34Show/hide
Query:  SFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEED
        SF  +LA   F QL R H R +Q GWTRQK +HLMI SSN G +IYF+ A++AT   W+  S+  GF+LMAFPKILFLA FLLLLSFWVD+CHQ N EED
Subjt:  SFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQANDEED

Query:  DDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGALGF
        DDDDEENS QQ LLE SK+KPGSSN    R+CC F   H+G+RQK V+  +ILVFILM++ ++LIWI +G+NP++S+ +A VY D  A  +L++GG + F
Subjt:  DDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGALGF

Query:  YGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEE
        YG  L + L KVRSE+ SSEM+KV GLA V VVCFT S+L+ LLT IPL Y+W   + +G+ ALVLLI+Y+ +GS +P AF+LW++RELPP   + RQE+
Subjt:  YGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEE

Query:  SRAIAFISHGAADANP----QRWADVTRSKNQVTRASPI
        +R I ++++      P    Q WA  T SKNQV++ASPI
Subjt:  SRAIAFISHGAADANP----QRWADVTRSKNQVTRASPI

AT4G21790.1 tobamovirus multiplication 15.4e-1224.92Show/hide
Query:  AMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND
        A+     +++ +A  QLIRI MR  + GWT QK  HLM    N    + F F +    +++        +VL+  P +LF +A+ LL+ FW ++ HQA  
Subjt:  AMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLLLSFWVDLCHQAND

Query:  EEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGA
           D                                          +   I V + V++  + +   IW+       D++ V  V + F+AV   ++  A
Subjt:  EEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLSGGA

Query:  LGF--YGFMLFY--RLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPK
        LGF  YG  LF+  R   + S+    ++ +VG +  +C  CF    +V     + +S   + +  + ++  VL ++Y+ +  ++PSA +L+I+R+L PPK
Subjt:  LGF--YGFMLFY--RLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPK

Query:  KIQRQ
        ++  Q
Subjt:  KIQRQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGCTTCAATCACTTGCCGCCAACACTGCTTGTGTTCCCCTGGACCTTGTCATTCTCGATGCAGCTATGGCTTCCTTCAACGGCGTTCTAGCTTTCTTAGCCTT
TTCGCAGCTCATCAGAATTCACATGCGGGGTCAACAGGATGGATGGACGCGTCAAAAAGCAGTTCATCTGATGATAGGCTCTTCTAACTTGGGCTATATGATTTATTTCA
TATTTGCACTTGTTGCTACTTTTGAACTCTGGAATTGCGGGTCTCATGTTTTTGGATTTGTCCTCATGGCTTTCCCTAAAATACTGTTTCTAGCAGCTTTTCTCCTACTT
CTTTCTTTCTGGGTCGACCTTTGCCATCAGGCAAACGATGAAGAGGATGACGACGACGATGAAGAAAATAGCACTCAACAGGCCTTGTTGGAAAATTCAAAGAACAAACC
TGGTTCATCAAATGTAGATGGCCATCGAAGATGTTGCGGATTTCCTGCTACTCATCTTGGAAGTAGGCAAAAAATTGTAATTGTGGTTGTCATACTGGTATTTATCCTCA
TGGTGGCAGTTTCTGTTCTGATCTGGATTGGGGCAGGACAAAATCCTATTGATTCTACAGCTGTTGCCAGGGTGTATGAAGACTTTCTTGCTGTTACAGTCCTCCTATCA
GGAGGTGCCCTAGGTTTCTATGGTTTCATGTTATTTTATAGATTGATAAAAGTACGTTCTGAGGAAGCCTCTTCCGAGATGAAGAAGGTTGGTGGGCTAGCAGTTGTCTG
TGTTGTGTGTTTTACATCAAGTGCTCTGGTAGATCTTCTTACAGATATTCCTCTTTCCTATAATTGGCGCTTCATGAGAACAAATGGAGTAAATGCCCTAGTCCTTTTGA
TTTTGTACTTCTGCATGGGTTCTTTGATTCCATCAGCCTTTTTATTGTGGATTATGAGAGAGCTGCCACCTCCTAAAAAAATTCAGAGACAAGAAGAATCGAGGGCAATA
GCTTTTATAAGCCATGGGGCAGCTGATGCAAATCCTCAGCGTTGGGCTGATGTAACTCGTTCAAAGAATCAGGTTACCAGAGCAAGCCCCATATAA
mRNA sequenceShow/hide mRNA sequence
CAGAAGCGAAGGGAGATTTACGATGGCATTATAAAACAGAGCCGAGAGGAATGCCTAATGCGTAATGTATTTATCTGTTTTTCACGTGGCGGTTTCTTTTAATTACAATT
ATTAAAAAATCGGAAGTCTCTCACGCACAGCGTCGCCAGCGACAAGACTTGCAGAAAAGCAAATCAGCGACACCTCTCTCTCTCTCACTCCTCTTCCTCCTCCTCTTATC
TAATCCGTACGCTCGATTGGTACCTCCATTTCCCCCGCCGATGCAGCACGCGCCGCCGTCACCAATCGCTTTCTCTTCTCTCCATACGATCTGGATCAGAGGTTGATTCT
CCCTCTGTTTTTGTCTTTTTCTTTTCTTTTTGTGTTCAAGACACCGACGGGAACAACATGGTGCTGCTTCAATCACTTGCCGCCAACACTGCTTGTGTTCCCCTGGACCT
TGTCATTCTCGATGCAGCTATGGCTTCCTTCAACGGCGTTCTAGCTTTCTTAGCCTTTTCGCAGCTCATCAGAATTCACATGCGGGGTCAACAGGATGGATGGACGCGTC
AAAAAGCAGTTCATCTGATGATAGGCTCTTCTAACTTGGGCTATATGATTTATTTCATATTTGCACTTGTTGCTACTTTTGAACTCTGGAATTGCGGGTCTCATGTTTTT
GGATTTGTCCTCATGGCTTTCCCTAAAATACTGTTTCTAGCAGCTTTTCTCCTACTTCTTTCTTTCTGGGTCGACCTTTGCCATCAGGCAAACGATGAAGAGGATGACGA
CGACGATGAAGAAAATAGCACTCAACAGGCCTTGTTGGAAAATTCAAAGAACAAACCTGGTTCATCAAATGTAGATGGCCATCGAAGATGTTGCGGATTTCCTGCTACTC
ATCTTGGAAGTAGGCAAAAAATTGTAATTGTGGTTGTCATACTGGTATTTATCCTCATGGTGGCAGTTTCTGTTCTGATCTGGATTGGGGCAGGACAAAATCCTATTGAT
TCTACAGCTGTTGCCAGGGTGTATGAAGACTTTCTTGCTGTTACAGTCCTCCTATCAGGAGGTGCCCTAGGTTTCTATGGTTTCATGTTATTTTATAGATTGATAAAAGT
ACGTTCTGAGGAAGCCTCTTCCGAGATGAAGAAGGTTGGTGGGCTAGCAGTTGTCTGTGTTGTGTGTTTTACATCAAGTGCTCTGGTAGATCTTCTTACAGATATTCCTC
TTTCCTATAATTGGCGCTTCATGAGAACAAATGGAGTAAATGCCCTAGTCCTTTTGATTTTGTACTTCTGCATGGGTTCTTTGATTCCATCAGCCTTTTTATTGTGGATT
ATGAGAGAGCTGCCACCTCCTAAAAAAATTCAGAGACAAGAAGAATCGAGGGCAATAGCTTTTATAAGCCATGGGGCAGCTGATGCAAATCCTCAGCGTTGGGCTGATGT
AACTCGTTCAAAGAATCAGGTTACCAGAGCAAGCCCCATATAATGAGACATTTAAGGACATACCACTGATTAAGGCTGCTTATCTTATGTTACATGAAAGAAATTATGAC
CAGAGATACAGAACCTAACATTTGGGACGAGTGTTAAGAGTTTGACCATGCTGCGAGCGACGCCTGTATATTTCAACAAAAAGTTGATGACTGACATCAAATGTGAGAGT
AATCAAAACTGCAGCTGGATACACAGAAGCTTTTATATGTTTTTCCTTTCCCCTCTTCTTTTCCCTTTTCCTTTGCACATAGTAGCCAAGGATTGCACATTTTCTATGAG
GGAAAAATGTTGTACAATTTTTGCTTCCCTAGAAGCTTGAAAGTTTTGCAGATTATGTTGTAAAATTACCCTTCTGCACCGCAACCTTGTTCTGATGGGTACTCATATTA
TGATGGTTGAGTATGTTAAAGCCTAGGGTATATACCAGTAACCAGCAATTTCTAACGAGTGGAGCAATTTATTTTACACTTACGTGAATACTCTCTCTTTCTCTCTGCGT
CCTAAAAATGCC
Protein sequenceShow/hide protein sequence
MVLLQSLAANTACVPLDLVILDAAMASFNGVLAFLAFSQLIRIHMRGQQDGWTRQKAVHLMIGSSNLGYMIYFIFALVATFELWNCGSHVFGFVLMAFPKILFLAAFLLL
LSFWVDLCHQANDEEDDDDDEENSTQQALLENSKNKPGSSNVDGHRRCCGFPATHLGSRQKIVIVVVILVFILMVAVSVLIWIGAGQNPIDSTAVARVYEDFLAVTVLLS
GGALGFYGFMLFYRLIKVRSEEASSEMKKVGGLAVVCVVCFTSSALVDLLTDIPLSYNWRFMRTNGVNALVLLILYFCMGSLIPSAFLLWIMRELPPPKKIQRQEESRAI
AFISHGAADANPQRWADVTRSKNQVTRASPI