; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015666 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015666
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionvacuolar protein-sorting-associated protein 33 homolog
Genome locationChr02:28734579..28753847
RNA-Seq ExpressionHG10015666
SyntenyHG10015666
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006886 - intracellular protein transport (biological process)
GO:0009116 - nucleoside metabolic process (biological process)
GO:0016192 - vesicle-mediated transport (biological process)
GO:0005773 - vacuole (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0033263 - CORVET complex (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR043154 - Sec1-like, domain 1
IPR043127 - Sec1-like, domain 3a
IPR039417 - Papain-like cysteine endopeptidase
IPR038765 - Papain-like cysteine peptidase superfamily
IPR036045 - Sec1-like superfamily
IPR027482 - Sec1-like, domain 2
IPR025661 - Cysteine peptidase, asparagine active site
IPR025660 - Cysteine peptidase, histidine active site
IPR016621 - Uncharacterised conserved protein UCP014543
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR001619 - Sec1-like protein
IPR000668 - Peptidase C1A, papain C-terminal
IPR000169 - Cysteine peptidase, cysteine active site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142391.1 vacuolar protein-sorting-associated protein 33 homolog isoform X4 [Cucumis sativus]1.5e-27986.17Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILNHLQTEEPVNSND               VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHL+TFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLLIL
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
        LSVTNSGLPK+HFDYL                                                   +PTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
Subjt:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVF+GGVTFAEISALRFLS QEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

XP_008446838.1 PREDICTED: vacuolar protein-sorting-associated protein 33 homolog isoform X1 [Cucumis melo]1.8e-28086.39Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND-----------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV
        ADILNHLQTEEPVNSND                 VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV
Subjt:  ADILNHLQTEEPVNSND-----------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV

Query:  VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLL
        VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLL
Subjt:  VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLL

Query:  ILLSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPI
        ILLSVTNSGLPKKHFDY                                                   ++PTDIAYVFSGYAPLSIRLVQQAVRSGWRPI
Subjt:  ILLSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPI

Query:  EEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        EEILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  EEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

XP_008446839.1 PREDICTED: vacuolar protein-sorting-associated protein 33 homolog isoform X2 [Cucumis melo]1.1e-28086.68Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILNHLQTEEPVNSND               VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLLIL
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
        LSVTNSGLPKKHFDY                                                   ++PTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
Subjt:  LSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

XP_031741424.1 vacuolar protein-sorting-associated protein 33 homolog isoform X3 [Cucumis sativus]2.6e-27985.88Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND-----------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV
        ADILNHLQTEEPVNSND                 VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV
Subjt:  ADILNHLQTEEPVNSND-----------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV

Query:  VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLL
        VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHL+TFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLL
Subjt:  VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLL

Query:  ILLSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPI
        ILLSVTNSGLPK+HFDYL                                                   +PTDIAYVFSGYAPLSIRLVQQAVRSGWRPI
Subjt:  ILLSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPI

Query:  EEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        EEILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVF+GGVTFAEISALRFLS QEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  EEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

XP_038891944.1 vacuolar protein-sorting-associated protein 33 homolog [Benincasa hispida]7.4e-28286.85Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILNHLQTEEPVNSND               VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKK+KVPLNSSDKLYKETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSK SFLGQLDMEHTIIEAESYD+CFEYIEELIHKQEPLVKVLRLLIL
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
        LSVTNSGLPK+HFDYL                                                   +PTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
Subjt:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPHSETKRGGFLSSSSYD+LQGAS SNDKVTDGRRTVVLVVF+GGVTFAEISALRFLSGQEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

TrEMBL top hitse value%identityAlignment
A0A0A0KWR0 Uncharacterized protein7.5e-28086.17Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILNHLQTEEPVNSND               VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHL+TFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLLIL
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
        LSVTNSGLPK+HFDYL                                                   +PTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
Subjt:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVF+GGVTFAEISALRFLS QEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

A0A1S3BFI5 vacuolar protein-sorting-associated protein 33 homolog isoform X25.2e-28186.68Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILNHLQTEEPVNSND               VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLLIL
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
        LSVTNSGLPKKHFDY                                                   ++PTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
Subjt:  LSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

A0A1S3BGU5 vacuolar protein-sorting-associated protein 33 homolog isoform X18.8e-28186.39Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND-----------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV
        ADILNHLQTEEPVNSND                 VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV
Subjt:  ADILNHLQTEEPVNSND-----------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVV

Query:  VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLL
        VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLL
Subjt:  VQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLL

Query:  ILLSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPI
        ILLSVTNSGLPKKHFDY                                                   ++PTDIAYVFSGYAPLSIRLVQQAVRSGWRPI
Subjt:  ILLSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPI

Query:  EEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        EEILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  EEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

A0A5D3CBN6 Vacuolar protein-sorting-associated protein 33-like protein isoform X25.2e-28186.68Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTS+LKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQ+DI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKG+ASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILNHLQTEEPVNSND               VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQE LVKVLRLLIL
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
        LSVTNSGLPKKHFDY                                                   ++PTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
Subjt:  LSVTNSGLPKKHFDY---------------------------------------------------LSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPHSETKRG FLSSSSYDSLQGAS SNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYEL+VGTTKIVSGN+LTETF+EK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

A0A6J1HQW9 vacuolar protein-sorting-associated protein 33 homolog1.7e-27986Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICS+IQND 
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        SKGLQREYFVYF PRRTVVCE+VLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILNHLQTEEPVNSND               VDMVTPMCSQLTYEGLVDEFL VNNG+VELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        ILRQKAM+MK+DYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYD+CFEYIEELIHKQEPLVKVLRLLIL
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
        LSVTNSGLPKK FDYL                                                   +PTDIAYVFSGYAPLSIRLVQQAVRSGWRP+EE
Subjt:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVF+GGVTFAEISALRFLSGQEGMAYEL+VGTTKIVSGN+LTE+FMEK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

SwissProt top hitse value%identityAlignment
P25804 Cysteine proteinase 15A3.4e-10451.87Show/hide
Query:  LTCAFSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSE
        L   F  A +   ++  T    D   +RQV D E  + L A  E  F  F  K+ K Y T++E+ +R G+F  NLI+A  HQ  DPTA HG+T+FSDL+ 
Subjt:  LTCAFSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSE

Query:  EEFERMFMGVRVGAGGTKLQEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTDK
         EF R F+G++      +L+    A          LPE FDWREKGAVT +K QG+CGSCWAFST GA+EGA+++ATGKL+SLSEQQLVDCDH CDP   
Subjt:  EEFERMFMGVRVGAGGTKLQEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTDK

Query:  TACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGKRF
         +C++GCNGGLM NA++YL++SGG+ +E  Y YTGR G C F   K+   VSNF+ + +DEDQIAA+LV+ GPLAV +NA +MQTY+ GVSCP +C K  
Subjt:  TACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGKRF

Query:  VNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG
        ++HGVL+VG+G   ++ +R ++ PYWIIKNSWG+ WGE GYY++CRG
Subjt:  VNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG

P43295 Probable cysteine protease RD19B4.7e-10654.15Show/hide
Query:  LTCAFSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSE
        L   FS++L+   +S S     D   +RQV D      L   SE  F +F +K+GK+Y + +E+ +R  +F  NL+RA  HQ +DP+A HGVTQFSDL+ 
Subjt:  LTCAFSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSE

Query:  EEFERMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTD
         EF R  +GV+   GG KL ++ NQA  +     + LPE FDWR++GAVT +K QG+CGSCW+FST GA+EGA+F+ATGKL+SLSEQQLVDCDH CDP +
Subjt:  EEFERMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTD

Query:  KTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGR-RGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGK
        + +C++GCNGGLM +A++Y +K+GGL  E  YPYTG   G C     KI   VSNF+ + I+EDQIAA+L++ GPLAV +NA +MQTYIGGVSCP IC +
Subjt:  KTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGR-RGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGK

Query:  RFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG
        R +NHGVL+VGYG  GFS  R ++ PYWIIKNSWGE WGE G+Y++C+G
Subjt:  RFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG

P43296 Cysteine protease RD19A3.6e-10655.07Show/hide
Query:  FSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFE
        F L+  I  +SSS     D   +RQV  G     L   SE  F +F  K+GK+Y + +E+ +R  +F  NL RA  HQ LDP+A HGVTQFSDL+  EF 
Subjt:  FSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFE

Query:  RMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTDKTAC
        +  +GVR    G KL ++ N+A  +     E LPE FDWR+ GAVT +K QG+CGSCW+FS  GA+EGANF+ATGKL+SLSEQQLVDCDH CDP +  +C
Subjt:  RMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTDKTAC

Query:  NNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGE-CTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGKRFVN
        ++GCNGGLM +A++Y +K+GGL +E  YPYTG+ G+ C     KI   VSNF+ I IDE+QIAA+LV+ GPLAV +NA +MQTYIGGVSCP IC +R +N
Subjt:  NNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGE-CTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGKRFVN

Query:  HGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG
        HGVL+VGYG  G++  RF++ PYWIIKNSWGE WGE G+Y++C+G
Subjt:  HGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG

Q8VYS0 Probable cysteine protease RD19D4.1e-13466.02Show/hide
Query:  MARAESLLLTCAFSLALLICEISSSTALRRDSEFLRQVT-DGEII--NNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAV
        +A+A + L+TC     +L C + +S     +   +RQVT D   I  N L   +E KF +FM  YGK Y TR+EY+HRLGIFAKN+++AAEHQ +DP+AV
Subjt:  MARAESLLLTCAFSLALLICEISSSTALRRDSEFLRQVT-DGEII--NNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAV

Query:  HGVTQFSDLSEEEFERMFMGVRVGAGGTKLQEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLV
        HGVTQFSDL+EEEF+RM+ GV    GG++   +   A M   EV+GLPE FDWREKG VT +K QG CGSCWAFST GA EGA+F++TGKLLSLSEQQLV
Subjt:  HGVTQFSDLSEEEFERMFMGVRVGAGGTKLQEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLV

Query:  DCDHTCDPTDKTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGG
        DCD  CDP DK AC+NGC GGLMTNAY+YL+++GGLEEE SYPYTG+RG C F  +K+AVRV NFTTIP+DE+QIAA+LVR GPLAVGLNAVFMQTYIGG
Subjt:  DCDHTCDPTDKTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGG

Query:  VSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRGH
        VSCPLIC KR VNHGVL+VGYG +GFSILR    PYWIIKNSWG++WGE GYY+LCRGH
Subjt:  VSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRGH

Q94KJ7 Vacuolar protein-sorting-associated protein 33 homolog3.4e-22165.43Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IP+L+NAPLNL+++R++S++EL+N+LK++RG KCLV+DPKL GS+SLII TS LKE G ELRHL+++P+QT+C KVVYLVR+Q+  M+FI S+IQNDI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        +K +QR+Y+VYF PRR+V CE++LE+EKVH+L+T+ E+PLY++PLDED++SFEL+ S K+ LVDGD SSLWHIAKAIH+LEFSFG I  +RAKGKASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILN +Q EEPVNSND               VDMVTPMCSQLTYEGL+DE LH++NG+VE+DSS+MGAQQ+GKK+KVPLNSSDKL+KETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        +LRQKAM MK+DY E++ +TQ+VSELKDFVKKLNSLPEMTRHI+LAQHL+TFTSK SF  QLDME T++EAE+YD+C+EYIEE+IHKQEPL  VLRLL+L
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
         SVTNSGLPKK FDY+                                                    P DIAYV+SGYAPLSIRL+QQA+ SGWRP+E+
Subjt:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPH ETKR GF SS S DSL GAS     V DGRR++VLVVF+GGVTFAEISALR+L+ +EGMAY+L+V TTKIV+G TL ETFMEK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

Arabidopsis top hitse value%identityAlignment
AT2G21430.1 Papain family cysteine protease3.4e-10754.15Show/hide
Query:  LTCAFSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSE
        L   FS++L+   +S S     D   +RQV D      L   SE  F +F +K+GK+Y + +E+ +R  +F  NL+RA  HQ +DP+A HGVTQFSDL+ 
Subjt:  LTCAFSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSE

Query:  EEFERMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTD
         EF R  +GV+   GG KL ++ NQA  +     + LPE FDWR++GAVT +K QG+CGSCW+FST GA+EGA+F+ATGKL+SLSEQQLVDCDH CDP +
Subjt:  EEFERMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTD

Query:  KTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGR-RGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGK
        + +C++GCNGGLM +A++Y +K+GGL  E  YPYTG   G C     KI   VSNF+ + I+EDQIAA+L++ GPLAV +NA +MQTYIGGVSCP IC +
Subjt:  KTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGR-RGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGK

Query:  RFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG
        R +NHGVL+VGYG  GFS  R ++ PYWIIKNSWGE WGE G+Y++C+G
Subjt:  RFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG

AT3G54860.1 Sec1/munc18-like (SM) proteins superfamily2.4e-22265.43Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IP+L+NAPLNL+++R++S++EL+N+LK++RG KCLV+DPKL GS+SLII TS LKE G ELRHL+++P+QT+C KVVYLVR+Q+  M+FI S+IQNDI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        +K +QR+Y+VYF PRR+V CE++LE+EKVH+L+T+ E+PLY++PLDED++SFEL+ S K+ LVDGD SSLWHIAKAIH+LEFSFG I  +RAKGKASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ
        ADILN +Q EEPVNSND               VDMVTPMCSQLTYEGL+DE LH++NG+VE+DSS+MGAQQ+GKK+KVPLNSSDKL+KETRDLNFEVVVQ
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSSDKLYKETRDLNFEVVVQ

Query:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL
        +LRQKAM MK+DY E++ +TQ+VSELKDFVKKLNSLPEMTRHI+LAQHL+TFTSK SF  QLDME T++EAE+YD+C+EYIEE+IHKQEPL  VLRLL+L
Subjt:  ILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHKQEPLVKVLRLLIL

Query:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE
         SVTNSGLPKK FDY+                                                    P DIAYV+SGYAPLSIRL+QQA+ SGWRP+E+
Subjt:  LSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEE

Query:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK
        ILKLLPGPH ETKR GF SS S DSL GAS     V DGRR++VLVVF+GGVTFAEISALR+L+ +EGMAY+L+V TTKIV+G TL ETFMEK
Subjt:  ILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK

AT3G54860.2 Sec1/munc18-like (SM) proteins superfamily3.3e-21963.71Show/hide
Query:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI
        A IP+L+NAPLNL+++R++S++EL+N+LK++RG KCLV+DPKL GS+SLII TS LKE G ELRHL+++P+QT+C KVVYLVR+Q+  M+FI S+IQNDI
Subjt:  AMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPKLGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDI

Query:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV
        +K +QR+Y+VYF PRR+V CE++LE+EKVH+L+T+ E+PLY++PLDED++SFEL+ S K+ LVDGD SSLWHIAKAIH+LEFSFG I  +RAKGKASVRV
Subjt:  SKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSFELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRV

Query:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSS----------------D
        ADILN +Q EEPVNSND               VDMVTPMCSQLTYEGL+DE LH++NG+VE+DSS+MGAQQ+GKK+KVPLNSS                D
Subjt:  ADILNHLQTEEPVNSND---------------VDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIKVPLNSS----------------D

Query:  KLYKETRDLNFEVVVQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEEL
        KL+KETRDLNFEVVVQ+LRQKAM MK+DY E++ +TQ+VSELKDFVKKLNSLPEMTRHI+LAQHL+TFTSK SF  QLDME T++EAE+YD+C+EYIEE+
Subjt:  KLYKETRDLNFEVVVQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEEL

Query:  IHKQEPLVKVLRLLILLSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSI
        IHKQEPL  VLRLL+L SVTNSGLPKK FDY+                                                    P DIAYV+SGYAPLSI
Subjt:  IHKQEPLVKVLRLLILLSVTNSGLPKKHFDYL---------------------------------------------------SPTDIAYVFSGYAPLSI

Query:  RLVQQAVRSGWRPIEEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGN
        RL+QQA+ SGWRP+E+ILKLLPGPH ETKR GF SS S DSL GAS     V DGRR++VLVVF+GGVTFAEISALR+L+ +EGMAY+L+V TTKIV+G 
Subjt:  RLVQQAVRSGWRPIEEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVFVGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGN

Query:  TLTETFMEK
        TL ETFMEK
Subjt:  TLTETFMEK

AT3G54940.2 Papain family cysteine protease2.9e-13566.02Show/hide
Query:  MARAESLLLTCAFSLALLICEISSSTALRRDSEFLRQVT-DGEII--NNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAV
        +A+A + L+TC     +L C + +S     +   +RQVT D   I  N L   +E KF +FM  YGK Y TR+EY+HRLGIFAKN+++AAEHQ +DP+AV
Subjt:  MARAESLLLTCAFSLALLICEISSSTALRRDSEFLRQVT-DGEII--NNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAV

Query:  HGVTQFSDLSEEEFERMFMGVRVGAGGTKLQEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLV
        HGVTQFSDL+EEEF+RM+ GV    GG++   +   A M   EV+GLPE FDWREKG VT +K QG CGSCWAFST GA EGA+F++TGKLLSLSEQQLV
Subjt:  HGVTQFSDLSEEEFERMFMGVRVGAGGTKLQEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLV

Query:  DCDHTCDPTDKTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGG
        DCD  CDP DK AC+NGC GGLMTNAY+YL+++GGLEEE SYPYTG+RG C F  +K+AVRV NFTTIP+DE+QIAA+LVR GPLAVGLNAVFMQTYIGG
Subjt:  DCDHTCDPTDKTACNNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGG

Query:  VSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRGH
        VSCPLIC KR VNHGVL+VGYG +GFSILR    PYWIIKNSWG++WGE GYY+LCRGH
Subjt:  VSCPLICGKRFVNHGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRGH

AT4G39090.1 Papain family cysteine protease2.6e-10755.07Show/hide
Query:  FSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFE
        F L+  I  +SSS     D   +RQV  G     L   SE  F +F  K+GK+Y + +E+ +R  +F  NL RA  HQ LDP+A HGVTQFSDL+  EF 
Subjt:  FSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSEEEFE

Query:  RMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTDKTAC
        +  +GVR    G KL ++ N+A  +     E LPE FDWR+ GAVT +K QG+CGSCW+FS  GA+EGANF+ATGKL+SLSEQQLVDCDH CDP +  +C
Subjt:  RMFMGVRVGAGGTKL-QEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTDKTAC

Query:  NNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGE-CTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGKRFVN
        ++GCNGGLM +A++Y +K+GGL +E  YPYTG+ G+ C     KI   VSNF+ I IDE+QIAA+LV+ GPLAV +NA +MQTYIGGVSCP IC +R +N
Subjt:  NNGCNGGLMTNAYKYLIKSGGLEEESSYPYTGRRGE-CTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGKRFVN

Query:  HGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG
        HGVL+VGYG  G++  RF++ PYWIIKNSWGE WGE G+Y++C+G
Subjt:  HGVLMVGYGDEGFSILRFRKLPYWIIKNSWGERWGEGGYYRLCRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGGGCGGAATCCCTCCTTCTAACATGCGCATTTAGCCTCGCATTACTAATCTGCGAAATTTCCTCCTCCACCGCTCTCCGCCGGGACTCGGAATTTCTACGCCA
AGTCACCGACGGCGAGATTATCAACAATCTCCCCGCAGGGAGCGAGCGGAAGTTCGTGATGTTCATGGAGAAGTACGGGAAGATCTATCCGACGAGGAAGGAGTATTTGC
ACCGCCTCGGGATTTTTGCTAAAAATTTGATTAGGGCGGCGGAGCACCAGGCGTTGGATCCGACCGCCGTGCACGGCGTGACGCAGTTCTCGGACTTGTCGGAGGAGGAG
TTTGAGCGGATGTTTATGGGTGTGAGAGTCGGCGCAGGCGGAACGAAGTTGCAGGAGATGAATCAGGCGGCGGCGATGACGGCGGAGGAGGTGGAGGGATTGCCGGAGAG
GTTTGATTGGCGGGAGAAGGGGGCTGTTACCGGGATTAAGATGCAGGGCACGTGCGGATCATGCTGGGCATTCAGTACGTGTGGAGCAGTGGAAGGCGCCAATTTCATAG
CCACGGGAAAGCTTCTTAGCCTCAGCGAGCAACAACTCGTCGATTGCGACCACACGTGCGATCCGACGGACAAAACCGCCTGCAACAACGGCTGCAACGGCGGCCTAATG
ACCAACGCCTACAAATACCTAATCAAATCCGGCGGCCTCGAGGAGGAATCTTCATACCCCTACACCGGCCGCCGTGGTGAATGCACCTTCCAATCCGACAAAATCGCAGT
TAGGGTTTCTAATTTCACCACTATCCCCATCGACGAGGATCAGATCGCCGCCCACCTGGTCCGCGGCGGCCCGCTCGCCGTCGGCCTCAACGCCGTCTTCATGCAGACCT
ACATCGGTGGCGTATCCTGTCCGTTGATATGTGGCAAGAGATTCGTCAACCATGGCGTGCTTATGGTCGGGTACGGCGATGAAGGGTTCTCGATTCTGCGGTTTCGGAAA
TTGCCATACTGGATTATTAAAAATTCTTGGGGAGAACGGTGGGGCGAAGGGGGTTATTATCGGCTGTGCCGCGGCCATGCCACCTTTTTCCATTTCATCAATGGCGTGCC
TTCCAAAATTCCACCACTATCCGTTTCTCCTACAAGTAATCCCCTCAAGTTGAAGCTCCCACGAATTTCCAGAGCGTCTTCTGAAGGAGCTCCCAATGATTTAATCGAGG
ACTCGAAGTTTGTTCCCTTGAATGCTGAAGATCCCAGATATGGTCCACCTGTATTGCTGTTGCTGGGCTTTGAACTGGAGGAGGCAGTGAAGATTCAAGAGCTTTTGAAA
GATTTGGACGGCGAATTCATGCAGATAGCCAGATCATTGCCACGAATCTGTCTCTTGTCCGGTCTTAGCGGAGAGGAAATGATGATGTTCATAGATGCTTTTCCAGAAAC
TGGACTTGAACCATCTGTATTTGCTGCTCTTGTTCCCAACGCTGCTAATAAACCAGTAGAAGAGTTAATAGAAGAGATCATGGGGGACCATGAAGCGATGATTCCAAATT
TGGATAATGCTCCTCTGAATCTCCGAGCGCTCAGGGAACAATCTCAGAAGGAGCTCATAAACATCCTGAAAAATATTCGAGGCAGGAAATGTTTAGTCGTTGATCCTAAG
CTGGGAGGTTCTCTATCCTTAATCATCCAAACGTCAATACTTAAGGAACATGGAGCTGAATTGCGACATCTTTCATCTGATCCAATTCAAACTGATTGCAATAAGGTGGT
TTATCTTGTTCGCGCTCAGATGGATTTGATGCGATTCATATGTTCCAATATTCAAAATGACATTTCGAAAGGACTTCAAAGAGAATATTTTGTTTATTTTGCCCCTCGCC
GTACAGTGGTTTGTGAGAGGGTTCTGGAGGAGGAAAAGGTCCACCACCTATTGACTATCGGGGAGTATCCTTTATACGTAATTCCATTGGATGAGGATATTCTGTCCTTT
GAGCTTGATCGTTCGAACAAAGAATACCTTGTTGATGGTGATACTAGCTCACTCTGGCATATTGCAAAGGCAATTCACAAGCTTGAGTTTTCCTTTGGGGCAATACCAAA
TGTGAGGGCCAAAGGGAAAGCATCAGTACGTGTTGCTGACATTCTAAATCATTTGCAAACAGAGGAACCTGTTAACTCAAACGATGTGGACATGGTCACTCCTATGTGTT
CTCAGTTGACATATGAAGGGCTGGTTGATGAGTTTTTGCATGTGAATAATGGTTCTGTGGAGCTTGATTCATCAATCATGGGTGCTCAGCAAGATGGAAAAAAGATCAAG
GTTCCACTTAATTCAAGTGACAAGCTGTATAAGGAGACGAGGGATCTCAACTTCGAAGTAGTTGTCCAGATTCTACGTCAAAAAGCTATGAACATGAAGCAGGACTATGC
AGAGATGTCAACTACTACACAGTCAGTTTCTGAGTTGAAAGACTTTGTTAAAAAGCTTAATTCGTTGCCAGAAATGACAAGGCACATTAACTTGGCTCAACACTTGTCGA
CGTTCACATCAAAGCCATCCTTTCTTGGGCAGCTTGACATGGAACACACAATTATTGAAGCTGAAAGCTATGACGTATGTTTTGAGTACATTGAAGAACTGATCCATAAG
CAGGAGCCCCTTGTTAAAGTCCTCCGTCTTCTCATCTTGCTTTCTGTTACAAATTCTGGTTTACCTAAAAAGCATTTTGACTACTTGAGCCCGACTGATATTGCCTATGT
CTTTTCTGGATATGCACCTCTTAGCATTCGTCTTGTCCAACAAGCTGTAAGATCTGGATGGCGTCCCATTGAAGAAATTTTGAAGTTATTACCTGGGCCTCATTCAGAAA
CAAAGAGAGGTGGATTCCTTAGCAGTTCATCCTATGATTCATTGCAAGGGGCTTCAACCAGTAATGACAAAGTGACTGATGGAAGGCGCACAGTTGTACTTGTTGTTTTT
GTTGGAGGAGTAACATTTGCTGAGATTTCTGCTCTTCGATTTCTGAGTGGTCAGGAAGGAATGGCCTACGAGTTGTTAGTTGGTACCACAAAGATTGTTAGTGGCAATAC
CTTGACTGAAACATTCATGGAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGGGCGGAATCCCTCCTTCTAACATGCGCATTTAGCCTCGCATTACTAATCTGCGAAATTTCCTCCTCCACCGCTCTCCGCCGGGACTCGGAATTTCTACGCCA
AGTCACCGACGGCGAGATTATCAACAATCTCCCCGCAGGGAGCGAGCGGAAGTTCGTGATGTTCATGGAGAAGTACGGGAAGATCTATCCGACGAGGAAGGAGTATTTGC
ACCGCCTCGGGATTTTTGCTAAAAATTTGATTAGGGCGGCGGAGCACCAGGCGTTGGATCCGACCGCCGTGCACGGCGTGACGCAGTTCTCGGACTTGTCGGAGGAGGAG
TTTGAGCGGATGTTTATGGGTGTGAGAGTCGGCGCAGGCGGAACGAAGTTGCAGGAGATGAATCAGGCGGCGGCGATGACGGCGGAGGAGGTGGAGGGATTGCCGGAGAG
GTTTGATTGGCGGGAGAAGGGGGCTGTTACCGGGATTAAGATGCAGGGCACGTGCGGATCATGCTGGGCATTCAGTACGTGTGGAGCAGTGGAAGGCGCCAATTTCATAG
CCACGGGAAAGCTTCTTAGCCTCAGCGAGCAACAACTCGTCGATTGCGACCACACGTGCGATCCGACGGACAAAACCGCCTGCAACAACGGCTGCAACGGCGGCCTAATG
ACCAACGCCTACAAATACCTAATCAAATCCGGCGGCCTCGAGGAGGAATCTTCATACCCCTACACCGGCCGCCGTGGTGAATGCACCTTCCAATCCGACAAAATCGCAGT
TAGGGTTTCTAATTTCACCACTATCCCCATCGACGAGGATCAGATCGCCGCCCACCTGGTCCGCGGCGGCCCGCTCGCCGTCGGCCTCAACGCCGTCTTCATGCAGACCT
ACATCGGTGGCGTATCCTGTCCGTTGATATGTGGCAAGAGATTCGTCAACCATGGCGTGCTTATGGTCGGGTACGGCGATGAAGGGTTCTCGATTCTGCGGTTTCGGAAA
TTGCCATACTGGATTATTAAAAATTCTTGGGGAGAACGGTGGGGCGAAGGGGGTTATTATCGGCTGTGCCGCGGCCATGCCACCTTTTTCCATTTCATCAATGGCGTGCC
TTCCAAAATTCCACCACTATCCGTTTCTCCTACAAGTAATCCCCTCAAGTTGAAGCTCCCACGAATTTCCAGAGCGTCTTCTGAAGGAGCTCCCAATGATTTAATCGAGG
ACTCGAAGTTTGTTCCCTTGAATGCTGAAGATCCCAGATATGGTCCACCTGTATTGCTGTTGCTGGGCTTTGAACTGGAGGAGGCAGTGAAGATTCAAGAGCTTTTGAAA
GATTTGGACGGCGAATTCATGCAGATAGCCAGATCATTGCCACGAATCTGTCTCTTGTCCGGTCTTAGCGGAGAGGAAATGATGATGTTCATAGATGCTTTTCCAGAAAC
TGGACTTGAACCATCTGTATTTGCTGCTCTTGTTCCCAACGCTGCTAATAAACCAGTAGAAGAGTTAATAGAAGAGATCATGGGGGACCATGAAGCGATGATTCCAAATT
TGGATAATGCTCCTCTGAATCTCCGAGCGCTCAGGGAACAATCTCAGAAGGAGCTCATAAACATCCTGAAAAATATTCGAGGCAGGAAATGTTTAGTCGTTGATCCTAAG
CTGGGAGGTTCTCTATCCTTAATCATCCAAACGTCAATACTTAAGGAACATGGAGCTGAATTGCGACATCTTTCATCTGATCCAATTCAAACTGATTGCAATAAGGTGGT
TTATCTTGTTCGCGCTCAGATGGATTTGATGCGATTCATATGTTCCAATATTCAAAATGACATTTCGAAAGGACTTCAAAGAGAATATTTTGTTTATTTTGCCCCTCGCC
GTACAGTGGTTTGTGAGAGGGTTCTGGAGGAGGAAAAGGTCCACCACCTATTGACTATCGGGGAGTATCCTTTATACGTAATTCCATTGGATGAGGATATTCTGTCCTTT
GAGCTTGATCGTTCGAACAAAGAATACCTTGTTGATGGTGATACTAGCTCACTCTGGCATATTGCAAAGGCAATTCACAAGCTTGAGTTTTCCTTTGGGGCAATACCAAA
TGTGAGGGCCAAAGGGAAAGCATCAGTACGTGTTGCTGACATTCTAAATCATTTGCAAACAGAGGAACCTGTTAACTCAAACGATGTGGACATGGTCACTCCTATGTGTT
CTCAGTTGACATATGAAGGGCTGGTTGATGAGTTTTTGCATGTGAATAATGGTTCTGTGGAGCTTGATTCATCAATCATGGGTGCTCAGCAAGATGGAAAAAAGATCAAG
GTTCCACTTAATTCAAGTGACAAGCTGTATAAGGAGACGAGGGATCTCAACTTCGAAGTAGTTGTCCAGATTCTACGTCAAAAAGCTATGAACATGAAGCAGGACTATGC
AGAGATGTCAACTACTACACAGTCAGTTTCTGAGTTGAAAGACTTTGTTAAAAAGCTTAATTCGTTGCCAGAAATGACAAGGCACATTAACTTGGCTCAACACTTGTCGA
CGTTCACATCAAAGCCATCCTTTCTTGGGCAGCTTGACATGGAACACACAATTATTGAAGCTGAAAGCTATGACGTATGTTTTGAGTACATTGAAGAACTGATCCATAAG
CAGGAGCCCCTTGTTAAAGTCCTCCGTCTTCTCATCTTGCTTTCTGTTACAAATTCTGGTTTACCTAAAAAGCATTTTGACTACTTGAGCCCGACTGATATTGCCTATGT
CTTTTCTGGATATGCACCTCTTAGCATTCGTCTTGTCCAACAAGCTGTAAGATCTGGATGGCGTCCCATTGAAGAAATTTTGAAGTTATTACCTGGGCCTCATTCAGAAA
CAAAGAGAGGTGGATTCCTTAGCAGTTCATCCTATGATTCATTGCAAGGGGCTTCAACCAGTAATGACAAAGTGACTGATGGAAGGCGCACAGTTGTACTTGTTGTTTTT
GTTGGAGGAGTAACATTTGCTGAGATTTCTGCTCTTCGATTTCTGAGTGGTCAGGAAGGAATGGCCTACGAGTTGTTAGTTGGTACCACAAAGATTGTTAGTGGCAATAC
CTTGACTGAAACATTCATGGAGAAGTGA
Protein sequenceShow/hide protein sequence
MARAESLLLTCAFSLALLICEISSSTALRRDSEFLRQVTDGEIINNLPAGSERKFVMFMEKYGKIYPTRKEYLHRLGIFAKNLIRAAEHQALDPTAVHGVTQFSDLSEEE
FERMFMGVRVGAGGTKLQEMNQAAAMTAEEVEGLPERFDWREKGAVTGIKMQGTCGSCWAFSTCGAVEGANFIATGKLLSLSEQQLVDCDHTCDPTDKTACNNGCNGGLM
TNAYKYLIKSGGLEEESSYPYTGRRGECTFQSDKIAVRVSNFTTIPIDEDQIAAHLVRGGPLAVGLNAVFMQTYIGGVSCPLICGKRFVNHGVLMVGYGDEGFSILRFRK
LPYWIIKNSWGERWGEGGYYRLCRGHATFFHFINGVPSKIPPLSVSPTSNPLKLKLPRISRASSEGAPNDLIEDSKFVPLNAEDPRYGPPVLLLLGFELEEAVKIQELLK
DLDGEFMQIARSLPRICLLSGLSGEEMMMFIDAFPETGLEPSVFAALVPNAANKPVEELIEEIMGDHEAMIPNLDNAPLNLRALREQSQKELINILKNIRGRKCLVVDPK
LGGSLSLIIQTSILKEHGAELRHLSSDPIQTDCNKVVYLVRAQMDLMRFICSNIQNDISKGLQREYFVYFAPRRTVVCERVLEEEKVHHLLTIGEYPLYVIPLDEDILSF
ELDRSNKEYLVDGDTSSLWHIAKAIHKLEFSFGAIPNVRAKGKASVRVADILNHLQTEEPVNSNDVDMVTPMCSQLTYEGLVDEFLHVNNGSVELDSSIMGAQQDGKKIK
VPLNSSDKLYKETRDLNFEVVVQILRQKAMNMKQDYAEMSTTTQSVSELKDFVKKLNSLPEMTRHINLAQHLSTFTSKPSFLGQLDMEHTIIEAESYDVCFEYIEELIHK
QEPLVKVLRLLILLSVTNSGLPKKHFDYLSPTDIAYVFSGYAPLSIRLVQQAVRSGWRPIEEILKLLPGPHSETKRGGFLSSSSYDSLQGASTSNDKVTDGRRTVVLVVF
VGGVTFAEISALRFLSGQEGMAYELLVGTTKIVSGNTLTETFMEK