; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010632 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010632
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionaspartyl protease family protein 1
Genome locationscaffold35:1102379..1105127
RNA-Seq ExpressionMS010632
SyntenyMS010632
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132517.1 aspartyl protease family protein 1 [Momordica charantia]5.8e-27889.72Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPYAS                   DFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYKDSEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

XP_022926302.1 aspartyl protease family protein 1-like [Cucurbita moschata]3.4e-25481.83Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYY++LALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAP EGSPYAS                   DF+LS+Y+PKESSTSKTVPCNNSLC Q+DQCI AFGNCPY+VSYVSAETST+GILIEDVLHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRI+FGDKGS EQEETPFN+NQLHPTYNITVT ++VGT LID DI ALFDSGTSFTYFTDPIY+KLSESFH QTRD R PPN RIPFEYCY MSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASL P +SLTMKGGS FPV+DPIIVIST+NELIYCLAVVKSAEL+IIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAG+
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GN+SSPGLTKETK S+Q STESEFN  HSSLLTCFRFFIILLFLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

XP_023003873.1 aspartyl protease family protein 1-like [Cucurbita maxima]1.5e-25481.83Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYY++LALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPYAS                   DF+LS+Y+P ESSTSKTVPCNNSLC Q+DQCI AFGNCPY+VSYVSAETST+GILIEDVLHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGDKGS EQEETPFN+NQLHPTYNITVT ++VGT LID DI ALFDSGTSFTYFTDPIY+KLSESFH QTRD R PPN RIPFEYCY MSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASL P +SLTMKGGS FPV+DPIIVIST+NELIYCLAVVKSAEL+IIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GN+SSPGLTKETK S+Q+STESEF+  HSSLLTCFRFFIIL FLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

XP_023517281.1 aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo]2.4e-25581.83Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYY++LALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPYAS                   DF+LS+Y+PKESSTSKTVPCNNSLC Q+DQCI AFGNCPY+VSYVSAETST+GILIED+LHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGDKGS EQEETPFN+NQLHPTYNITVT ++VGT LID DI ALFDSGTSFTYFTDPIY+KLSESFH QTRD R PPN RIPFEYCY MSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASL P +SLTMKGGS FPV+DPIIVIST+NELIYCLAVVKSAEL+IIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GN+SSPGLT+ETK S+Q+STESEFN  HSSLLTCFRFFIIL FLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

XP_038881847.1 aspartyl protease family protein 1 [Benincasa hispida]2.8e-25682.2Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVK WSGVSGKLSLPDSWPAKGSIEYYA+LA RDR+FR +RLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPYAS                   DF+LSVY+PK+SS+SKTVPC+NSLC Q+DQC   FGNCPYVVSYVSAETSTTGILIED+LHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREG MANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGD+GSPEQEETPFN+NQLHPTYNITVT +RVGTTLIDADITALFDSGTSFTYFTDP+Y+KLS SFH+QTRDGR PPN RIPFEYCYNMSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASLTP ISLTMKGGSPFPVYDPIIVIST+NELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGW+KFDCYDIEEQNLFP KPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
         N SSPGLTKET+ S+QISTESEFNSCHSSLL+CFRFFIILLFLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

TrEMBL top hitse value%identityAlignment
A0A1S3B1N6 LOW QUALITY PROTEIN: aspartyl protease family protein 12.8e-24980Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFS+Q+K WSGVS       SWPAKG+IEYYA+LA RDR+FRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PG KFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPY+S                   DF+LSVY+PK+SSTSKTVPCNN LC Q+DQC  AFGNCPYVVSYVSAETSTTGIL+ED+LHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREG MANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGDKGS EQEETPFN+NQLHP YNITVT +RVGTTLIDADITALFDSGTSF+YFTDPIY+KLS SFH+QTRDGR PPN RIPFEYCYNMSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASLTPGISLTMKGGSPFPVYDPIIVIST+NELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGW+KFDCYDIEE+NLFPTKPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GN+SSPGLTK+ + S+QISTESEF SC+SSLL+CFRFFIILLFLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

A0A5A7SN20 Aspartyl protease family protein 18.6e-23580.75Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFS+Q+K WSGVSGKL+LPDSWPAKG+IEYYA+LA RDR+FRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PG KFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPY+S                   DF+LSVY+PK+SSTSKTVPCNN LC Q+DQC  AFGNCPYVVSYVSAETSTTGIL+ED+LHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREG MANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGDKGS EQEETPFN+NQLHP YNITVT +RVGTTLIDADITALFDSGTSF+YFTDPIY+KLS SFH+QTRDGR PPN RIPFEYCYNMSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASLTPGISLTMKGGSPFPVYDPIIVIST+NELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGW+KFDCYDIEE+NLFPTKPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNS
          N+
Subjt:  GNNS

A0A6J1BU18 aspartyl protease family protein 12.8e-27889.72Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPYAS                   DFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYKDSEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

A0A6J1EE57 aspartyl protease family protein 1-like1.7e-25481.83Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYY++LALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAP EGSPYAS                   DF+LS+Y+PKESSTSKTVPCNNSLC Q+DQCI AFGNCPY+VSYVSAETST+GILIEDVLHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRI+FGDKGS EQEETPFN+NQLHPTYNITVT ++VGT LID DI ALFDSGTSFTYFTDPIY+KLSESFH QTRD R PPN RIPFEYCY MSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASL P +SLTMKGGS FPV+DPIIVIST+NELIYCLAVVKSAEL+IIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAG+
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GN+SSPGLTKETK S+Q STESEFN  HSSLLTCFRFFIILLFLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

A0A6J1KXV5 aspartyl protease family protein 1-like7.5e-25581.83Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV
        MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYY++LALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLG+PGMKFMVALDTGSDLFWV
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWV

Query:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT
        PCDCSRCAPTEGSPYAS                   DF+LS+Y+P ESSTSKTVPCNNSLC Q+DQCI AFGNCPY+VSYVSAETST+GILIEDVLHLKT
Subjt:  PCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKT

Query:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD
        EYK SEPIQAYITFG                                     CGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF D
Subjt:  EYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD

Query:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA
        DGVGRINFGDKGS EQEETPFN+NQLHPTYNITVT ++VGT LID DI ALFDSGTSFTYFTDPIY+KLSESFH QTRD R PPN RIPFEYCY MSPDA
Subjt:  DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDA

Query:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
        NASL P +SLTMKGGS FPV+DPIIVIST+NELIYCLAVVKSAEL+IIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV
Subjt:  NASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGV

Query:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        GN+SSPGLTKETK S+Q+STESEF+  HSSLLTCFRFFIIL FLL
Subjt:  GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 361.3e-2224.9Show/hide
Query:  VSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWVPCDCSRCAPTEGSP
        VSG      +    G  +  + L   D +   R L+  D PL    G+S  R  S+G L++T ++LGSP  ++ V +DTGSD+ WV      CAP     
Subjt:  VSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWVPCDCSRCAPTEGSP

Query:  YASRHIFKSHASLHPFNCAIMQD--FQLSVYNPKESSTSKTVPCNNSLCE--QQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKTEYKDSEPIQA
                         C +  D    LS+Y+ K SSTSK V C +  C    Q +   A   C Y V Y    TS  G  I+D + L+           
Subjt:  YASRHIFKSHASLHPFNCAIMQD--FQLSVYNPKESSTSKTVPCNNSLCE--QQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKTEYKDSEPIQA

Query:  YITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDV-AAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD-DGVGRINF
                        +TG L          + P     +  CG+ QSG      +A +G+ G G    S+ S L+  G     FS C  + +G G    
Subjt:  YITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDV-AAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD-DGVGRINF

Query:  GDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLID---------ADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPD
        G+  SP  + TP   NQ+H  YN+ +  + V    ID          D   + DSGT+  Y    +Y  L E   ++ +              C++ + +
Subjt:  GDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLID---------ADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPD

Query:  ANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYC-------LAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDC
         + +  P ++L  +      VY    + S   ++ YC       +     A++ ++G   ++   +V+D E  V+GW   +C
Subjt:  ANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYC-------LAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDC

Q8VYV9 Aspartyl protease family protein 11.1e-12545.72Show/hide
Query:  HHRFSEQVKKWSGVSGKLSLP-DSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPL-AFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFW
        HHRFS+QV    GV     LP D  P + S +YY  +A RDR  RGRRL+  D  L  FSDGN + R+ +LGFLHY  V +G+P   FMVALDTGSDLFW
Subjt:  HHRFSEQVKKWSGVSGKLSLP-DSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPL-AFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFW

Query:  VPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLK
        +PCDC+ C     +P  S                      L++Y+P  SSTS  VPCN++LC + D+C     +CPY + Y+S  TS+TG+L+EDVLHL 
Subjt:  VPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLK

Query:  TEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFG
        +  K S+ I A +TFG                                     CGQVQ+G F D AAPNGLFGLG+E ISVPS+L++EG  ANSFSMCFG
Subjt:  TEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFG

Query:  DDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGR-RPPNTRIPFEYCYNMSP
        +DG GRI+FGDKGS +Q ETP N+ Q HPTYNITVT++ VG    D +  A+FDSGTSFTY TD  YT +SESF+S   D R +  ++ +PFEYCY +SP
Subjt:  DDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGR-RPPNTRIPFEYCYNMSP

Query:  DANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCY--DIEEQNLFPTKPDVTTVPPAV
        + ++   P ++LTMKGGS +PVY P++VI  ++  +YCLA++K  +++IIGQNFMTGYR+VFDREKL+LGW++ DCY  +   + L   +   +  PPA 
Subjt:  DANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCY--DIEEQNLFPTKPDVTTVPPAV

Query:  AAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
             ++  P   + T   SQ    S  ++ +S  ++   FF  +L +L
Subjt:  AAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

Q9LTW4 Aspartic proteinase NANA, chloroplast8.5e-1422.86Show/hide
Query:  HYTTVQLGSPGMKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGN
        ++T +++G+P  KF V +DTGS+L WV C            Y +R                       V+   ES + KTV C    C+     + +   
Subjt:  HYTTVQLGSPGMKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGN

Query:  CPYVVSYVSAETSTTGILIEDVLHLKTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGL
        CP         T +T            +Y+ ++        GS    V  K  IT  L     + LP        HL  C    +G     A  +G+ GL
Subjt:  CPYVVSYVSAETSTTGILIEDVLHLKTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGL

Query:  GMEQISVPSILSREGFMANSFSMCFGDDGVGR-----INFGDKGSPE---QEETPFNVNQLHPTYNITVTRVRVGTTLIDA-----DITA----LFDSGT
             S  S  +        FS C  D    +     + FG   S +   +  TP ++ ++ P Y I V  + +G  ++D      D T+    + DSGT
Subjt:  GMEQISVPSILSREGFMANSFSMCFGDDGVGR-----INFGDKGSPE---QEETPFNVNQLHPTYNITVTRVRVGTTLIDA-----DITA----LFDSGT

Query:  SFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSA--ELNIIGQNFMT
        S T   D  Y ++         + +R     +P EYC++ +   N S  P ++  +KGG+ F  +    ++      + CL  V +     N+IG     
Subjt:  SFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSA--ELNIIGQNFMT

Query:  GYRIVFDREKLVLGWRKFDC
         Y   FD     L +    C
Subjt:  GYRIVFDREKLVLGWRKFDC

Q9LX20 Aspartic proteinase-like protein 13.5e-6833.93Show/hide
Query:  HRFSEQVKKWSGVSGKL-SLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISS---LGFLHYTTVQLGSPGMKFMVALDTGSDLF
        HRFS++ +     S K  S  DS P K S+EYY  LA  D  FR +R++      +      S  ISS    G+LHYT + +G+P + F+VALDTGS+L 
Subjt:  HRFSEQVKKWSGVSGKL-SLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISS---LGFLHYTTVQLGSPGMKFMVALDTGSDLF

Query:  WVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHL
        W+PC+C +CAP   + Y+S                 +    L+ YNP  SSTSK   C++ LC+    C      CPY V+Y+S  TS++G+L+ED+LHL
Subjt:  WVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHL

Query:  KTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF
           Y  +  +       SV A+V+I                             CG+ QSG +LD  AP+GL GLG  +ISVPS LS+ G M NSFS+CF
Subjt:  KTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF

Query:  GDDGVGRINFGDKGSPEQEETPFNV--NQLHPTYNITVTRVRVGTT-LIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYN
         ++  GRI FGD G   Q+ TPF    N  +  Y + V    +G + L     T   DSG SFTY  + IY K++        +        + +EYCY 
Subjt:  GDDGVGRINFGDKGSPEQEETPFNV--NQLHPTYNITVTRVRVGTT-LIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYN

Query:  MSPDANASLTPGISLTMKGGSPFPVYDPIIVISTENELI-YCLAVVKSAELNI--IGQNFMTGYRIVFDREKLVLGWRKFDCYD--IEEQNLFP---TKP
         S +      P I L     + F ++ P+ V      L+ +CL +  S +  I  IGQN+M GYR+VFDRE + LGW    C +  IE     P   + P
Subjt:  MSPDANASLTPGISLTMKGGSPFPVYDPIIVISTENELI-YCLAVVKSAELNI--IGQNFMTGYRIVFDREKLVLGWRKFDCYD--IEEQNLFP---TKP

Query:  DVTTVPPAVAAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFL
        +        + G G+  SP +  +T   +  S+ S      SS++  F   ++L +L
Subjt:  DVTTVPPAVAAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFL

Q9S9K4 Aspartic proteinase 392.6e-2627.05Show/hide
Query:  RRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWVPC-DCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNP
        R L+  D PL    G+S  R+ S+G L++T ++LGSP  ++ V +DTGSD+ W+ C  C +C PT+ +                       +F+LS+++ 
Subjt:  RRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWVPC-DCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNP

Query:  KESSTSKTVPCNNSLC---EQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKTEYKD--SEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLP
          SSTSK V C++  C    Q D C  A G C Y + Y   E+++ G  I D+L L+    D  + P+   + FG                         
Subjt:  KESSTSKTVPCNNSLC---EQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKTEYKD--SEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLP

Query:  MSLPCPSPHLCSCGQVQSGSFLD-VAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD-DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVR
                    CG  QSG   +  +A +G+ G G    SV S L+  G     FS C  +  G G    G   SP+ + TP   NQ+H  YN+ +  + 
Subjt:  MSLPCPSPHLCSCGQVQSGSFLD-VAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGD-DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVR

Query:  VGTTLIDADIT------ALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFE--YCYNMSPDANASLTPGISLTMKGGSPFPVYDPIIVISTE
        V  T +D   +       + DSGT+  YF   +Y  L E+  +     R+P    I  E   C++ S + + +  P +S   +      VY    + + E
Subjt:  VGTTLIDADIT------ALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFE--YCYNMSPDANASLTPGISLTMKGGSPFPVYDPIIVISTE

Query:  NELIYC-------LAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDC
         EL YC       L   + +E+ ++G   ++   +V+D +  V+GW   +C
Subjt:  NELIYC-------LAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDC

Arabidopsis top hitse value%identityAlignment
AT2G17760.1 Eukaryotic aspartyl protease family protein7.7e-12745.72Show/hide
Query:  HHRFSEQVKKWSGVSGKLSLP-DSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPL-AFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFW
        HHRFS+QV    GV     LP D  P + S +YY  +A RDR  RGRRL+  D  L  FSDGN + R+ +LGFLHY  V +G+P   FMVALDTGSDLFW
Subjt:  HHRFSEQVKKWSGVSGKLSLP-DSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPL-AFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFW

Query:  VPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLK
        +PCDC+ C     +P  S                      L++Y+P  SSTS  VPCN++LC + D+C     +CPY + Y+S  TS+TG+L+EDVLHL 
Subjt:  VPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLK

Query:  TEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFG
        +  K S+ I A +TFG                                     CGQVQ+G F D AAPNGLFGLG+E ISVPS+L++EG  ANSFSMCFG
Subjt:  TEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFG

Query:  DDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGR-RPPNTRIPFEYCYNMSP
        +DG GRI+FGDKGS +Q ETP N+ Q HPTYNITVT++ VG    D +  A+FDSGTSFTY TD  YT +SESF+S   D R +  ++ +PFEYCY +SP
Subjt:  DDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGR-RPPNTRIPFEYCYNMSP

Query:  DANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCY--DIEEQNLFPTKPDVTTVPPAV
        + ++   P ++LTMKGGS +PVY P++VI  ++  +YCLA++K  +++IIGQNFMTGYR+VFDREKL+LGW++ DCY  +   + L   +   +  PPA 
Subjt:  DANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCY--DIEEQNLFPTKPDVTTVPPAV

Query:  AAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
             ++  P   + T   SQ    S  ++ +S  ++   FF  +L +L
Subjt:  AAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL

AT3G51330.1 Eukaryotic aspartyl protease family protein4.2e-10139.77Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRL--SEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLF
        +HH FS++VK+       L L D  P KGS+EY+  LA RDR  RGR L  +  + P+ F  GN +  I  LGFLHY  V +G+P   F+VALDTGSDLF
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRL--SEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLF

Query:  WVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHL
        W+PC+C      +                      + Q   L++Y+P  SSTS ++ C++  C    +C     +CPY + Y+S +T TTG L EDVLHL
Subjt:  WVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHL

Query:  KTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF
         TE +  EP++A IT G                                     CG+ Q+G     AA NGL GLG++  SVPSIL++    ANSFSMCF
Subjt:  KTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF

Query:  GD--DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM
        G+  D VGRI+FGDKG  +Q ETP    +  PTY ++VT V VG   +   + ALFD+GTSFT+  +P Y  ++++F     D RRP +  +PFE+CY++
Subjt:  GD--DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM

Query:  SPDANASLTPGISLTMKGGSPFPVYDPIIVI-STENELIYCLAVVKSAE--LNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVP
        SP+    L P +++T +GGS   + +P+ ++ + +N  +YCL ++KS +  +NIIGQNFM+GYRIVFDRE+++LGW++ DC+  E+++L  T P     P
Subjt:  SPDANASLTPGISLTMKGGSPFPVYDPIIVI-STENELIYCLAVVKSAE--LNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVP

Query:  PAVAAGVGNNSSP
        P   A   + S+P
Subjt:  PAVAAGVGNNSSP

AT3G51350.1 Eukaryotic aspartyl protease family protein4.9e-8936.96Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRL--SEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLF
        +HH FS+ VK+       L L D  P +GS+EY+  LA RDR  RGR L  +  + P+ F  GN +  +  LG L+Y  V +G+P   F+VALDTGSDLF
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRL--SEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLF

Query:  WVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHL
        W+PC+C      +                   +  + Q   L++Y P  S+TS ++ C++  C    +C      CPY +SY S  T T G L++DVLHL
Subjt:  WVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHL

Query:  KTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF
         TE ++  P++A +T G                                     CGQ Q+G F    + NG+ GLG++  SVPS+L++    ANSFSMCF
Subjt:  KTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCF

Query:  GD--DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM
        G     VGRI+FGD+G  +QEETPF        Y + ++ V V    +D  + A FD+G+SFT+  +P Y  L++SF     D RRP +  +PFE+CY++
Subjt:  GD--DGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM

Query:  SPDANASLTPGISLTMKGGSPFPVYDPIIVIST-ENELIYCLAVVKSA--ELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEE-QNLFPTKPDVTTV
        SP+A     P + +T  GGS   + +P     T E  ++YCL V+KS   ++N+IGQNF+ GYRIVFDRE+++LGW++  C++ E  ++  P  P+V   
Subjt:  SPDANASLTPGISLTMKGGSPFPVYDPIIVIST-ENELIYCLAVVKSA--ELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEE-QNLFPTKPDVTTV

Query:  PPAVAA
         P+V+A
Subjt:  PPAVAA

AT3G51360.1 Eukaryotic aspartyl protease family protein2.6e-9037.16Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDG---PLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDL
        +HHRFSEQVK   G  G        P  GS++YY  L  RD   RGR+L+  +     ++F+ GNS+  IS   FLHY  V +G+P   F+VALDTGSDL
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDG---PLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDL

Query:  FWVPCDC-SRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVL
        FW+PC+C S C  +  +    R                    +L++YNP +S +S  V CN++LC  +++CI    +CPY + Y+S  + +TG+L+EDV+
Subjt:  FWVPCDC-SRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVL

Query:  HLKTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSM
        H+ TE  + E   A ITFG                                     C + Q G F +VA  NG+ GL +  I+VP++L + G  ++SFSM
Subjt:  HLKTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSM

Query:  CFGDDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM
        CFG +G G I+FGDKGS +Q ETP +       Y++++T+ +VG   +D + TA FDSGT+ T+  +P YT L+ +FH    D R   +   PFE+CY +
Subjt:  CFGDDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM

Query:  SPDANASLTPGISLTMKGGSPFPVYDPIIVISTENE--LIYCLAVVK--SAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTV
        +  ++    P +S  MKGG+ + V+ PI+V  T +    +YCLAV+K  +A+ +IIGQNFMT YRIV DRE+ +LGW+K +C D    N F T P     
Subjt:  SPDANASLTPGISLTMKGGSPFPVYDPIIVISTENE--LIYCLAVVK--SAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTV

Query:  PPAVAAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIIL
        PP++A      SSP   +    SS++   +   +  S  + CF  F+ L
Subjt:  PPAVAAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIIL

AT4G35880.1 Eukaryotic aspartyl protease family protein2.7e-18059.71Show/hide
Query:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRL----SEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSD
        MHHRFS++VK+WS  +G+ +    +P KGS EY+  L LRD   RGRRL    SE +  L FSDGNS+ RISSLGFLHYTTV+LG+PGM+FMVALDTGSD
Subjt:  MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRL----SEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSD

Query:  LFWVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVL
        LFWVPCDC +CAPTEG+ YAS                   +F+LS+YNPK S+T+K V CNNSLC Q++QC+  F  CPY+VSYVSA+TST+GIL+EDV+
Subjt:  LFWVPCDCSRCAPTEGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVL

Query:  HLKTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSM
        HL TE K+ E ++AY+TFG                                     CGQVQSGSFLD+AAPNGLFGLGME+ISVPS+L+REG +A+SFSM
Subjt:  HLKTEYKDSEPIQAYITFGSVYAKVLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSM

Query:  CFGDDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM
        CFG DGVGRI+FGDKGS +QEETPFN+N  HP YNITVTRVRVGTTLID + TALFD+GTSFTY  DP+YT +SESFHSQ +D R  P++RIPFEYCY+M
Subjt:  CFGDDGVGRINFGDKGSPEQEETPFNVNQLHPTYNITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNM

Query:  SPDANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVT-TVPPA
        S DANASL P +SLTMKG S F + DPIIVISTE EL+YCLA+VKS+ELNIIGQN+MTGYR+VFDREKLVL W+KFDCYDIEE N      + T  V PA
Subjt:  SPDANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVVKSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVT-TVPPA

Query:  VAAGV-GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL
        +AAG+  +N+S  L K  +  S+ ++     S    + + FRF  ILL L+
Subjt:  VAAGV-GNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCACCGCTTCTCCGAACAGGTCAAGAAGTGGTCCGGTGTCTCTGGGAAGTTGTCTCTTCCTGATTCTTGGCCGGCCAAGGGAAGCATTGAGTATTATGCGCGACT
TGCCCTCCGCGATCGCTACTTCCGCGGCCGGAGGCTCTCCGAATTTGATGGACCGCTGGCTTTCTCTGATGGGAACTCGAGCTTTCGGATTAGCTCGTTGGGATTTCTGC
ATTACACTACCGTACAATTGGGGTCTCCGGGAATGAAGTTTATGGTGGCACTTGACACTGGAAGTGATCTGTTCTGGGTGCCCTGTGATTGTAGTAGATGCGCGCCCACG
GAGGGCTCTCCTTACGCTTCTAGACATATCTTCAAAAGTCATGCATCCCTTCATCCTTTCAATTGTGCAATCATGCAGGATTTTCAGTTGAGTGTATACAACCCCAAAGA
GTCATCTACTAGCAAAACTGTTCCTTGCAACAATAGTTTGTGTGAACAGCAGGATCAATGCATAATGGCCTTTGGAAATTGTCCTTACGTTGTCTCCTATGTCTCAGCTG
AAACATCAACCACTGGTATATTGATAGAGGATGTTCTACACTTGAAAACTGAATATAAGGATTCGGAACCTATCCAGGCATACATCACGTTTGGGTCAGTATACGCTAAA
GTTTTGATTAAGCACCTCATTACAGGGCGACTCTGTTATGTAAACTGTTCTTATTTGCCCATGTCATTGCCTTGCCCTTCTCCCCATCTGTGCAGCTGCGGGCAGGTGCA
GAGTGGCTCATTTCTTGATGTTGCAGCTCCCAATGGCTTGTTTGGGCTCGGCATGGAGCAGATATCAGTTCCTAGCATATTATCTAGAGAAGGTTTCATGGCAAATTCTT
TTTCTATGTGCTTTGGTGATGATGGAGTCGGAAGGATCAATTTTGGAGACAAGGGTAGTCCAGAGCAGGAAGAGACCCCATTTAATGTGAACCAATTACACCCAACCTAT
AATATCACAGTTACTCGTGTTCGAGTGGGCACAACTCTAATTGATGCAGATATAACTGCTCTTTTTGACTCCGGGACATCTTTTACATACTTCACCGACCCAATCTACAC
CAAGCTTTCCGAAAGTTTCCATTCACAAACAAGAGATGGACGTCGCCCCCCTAATACAAGGATACCTTTTGAATATTGTTATAACATGAGTCCAGATGCAAATGCTTCTC
TGACACCTGGTATTAGTTTAACTATGAAAGGTGGAAGTCCCTTTCCTGTCTATGATCCGATTATTGTCATCTCCACTGAGAATGAACTCATTTATTGCCTGGCCGTGGTC
AAGAGTGCTGAACTGAATATAATTGGACAAAACTTCATGACTGGCTACCGTATTGTATTTGACCGGGAAAAGCTTGTCTTGGGCTGGAGGAAGTTTGATTGTTATGACAT
TGAGGAACAAAATCTCTTTCCAACGAAACCAGATGTTACTACAGTTCCTCCTGCTGTTGCTGCTGGAGTAGGTAATAACTCTAGTCCAGGATTAACAAAAGAGACAAAGT
TTAGTTCTCAAATTTCAACTGAATCAGAATTCAATAGTTGCCATTCTTCTCTTTTGACTTGTTTCAGATTTTTCATCATATTGCTTTTTTTACTA
mRNA sequenceShow/hide mRNA sequence
ATGCACCACCGCTTCTCCGAACAGGTCAAGAAGTGGTCCGGTGTCTCTGGGAAGTTGTCTCTTCCTGATTCTTGGCCGGCCAAGGGAAGCATTGAGTATTATGCGCGACT
TGCCCTCCGCGATCGCTACTTCCGCGGCCGGAGGCTCTCCGAATTTGATGGACCGCTGGCTTTCTCTGATGGGAACTCGAGCTTTCGGATTAGCTCGTTGGGATTTCTGC
ATTACACTACCGTACAATTGGGGTCTCCGGGAATGAAGTTTATGGTGGCACTTGACACTGGAAGTGATCTGTTCTGGGTGCCCTGTGATTGTAGTAGATGCGCGCCCACG
GAGGGCTCTCCTTACGCTTCTAGACATATCTTCAAAAGTCATGCATCCCTTCATCCTTTCAATTGTGCAATCATGCAGGATTTTCAGTTGAGTGTATACAACCCCAAAGA
GTCATCTACTAGCAAAACTGTTCCTTGCAACAATAGTTTGTGTGAACAGCAGGATCAATGCATAATGGCCTTTGGAAATTGTCCTTACGTTGTCTCCTATGTCTCAGCTG
AAACATCAACCACTGGTATATTGATAGAGGATGTTCTACACTTGAAAACTGAATATAAGGATTCGGAACCTATCCAGGCATACATCACGTTTGGGTCAGTATACGCTAAA
GTTTTGATTAAGCACCTCATTACAGGGCGACTCTGTTATGTAAACTGTTCTTATTTGCCCATGTCATTGCCTTGCCCTTCTCCCCATCTGTGCAGCTGCGGGCAGGTGCA
GAGTGGCTCATTTCTTGATGTTGCAGCTCCCAATGGCTTGTTTGGGCTCGGCATGGAGCAGATATCAGTTCCTAGCATATTATCTAGAGAAGGTTTCATGGCAAATTCTT
TTTCTATGTGCTTTGGTGATGATGGAGTCGGAAGGATCAATTTTGGAGACAAGGGTAGTCCAGAGCAGGAAGAGACCCCATTTAATGTGAACCAATTACACCCAACCTAT
AATATCACAGTTACTCGTGTTCGAGTGGGCACAACTCTAATTGATGCAGATATAACTGCTCTTTTTGACTCCGGGACATCTTTTACATACTTCACCGACCCAATCTACAC
CAAGCTTTCCGAAAGTTTCCATTCACAAACAAGAGATGGACGTCGCCCCCCTAATACAAGGATACCTTTTGAATATTGTTATAACATGAGTCCAGATGCAAATGCTTCTC
TGACACCTGGTATTAGTTTAACTATGAAAGGTGGAAGTCCCTTTCCTGTCTATGATCCGATTATTGTCATCTCCACTGAGAATGAACTCATTTATTGCCTGGCCGTGGTC
AAGAGTGCTGAACTGAATATAATTGGACAAAACTTCATGACTGGCTACCGTATTGTATTTGACCGGGAAAAGCTTGTCTTGGGCTGGAGGAAGTTTGATTGTTATGACAT
TGAGGAACAAAATCTCTTTCCAACGAAACCAGATGTTACTACAGTTCCTCCTGCTGTTGCTGCTGGAGTAGGTAATAACTCTAGTCCAGGATTAACAAAAGAGACAAAGT
TTAGTTCTCAAATTTCAACTGAATCAGAATTCAATAGTTGCCATTCTTCTCTTTTGACTTGTTTCAGATTTTTCATCATATTGCTTTTTTTACTA
Protein sequenceShow/hide protein sequence
MHHRFSEQVKKWSGVSGKLSLPDSWPAKGSIEYYARLALRDRYFRGRRLSEFDGPLAFSDGNSSFRISSLGFLHYTTVQLGSPGMKFMVALDTGSDLFWVPCDCSRCAPT
EGSPYASRHIFKSHASLHPFNCAIMQDFQLSVYNPKESSTSKTVPCNNSLCEQQDQCIMAFGNCPYVVSYVSAETSTTGILIEDVLHLKTEYKDSEPIQAYITFGSVYAK
VLIKHLITGRLCYVNCSYLPMSLPCPSPHLCSCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGFMANSFSMCFGDDGVGRINFGDKGSPEQEETPFNVNQLHPTY
NITVTRVRVGTTLIDADITALFDSGTSFTYFTDPIYTKLSESFHSQTRDGRRPPNTRIPFEYCYNMSPDANASLTPGISLTMKGGSPFPVYDPIIVISTENELIYCLAVV
KSAELNIIGQNFMTGYRIVFDREKLVLGWRKFDCYDIEEQNLFPTKPDVTTVPPAVAAGVGNNSSPGLTKETKFSSQISTESEFNSCHSSLLTCFRFFIILLFLL