; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010755 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010755
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionaspartic proteinase-like
Genome locationscaffold35:1805503..1808763
RNA-Seq ExpressionMS010755
SyntenyMS010755
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006629 - lipid metabolic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR007856 - Saposin-like type B, region 1
IPR008138 - Saposin B type, region 2
IPR008139 - Saposin B type domain
IPR011001 - Saposin-like
IPR021109 - Aspartic peptidase domain superfamily
IPR033121 - Peptidase family A1 domain
IPR033869 - Phytepsin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033959.1 Aspartic proteinase A1, partial [Cucurbita argyrosperma subsp. argyrosperma]6.8e-24180.82Show/hide
Query:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK
        MRNSL+PLLVSL + I Y    SSASNEGL+RIGLKKIKV+KN  LKA +ESK       K  +  N++GES ++D+VALKNY+DAQYYGEIG+GTPPQK
Subjt:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK

Query:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF
        FTVIFDTGSSNLWVPSSKCVFSIACFFH RYQS +S+TY++NGTSA+IQYGSGAI+GFFSYD+VQVGD++VR+QQFIE TS+SSMTFIAAKFDGILGLGF
Subjt:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF

Query:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA
        QEIS G+AVPVWYNM+ QKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVT KGYWQFNIGDILIG KPTEYCA GCSAIADSGTSLLA
Subjt:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA

Query:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL
        GPS IVTLIN+AIGA+     ECKA+VSQHG++IMDLLLAK QPEKICSKIGVCTFDGTHGVSM+IES++NEK GRSSG FSDAMCSACEMAVSWM ++L
Subjt:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL

Query:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS
        KQNKT++ +I+YV++LCDR  NQGETLVDC RIS+MPTVSFTIGDK+FEL++ DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMGRYHTVFD 
Subjt:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS

Query:  GKVRVGFAEAA
        GK+RVGFAEAA
Subjt:  GKVRVGFAEAA

XP_022142263.1 aspartic proteinase-like [Momordica charantia]7.9e-29099.8Show/hide
Query:  MRNSLRPLLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSN
        MRNSLRPLLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSN
Subjt:  MRNSLRPLLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSN

Query:  LWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPV
        LWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPV
Subjt:  LWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPV

Query:  WYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINK
        WYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINK
Subjt:  WYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINK

Query:  AIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIIN
        AIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIIN
Subjt:  AIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIIN

Query:  YVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
        YVDELCDRGSNQGETLV+CDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
Subjt:  YVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA

XP_022950077.1 aspartic proteinase-like [Cucurbita moschata]3.4e-24080.82Show/hide
Query:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK
        M NSL+PLLVSL + I Y    SSASNEGL+RIGLKKIKV+KN  LKA +ESK       K  +  N++GES ++D+VALKNY+DAQYYGEIG+GTPPQK
Subjt:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK

Query:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF
        FTVIFDTGSSNLWVPSSKCVFSIACFFH RYQS +S+TY++NGTSA+IQYGSGAI+GFFSYD+VQVGD+VVR+QQFIE TS+SSMTFIAAKFDGILGLGF
Subjt:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF

Query:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA
        QEIS G+AVPVWYNM+ QKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVT KGYWQFNIGDILIG KPTEYCA GCSAIADSGTSLLA
Subjt:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA

Query:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL
        GPS IVTLIN+AIGA+     ECKA+VSQHG++IMDLLLAK QPEKICSKIGVCTFDGTHGVSM+IES++NEK GRSSG FSDAMCSACEMAVSWM ++L
Subjt:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL

Query:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS
        KQNKT++ +I+YV++LCDR  NQGETLVDC RIS+MPTVSFTIGDK+FEL++ DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMGRYHTVFD 
Subjt:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS

Query:  GKVRVGFAEAA
        GK+RVGFAEAA
Subjt:  GKVRVGFAEAA

XP_022977721.1 aspartic proteinase-like [Cucurbita maxima]8.1e-24280.9Show/hide
Query:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF---------NNLGESTDTDVVALKNYLDAQYYGEIGMGTPP
        MRNSL+PLLVSL L I Y    SSASNEGL+RIGLKKIKV+KN  LKA +ESKK   EF         N+LGES ++D+VALKNY+DAQYYGEIG+GTPP
Subjt:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF---------NNLGESTDTDVVALKNYLDAQYYGEIGMGTPP

Query:  QKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGL
        QKFTVIFDTGSSNLWVPSSKCVFSIACFFH RYQS +S+TY++NGTSA+IQYGSGAI+GFFSYD+VQVGD+VVRNQQFIE TS+SSMTFIAAKFDGILGL
Subjt:  QKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGL

Query:  GFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSL
        GFQEIS G+AVPVWYNM+KQKLVKEPVFSFWLNRNAEEEEGGE+VFGGVDPKHFKGQHTYVPVT KGYWQFNIGDILIG KPTEYCA GCSAIADSGTSL
Subjt:  GFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSL

Query:  LAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQE
        LAGPS IVTLIN+AIGA+     ECK +VSQHG++IMDLLLAK QPEKICSKIGVC FDG+HGVS +IESV+NEKDG SSG FSDAMCSACEMAVSWM +
Subjt:  LAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQE

Query:  QLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVF
        +LKQNKT++ +I+YV++LCDR  N+GETLVDC RIS+MPTVSFTIGDK+FEL++ DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMGRYHTVF
Subjt:  QLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVF

Query:  DSGKVRVGFAEAA
        D GK+RVGFAEAA
Subjt:  DSGKVRVGFAEAA

XP_023544281.1 aspartic proteinase-like [Cucurbita pepo subsp. pepo]2.8e-24280.82Show/hide
Query:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK
        MRNSL+PLLVSL L I Y    SSASNEGL+RIGLKKIKV+KN  LKA +ESK       K  +  N++GES ++D+VALKNY+DAQYYGEIG+GTPPQK
Subjt:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK

Query:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF
        FTVIFDTGSSNLWVPSSKCVFSIACFFH RYQS +S+TY++NGTSA+IQYG+GAI+GFFSYD+VQVGD+VVR+QQFIE TS+SSMTFIAAKFDGILGLGF
Subjt:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF

Query:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA
        QEIS G+AVPVWYNM+ QKLVKEPVFSFWLNRNA+EEEGGE+VFGGVDPKHFKGQHTYVPVT KGYWQFNIGDILIG +PTEYCA GCSAIADSGTSLLA
Subjt:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA

Query:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL
        GPS IVTLIN+AIGA+     ECKA+VSQHG++IMDLLLAK QPEKICSKIGVC FDGTHGVSM+IESV NEKDGRSSG FSDAMCSACEMAVSWM ++L
Subjt:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL

Query:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS
        KQNKT++ +I+YV++LCDR SNQGETLVDC RIS+MPTVSFTIGDK+FEL++ DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMGRYHTVFD 
Subjt:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS

Query:  GKVRVGFAEAA
        GK+RVGFAEAA
Subjt:  GKVRVGFAEAA

TrEMBL top hitse value%identityAlignment
A0A1S3B040 aspartic proteinase-like isoform X21.7e-22978.02Show/hide
Query:  SLRPLLVSLFLFISY-----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF------------NNLGESTDTDVVALKNYLDAQYYGEIGMGTP
        S   LLVSL L I +     +SASNEG LRIGLKKI+ D+N+R KA LESKK   EF            NNLGES + D V LKNYLDAQYYGEIG+GTP
Subjt:  SLRPLLVSLFLFISY-----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF------------NNLGESTDTDVVALKNYLDAQYYGEIGMGTP

Query:  PQKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILG
        PQKFTVIFDTGSSNLWVPSSKCVFS+ACFFH RYQS +S+TYKKNGTSA+IQYGSGAIAGFFS D+V+VGD+VVRNQ  IEATS+SSMTF+AAKFDGILG
Subjt:  PQKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILG

Query:  LGFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTS
        LGFQEIS G AVPVWYNM+KQKLVKE VFSFWLNRNA+EEEGGELVFGGVDPKHFKGQHTYVPVT KGYWQF+IGDILIG + T+YCA GCSAIADSGTS
Subjt:  LGFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTS

Query:  LLAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQ
        LLAGPS+IV LIN+AIGA+A AH ECKAIVSQHG+ IMDLLLAKAQPEKICS IGVCTFD T  VS++IE+V+++KDGRSSG FS+AMCSACEMAVSW+Q
Subjt:  LLAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQ

Query:  EQLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTV
        ++L+QNKT++DII+ V+ELCDRGSNQ ETLVDC RIS+MP+VSFTIGD++FEL S DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMG YHTV
Subjt:  EQLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTV

Query:  FDSGKVRVGFAEAA
        FD GK RVGFA+AA
Subjt:  FDSGKVRVGFAEAA

A0A5D3CRY9 Aspartic proteinase-like isoform X21.7e-22978.02Show/hide
Query:  SLRPLLVSLFLFISY-----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF------------NNLGESTDTDVVALKNYLDAQYYGEIGMGTP
        S   LLVSL L I +     +SASNEG LRIGLKKI+ D+N+R KA LESKK   EF            NNLGES + D V LKNYLDAQYYGEIG+GTP
Subjt:  SLRPLLVSLFLFISY-----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF------------NNLGESTDTDVVALKNYLDAQYYGEIGMGTP

Query:  PQKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILG
        PQKFTVIFDTGSSNLWVPSSKCVFS+ACFFH RYQS +S+TYKKNGTSA+IQYGSGAIAGFFS D+V+VGD+VVRNQ  IEATS+SSMTF+AAKFDGILG
Subjt:  PQKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILG

Query:  LGFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTS
        LGFQEIS G AVPVWYNM+KQKLVKE VFSFWLNRNA+EEEGGELVFGGVDPKHFKGQHTYVPVT KGYWQF+IGDILIG + T+YCA GCSAIADSGTS
Subjt:  LGFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTS

Query:  LLAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQ
        LLAGPS+IV LIN+AIGA+A AH ECKAIVSQHG+ IMDLLLAKAQPEKICS IGVCTFD T  VS++IE+V+++KDGRSSG FS+AMCSACEMAVSW+Q
Subjt:  LLAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQ

Query:  EQLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTV
        ++L+QNKT++DII+ V+ELCDRGSNQ ETLVDC RIS+MP+VSFTIGD++FEL S DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMG YHTV
Subjt:  EQLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTV

Query:  FDSGKVRVGFAEAA
        FD GK RVGFA+AA
Subjt:  FDSGKVRVGFAEAA

A0A6J1CL31 aspartic proteinase-like3.8e-29099.8Show/hide
Query:  MRNSLRPLLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSN
        MRNSLRPLLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSN
Subjt:  MRNSLRPLLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSN

Query:  LWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPV
        LWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPV
Subjt:  LWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPV

Query:  WYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINK
        WYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINK
Subjt:  WYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINK

Query:  AIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIIN
        AIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIIN
Subjt:  AIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIIN

Query:  YVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
        YVDELCDRGSNQGETLV+CDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
Subjt:  YVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA

A0A6J1GDT6 aspartic proteinase-like1.6e-24080.82Show/hide
Query:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK
        M NSL+PLLVSL + I Y    SSASNEGL+RIGLKKIKV+KN  LKA +ESK       K  +  N++GES ++D+VALKNY+DAQYYGEIG+GTPPQK
Subjt:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESK-------KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQK

Query:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF
        FTVIFDTGSSNLWVPSSKCVFSIACFFH RYQS +S+TY++NGTSA+IQYGSGAI+GFFSYD+VQVGD+VVR+QQFIE TS+SSMTFIAAKFDGILGLGF
Subjt:  FTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGF

Query:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA
        QEIS G+AVPVWYNM+ QKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVT KGYWQFNIGDILIG KPTEYCA GCSAIADSGTSLLA
Subjt:  QEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLA

Query:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL
        GPS IVTLIN+AIGA+     ECKA+VSQHG++IMDLLLAK QPEKICSKIGVCTFDGTHGVSM+IES++NEK GRSSG FSDAMCSACEMAVSWM ++L
Subjt:  GPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQL

Query:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS
        KQNKT++ +I+YV++LCDR  NQGETLVDC RIS+MPTVSFTIGDK+FEL++ DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMGRYHTVFD 
Subjt:  KQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDS

Query:  GKVRVGFAEAA
        GK+RVGFAEAA
Subjt:  GKVRVGFAEAA

A0A6J1IKS1 aspartic proteinase-like3.9e-24280.9Show/hide
Query:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF---------NNLGESTDTDVVALKNYLDAQYYGEIGMGTPP
        MRNSL+PLLVSL L I Y    SSASNEGL+RIGLKKIKV+KN  LKA +ESKK   EF         N+LGES ++D+VALKNY+DAQYYGEIG+GTPP
Subjt:  MRNSLRPLLVSLFLFISY----SSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREF---------NNLGESTDTDVVALKNYLDAQYYGEIGMGTPP

Query:  QKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGL
        QKFTVIFDTGSSNLWVPSSKCVFSIACFFH RYQS +S+TY++NGTSA+IQYGSGAI+GFFSYD+VQVGD+VVRNQQFIE TS+SSMTFIAAKFDGILGL
Subjt:  QKFTVIFDTGSSNLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGL

Query:  GFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSL
        GFQEIS G+AVPVWYNM+KQKLVKEPVFSFWLNRNAEEEEGGE+VFGGVDPKHFKGQHTYVPVT KGYWQFNIGDILIG KPTEYCA GCSAIADSGTSL
Subjt:  GFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSL

Query:  LAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQE
        LAGPS IVTLIN+AIGA+     ECK +VSQHG++IMDLLLAK QPEKICSKIGVC FDG+HGVS +IESV+NEKDG SSG FSDAMCSACEMAVSWM +
Subjt:  LAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQE

Query:  QLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVF
        +LKQNKT++ +I+YV++LCDR  N+GETLVDC RIS+MPTVSFTIGDK+FEL++ DYILKV +G+ AQCISGFIPLDIPPPRGPLWILGD+FMGRYHTVF
Subjt:  QLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVF

Query:  DSGKVRVGFAEAA
        D GK+RVGFAEAA
Subjt:  DSGKVRVGFAEAA

SwissProt top hitse value%identityAlignment
O04057 Aspartic proteinase3.0e-20769.72Show/hide
Query:  LFLFISY---SSASNEGLLRIGLKKIKVDKNNRLKARLES------KKRLREFN---NLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS
        LFL +S+   SSASN+GLLR+GLKKIK+D  NRL AR+ES      K   R++N   NLGES+DTD+VALKNYLDAQYYGEI +GTPPQKFTVIFDTGSS
Subjt:  LFLFISY---SSASNEGLLRIGLKKIKVDKNNRLKARLES------KKRLREFN---NLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS

Query:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP
        NLWV   +C+FS+AC FH RY+S +S++YKKNGTSASI+YG+GA++GFFSYD+V+VGDLVV+ Q FIEAT   S+TF+ AKFDG+LGLGFQEI+VGNAVP
Subjt:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP

Query:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN
        VWYNM++Q LVKEPVFSFWLNRN EEEEGGE+VFGGVDPKH++G+HTYVPVT+KGYWQF++GD+LI  +PT +C  GCSAIADSGTSLLAGP+ ++T+IN
Subjt:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN

Query:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII
         AIGA     Q+CKA+V+Q+GQTIMDLLL++A P+KICS+I +CTFDGT GVSM IESV++E  G+SS S  D MCS CEM V WMQ QL+QN+T++ II
Subjt:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII

Query:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE
        NY++ELCDR  S  G++ VDC ++S MPTVSFTIG KIF+L   +YILKV +G  AQCISGF   DIPPPRGPLWILGD+FMGRYHTVFD GK+RVG AE
Subjt:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE

Query:  AA
        AA
Subjt:  AA

O65390 Aspartic proteinase A19.4e-20167.54Show/hide
Query:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESK--KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNLWVPS
        L+VS  L  S  +  N+G  R+GLKK+K+D  NRL AR+ESK  K LR +  LG+S D DVV LKNYLDAQYYGEI +GTPPQKFTV+FDTGSSNLWVPS
Subjt:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESK--KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNLWVPS

Query:  SKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVWYNMI
        SKC FS+AC  H +Y+S +S+TY+KNG +A+I YG+GAIAGFFS D+V VGDLVV++Q+FIEAT    +TF+ AKFDGILGLGFQEISVG A PVWYNM+
Subjt:  SKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVWYNMI

Query:  KQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKAIGAS
        KQ L+KEPVFSFWLNRNA+EEEGGELVFGGVDP HFKG+HTYVPVT+KGYWQF++GD+LIG  PT +C  GCSAIADSGTSLLAGP+ I+T+IN AIGA+
Subjt:  KQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKAIGAS

Query:  AAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINYVDEL
            Q+CK +V Q+GQTI+DLLL++ QP+KICS+IG+CTFDGT GVSM IESV+++++ + S    DA CSACEMAV W+Q QL+QN T++ I+NYV+EL
Subjt:  AAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINYVDEL

Query:  CDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
        C+R  S  GE+ VDC ++S MPTVS TIG K+F+L   +Y+LKV +G  AQCISGFI LD+ PPRGPLWILGD+FMG+YHTVFD G  +VGFAEAA
Subjt:  CDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA

P42210 Phytepsin1.7e-18664.2Show/hide
Query:  SSASNEGLLRIGLKKIKVDKNNRLKARL---ESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNLWVPSSKCVFSIACF
        +++  EGL+RI LKK  +D+N+R+   L   E +  L   N L    + D+VALKNY++AQY+GEIG+GTPPQKFTVIFDTGSSNLWVPS+KC FSIAC+
Subjt:  SSASNEGLLRIGLKKIKVDKNNRLKARL---ESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNLWVPSSKCVFSIACF

Query:  FHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVWYNMIKQKLVKEPVF
         H RY++  S+TYKKNG  A+IQYG+G+IAG+FS DSV VGDLVV++Q+FIEAT    +TF+ AKFDGILGLGF+EISVG AVPVWY MI+Q LV +PVF
Subjt:  FHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVWYNMIKQKLVKEPVF

Query:  SFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKAIGASAAAHQECKAI
        SFWLNR+ +E EGGE++FGG+DPKH+ G+HTYVPVT+KGYWQF++GD+L+G K T +CA GC+AIADSGTSLLAGP+ I+T IN+ IGA+    QECK I
Subjt:  SFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKAIGASAAAHQECKAI

Query:  VSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINYVDELCDR-GSNQGE
        VSQ+GQ I+DLLLA+ QP+KICS++G+CTFDGT GVS  I SV++++  +S+G  +D MCSACEMAV WMQ QL QNKT+D I++YV++LC+R  S  GE
Subjt:  VSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINYVDELCDR-GSNQGE

Query:  TLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
        + VDC  +  MP + FTIG K F L   +YILKV +GA AQCISGF  +DIPPPRGPLWILGD+FMG YHTVFD GK+R+GFA+AA
Subjt:  TLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA

Q42456 Aspartic proteinase oryzasin-18.8e-19164.6Show/hide
Query:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKK-----RLREFNNL-GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNL
        LL ++ L     +++ EGL+RI LKK  +D+N+R+ ARL  ++      LR  N+L G   + D+VALKNY++AQY+GEIG+GTPPQKFTVIFDTGSSNL
Subjt:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKK-----RLREFNNL-GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNL

Query:  WVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVW
        WVPS+KC FSIACFFH RY+S +S+TY+KNG  A+IQYG+G+IAGFFS DSV VGDLVV++Q+FIEAT    +TF+ AKFDGILGLGFQEISVG+AVPVW
Subjt:  WVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVW

Query:  YNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKA
        Y M++Q LV EPVFSFW NR+++E EGGE+VFGG+DP H+KG HTYVPV++KGYWQF +GD+LIG K T +CA GCSAIADSGTSLLAGP+ I+T IN+ 
Subjt:  YNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKA

Query:  IGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINY
        IGA+    QECK +VSQ+GQ I+DLLLA+ QP KICS++G+CTFDG HGVS  I+SV++++ G S+G  S  MC+ACEMAV WMQ QL QNKT+D I+NY
Subjt:  IGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINY

Query:  VDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
        +++LCD+  S  GE+ VDC  ++ MP +SFTIG K F L   +YILKV +GA AQCISGF  +DIPPPRGPLWILGD+FMG YHTVFD GK+RVGFA++A
Subjt:  VDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA

Q8VYL3 Aspartic proteinase A25.7e-19864.47Show/hide
Query:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKR------LREFNNL--GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS
        + VS  LF +  S  N+G  R+GLKK+K+D NNRL  R  SK+       LR +NN   G+S D D+V LKNYLDAQYYGEI +GTPPQKFTVIFDTGSS
Subjt:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKR------LREFNNL--GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS

Query:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP
        NLWVPS KC FS++C+FH +Y+S +S+TYKK+G  A+I YGSG+I+GFFSYD+V VGDLVV++Q+FIE TS   +TF+ AKFDG+LGLGFQEI+VGNA P
Subjt:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP

Query:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN
        VWYNM+KQ L+K PVFSFWLNR+ + EEGGE+VFGGVDPKHF+G+HT+VPVT++GYWQF++G++LI  + T YC  GCSAIADSGTSLLAGP+ +V +IN
Subjt:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN

Query:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII
        KAIGAS    Q+CK +V Q+GQTI+DLLLA+ QP+KICS+IG+C +DGTHGVSM IESV+++++ RSS    DA C ACEMAV W+Q QL+QN T++ I+
Subjt:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII

Query:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE
        NY++E+C+R  S  GE+ VDC ++S+MPTVSFTIG K+F+L   +Y+LK+ +G  AQCISGF  LDIPPPRGPLWILGD+FMG+YHTVFD G  +VGFAE
Subjt:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE

Query:  A
        A
Subjt:  A

Arabidopsis top hitse value%identityAlignment
AT1G11910.1 aspartic proteinase A16.7e-20267.54Show/hide
Query:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESK--KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNLWVPS
        L+VS  L  S  +  N+G  R+GLKK+K+D  NRL AR+ESK  K LR +  LG+S D DVV LKNYLDAQYYGEI +GTPPQKFTV+FDTGSSNLWVPS
Subjt:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESK--KRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNLWVPS

Query:  SKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVWYNMI
        SKC FS+AC  H +Y+S +S+TY+KNG +A+I YG+GAIAGFFS D+V VGDLVV++Q+FIEAT    +TF+ AKFDGILGLGFQEISVG A PVWYNM+
Subjt:  SKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVWYNMI

Query:  KQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKAIGAS
        KQ L+KEPVFSFWLNRNA+EEEGGELVFGGVDP HFKG+HTYVPVT+KGYWQF++GD+LIG  PT +C  GCSAIADSGTSLLAGP+ I+T+IN AIGA+
Subjt:  KQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKAIGAS

Query:  AAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINYVDEL
            Q+CK +V Q+GQTI+DLLL++ QP+KICS+IG+CTFDGT GVSM IESV+++++ + S    DA CSACEMAV W+Q QL+QN T++ I+NYV+EL
Subjt:  AAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINYVDEL

Query:  CDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
        C+R  S  GE+ VDC ++S MPTVS TIG K+F+L   +Y+LKV +G  AQCISGFI LD+ PPRGPLWILGD+FMG+YHTVFD G  +VGFAEAA
Subjt:  CDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA

AT1G62290.1 Saposin-like aspartyl protease family protein4.0e-19964.47Show/hide
Query:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKR------LREFNNL--GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS
        + VS  LF +  S  N+G  R+GLKK+K+D NNRL  R  SK+       LR +NN   G+S D D+V LKNYLDAQYYGEI +GTPPQKFTVIFDTGSS
Subjt:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKR------LREFNNL--GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS

Query:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP
        NLWVPS KC FS++C+FH +Y+S +S+TYKK+G  A+I YGSG+I+GFFSYD+V VGDLVV++Q+FIE TS   +TF+ AKFDG+LGLGFQEI+VGNA P
Subjt:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP

Query:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN
        VWYNM+KQ L+K PVFSFWLNR+ + EEGGE+VFGGVDPKHF+G+HT+VPVT++GYWQF++G++LI  + T YC  GCSAIADSGTSLLAGP+ +V +IN
Subjt:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN

Query:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII
        KAIGAS    Q+CK +V Q+GQTI+DLLLA+ QP+KICS+IG+C +DGTHGVSM IESV+++++ RSS    DA C ACEMAV W+Q QL+QN T++ I+
Subjt:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII

Query:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE
        NY++E+C+R  S  GE+ VDC ++S+MPTVSFTIG K+F+L   +Y+LK+ +G  AQCISGF  LDIPPPRGPLWILGD+FMG+YHTVFD G  +VGFAE
Subjt:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE

Query:  A
        A
Subjt:  A

AT1G62290.2 Saposin-like aspartyl protease family protein4.0e-19964.47Show/hide
Query:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKR------LREFNNL--GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS
        + VS  LF +  S  N+G  R+GLKK+K+D NNRL  R  SK+       LR +NN   G+S D D+V LKNYLDAQYYGEI +GTPPQKFTVIFDTGSS
Subjt:  LLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKR------LREFNNL--GESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSS

Query:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP
        NLWVPS KC FS++C+FH +Y+S +S+TYKK+G  A+I YGSG+I+GFFSYD+V VGDLVV++Q+FIE TS   +TF+ AKFDG+LGLGFQEI+VGNA P
Subjt:  NLWVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVP

Query:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN
        VWYNM+KQ L+K PVFSFWLNR+ + EEGGE+VFGGVDPKHF+G+HT+VPVT++GYWQF++G++LI  + T YC  GCSAIADSGTSLLAGP+ +V +IN
Subjt:  VWYNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLIN

Query:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII
        KAIGAS    Q+CK +V Q+GQTI+DLLLA+ QP+KICS+IG+C +DGTHGVSM IESV+++++ RSS    DA C ACEMAV W+Q QL+QN T++ I+
Subjt:  KAIGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDII

Query:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE
        NY++E+C+R  S  GE+ VDC ++S+MPTVSFTIG K+F+L   +Y+LK+ +G  AQCISGF  LDIPPPRGPLWILGD+FMG+YHTVFD G  +VGFAE
Subjt:  NYVDELCDR-GSNQGETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAE

Query:  A
        A
Subjt:  A

AT4G04460.1 Saposin-like aspartyl protease family protein5.5e-18060Show/hide
Query:  LVSLFLFISYSSA--SNEGLLRIGLKKIKVDKNNRLKARLESKKR-----LREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNL
        L+S  + IS +S   + +G +RIGLKK K+D++NRL ++L  K R      + +  L +  + D+V LKNYLDAQYYG+I +GTPPQKFTVIFDTGSSNL
Subjt:  LVSLFLFISYSSA--SNEGLLRIGLKKIKVDKNNRLKARLESKKR-----LREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNL

Query:  WVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVW
        W+PS+KC  S+AC+FH +Y++ +S++Y+KNG  ASI+YG+GAI+G+FS D V+VGD+VV+ Q+FIEATS   +TF+ AKFDGILGLGF+EISVGN+ PVW
Subjt:  WVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVW

Query:  YNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKA
        YNM+++ LVKEP+FSFWLNRN ++ EGGE+VFGGVDPKHFKG+HT+VPVT KGYWQF++GD+ I  KPT YCA GCSAIADSGTSLL GPS ++T+IN A
Subjt:  YNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKA

Query:  IGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINY
        IGA     +ECKA+V Q+G+T+++ LLA+  P+K+CS+IGVC +DGT  VSM I+SV+   D  +SG  + AMCSACEMA  WM+ +L QN+T++ I+ Y
Subjt:  IGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINY

Query:  VDELCDRGSNQG-ETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
          ELCD    Q  ++ VDC R+S MP V+F+IG + F+L   DYI K+ +G ++QC SGF  +DI PPRGPLWILGDIFMG YHTVFD GK RVGFA+AA
Subjt:  VDELCDRGSNQG-ETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA

AT4G04460.2 Saposin-like aspartyl protease family protein1.9e-17759.8Show/hide
Query:  LVSLFLFISYSSA--SNEGLLRIGLKKIKVDKNNRLKARLESKKR-----LREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNL
        L+S  + IS +S   + +G +RIGLKK K+D++NRL ++L  K R      + +  L +  + D+V LKNYLDAQYYG+I +GTPPQKFTVIFDTGSSNL
Subjt:  LVSLFLFISYSSA--SNEGLLRIGLKKIKVDKNNRLKARLESKKR-----LREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNL

Query:  WVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVW
        W+PS+KC  S+AC+FH +Y++ +S++Y+KNG  ASI+YG+GAI+G+FS D V+VGD+VV+ Q+FIEATS   +TF+ AKFDGILGLGF+EISVGN+ PVW
Subjt:  WVPSSKCVFSIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVW

Query:  YNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKA
        YNM+++ LVKEP+FSFWLNRN ++ EGGE+VFGGVDPKHFKG+HT+VPVT KGYWQF++GD+ I  KPT YCA GCSAIADSGTSLL GPS ++T+IN A
Subjt:  YNMIKQKLVKEPVFSFWLNRNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKA

Query:  IGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINY
        IGA     +ECKA+V Q+G+T+++ LLA    +K+CS+IGVC +DGT  VSM I+SV+   D  +SG  + AMCSACEMA  WM+ +L QN+T++ I+ Y
Subjt:  IGASAAAHQECKAIVSQHGQTIMDLLLAKAQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINY

Query:  VDELCDRGSNQG-ETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA
          ELCD    Q  ++ VDC R+S MP V+F+IG + F+L   DYI K+ +G ++QC SGF  +DI PPRGPLWILGDIFMG YHTVFD GK RVGFA+AA
Subjt:  VDELCDRGSNQG-ETLVDCDRISEMPTVSFTIGDKIFELDSTDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAATAGCCTTAGGCCTCTCCTGGTTTCTCTGTTCCTTTTCATATCATATTCTTCTGCATCTAACGAAGGGCTGCTAAGGATTGGACTGAAAAAGATCAAAGTGGA
TAAAAACAATCGGTTAAAAGCACGGCTTGAGTCAAAAAAAAGACTAAGAGAGTTTAATAATCTAGGAGAATCTACGGATACTGATGTTGTAGCATTAAAGAATTATTTGG
ATGCTCAATACTATGGAGAGATTGGCATGGGCACTCCACCTCAAAAGTTCACTGTAATTTTTGACACTGGAAGCTCAAATTTGTGGGTGCCATCTTCGAAATGCGTTTTC
TCGATTGCTTGCTTTTTCCATGTCAGGTATCAATCAAAGAAGTCAAACACATACAAAAAAAATGGAACATCTGCTTCTATCCAGTATGGTTCAGGAGCTATTGCCGGTTT
CTTTAGTTATGACAGCGTTCAAGTTGGTGATCTCGTTGTTCGTAATCAGCAATTCATTGAGGCAACTAGCCTGTCTAGTATGACATTCATAGCTGCTAAGTTTGATGGTA
TATTGGGACTTGGATTTCAAGAGATCTCGGTCGGTAATGCTGTTCCAGTGTGGTATAACATGATTAAACAAAAACTTGTCAAGGAACCAGTTTTCTCATTTTGGCTCAAT
CGCAATGCCGAGGAGGAAGAAGGAGGTGAACTTGTATTTGGCGGGGTCGATCCCAAGCACTTCAAAGGCCAGCATACATACGTGCCTGTGACAAAGAAAGGGTATTGGCA
GTTCAACATCGGCGATATTCTTATTGGTGATAAACCAACTGAATATTGTGCTCATGGTTGCTCCGCTATTGCTGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTAATA
TAGTGACATTAATAAATAAAGCCATTGGAGCTTCTGCAGCTGCTCATCAAGAATGCAAGGCAATTGTTTCACAACATGGACAGACTATTATGGATTTGCTTTTAGCTAAG
GCACAACCAGAGAAGATTTGCTCCAAAATTGGGGTTTGTACCTTTGATGGAACCCATGGCGTTAGTATGAGAATTGAGAGTGTGCTGAATGAGAAAGATGGTAGATCATC
TGGTAGTTTCTCCGATGCGATGTGCTCTGCTTGTGAGATGGCTGTTTCCTGGATGCAAGAACAGCTGAAGCAGAACAAAACTCGAGATGATATTATTAATTACGTCGATG
AGTTATGTGATCGAGGCTCAAACCAGGGAGAAACATTGGTCGACTGTGATCGGATCTCTGAAATGCCTACCGTGTCCTTCACCATTGGCGACAAAATTTTTGAACTTGAC
TCAACAGATTACATTCTCAAGGTGAGTGATGGAGCTCAAGCTCAATGCATCAGTGGATTCATACCTTTGGACATTCCTCCTCCTCGTGGACCCCTCTGGATCTTGGGAGA
CATCTTCATGGGACGCTACCACACAGTCTTCGATTCTGGCAAAGTGAGAGTCGGATTCGCCGAGGCCGCT
mRNA sequenceShow/hide mRNA sequence
ATGAGAAATAGCCTTAGGCCTCTCCTGGTTTCTCTGTTCCTTTTCATATCATATTCTTCTGCATCTAACGAAGGGCTGCTAAGGATTGGACTGAAAAAGATCAAAGTGGA
TAAAAACAATCGGTTAAAAGCACGGCTTGAGTCAAAAAAAAGACTAAGAGAGTTTAATAATCTAGGAGAATCTACGGATACTGATGTTGTAGCATTAAAGAATTATTTGG
ATGCTCAATACTATGGAGAGATTGGCATGGGCACTCCACCTCAAAAGTTCACTGTAATTTTTGACACTGGAAGCTCAAATTTGTGGGTGCCATCTTCGAAATGCGTTTTC
TCGATTGCTTGCTTTTTCCATGTCAGGTATCAATCAAAGAAGTCAAACACATACAAAAAAAATGGAACATCTGCTTCTATCCAGTATGGTTCAGGAGCTATTGCCGGTTT
CTTTAGTTATGACAGCGTTCAAGTTGGTGATCTCGTTGTTCGTAATCAGCAATTCATTGAGGCAACTAGCCTGTCTAGTATGACATTCATAGCTGCTAAGTTTGATGGTA
TATTGGGACTTGGATTTCAAGAGATCTCGGTCGGTAATGCTGTTCCAGTGTGGTATAACATGATTAAACAAAAACTTGTCAAGGAACCAGTTTTCTCATTTTGGCTCAAT
CGCAATGCCGAGGAGGAAGAAGGAGGTGAACTTGTATTTGGCGGGGTCGATCCCAAGCACTTCAAAGGCCAGCATACATACGTGCCTGTGACAAAGAAAGGGTATTGGCA
GTTCAACATCGGCGATATTCTTATTGGTGATAAACCAACTGAATATTGTGCTCATGGTTGCTCCGCTATTGCTGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTAATA
TAGTGACATTAATAAATAAAGCCATTGGAGCTTCTGCAGCTGCTCATCAAGAATGCAAGGCAATTGTTTCACAACATGGACAGACTATTATGGATTTGCTTTTAGCTAAG
GCACAACCAGAGAAGATTTGCTCCAAAATTGGGGTTTGTACCTTTGATGGAACCCATGGCGTTAGTATGAGAATTGAGAGTGTGCTGAATGAGAAAGATGGTAGATCATC
TGGTAGTTTCTCCGATGCGATGTGCTCTGCTTGTGAGATGGCTGTTTCCTGGATGCAAGAACAGCTGAAGCAGAACAAAACTCGAGATGATATTATTAATTACGTCGATG
AGTTATGTGATCGAGGCTCAAACCAGGGAGAAACATTGGTCGACTGTGATCGGATCTCTGAAATGCCTACCGTGTCCTTCACCATTGGCGACAAAATTTTTGAACTTGAC
TCAACAGATTACATTCTCAAGGTGAGTGATGGAGCTCAAGCTCAATGCATCAGTGGATTCATACCTTTGGACATTCCTCCTCCTCGTGGACCCCTCTGGATCTTGGGAGA
CATCTTCATGGGACGCTACCACACAGTCTTCGATTCTGGCAAAGTGAGAGTCGGATTCGCCGAGGCCGCT
Protein sequenceShow/hide protein sequence
MRNSLRPLLVSLFLFISYSSASNEGLLRIGLKKIKVDKNNRLKARLESKKRLREFNNLGESTDTDVVALKNYLDAQYYGEIGMGTPPQKFTVIFDTGSSNLWVPSSKCVF
SIACFFHVRYQSKKSNTYKKNGTSASIQYGSGAIAGFFSYDSVQVGDLVVRNQQFIEATSLSSMTFIAAKFDGILGLGFQEISVGNAVPVWYNMIKQKLVKEPVFSFWLN
RNAEEEEGGELVFGGVDPKHFKGQHTYVPVTKKGYWQFNIGDILIGDKPTEYCAHGCSAIADSGTSLLAGPSNIVTLINKAIGASAAAHQECKAIVSQHGQTIMDLLLAK
AQPEKICSKIGVCTFDGTHGVSMRIESVLNEKDGRSSGSFSDAMCSACEMAVSWMQEQLKQNKTRDDIINYVDELCDRGSNQGETLVDCDRISEMPTVSFTIGDKIFELD
STDYILKVSDGAQAQCISGFIPLDIPPPRGPLWILGDIFMGRYHTVFDSGKVRVGFAEAA