; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G04830 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G04830
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAspartic proteinase
Genome locationClcChr03:4697192..4701963
RNA-Seq ExpressionClc03G04830
SyntenyClc03G04830
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006629 - lipid metabolic process (biological process)
GO:0005773 - vacuole (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR007856 - Saposin-like type B, region 1
IPR008139 - Saposin B type domain
IPR011001 - Saposin-like
IPR021109 - Aspartic peptidase domain superfamily
IPR033121 - Peptidase family A1 domain
IPR033869 - Phytepsin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056841.1 aspartic proteinase isoform X2 [Cucumis melo var. makuwa]7.0e-26890.47Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        MASYHSKAAFLCLFLLVSFNIVSS SNDGLLRVGLKKIKLDPENRLAARLESKD EILKAAFRKYNPNGNL ESSDTDIVALKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPT                                      A+PKKICSQIKLCTFDGT+GVSMGIESV+DEN GKSSDGLRDGMCSVCEMTVVWMQN
Subjt:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQNQTKERIINYINELC+RMPSPMGQSAVDCGKLSSMPSVSFTIG KVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEAA
        FD+GKLRVGFAEAA
Subjt:  FDYGKLRVGFAEAA

KAE8652662.1 hypothetical protein Csa_013207 [Cucumis sativus]2.9e-26689.88Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        MASYHSKAAFLCLFLLVS NIVSS SNDGLLRVGLKKI LDPENRLAARLESKD EILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREP LTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDG+PTGYCEGGCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPT                                      A+PKKICSQIKLCTFDGTRGVSMGIESVVDEN GKSSDGLRDGMCSVCEMTVVWMQN
Subjt:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQNQTKERIINYINELC+RMPSPMGQSAVDCG LSSMPSVSFTIG KVFDLAPEEYILKVGEG AAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEAA
        FD+GKLRVGFAEAA
Subjt:  FDYGKLRVGFAEAA

XP_008440898.1 PREDICTED: aspartic proteinase isoform X2 [Cucumis melo]1.8e-26890.66Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        MASYHSKAAFLCLFLLVSFNIVSS SNDGLLRVGLKKIKLDPENRLAARLESKD EILKAAFRKYNPNGNL ESSDTDIVALKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPT                                      ADPKKICSQIKLCTFDGT+GVSMGIESV+DEN GKSSDGLRDGMCSVCEMTVVWMQN
Subjt:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQNQTKERIINYINELC+RMPSPMGQSAVDCGKLSSMPSVSFTIG KVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEAA
        FD+GKLRVGFAEAA
Subjt:  FDYGKLRVGFAEAA

XP_011652603.1 aspartic proteinase [Cucumis sativus]7.7e-26790.08Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        MASYHSKAAFLCLFLLVS NIVSS SNDGLLRVGLKKI LDPENRLAARLESKD EILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREP LTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDG+PTGYCEGGCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPT                                      ADPKKICSQIKLCTFDGTRGVSMGIESVVDEN GKSSDGLRDGMCSVCEMTVVWMQN
Subjt:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQNQTKERIINYINELC+RMPSPMGQSAVDCG LSSMPSVSFTIG KVFDLAPEEYILKVGEG AAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEAA
        FD+GKLRVGFAEAA
Subjt:  FDYGKLRVGFAEAA

XP_016899738.1 PREDICTED: aspartic proteinase isoform X1 [Cucumis melo]8.2e-26990.5Show/hide
Query:  LSMASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGT
        L MASYHSKAAFLCLFLLVSFNIVSS SNDGLLRVGLKKIKLDPENRLAARLESKD EILKAAFRKYNPNGNL ESSDTDIVALKNYLDAQYYGEIAIGT
Subjt:  LSMASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGT

Query:  PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL
        PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL
Subjt:  PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL

Query:  GLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT
        GLGFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT
Subjt:  GLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT

Query:  SLLAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWM
        SLLAGPT                                      ADPKKICSQIKLCTFDGT+GVSMGIESV+DEN GKSSDGLRDGMCSVCEMTVVWM
Subjt:  SLLAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWM

Query:  QNQLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYH
        QNQLRQNQTKERIINYINELC+RMPSPMGQSAVDCGKLSSMPSVSFTIG KVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYH
Subjt:  QNQLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYH

Query:  TVFDYGKLRVGFAEAA
        TVFD+GKLRVGFAEAA
Subjt:  TVFDYGKLRVGFAEAA

TrEMBL top hitse value%identityAlignment
A0A0A0LRZ8 Aspartic proteinase3.5e-27397.07Show/hide
Query:  LSMASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGT
        L MASYHSKAAFLCLFLLVS NIVSS SNDGLLRVGLKKI LDPENRLAARLESKD EILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGT
Subjt:  LSMASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGT

Query:  PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL
        PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREP LTFLVAKFDGLL
Subjt:  PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL

Query:  GLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT
        GLGFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDG+PTGYCEGGCSAIADSGT
Subjt:  GLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT

Query:  SLLAGPTADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCERMPSPMGQSAVDCGKL
        SLLAGPTADPKKICSQIKLCTFDGTRGVSMGIESVVDEN GKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELC+RMPSPMGQSAVDCG L
Subjt:  SLLAGPTADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCERMPSPMGQSAVDCGKL

Query:  SSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA
        SSMPSVSFTIG KVFDLAPEEYILKVGEG AAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFD+GKLRVGFAEAA
Subjt:  SSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA

A0A1S3B267 aspartic proteinase isoform X28.9e-26990.66Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        MASYHSKAAFLCLFLLVSFNIVSS SNDGLLRVGLKKIKLDPENRLAARLESKD EILKAAFRKYNPNGNL ESSDTDIVALKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPT                                      ADPKKICSQIKLCTFDGT+GVSMGIESV+DEN GKSSDGLRDGMCSVCEMTVVWMQN
Subjt:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQNQTKERIINYINELC+RMPSPMGQSAVDCGKLSSMPSVSFTIG KVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEAA
        FD+GKLRVGFAEAA
Subjt:  FDYGKLRVGFAEAA

A0A1S4DUU4 aspartic proteinase isoform X14.0e-26990.5Show/hide
Query:  LSMASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGT
        L MASYHSKAAFLCLFLLVSFNIVSS SNDGLLRVGLKKIKLDPENRLAARLESKD EILKAAFRKYNPNGNL ESSDTDIVALKNYLDAQYYGEIAIGT
Subjt:  LSMASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGT

Query:  PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL
        PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL
Subjt:  PPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLL

Query:  GLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT
        GLGFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT
Subjt:  GLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGT

Query:  SLLAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWM
        SLLAGPT                                      ADPKKICSQIKLCTFDGT+GVSMGIESV+DEN GKSSDGLRDGMCSVCEMTVVWM
Subjt:  SLLAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWM

Query:  QNQLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYH
        QNQLRQNQTKERIINYINELC+RMPSPMGQSAVDCGKLSSMPSVSFTIG KVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYH
Subjt:  QNQLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYH

Query:  TVFDYGKLRVGFAEAA
        TVFD+GKLRVGFAEAA
Subjt:  TVFDYGKLRVGFAEAA

A0A5A7UTL2 Aspartic proteinase isoform X23.4e-26890.47Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        MASYHSKAAFLCLFLLVSFNIVSS SNDGLLRVGLKKIKLDPENRLAARLESKD EILKAAFRKYNPNGNL ESSDTDIVALKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVG+AVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPT                                      A+PKKICSQIKLCTFDGT+GVSMGIESV+DEN GKSSDGLRDGMCSVCEMTVVWMQN
Subjt:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQNQTKERIINYINELC+RMPSPMGQSAVDCGKLSSMPSVSFTIG KVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEAA
        FD+GKLRVGFAEAA
Subjt:  FDYGKLRVGFAEAA

A0A6J1EB32 aspartic proteinase isoform X17.0e-26681.22Show/hide
Query:  MPFFGIRAFK---PFRSEAWTTLQESREEKTTLPPPKR--TAQIKSIFFLCFKPLNPN----------------RLSMASYHSKAAFLCLFLLVSFNIVS
        M FFGIR  K   P R      L + R  K  L P K   T QI S  F     LN N                RL MASYHSKAAFLCLFLLVSFNIV 
Subjt:  MPFFGIRAFK---PFRSEAWTTLQESREEKTTLPPPKR--TAQIKSIFFLCFKPLNPN----------------RLSMASYHSKAAFLCLFLLVSFNIVS

Query:  SASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLF
        SASNDGLLRVGLKKIKLDPENRLAAR+ESKD EILKAAFRKYNP GNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLF
Subjt:  SASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLF

Query:  SVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLV
        SVACHFHARYKSSRSS+YKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVK Q+FIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLV
Subjt:  SVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLV

Query:  KEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPT------------------
        KEPVFSFWLNRN EEEEGGEIVFGGVDPKHY+GKHTYVPVTQKGYWQFDMGDVLIDGEPTG+C+GGCSAIADSGTSLLAGPT                  
Subjt:  KEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPT------------------

Query:  --------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCERMP
                            ADPKKICSQI LCTFDGTRGVSMGIESVVDEN GKSSD LRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELC+RMP
Subjt:  --------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCERMP

Query:  SPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA
        SPMGQSAVDCG+LSSMP+VSFTIGGK+FDLAPEEYILKVGEG  AQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFD+GKLRVGFAEAA
Subjt:  SPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA

SwissProt top hitse value%identityAlignment
O04057 Aspartic proteinase4.5e-26287.55Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAAR+ESKD EILKAAFRKYNP GNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWV   +CLFSVACHFHARYKSSRSS+YKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVK Q+FIEATREPSLTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRN EEEEGGEIVFGGVDPKHY+GKHTYVPVTQKGYWQFDMGDVLIDGEPTG+C+GGCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPT                                      ADPKKICSQI LCTFDGTRGVSMGIESVVDEN GKSSD L DGMCSVCEMTVVWMQN
Subjt:  LAGPT--------------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQNQTKERIINYINELC+RMPSPMGQSAVDCG+LSSMP+VSFTIGGK+FDLAPEEYILKVGEG  AQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEAA
        FD+GKLRVG AEAA
Subjt:  FDYGKLRVGFAEAA

O65390 Aspartic proteinase A15.1e-20569.78Show/hide
Query:  LLVSFNIVSSA---SNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGS
        L+VSF +  SA    NDG  RVGLKK+KLD +NRLAAR+ESK  + L+A          LG+S D D+V LKNYLDAQYYGEIAIGTPPQKFTV+FDTGS
Subjt:  LLVSFNIVSSA---SNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGS

Query:  SNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAV
        SNLWVPS+KC FS+AC  H +YKSSRSSTY+KNG +A+I YGTGA++GFFS D V VGDLVVK+Q FIEAT+EP +TF+VAKFDG+LGLGFQEI+VG A 
Subjt:  SNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAV

Query:  PVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPT------
        PVWYNM++QGL+KEPVFSFWLNRNA+EEEGGE+VFGGVDP H+KGKHTYVPVTQKGYWQFDMGDVLI G PTG+CE GCSAIADSGTSLLAGPT      
Subjt:  PVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPT------

Query:  --------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERI
                                          PKKICSQI LCTFDGTRGVSMGIESVVD+   K S+G+ D  CS CEM VVW+Q+QLRQN T+ERI
Subjt:  --------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERI

Query:  INYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFA
        +NY+NELCER+PSPMG+SAVDC +LS+MP+VS TIGGKVFDLAPEEY+LKVGEG  AQCISGF A D+ PPRGPLWILGDVFMG+YHTVFD+G  +VGFA
Subjt:  INYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFA

Query:  EAA
        EAA
Subjt:  EAA

P42210 Phytepsin3.9e-18963.83Show/hide
Query:  AFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFD
        A L   LL+   + +++  +GL+R+ LKK  +D  +R+A  L   + + L +     NP   L    + DIVALKNY++AQY+GEI +GTPPQKFTVIFD
Subjt:  AFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFD

Query:  TGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVG
        TGSSNLWVPSAKC FS+AC+ H+RYK+  SSTYKKNG  A+I+YGTG+++G+FS D+V VGDLVVK+Q FIEAT+EP +TFLVAKFDG+LGLGF+EI+VG
Subjt:  TGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVG

Query:  NAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPTA--
         AVPVWY M+EQGLV +PVFSFWLNR+ +E EGGEI+FGG+DPKHY G+HTYVPVTQKGYWQFDMGDVL+ G+ TG+C GGC+AIADSGTSLLAGPTA  
Subjt:  NAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPTA--

Query:  ------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTK
                                             PKKICSQ+ LCTFDGTRGVS GI SVVD+   KS+    D MCS CEM VVWMQNQL QN+T+
Subjt:  ------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTK

Query:  ERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRV
        + I++Y+N+LC R+PSPMG+SAVDCG L SMP + FTIGGK F L PEEYILKVGEG AAQCISGFTA DIPPPRGPLWILGDVFMG YHTVFDYGKLR+
Subjt:  ERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRV

Query:  GFAEAA
        GFA+AA
Subjt:  GFAEAA

Q42456 Aspartic proteinase oryzasin-11.3e-18965.25Show/hide
Query:  IVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAK
        ++ +++ +GL+R+ LKK  +D  +R+AARL  ++    +   R  N  G  G   + DIVALKNY++AQY+GEI +GTPPQKFTVIFDTGSSNLWVPSAK
Subjt:  IVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAK

Query:  CLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQ
        C FS+AC FH+RYKS +SSTY+KNG  A+I+YGTG+++GFFS D+V VGDLVVK+Q FIEAT+EP LTF+VAKFDG+LGLGFQEI+VG+AVPVWY MVEQ
Subjt:  CLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQ

Query:  GLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPTA--------------
        GLV EPVFSFW NR+++E EGGEIVFGG+DP HYKG HTYVPV+QKGYWQF+MGDVLI G+ TG+C  GCSAIADSGTSLLAGPTA              
Subjt:  GLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPTA--------------

Query:  ------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDG-MCSVCEMTVVWMQNQLRQNQTKERIINYINELC
                                 P KICSQ+ LCTFDG  GVS GI+SVVD+  G+ S+GL+ G MC+ CEM VVWMQNQL QN+T++ I+NYIN+LC
Subjt:  ------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDG-MCSVCEMTVVWMQNQLRQNQTKERIINYINELC

Query:  ERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA
        +++PSPMG+S+VDCG L+SMP +SFTIGGK F L PEEYILKVGEG AAQCISGFTA DIPPPRGPLWILGDVFMG YHTVFDYGK+RVGFA++A
Subjt:  ERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA

Q8VYL3 Aspartic proteinase A29.3e-20769.01Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        M  Y    AF      + F    S  NDG  RVGLKK+KLDP NRLA R  SK  E L+++ R YN N   G+S D DIV LKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPS KC FS++C+FHA+YKSSRSSTYKK+G  A+I YG+G++SGFFSYD V VGDLVVK+Q FIE T EP LTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVGNA PVWYNM++QGL+K PVFSFWLNR+ + EEGGEIVFGGVDPKH++G+HT+VPVTQ+GYWQFDMG+VLI GE TGYC  GCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPTA                                       PKKICSQI LC +DGT GVSMGIESVVD+   +SS GLRD  C  CEM VVW+Q+
Subjt:  LAGPTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQN T+ERI+NYINE+CERMPSP G+SAVDC +LS MP+VSFTIGGKVFDLAPEEY+LK+GEG  AQCISGFTA DIPPPRGPLWILGDVFMG+YHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEA
        FD+G  +VGFAEA
Subjt:  FDYGKLRVGFAEA

Arabidopsis top hitse value%identityAlignment
AT1G11910.1 aspartic proteinase A13.6e-20669.78Show/hide
Query:  LLVSFNIVSSA---SNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGS
        L+VSF +  SA    NDG  RVGLKK+KLD +NRLAAR+ESK  + L+A          LG+S D D+V LKNYLDAQYYGEIAIGTPPQKFTV+FDTGS
Subjt:  LLVSFNIVSSA---SNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGS

Query:  SNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAV
        SNLWVPS+KC FS+AC  H +YKSSRSSTY+KNG +A+I YGTGA++GFFS D V VGDLVVK+Q FIEAT+EP +TF+VAKFDG+LGLGFQEI+VG A 
Subjt:  SNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAV

Query:  PVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPT------
        PVWYNM++QGL+KEPVFSFWLNRNA+EEEGGE+VFGGVDP H+KGKHTYVPVTQKGYWQFDMGDVLI G PTG+CE GCSAIADSGTSLLAGPT      
Subjt:  PVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAGPT------

Query:  --------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERI
                                          PKKICSQI LCTFDGTRGVSMGIESVVD+   K S+G+ D  CS CEM VVW+Q+QLRQN T+ERI
Subjt:  --------------------------------ADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERI

Query:  INYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFA
        +NY+NELCER+PSPMG+SAVDC +LS+MP+VS TIGGKVFDLAPEEY+LKVGEG  AQCISGF A D+ PPRGPLWILGDVFMG+YHTVFD+G  +VGFA
Subjt:  INYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFA

Query:  EAA
        EAA
Subjt:  EAA

AT1G62290.1 Saposin-like aspartyl protease family protein6.6e-20869.01Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        M  Y    AF      + F    S  NDG  RVGLKK+KLDP NRLA R  SK  E L+++ R YN N   G+S D DIV LKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPS KC FS++C+FHA+YKSSRSSTYKK+G  A+I YG+G++SGFFSYD V VGDLVVK+Q FIE T EP LTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVGNA PVWYNM++QGL+K PVFSFWLNR+ + EEGGEIVFGGVDPKH++G+HT+VPVTQ+GYWQFDMG+VLI GE TGYC  GCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPTA                                       PKKICSQI LC +DGT GVSMGIESVVD+   +SS GLRD  C  CEM VVW+Q+
Subjt:  LAGPTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQN T+ERI+NYINE+CERMPSP G+SAVDC +LS MP+VSFTIGGKVFDLAPEEY+LK+GEG  AQCISGFTA DIPPPRGPLWILGDVFMG+YHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEA
        FD+G  +VGFAEA
Subjt:  FDYGKLRVGFAEA

AT1G62290.2 Saposin-like aspartyl protease family protein6.6e-20869.01Show/hide
Query:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP
        M  Y    AF      + F    S  NDG  RVGLKK+KLDP NRLA R  SK  E L+++ R YN N   G+S D DIV LKNYLDAQYYGEIAIGTPP
Subjt:  MASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPP

Query:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL
        QKFTVIFDTGSSNLWVPS KC FS++C+FHA+YKSSRSSTYKK+G  A+I YG+G++SGFFSYD V VGDLVVK+Q FIE T EP LTFLVAKFDGLLGL
Subjt:  QKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGL

Query:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL
        GFQEIAVGNA PVWYNM++QGL+K PVFSFWLNR+ + EEGGEIVFGGVDPKH++G+HT+VPVTQ+GYWQFDMG+VLI GE TGYC  GCSAIADSGTSL
Subjt:  GFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSL

Query:  LAGPTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN
        LAGPTA                                       PKKICSQI LC +DGT GVSMGIESVVD+   +SS GLRD  C  CEM VVW+Q+
Subjt:  LAGPTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQN

Query:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV
        QLRQN T+ERI+NYINE+CERMPSP G+SAVDC +LS MP+VSFTIGGKVFDLAPEEY+LK+GEG  AQCISGFTA DIPPPRGPLWILGDVFMG+YHTV
Subjt:  QLRQNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTV

Query:  FDYGKLRVGFAEA
        FD+G  +VGFAEA
Subjt:  FDYGKLRVGFAEA

AT4G04460.1 Saposin-like aspartyl protease family protein5.6e-18361.45Show/hide
Query:  AFLCLFLLVSFNIVSSAS----NDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLG-ESSDTDIVALKNYLDAQYYGEIAIGTPPQKF
        +FL +FLL    ++S+AS     DG +R+GLKK KLD  NRLA++L       LK     ++P         + D+V LKNYLDAQYYG+I IGTPPQKF
Subjt:  AFLCLFLLVSFNIVSSAS----NDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLG-ESSDTDIVALKNYLDAQYYGEIAIGTPPQKF

Query:  TVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQ
        TVIFDTGSSNLW+PS KC  SVAC+FH++YK+S+SS+Y+KNG  ASIRYGTGA+SG+FS D+VKVGD+VVK Q FIEAT EP +TFL+AKFDG+LGLGF+
Subjt:  TVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQ

Query:  EIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAG
        EI+VGN+ PVWYNMVE+GLVKEP+FSFWLNRN ++ EGGEIVFGGVDPKH+KG+HT+VPVT KGYWQFDMGD+ I G+PTGYC  GCSAIADSGTSLL G
Subjt:  EIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAG

Query:  PTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLR
        P+                                       DPKK+CSQI +C +DGT+ VSMGI+SVVD+    +S  L   MCS CEM  VWM+++L 
Subjt:  PTA--------------------------------------DPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLR

Query:  QNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDY
        QNQT+ERI+ Y  ELC+ +P+   QSAVDCG++SSMP V+F+IGG+ FDL P++YI K+GEGV +QC SGFTA DI PPRGPLWILGD+FMG YHTVFDY
Subjt:  QNQTKERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDY

Query:  GKLRVGFAEAA
        GK RVGFA+AA
Subjt:  GKLRVGFAEAA

AT4G04460.2 Saposin-like aspartyl protease family protein5.3e-18161.34Show/hide
Query:  AFLCLFLLVSFNIVSSAS----NDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLG-ESSDTDIVALKNYLDAQYYGEIAIGTPPQKF
        +FL +FLL    ++S+AS     DG +R+GLKK KLD  NRLA++L       LK     ++P         + D+V LKNYLDAQYYG+I IGTPPQKF
Subjt:  AFLCLFLLVSFNIVSSAS----NDGLLRVGLKKIKLDPENRLAARLESKDVEILKAAFRKYNPNGNLG-ESSDTDIVALKNYLDAQYYGEIAIGTPPQKF

Query:  TVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQ
        TVIFDTGSSNLW+PS KC  SVAC+FH++YK+S+SS+Y+KNG  ASIRYGTGA+SG+FS D+VKVGD+VVK Q FIEAT EP +TFL+AKFDG+LGLGF+
Subjt:  TVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQ

Query:  EIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAG
        EI+VGN+ PVWYNMVE+GLVKEP+FSFWLNRN ++ EGGEIVFGGVDPKH+KG+HT+VPVT KGYWQFDMGD+ I G+PTGYC  GCSAIADSGTSLL G
Subjt:  EIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGEPTGYCEGGCSAIADSGTSLLAG

Query:  PTAD----------------------------------PKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQT
        P+                                     +K+CSQI +C +DGT+ VSMGI+SVVD+    +S  L   MCS CEM  VWM+++L QNQT
Subjt:  PTAD----------------------------------PKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQT

Query:  KERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLR
        +ERI+ Y  ELC+ +P+   QSAVDCG++SSMP V+F+IGG+ FDL P++YI K+GEGV +QC SGFTA DI PPRGPLWILGD+FMG YHTVFDYGK R
Subjt:  KERIINYINELCERMPSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLR

Query:  VGFAEAA
        VGFA+AA
Subjt:  VGFAEAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTCTTCGGGATCCGAGCATTCAAACCCTTCCGGTCCGAAGCGTGGACGACTCTACAGGAATCCCGCGAAGAAAAGACAACTCTGCCCCCTCCAAAACGGACGGC
CCAGATCAAATCCATCTTCTTTCTCTGTTTCAAACCATTGAATCCGAACCGGTTATCGATGGCTTCCTACCACTCCAAAGCAGCTTTCTTGTGTTTGTTCTTGTTGGTTT
CATTTAATATTGTCTCGTCTGCATCGAATGATGGGTTGCTTAGAGTTGGACTGAAGAAGATTAAGTTAGACCCAGAGAACCGGCTAGCTGCCCGCCTTGAGTCCAAGGAT
GTTGAGATTTTGAAAGCTGCTTTTAGGAAGTATAACCCCAATGGTAATCTTGGAGAATCTTCTGATACTGATATTGTTGCGTTAAAGAACTACCTGGATGCTCAGTACTA
TGGTGAGATTGCCATTGGTACACCCCCACAAAAGTTCACTGTGATTTTTGACACGGGCAGCTCCAATCTGTGGGTGCCTTCTGCGAAGTGCTTGTTCTCTGTGGCTTGTC
ATTTCCATGCCAGATACAAGTCAAGCCGCTCGAGTACATACAAGAAAAATGGGACGTCTGCTTCAATTCGGTATGGCACTGGAGCAGTCTCTGGTTTCTTTAGTTATGAC
AATGTCAAAGTTGGAGACCTAGTCGTAAAGAATCAGTTGTTCATTGAGGCAACCAGAGAACCTAGTCTTACATTTCTGGTCGCCAAGTTTGATGGGTTGTTGGGACTTGG
TTTTCAAGAGATCGCAGTTGGTAATGCTGTCCCAGTATGGTATAACATGGTTGAACAAGGTCTTGTTAAGGAACCTGTCTTTTCCTTTTGGCTCAATCGCAATGCTGAGG
AGGAGGAAGGAGGCGAAATTGTGTTTGGTGGGGTTGACCCAAAGCATTATAAGGGCAAGCATACTTATGTTCCTGTCACACAGAAAGGTTATTGGCAGTTTGACATGGGT
GATGTTCTCATAGATGGTGAACCAACTGGATATTGCGAAGGTGGTTGCTCAGCAATAGCAGATTCTGGAACTTCACTTTTGGCTGGTCCAACTGCAGATCCAAAGAAAAT
CTGTTCTCAAATTAAGTTGTGTACTTTTGATGGAACTCGAGGAGTTAGTATGGGAATCGAGAGTGTCGTAGATGAGAATGTCGGTAAATCATCTGATGGTCTAAGGGATG
GCATGTGCTCTGTATGTGAGATGACAGTTGTCTGGATGCAAAATCAACTTCGTCAGAATCAAACCAAAGAACGCATAATAAACTATATCAACGAGCTATGTGAGCGAATG
CCTAGTCCAATGGGACAATCAGCTGTCGACTGTGGAAAACTTTCTTCCATGCCTAGTGTTTCCTTCACCATTGGTGGCAAAGTTTTTGACCTTGCCCCAGAAGAGTATAT
ACTCAAGGTGGGTGAGGGTGTTGCAGCTCAGTGCATCAGTGGATTCACAGCATTTGATATTCCTCCTCCTCGTGGACCCCTCTGGATCTTGGGAGACGTCTTCATGGGCC
GCTACCACACAGTATTTGATTATGGCAAGCTGAGAGTCGGATTTGCAGAGGCAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTTCTTCGGGATCCGAGCATTCAAACCCTTCCGGTCCGAAGCGTGGACGACTCTACAGGAATCCCGCGAAGAAAAGACAACTCTGCCCCCTCCAAAACGGACGGC
CCAGATCAAATCCATCTTCTTTCTCTGTTTCAAACCATTGAATCCGAACCGGTTATCGATGGCTTCCTACCACTCCAAAGCAGCTTTCTTGTGTTTGTTCTTGTTGGTTT
CATTTAATATTGTCTCGTCTGCATCGAATGATGGGTTGCTTAGAGTTGGACTGAAGAAGATTAAGTTAGACCCAGAGAACCGGCTAGCTGCCCGCCTTGAGTCCAAGGAT
GTTGAGATTTTGAAAGCTGCTTTTAGGAAGTATAACCCCAATGGTAATCTTGGAGAATCTTCTGATACTGATATTGTTGCGTTAAAGAACTACCTGGATGCTCAGTACTA
TGGTGAGATTGCCATTGGTACACCCCCACAAAAGTTCACTGTGATTTTTGACACGGGCAGCTCCAATCTGTGGGTGCCTTCTGCGAAGTGCTTGTTCTCTGTGGCTTGTC
ATTTCCATGCCAGATACAAGTCAAGCCGCTCGAGTACATACAAGAAAAATGGGACGTCTGCTTCAATTCGGTATGGCACTGGAGCAGTCTCTGGTTTCTTTAGTTATGAC
AATGTCAAAGTTGGAGACCTAGTCGTAAAGAATCAGTTGTTCATTGAGGCAACCAGAGAACCTAGTCTTACATTTCTGGTCGCCAAGTTTGATGGGTTGTTGGGACTTGG
TTTTCAAGAGATCGCAGTTGGTAATGCTGTCCCAGTATGGTATAACATGGTTGAACAAGGTCTTGTTAAGGAACCTGTCTTTTCCTTTTGGCTCAATCGCAATGCTGAGG
AGGAGGAAGGAGGCGAAATTGTGTTTGGTGGGGTTGACCCAAAGCATTATAAGGGCAAGCATACTTATGTTCCTGTCACACAGAAAGGTTATTGGCAGTTTGACATGGGT
GATGTTCTCATAGATGGTGAACCAACTGGATATTGCGAAGGTGGTTGCTCAGCAATAGCAGATTCTGGAACTTCACTTTTGGCTGGTCCAACTGCAGATCCAAAGAAAAT
CTGTTCTCAAATTAAGTTGTGTACTTTTGATGGAACTCGAGGAGTTAGTATGGGAATCGAGAGTGTCGTAGATGAGAATGTCGGTAAATCATCTGATGGTCTAAGGGATG
GCATGTGCTCTGTATGTGAGATGACAGTTGTCTGGATGCAAAATCAACTTCGTCAGAATCAAACCAAAGAACGCATAATAAACTATATCAACGAGCTATGTGAGCGAATG
CCTAGTCCAATGGGACAATCAGCTGTCGACTGTGGAAAACTTTCTTCCATGCCTAGTGTTTCCTTCACCATTGGTGGCAAAGTTTTTGACCTTGCCCCAGAAGAGTATAT
ACTCAAGGTGGGTGAGGGTGTTGCAGCTCAGTGCATCAGTGGATTCACAGCATTTGATATTCCTCCTCCTCGTGGACCCCTCTGGATCTTGGGAGACGTCTTCATGGGCC
GCTACCACACAGTATTTGATTATGGCAAGCTGAGAGTCGGATTTGCAGAGGCAGCATGAAGAAGCACCTCGTTTGGTGGCTTAGTTCGGATGCCTTCAATGTTTATACAC
ATGAGCCATCTTTAGTTTCAATGGAACCTAACTTTACTTTATGTTCATGGATTTATGGAGTGTTAAATGCGGTTTGCTTCCCTGCAAAAGGTTGAAACCCAAGATTGTAA
ATGCTTGCTACTGTTTTATCTTTTATAGACACAGCTTGTACAGTTGCTTTAGCATCTCTTCTGATTTAAATATATACAGAGGCTGTTATATATTTGGAGATGGAGATAAG
AGAACAAATGAATTAAATAATATGTGTTATCTTGATTTACAAGATTGATCTCATGAGTTAAAGGAGATTTACAACTTAGAAAA
Protein sequenceShow/hide protein sequence
MPFFGIRAFKPFRSEAWTTLQESREEKTTLPPPKRTAQIKSIFFLCFKPLNPNRLSMASYHSKAAFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARLESKD
VEILKAAFRKYNPNGNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLFSVACHFHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYD
NVKVGDLVVKNQLFIEATREPSLTFLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMG
DVLIDGEPTGYCEGGCSAIADSGTSLLAGPTADPKKICSQIKLCTFDGTRGVSMGIESVVDENVGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCERM
PSPMGQSAVDCGKLSSMPSVSFTIGGKVFDLAPEEYILKVGEGVAAQCISGFTAFDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFAEAA