; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G010210 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G010210
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPeptidase A1 domain-containing protein
Genome locationchr02:10626070..10627617
RNA-Seq ExpressionLsi02G010210
SyntenyLsi02G010210
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]1.2e-26788.01Show/hide
Query:  MPSIST-SIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEM-DSDDNVIEPLREIRDGYLMSLTLGTP
        MPSIS+ S ATKFLS FLLLV+VS +TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGYN +S+KRMKAM+  D DDNVIEPLREIRDGYLMSL++GTP
Subjt:  MPSIST-SIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEM-DSDDNVIEPLREIRDGYLMSLTLGTP

Query:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
        PQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAYT
Subjt:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT

Query:  YGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
        YG+SGVV G+LTRD+L  HGN  N+  + KQIPRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
Subjt:  YGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK

Query:  DGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYK
        D +LQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYK
Subjt:  DGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYK

Query:  VPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQ
        VPCKNNN SF+DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAGIFGSFQQQN+EVVYDLEKERLGFQ
Subjt:  VPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQ

Query:  PMDCASVAATEGLHKNV
        PMDC SVAA +GLHKNV
Subjt:  PMDCASVAATEGLHKNV

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]7.2e-27088.68Show/hide
Query:  MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTP
        MPSI STSIATKFLS FLLLV+ S++TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGYN +S+KRMKAM +MD DDNVIEPLREIRDGYLMSL++GTP
Subjt:  MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTP

Query:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
        PQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
Subjt:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT

Query:  YGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        YG+SGVV G+LTRD+L MHG    NN N+  + KQ+PRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LA
Subjt:  YGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD
        ISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD
Subjt:  ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD

Query:  LCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKER
        LCYKVPCKNNN SF+DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAGIFGSFQQQNL+VVYDLEKER
Subjt:  LCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKER

Query:  LGFQPMDCASVAATEGLHKNV
        LGFQ MDC SVAA +GLHKNV
Subjt:  LGFQPMDCASVAATEGLHKNV

XP_022142611.1 probable aspartyl protease At4g16563 [Momordica charantia]1.5e-24382.06Show/hide
Query:  SISTSIATKFLSFFLLLV-YVSRKTLASNPKTNNFP-TDSLVIGLAHSRTSLLTPKKGYNS---MSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGT
        S +T+I++K L+FFLLL+  +S    A+ P  NNFP TDSLV+GL HSRTSLLTPK+GY S    S    K ME    DNVIEPLREIRDGYL+SLTLGT
Subjt:  SISTSIATKFLSFFLLLV-YVSRKTLASNPKTNNFP-TDSLVIGLAHSRTSLLTPKKGYNS---MSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV GPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS
        TYG+SGVV GTLT+D++LMHG    SP ST QIPRFCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Subjt:  TYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS

Query:  KDGHLQFTPLLKSPIYPNYYYIGLESITIGN--GNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDL
        KD  LQFTPLLKSP+YPNYYYIGLES+TIG+  GNNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDL
Subjt:  KDGHLQFTPLLKSPIYPNYYYIGLESITIGN--GNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDL

Query:  CYKVPCKNN-NF-SFIDD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDL
        CYK+PCKNN NF S +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTV+KCLLFQSMD G GGD D +   DGPAGIFGSFQQQN+EVVYDL
Subjt:  CYKVPCKNN-NF-SFIDD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDL

Query:  EKERLGFQPMDCASVAATEGLHKN
        +KER+GFQ MDCAS AA++GLHKN
Subjt:  EKERLGFQPMDCASVAATEGLHKN

XP_023520027.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]7.0e-24181.68Show/hide
Query:  ISTSIATKFLSFFLLLVYVSRKTLA---SNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQ
        +++ +A  F    L+LV VS + +    +NPKT  F  DSLV+GL HSRTSLLTPK+GYNS+SRKR+K MEM +DD VIEPLREIRDGYLMSLTLGTPPQ
Subjt:  ISTSIATKFLSFFLLLVYVSRKTLA---SNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQ

Query:  VIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYG
        VIQVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYG
Subjt:  VIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYG

Query:  SSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDG
        +SG+VIGTLT+D++ +HG   NSP S+++IP+FCFGCVGA+YREPIGIAGFGRGLLSLP+QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+ 
Subjt:  SSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDG

Query:  HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVP
        HL+FTPLLKSP YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+ISYPRAK+ ELNTGFDLCYKVP
Subjt:  HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVP

Query:  CKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPM
         KNN F F D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAP NSTV+KCLLFQSMD    GD       DGPAGIFGSFQQQNLEVVYDLEKERLGF+ M
Subjt:  CKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPM

Query:  DCASVAATEGLHK
        DCASVA ++GLHK
Subjt:  DCASVAATEGLHK

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]2.5e-27893.02Show/hide
Query:  MPSISTSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQ
        M SISTS A K LS+FLLLVYVSRKTLA+NPKTN  P DSLVIGL HSRT+LLTPKKGYN +SRKRMKAMEM  DDNVIEPLREIRDGYLMSLTLGTPPQ
Subjt:  MPSISTSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQ

Query:  VIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYG
        VIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPSFAYTYG
Subjt:  VIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYG

Query:  SSGVVIGTLTRDILLMHGNNINSP-ISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
        +SGVVIGTLTRD+LLMH NNINSP  STK+ PRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA+SSKD
Subjt:  SSGVVIGTLTRDILLMHGNNINSP-ISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD

Query:  GHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKV
         HLQFTPLLKSPIYPNYYYIGLESITIGNGN+NFRFGVSF LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKV
Subjt:  GHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKV

Query:  PCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQP
        PCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQ NNFYAMAAPINSTV+KCLLFQSM DGVGGD DDD  RDGPAGIFGSFQQQNLEVVYDLEKERLGFQP
Subjt:  PCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQP

Query:  MDCASVAATEGLHKNV
        MDCA VAAT+GLHKNV
Subjt:  MDCASVAATEGLHKNV

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein5.6e-26888.01Show/hide
Query:  MPSIST-SIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEM-DSDDNVIEPLREIRDGYLMSLTLGTP
        MPSIS+ S ATKFLS FLLLV+VS +TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGYN +S+KRMKAM+  D DDNVIEPLREIRDGYLMSL++GTP
Subjt:  MPSIST-SIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEM-DSDDNVIEPLREIRDGYLMSLTLGTP

Query:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
        PQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAYT
Subjt:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT

Query:  YGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
        YG+SGVV G+LTRD+L  HGN  N+  + KQIPRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK
Subjt:  YGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK

Query:  DGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYK
        D +LQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYK
Subjt:  DGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYK

Query:  VPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQ
        VPCKNNN SF+DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAGIFGSFQQQN+EVVYDLEKERLGFQ
Subjt:  VPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQ

Query:  PMDCASVAATEGLHKNV
        PMDC SVAA +GLHKNV
Subjt:  PMDCASVAATEGLHKNV

A0A1S3CAK9 aspartic proteinase nepenthesin-23.5e-27088.68Show/hide
Query:  MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTP
        MPSI STSIATKFLS FLLLV+ S++TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGYN +S+KRMKAM +MD DDNVIEPLREIRDGYLMSL++GTP
Subjt:  MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTP

Query:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
        PQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
Subjt:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT

Query:  YGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        YG+SGVV G+LTRD+L MHG    NN N+  + KQ+PRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LA
Subjt:  YGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD
        ISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD
Subjt:  ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD

Query:  LCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKER
        LCYKVPCKNNN SF+DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAGIFGSFQQQNL+VVYDLEKER
Subjt:  LCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKER

Query:  LGFQPMDCASVAATEGLHKNV
        LGFQ MDC SVAA +GLHKNV
Subjt:  LGFQPMDCASVAATEGLHKNV

A0A5A7TNC9 Aspartic proteinase nepenthesin-23.5e-27088.68Show/hide
Query:  MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTP
        MPSI STSIATKFLS FLLLV+ S++TLA+NPKT NFP DSLV+GL HSRTSLLTPKKGYN +S+KRMKAM +MD DDNVIEPLREIRDGYLMSL++GTP
Subjt:  MPSI-STSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAM-EMDSDDNVIEPLREIRDGYLMSLTLGTP

Query:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
        PQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT
Subjt:  PQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYT

Query:  YGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        YG+SGVV G+LTRD+L MHG    NN N+  + KQ+PRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LA
Subjt:  YGSSGVVIGTLTRDILLMHG----NNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD
        ISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD
Subjt:  ISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFD

Query:  LCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKER
        LCYKVPCKNNN SF+DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTV+KCLL+QSM DGVG DND D+  +GPAGIFGSFQQQNL+VVYDLEKER
Subjt:  LCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKER

Query:  LGFQPMDCASVAATEGLHKNV
        LGFQ MDC SVAA +GLHKNV
Subjt:  LGFQPMDCASVAATEGLHKNV

A0A6J1CMP8 probable aspartyl protease At4g165637.3e-24482.06Show/hide
Query:  SISTSIATKFLSFFLLLV-YVSRKTLASNPKTNNFP-TDSLVIGLAHSRTSLLTPKKGYNS---MSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGT
        S +T+I++K L+FFLLL+  +S    A+ P  NNFP TDSLV+GL HSRTSLLTPK+GY S    S    K ME    DNVIEPLREIRDGYL+SLTLGT
Subjt:  SISTSIATKFLSFFLLLV-YVSRKTLASNPKTNNFP-TDSLVIGLAHSRTSLLTPKKGYNS---MSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV GPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS
        TYG+SGVV GTLT+D++LMHG    SP ST QIPRFCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Subjt:  TYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS

Query:  KDGHLQFTPLLKSPIYPNYYYIGLESITIGN--GNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDL
        KD  LQFTPLLKSP+YPNYYYIGLES+TIG+  GNNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTGFDL
Subjt:  KDGHLQFTPLLKSPIYPNYYYIGLESITIGN--GNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDL

Query:  CYKVPCKNN-NF-SFIDD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDL
        CYK+PCKNN NF S +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTV+KCLLFQSMD G GGD D +   DGPAGIFGSFQQQN+EVVYDL
Subjt:  CYKVPCKNN-NF-SFIDD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDL

Query:  EKERLGFQPMDCASVAATEGLHKN
        +KER+GFQ MDCAS AA++GLHKN
Subjt:  EKERLGFQPMDCASVAATEGLHKN

A0A6J1EHM1 probable aspartyl protease At4g165632.7e-23881.69Show/hide
Query:  ATKFLSFFLLLVYVSRKTLA---SNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVY
        A  +    L+LV VS + +    +NPKT  F  DSLV+GL HSRTSLLTPK+GYNS+  KR+K MEM +DD VIEPLREIRDGYLMSLTLGTPPQVIQVY
Subjt:  ATKFLSFFLLLVYVSRKTLA---SNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVY

Query:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVV
        MDTGSDLTWVPCGNLSFDCQDC+EYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYG+SG+V
Subjt:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVV

Query:  IGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFT
        IGTLT+D++ +HG   NSP S+++IP+FCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+ HL+FT
Subjt:  IGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFT

Query:  PLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN
        P LKSP YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+ISYPRAK+ ELNTGFDLCYKVP KNN 
Subjt:  PLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNN

Query:  FSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
        F F D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAP NSTV+KCLLFQSMD    GD       DGPAGIFGSFQQQNLEVVYDLEKERLGF+ MDCASV
Subjt:  FSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

Query:  AATEGLHK
        A ++GLHK
Subjt:  AATEGLHK

SwissProt top hitse value%identityAlignment
Q6XBF8 Aspartic proteinase CDR18.6e-3228.79Show/hide
Query:  NSMSRKRMKAMEMDSDDNVIEPLREIRDG---YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCG
        N++ R   +       DN  +P  ++      YLM++++GTPP  I    DTGSDL W  C      C DC    + +  PK        SST    +C 
Subjt:  NSMSRKRMKAMEMDSDDNVIEPLREIRDG---YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCG

Query:  SSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFG
        SS C  + +          A CS       TC     S++ +YG +    G +  D L + G++   P+  K I     GC     G   ++  GI G G
Subjt:  SSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFG

Query:  RGLLSLPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGG
         G +SL  QLG S  G FS+C +P   ++  + +S +  G  AI S  G +  TPL+       +YY+ L+SI++G+    +    S           G 
Subjt:  RGLLSLPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGG

Query:  MLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLL
        ++IDSGTT T LP   YS+L   + S  S    K+ +  +G  LCY         S   D ++P IT HF +   V L   N F  +     S  + C  
Subjt:  MLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLL

Query:  FQSMDDGVGGDNDDDNGRDGPA-GIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
        F                R  P+  I+G+  Q N  V YD   + + F+P DCA +
Subjt:  FQSMDDGVGGDNDDDNGRDGPA-GIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

Q766C2 Aspartic proteinase nepenthesin-23.5e-3327.07Show/hide
Query:  KRMKAME--MDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMD
        +RM+++   + S   +  P+      YLM++ +GTP       MDTGSDL W  C      C  C      +       F P  SS+     C S +C D
Subjt:  KRMKAME--MDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMD

Query:  IHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSL
        +     P + C    C                + Y YG      G +  +              T  +P   FGC     G       G+ G G G LSL
Subjt:  IHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSL

Query:  PFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGT
        P QLG     FS+C   +  S+     S L LG+ A    +G    T L+ S + P YYYI L+ IT+G  N     G+     ++   G GGM+IDSGT
Subjt:  PFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGT

Query:  TYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDD-
        T T+LP+  Y+ +       I+ P     E ++G   C++ P   +        Q+P I+  F   V  +  Q      + +P    +  CL   S    
Subjt:  TYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDD-

Query:  GVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS
        G+               IFG+ QQQ  +V+YDL+   + F P  C +
Subjt:  GVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS

Q766C3 Aspartic proteinase nepenthesin-12.3e-3229.08Show/hide
Query:  SRKRMKAMEMDSDDNVIEPLREIRDG-YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCM
        SR+  +   M +  + +E      DG YLM+L++GTP Q     MDTGSDL W         CQ C +  N         F P  SS+     C S  C 
Subjt:  SRKRMKAMEMDSDDNVIEPLREIRDG-YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCM

Query:  DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFGRGLLS
         + S                     TC      + Y YG      G++  + L            +  IP   FGC     G       G+ G GRG LS
Subjt:  DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFGRGLLS

Query:  LPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSG
        LP QL  +   FS+C  P   S   N    L+LG+LA S   G    T L++S   P +YYI L  +++G+         +F L      G GG++IDSG
Subjt:  LPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSG

Query:  TTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDD
        TT T+     Y  +     S I+ P       ++GFDLC++ P   +N       Q+P+   HF +   + LP  N F    +P N  +  CL   S   
Subjt:  TTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDD

Query:  GVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS
        G+               IFG+ QQQN+ VVYD     + F    C +
Subjt:  GVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS

Q940R4 Probable aspartyl protease At4g165632.7e-5432.72Show/hide
Query:  HSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTH
        HS + L   K   +  S +  +         +  P+    D YL+SL++G+    + +Y+DTGSDL W PC    F C  CE   +  L P   + L   
Subjt:  HSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTH

Query:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYR
        SS++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G ++  L  D L          + +  +  F FGC   +  
Subjt:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYR

Query:  EPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA------ISSKDGH------------LQFTPLLKSPIYPNYYYIG
        EPIG+AGFGRG LSLP QL     H G  FS+C +   F S+     SPLILG         + + D H              FT +L++P +P +Y + 
Subjt:  EPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA------ISSKDGH------------LQFTPLLKSPIYPNYYYIG

Query:  LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS--YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSIT
        L+ I+IG  N          LR ID  G GG+++DSGTT+T LP   Y+ ++   +S +   + RA +VE ++G   CY +             ++P++ 
Subjt:  LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS--YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSIT

Query:  FHFL-NNVSVVLPQGNNFYAMA----APINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
         HF  N  SV LP+ N FY              I CL+  +     GGD  +  G  G   I G++QQQ  EVVYDL   R+GF    CAS+
Subjt:  FHFL-NNVSVVLPQGNNFYAMA----APINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

Q9LNJ3 Aspartyl protease family protein 21.7e-3228.31Show/hide
Query:  NVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIA
        +V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +       F P  S T     C S  C  + S          A
Subjt:  NVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIA

Query:  GCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC--------VGASYREPIGIAGFGRGLLSLPFQLG--FSH
        GC+     + TC      +  +YG     +G  + + L    N +              GC        VGA+     G+ G G+G LS P Q G  F+ 
Subjt:  GCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC--------VGASYREPIGIAGFGRGLLSLPFQLG--FSH

Query:  KGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEP
        K FS+C +    S+ P   S ++ GN A+S      +FTPLL +P    +YY+GL  I++G        GV+  L ++D  GNGG++IDSGT+ T L  P
Subjt:  KGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEP

Query:  LYSQLISNLE-SVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDD
         Y  +         +  RA    L   FD C+       + S +++ ++P++  HF     V LP  N       P+++    C  F             
Subjt:  LYSQLISNLE-SVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDD

Query:  DNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA
          G  G   I G+ QQQ   VVYDL   R+GF P  CA
Subjt:  DNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein8.2e-3828.71Show/hide
Query:  STSIATKFLSFFLLL------VYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPK---------KGYNSMSRKRMKAM-----EMDSDDNVIEPLRE
        S+S ++    FFL+L      V  SR++L       N P     + L H  +     K         +G++ ++R    A+     + D  +N+  P   
Subjt:  STSIATKFLSFFLLL------VYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPK---------KGYNSMSRKRMKAM-----EMDSDDNVIEPLRE

Query:  IRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLV
            +LM L++G P       +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  +  S+   D             
Subjt:  IRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLV

Query:  KGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSN
        K  C      + YTYG      G L  +       N  S I         FGC     G  + +  G+ G GRG LSL  QL      FS+C    + S 
Subjt:  KGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSN

Query:  NPNFSSPLILGNLAI-------SSKDGHLQFT-PLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLI
            SS L +G+LA        +S DG +  T  LL++P  P++YY+ L+ IT+G      R  V     E+   G GGM+IDSGTT T+L E  +  L 
Subjt:  NPNFSSPLILGNLAI-------SSKDGHLQFT-PLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLI

Query:  SNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGP
            S +S P       +TG DLC+K+P    N +      +P + FHF     + LP G N+  M A  +ST + CL                 G    
Subjt:  SNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGP

Query:  AGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
          IFG+ QQQN  V++DLEKE + F P +C  +
Subjt:  AGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

AT3G25700.1 Eukaryotic aspartyl protease family protein2.6e-3929.91Show/hide
Query:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        Y + L +G PPQ + +  DTGSDL WV C      C++C  +           F P HSST     C    C  +   D        A     T +  TC
Subjt:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC---------VGASYREPIGIAGFGRGLLSLPFQLG--FSHKGFSHCFLPFK
              + Y Y    +  G   R+   +      S     ++    FGC          G S+    G+ G GRG +S   QLG  F +K FS+C + + 
Subjt:  PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGC---------VGASYREPIGIAGFGRGLLSLPFQLG--FSHKGFSHCFLPFK

Query:  FSNNPNFSSPLILGNLAISSKDG--HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNL
         S  P  +S LI+GN      DG   L FTPLL +P+ P +YY+ L+S+ +    N  +  +   + EID  GNGG ++DSGTT   L EP Y  +I+ +
Subjt:  FSNNPNFSSPLILGNLAISSKDG--HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNL

Query:  ESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGI
           +  P A    L  GFDLC  V           +  LP + F F      V P  N F           I+CL  QS+D  VG              +
Subjt:  ESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGI

Query:  FGSFQQQNLEVVYDLEKERLGFQPMDCA
         G+  QQ     +D ++ RLGF    CA
Subjt:  FGSFQQQNLEVVYDLEKERLGFQPMDCA

AT3G52500.1 Eukaryotic aspartyl protease family protein1.5e-4433.1Show/hide
Query:  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKL-AAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG
        GY +SL+ GTP Q I    DTGS L W+PC +  + C  C+    + L P L   F+P +SS+S    C S  C  ++    P   C   GC   T    
Subjt:  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKL-AAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG

Query:  TCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSS
         C   CP +   YG  G   G L  +        ++ P  T  +P F  GC   S R+P GIAGFGRG +SLP Q+    K FSHC +  +F ++ N ++
Subjt:  TCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSS

Query:  PLILGNLA---ISSKDGHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV
         L L   +     SK   L +TP  K+P   N     YYY+ L  I +G  +      + +K     T G+GG ++DSG+T+T +  P++  +     S 
Subjt:  PLILGNLA---ISSKDGHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV

Query:  IS-YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFG
        +S Y R K +E  TG   C+ +  K        D  +P + F F     + LP  N F      + +T   CL        V     + +G  GPA I G
Subjt:  IS-YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFG

Query:  SFQQQNLEVVYDLEKERLGFQPMDCA
        SFQQQN  V YDLE +R GF    C+
Subjt:  SFQQQNLEVVYDLEKERLGFQPMDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein1.9e-5532.72Show/hide
Query:  HSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTH
        HS + L   K   +  S +  +         +  P+    D YL+SL++G+    + +Y+DTGSDL W PC    F C  CE   +  L P   + L   
Subjt:  HSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTH

Query:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYR
        SS++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G ++  L  D L          + +  +  F FGC   +  
Subjt:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGSSGVVIGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYR

Query:  EPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA------ISSKDGH------------LQFTPLLKSPIYPNYYYIG
        EPIG+AGFGRG LSLP QL     H G  FS+C +   F S+     SPLILG         + + D H              FT +L++P +P +Y + 
Subjt:  EPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA------ISSKDGH------------LQFTPLLKSPIYPNYYYIG

Query:  LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS--YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSIT
        L+ I+IG  N          LR ID  G GG+++DSGTT+T LP   Y+ ++   +S +   + RA +VE ++G   CY +             ++P++ 
Subjt:  LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS--YPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSIT

Query:  FHFL-NNVSVVLPQGNNFYAMA----APINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
         HF  N  SV LP+ N FY              I CL+  +     GGD  +  G  G   I G++QQQ  EVVYDL   R+GF    CAS+
Subjt:  FHFL-NNVSVVLPQGNNFYAMA----APINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

AT5G45120.1 Eukaryotic aspartyl protease family protein1.1e-17058.98Show/hide
Query:  TKFLSFFLLLVYVSRKTLASNPKTNNFPTDS----LVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVY
        T  L  FLL+  +   T  +  + +  P+ S    LV+ L  S  SL TPK    S +++R+K   + S D V+EPLRE+RDGYL++L +GTPPQ +QVY
Subjt:  TKFLSFFLLLVYVSRKTLASNPKTNNFPTDS----LVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVY

Query:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVV
        +DTGSDLTWVPCGNLSFDC +C + +NN L    + F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K TC RPCPSFAYTYG  G++
Subjt:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVV

Query:  IGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDGHLQF
         G LTRDIL            T+ +PRF FGCV ++YREPIGIAGFGRGLLSLP QLGF  KGFSHCFLPFKF NNPN SSPLILG  A+S +    LQF
Subjt:  IGTLTRDILLMHGNNINSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDGHLQF

Query:  TPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNN
        TP+L +P+YPN YYIGLESITI  G N     V   LR+ D++GNGGML+DSGTTYTHLPEP YSQL++ L+S I+YPRA + E  TGFDLCYKVPC NN
Subjt:  TPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNN

Query:  NFSFIDDSQL---PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMD
        N + +++  +   PSITFHFLNN +++LPQGN+FYAM+AP + +V++CLLFQ+M+D          G  GPAG+FGSFQQQN++VVYDLEKER+GFQ MD
Subjt:  NFSFIDDSQL---PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMD

Query:  CASVAATEGLHK
        C   AA+ GL++
Subjt:  CASVAATEGLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCAATATCAACCTCCATTGCAACAAAATTCTTGAGCTTTTTCCTTCTTCTTGTATATGTCTCAAGAAAAACCCTAGCTTCAAACCCTAAAACCAATAATTTCCC
CACAGATTCTCTAGTTATTGGTCTTGCTCATTCAAGAACATCCCTCCTTACCCCTAAAAAAGGCTATAATTCCATGTCAAGGAAGAGAATGAAGGCAATGGAAATGGATA
GTGATGATAATGTAATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACAGGAAGT
GATCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGCCCAAAATTGGCTGCTTTTTTGCCTACTCATTC
TTCTACTTCTATTAGAGACACTTGTGGTAGCTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTG
TGAAGGGCACTTGCCCTAGACCATGCCCTTCATTTGCTTACACTTATGGGTCAAGTGGGGTTGTAATTGGAACTTTAACAAGAGATATCCTTTTAATGCATGGAAATAAT
ATTAATTCTCCAATTTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCAAGTTATAGAGAGCCAATTGGTATAGCTGGTTTTGGTAGAGGTTTACTTTC
TCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTCTTGCCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAACCTTG
CTATTTCTTCAAAAGATGGCCATTTACAATTTACTCCTTTGTTGAAAAGTCCAATTTACCCCAACTACTACTATATTGGGCTTGAGTCAATCACTATAGGAAATGGGAAT
AATAATTTTAGATTTGGTGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCGGGTACTACCTATACTCATTTACCTGAACCTTT
GTATTCACAACTTATTTCAAATCTTGAGTCAGTGATAAGCTATCCAAGAGCCAAACAAGTTGAACTCAATACAGGATTTGATCTTTGTTATAAAGTTCCTTGTAAAAACA
ACAATTTTTCTTTTATTGATGACTCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTTAGTGTTGTTTTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCT
CCAATTAACTCCACTGTGATTAAATGCTTGTTGTTTCAAAGCATGGATGACGGTGTTGGTGGCGATAACGATGACGATAACGGCCGAGATGGGCCGGCGGGCATTTTCGG
AAGCTTCCAACAACAAAATTTAGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGCTTCTGTTGCTGCCACTGAAGGACTCCACAAGA
ATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCAATATCAACCTCCATTGCAACAAAATTCTTGAGCTTTTTCCTTCTTCTTGTATATGTCTCAAGAAAAACCCTAGCTTCAAACCCTAAAACCAATAATTTCCC
CACAGATTCTCTAGTTATTGGTCTTGCTCATTCAAGAACATCCCTCCTTACCCCTAAAAAAGGCTATAATTCCATGTCAAGGAAGAGAATGAAGGCAATGGAAATGGATA
GTGATGATAATGTAATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACAGGAAGT
GATCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGCCCAAAATTGGCTGCTTTTTTGCCTACTCATTC
TTCTACTTCTATTAGAGACACTTGTGGTAGCTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTG
TGAAGGGCACTTGCCCTAGACCATGCCCTTCATTTGCTTACACTTATGGGTCAAGTGGGGTTGTAATTGGAACTTTAACAAGAGATATCCTTTTAATGCATGGAAATAAT
ATTAATTCTCCAATTTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCAAGTTATAGAGAGCCAATTGGTATAGCTGGTTTTGGTAGAGGTTTACTTTC
TCTTCCTTTTCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGCTTCTTGCCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAACCTTG
CTATTTCTTCAAAAGATGGCCATTTACAATTTACTCCTTTGTTGAAAAGTCCAATTTACCCCAACTACTACTATATTGGGCTTGAGTCAATCACTATAGGAAATGGGAAT
AATAATTTTAGATTTGGTGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCGGGTACTACCTATACTCATTTACCTGAACCTTT
GTATTCACAACTTATTTCAAATCTTGAGTCAGTGATAAGCTATCCAAGAGCCAAACAAGTTGAACTCAATACAGGATTTGATCTTTGTTATAAAGTTCCTTGTAAAAACA
ACAATTTTTCTTTTATTGATGACTCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTTAGTGTTGTTTTGCCCCAAGGGAACAATTTCTATGCCATGGCTGCT
CCAATTAACTCCACTGTGATTAAATGCTTGTTGTTTCAAAGCATGGATGACGGTGTTGGTGGCGATAACGATGACGATAACGGCCGAGATGGGCCGGCGGGCATTTTCGG
AAGCTTCCAACAACAAAATTTAGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGCTTCTGTTGCTGCCACTGAAGGACTCCACAAGA
ATGTTTGA
Protein sequenceShow/hide protein sequence
MPSISTSIATKFLSFFLLLVYVSRKTLASNPKTNNFPTDSLVIGLAHSRTSLLTPKKGYNSMSRKRMKAMEMDSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGS
DLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGSSGVVIGTLTRDILLMHGNN
INSPISTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDGHLQFTPLLKSPIYPNYYYIGLESITIGNGN
NNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVELNTGFDLCYKVPCKNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAA
PINSTVIKCLLFQSMDDGVGGDNDDDNGRDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATEGLHKNV