; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020810 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020810
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPeptidase A1 domain-containing protein
Genome locationtig00153574:116197..117741
RNA-Seq ExpressionSgr020810
SyntenySgr020810
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583807.1 putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia]8.9e-23681.67Show/hide
Query:  ILSFFLLLVYVSGIKLGQNLVAKPKSFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMA--DVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGS
        +L   L+LV VSG  +GQ L      F  DSLVLGLV SRTSLL PKRGY  +SR R KPMEM   DVIEPLRE+RDGYL+SLTLGTP QV+QVYMDTGS
Subjt:  ILSFFLLLVYVSGIKLGQNLVAKPKSFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMA--DVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGS

Query:  DLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLT
        DLTWVPCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSS+SIRDTCGSSFCIDIHSSDNPFDPCTI+GCSLATLVK TCPRPCPSF+YTYGASG+V GTLT
Subjt:  DLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLT

Query:  RDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYP
        +DVI +HG+SPNSSR+IP+FCFGCVGATYREP+GIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+HL+FTPLLKSP YP
Subjt:  RDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYP

Query:  NYYYIGLESITIGNGNN-SRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQL
        NYYYIGLESITIGNG N SRFGVSL+LRE DTKGNGG+LIDSGTTYTHLPEPLYSQ++SNLES+ISYPRAK+ E+NTGFDLCYKVP +NNT  S D+ +L
Subjt:  NYYYIGLESITIGNGNN-SRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQL

Query:  PSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCASAAASQGL
        PSITFHFLNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMDG G             DGPAGIFGSFQQQN+EVVYDLEKER+GF+ +DCAS A SQGL
Subjt:  PSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCASAAASQGL

Query:  HK
        HK
Subjt:  HK

XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]7.3e-23880Show/hide
Query:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT
        MP+ ++  T A K LS FLLLV+VS     Q L   PK +FP DSLVLGLV SRTSLL PK+GY +IS+ R K M+  D    VIEPLRE+RDGYL+SL+
Subjt:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT

Query:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS
        +GTP QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LAAFLPTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLA+LVK TCPRPCPS
Subjt:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS

Query:  FAYTYGASGVVTGTLTRDVIVMHG---SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        FAYTYGASGVVTG+LTRDV+  HG   ++ N+++QIPRFCFGCVGATYREP+GIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
Subjt:  FAYTYGASGVVTGTLTRDVIVMHG---SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNG-NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFD
        ISSKD +LQFTPLLKSP+YPNYYYIGLESITIGNG NN RFGVS KLRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLE VI YPRAKQVE+NTGFD
Subjt:  ISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNG-NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFD

Query:  LCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLE
        LCYKVPC+NN +S  DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD      DN+  ++GPAGIFGSFQQQN+EVVYDLE
Subjt:  LCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLE

Query:  KERIGFQSVDCASAAASQGLHKNFR
        KER+GFQ +DC S AA QGLHKN R
Subjt:  KERIGFQSVDCASAAASQGLHKNFR

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]2.3e-23979.77Show/hide
Query:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT
        MP+ ++T + A K LS FLLLV+ S     Q L   PK +FP DSLVLGLV SRTSLL PK+GY +IS+ R K M+  D    VIEPLRE+RDGYL+SL+
Subjt:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT

Query:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS
        +GTP QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLATLVK TCPRPCPS
Subjt:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS

Query:  FAYTYGASGVVTGTLTRDVIVMHG-------SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
        FAYTYGASGVVTG+LTRDV+ MHG       ++ N+++Q+PRFCFGCVGATYREP+GIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLIL
Subjt:  FAYTYGASGVVTGTLTRDVIVMHG-------SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL

Query:  GNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNS-RFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN
        G+LAISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNN+ RFGVS KLRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLESVISYPRAKQVE+N
Subjt:  GNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNS-RFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN

Query:  TGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVV
        TGFDLCYKVPC+NN +S  DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD      DN+  ++GPAGIFGSFQQQN++VV
Subjt:  TGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVV

Query:  YDLEKERIGFQSVDCASAAASQGLHKNFR
        YDLEKER+GFQ++DC S AA+QGLHKN R
Subjt:  YDLEKERIGFQSVDCASAAASQGLHKNFR

XP_022142611.1 probable aspartyl protease At4g16563 [Momordica charantia]1.2e-24381.27Show/hide
Query:  MPAT-TATPTFAAKILSFFLLLVYVSGIKLGQNLVAKP--KSFP-TDSLVLGLVRSRTSLLFPKRGYY-----ISRTRKPME---MADVIEPLREVRDGY
        MP T ++  T ++K+L+FFLLL+ +  +       A+P   +FP TDSLVLGLV SRTSLL PKRGY+      S   KPME     +VIEPLRE+RDGY
Subjt:  MPAT-TATPTFAAKILSFFLLLVYVSGIKLGQNLVAKP--KSFP-TDSLVLGLVRSRTSLLFPKRGYY-----ISRTRKPME---MADVIEPLREVRDGY

Query:  LISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCP
        LISLTLGTP QV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLA F PTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLATLVK TCP
Subjt:  LISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCP

Query:  RPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN
        RPCPSFAYTYGASGVVTGTLT+DVI+MHG SPNS+ QIPRFCFGCVGATYREP+GIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+
Subjt:  RPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN

Query:  LAISSKDH-LQFTPLLKSPIYPNYYYIGLESITIGNG---NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN
        LAISSKDH LQFTPLLKSP+YPNYYYIGLES+TIG+G   NNSRFGVSLKLRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLESV++YPRAKQVEIN
Subjt:  LAISSKDH-LQFTPLLKSPIYPNYYYIGLESITIGNG---NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN

Query:  TGFDLCYKVPCQNNT--TSSADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMD-GSGDENGNGNDNNDGEDGPAGIFGSFQQ
        TGFDLCYK+PC+NNT  +SS DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMD G GD +GNG+D     DGPAGIFGSFQQ
Subjt:  TGFDLCYKVPCQNNT--TSSADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMD-GSGDENGNGNDNNDGEDGPAGIFGSFQQ

Query:  QNMEVVYDLEKERIGFQSVDCASAAASQGLHKNF
        QNMEVVYDL+KERIGFQ++DCAS+AASQGLHKNF
Subjt:  QNMEVVYDLEKERIGFQSVDCASAAASQGLHKNF

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]1.7e-23983.24Show/hide
Query:  TFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD-VIEPLREVRDGYLISLTLGTPAQVVQVY
        +FA KILS+FLLLVYVS     + L   PK + P DSLV+GLV SRT+LL PK+GY +ISR R K MEM D VIEPLRE+RDGYL+SLTLGTP QV+QVY
Subjt:  TFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD-VIEPLREVRDGYLISLTLGTPAQVVQVY

Query:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVV
        MDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLATLVKATCPRPCPSFAYTYGASGVV
Subjt:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVV

Query:  TGTLTRDVIVMH---GSSPNSS-RQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD-HLQF
         GTLTRDV++MH    +SPNSS ++ PRFCFGCVGA+YREP+GIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA+SSKD HLQF
Subjt:  TGTLTRDVIVMH---GSSPNSS-RQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD-HLQF

Query:  TPLLKSPIYPNYYYIGLESITIGNGN-NSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNN
        TPLLKSPIYPNYYYIGLESITIGNGN N RFGVS  LRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLESVISYPRAKQVE+NTGFDLCYKVPC+NN
Subjt:  TPLLKSPIYPNYYYIGLESITIGNGN-NSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNN

Query:  TTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVD
          S  DDSQLPSITFHFLNNVSVVLPQ NNFYAMAAP NSTVVKCLLFQSMDG       G D +D  DGPAGIFGSFQQQN+EVVYDLEKER+GFQ +D
Subjt:  TTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVD

Query:  CASAAASQGLHKN
        CA  AA+QGLHKN
Subjt:  CASAAASQGLHKN

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein3.5e-23880Show/hide
Query:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT
        MP+ ++  T A K LS FLLLV+VS     Q L   PK +FP DSLVLGLV SRTSLL PK+GY +IS+ R K M+  D    VIEPLRE+RDGYL+SL+
Subjt:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT

Query:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS
        +GTP QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LAAFLPTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLA+LVK TCPRPCPS
Subjt:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS

Query:  FAYTYGASGVVTGTLTRDVIVMHG---SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        FAYTYGASGVVTG+LTRDV+  HG   ++ N+++QIPRFCFGCVGATYREP+GIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
Subjt:  FAYTYGASGVVTGTLTRDVIVMHG---SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNG-NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFD
        ISSKD +LQFTPLLKSP+YPNYYYIGLESITIGNG NN RFGVS KLRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLE VI YPRAKQVE+NTGFD
Subjt:  ISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNG-NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFD

Query:  LCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLE
        LCYKVPC+NN +S  DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD      DN+  ++GPAGIFGSFQQQN+EVVYDLE
Subjt:  LCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLE

Query:  KERIGFQSVDCASAAASQGLHKNFR
        KER+GFQ +DC S AA QGLHKN R
Subjt:  KERIGFQSVDCASAAASQGLHKNFR

A0A1S3CAK9 aspartic proteinase nepenthesin-21.1e-23979.77Show/hide
Query:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT
        MP+ ++T + A K LS FLLLV+ S     Q L   PK +FP DSLVLGLV SRTSLL PK+GY +IS+ R K M+  D    VIEPLRE+RDGYL+SL+
Subjt:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT

Query:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS
        +GTP QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLATLVK TCPRPCPS
Subjt:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS

Query:  FAYTYGASGVVTGTLTRDVIVMHG-------SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
        FAYTYGASGVVTG+LTRDV+ MHG       ++ N+++Q+PRFCFGCVGATYREP+GIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLIL
Subjt:  FAYTYGASGVVTGTLTRDVIVMHG-------SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL

Query:  GNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNS-RFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN
        G+LAISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNN+ RFGVS KLRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLESVISYPRAKQVE+N
Subjt:  GNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNS-RFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN

Query:  TGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVV
        TGFDLCYKVPC+NN +S  DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD      DN+  ++GPAGIFGSFQQQN++VV
Subjt:  TGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVV

Query:  YDLEKERIGFQSVDCASAAASQGLHKNFR
        YDLEKER+GFQ++DC S AA+QGLHKN R
Subjt:  YDLEKERIGFQSVDCASAAASQGLHKNFR

A0A5A7TNC9 Aspartic proteinase nepenthesin-21.1e-23979.77Show/hide
Query:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT
        MP+ ++T + A K LS FLLLV+ S     Q L   PK +FP DSLVLGLV SRTSLL PK+GY +IS+ R K M+  D    VIEPLRE+RDGYL+SL+
Subjt:  MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPK-SFPTDSLVLGLVRSRTSLLFPKRGY-YISRTR-KPMEMAD----VIEPLREVRDGYLISLT

Query:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS
        +GTP QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLATLVK TCPRPCPS
Subjt:  LGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPS

Query:  FAYTYGASGVVTGTLTRDVIVMHG-------SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
        FAYTYGASGVVTG+LTRDV+ MHG       ++ N+++Q+PRFCFGCVGATYREP+GIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLIL
Subjt:  FAYTYGASGVVTGTLTRDVIVMHG-------SSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL

Query:  GNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNS-RFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN
        G+LAISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNN+ RFGVS KLRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLESVISYPRAKQVE+N
Subjt:  GNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNS-RFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN

Query:  TGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVV
        TGFDLCYKVPC+NN +S  DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD      DN+  ++GPAGIFGSFQQQN++VV
Subjt:  TGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVV

Query:  YDLEKERIGFQSVDCASAAASQGLHKNFR
        YDLEKER+GFQ++DC S AA+QGLHKN R
Subjt:  YDLEKERIGFQSVDCASAAASQGLHKNFR

A0A6J1CMP8 probable aspartyl protease At4g165635.6e-24481.27Show/hide
Query:  MPAT-TATPTFAAKILSFFLLLVYVSGIKLGQNLVAKP--KSFP-TDSLVLGLVRSRTSLLFPKRGYY-----ISRTRKPME---MADVIEPLREVRDGY
        MP T ++  T ++K+L+FFLLL+ +  +       A+P   +FP TDSLVLGLV SRTSLL PKRGY+      S   KPME     +VIEPLRE+RDGY
Subjt:  MPAT-TATPTFAAKILSFFLLLVYVSGIKLGQNLVAKP--KSFP-TDSLVLGLVRSRTSLLFPKRGYY-----ISRTRKPME---MADVIEPLREVRDGY

Query:  LISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCP
        LISLTLGTP QV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLA F PTHSS+SIRDTCGSSFC+DIHSSDNPFDPCTI+GCSLATLVK TCP
Subjt:  LISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCP

Query:  RPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN
        RPCPSFAYTYGASGVVTGTLT+DVI+MHG SPNS+ QIPRFCFGCVGATYREP+GIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+
Subjt:  RPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN

Query:  LAISSKDH-LQFTPLLKSPIYPNYYYIGLESITIGNG---NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN
        LAISSKDH LQFTPLLKSP+YPNYYYIGLES+TIG+G   NNSRFGVSLKLRE DTKGNGGMLIDSGTTYTHLPEPLYSQL+SNLESV++YPRAKQVEIN
Subjt:  LAISSKDH-LQFTPLLKSPIYPNYYYIGLESITIGNG---NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEIN

Query:  TGFDLCYKVPCQNNT--TSSADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMD-GSGDENGNGNDNNDGEDGPAGIFGSFQQ
        TGFDLCYK+PC+NNT  +SS DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMD G GD +GNG+D     DGPAGIFGSFQQ
Subjt:  TGFDLCYKVPCQNNT--TSSADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMD-GSGDENGNGNDNNDGEDGPAGIFGSFQQ

Query:  QNMEVVYDLEKERIGFQSVDCASAAASQGLHKNF
        QNMEVVYDL+KERIGFQ++DCAS+AASQGLHKNF
Subjt:  QNMEVVYDLEKERIGFQSVDCASAAASQGLHKNF

A0A6J1EHM1 probable aspartyl protease At4g165632.1e-23581.69Show/hide
Query:  LLLVYVSGIKLGQNLVAKPKSFPTDSLVLGLVRSRTSLLFPKRGY--YISRTRKPMEMA--DVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWV
        L+LV VSG  +GQ L      F  DSLVLGLV SRTSLL PKRGY   +++  KPMEM   DVIEPLRE+RDGYL+SLTLGTP QV+QVYMDTGSDLTWV
Subjt:  LLLVYVSGIKLGQNLVAKPKSFPTDSLVLGLVRSRTSLLFPKRGY--YISRTRKPMEMA--DVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWV

Query:  PCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIV
        PCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSS+SIRDTCGSSFCIDIHSSDNPFDPCTI+GCSLATLVK TCPRPCPSF+YTYGASG+V GTLT+DVI 
Subjt:  PCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIV

Query:  MHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYI
        +HG+SPNSSR+IP+FCFGCVGATYREP+GIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+HL+FTP LKSP YPNYYYI
Subjt:  MHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYI

Query:  GLESITIGNGNN-SRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITF
        GLESITIGNG N SRFGVSL+LRE DTKGNGG+LIDSGTTYTHLPEPLYSQL+SNLES+ISYPRAK+ E+NTGFDLCYKVP +NNT  S D+ +LPSITF
Subjt:  GLESITIGNGNN-SRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITF

Query:  HFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCASAAASQGLHK
        HFLNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMDG G             DGPAGIFGSFQQQN+EVVYDLEKER+GF+++DCAS A SQGLHK
Subjt:  HFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCASAAASQGLHK

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-22.0e-3328.4Show/hide
Query:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC
        YL+++ +GTP       MDTGSDL W  C      C  C      +       F P  SSS     C S +C D+     P + C  + C          
Subjt:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC

Query:  PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC----VGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP
              + Y YG      G +  +      SS      +P   FGC     G       G+ G G G LSLPSQLG     FS+C   +  S+     S 
Subjt:  PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC----VGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP

Query:  LILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEI
        L LG+ A    +    T L+ S + P YYYI L+ IT+G  N    G+     +    G GGM+IDSGTT T+LP+  Y+ +       I+ P     E 
Subjt:  LILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEI

Query:  NTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEV
        ++G   C++ P   +T       Q+P I+  F   V  +  Q      + +P    +  CL      GS  + G              IFG+ QQQ  +V
Subjt:  NTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEV

Query:  VYDLEKERIGFQSVDCASA
        +YDL+   + F    C ++
Subjt:  VYDLEKERIGFQSVDCASA

Q766C3 Aspartic proteinase nepenthesin-11.2e-3330.02Show/hide
Query:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC
        YL++L++GTPAQ     MDTGSDL W         CQ C +  N         F P  SSS     C S  C                      L   TC
Subjt:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC

Query:  PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC----VGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP
              + Y YG      G++  + +         S  IP   FGC     G       G+ G GRG LSLPSQL  +   FS+C  P   S   N    
Subjt:  PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC----VGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP

Query:  LILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNG----NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAK
        L+LG+LA S       T L++S   P +YYI L  +++G+     + S F ++         G GG++IDSGTT T+     Y  +     S I+ P   
Subjt:  LILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNG----NNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAK

Query:  QVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQ
            ++GFDLC++ P      S   + Q+P+   HF +   + LP  N F    +P+N  +  CL      GS  +                IFG+ QQQ
Subjt:  QVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQ

Query:  NMEVVYDLEKERIGFQSVDCASA
        NM VVYD     + F S  C ++
Subjt:  NMEVVYDLEKERIGFQSVDCASA

Q7XV21 Aspartyl protease 375.6e-2326.06Show/hide
Query:  PLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSL
        P+      YL+ L +GTP       +DT SDL W  C      C  C    + +  P++++       SS  DTC     +D+H   +  D         
Subjt:  PLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSL

Query:  ATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCV-----GATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKF
              +C      + YTY  +    GTL  D +V+     ++ R +    FGC      GA   +  G+ G GRG LSL SQL  S + F++C LP   
Subjt:  ATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCV-----GATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKF

Query:  SNNPNFSSPLILGNLAISSKDHLQ--FTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNG--------------------GMLIDSGT
        S  P     L+LG  A ++++       P+ + P YP+YYY+ L+ + IG+   S    +                                GM+ID  +
Subjt:  SNNPNFSSPLILGNLAISSKDHLQ--FTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNG--------------------GMLIDSGT

Query:  TYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGS
        T T L   LY +L+++LE  I  PR     +  G DLC+ +P       + D   +P++   F +   + L +   F    A    + + CL+       
Subjt:  TYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGS

Query:  GDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCAS
                     E G   I G+FQQQNM+V+Y+L + R+ F    C +
Subjt:  GDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCAS

Q940R4 Probable aspartyl protease At4g165632.7e-5434.82Show/hide
Query:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC
        YLISL++G+ +  V +Y+DTGSDL W PC    F C  CE      S P       + SSS+   +C S  C   HSS    D C IS C L  +    C
Subjt:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC

Query:  ---PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN
             PCP F Y YG  G +   L  D +    S P+ S  +  F FGC   T  EP+G+AGFGRG LSLP+QL     H G  FS+C +   F S+   
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN

Query:  FSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEP
          SPLILG                         K+   FT +L++P +P +Y + L+ I+IG  N         LR  D  G GG+++DSGTT+T LP  
Subjt:  FSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEP

Query:  LYSQLLSNLESVIS--YPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLLFQSMDGSG
         Y+ ++   +S +   + RA +VE ++G   CY +   N T       ++P++  HF  N  SV LP+ N FY              + CL+        
Subjt:  LYSQLLSNLESVIS--YPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLLFQSMDGSG

Query:  DENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCAS
            NG D ++   G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  DENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCAS

Q9LNJ3 Aspartyl protease family protein 25.5e-3127.57Show/hide
Query:  ADVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTI
        + V+  L +    Y   L +GTPA+ V + +DTGSD+ W+ C      C+ C    + +       F P  S +     C S  C  + S          
Subjt:  ADVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTI

Query:  SGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPF
        +GC+     + TC      +  +YG      G  + + +       N  + +   C       +    G+ G G+G LS P Q G  F+ K FS+C +  
Subjt:  SGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPF

Query:  KFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESV
          S+ P   S ++ GN A+S     +FTPLL +P    +YY+GL  I++G       GV+  L + D  GNGG++IDSGT+ T L  P Y  +       
Subjt:  KFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESV

Query:  ISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGI
        +     K+    + FD C+ +       S+ ++ ++P++  HF     V LP  N  Y +   TN     C  F                  G  G   I
Subjt:  ISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGI

Query:  FGSFQQQNMEVVYDLEKERIGFQSVDCA
         G+ QQQ   VVYDL   R+GF    CA
Subjt:  FGSFQQQNMEVVYDLEKERIGFQSVDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein5.0e-3529.8Show/hide
Query:  KPMEMADVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPF
        KP +  ++  P       +L+ L++G PA      +DTGSDL W  C      C +C +    +       F P  SSS  +  C S  C  +  S+   
Subjt:  KPMEMADVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPF

Query:  DPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC----VGATYREPVGIAGFGRGLLSLPSQLGFSHKGF
        D             K  C      + YTYG      G L  +              I    FGC     G  + +  G+ G GRG LSL SQL      F
Subjt:  DPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC----VGATYREPVGIAGFGRGLLSLPSQLGFSHKGF

Query:  SHCFLPFKFSNNPNFSSPLILGNLA--ISSK-------DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTH
        S+C    + S     SS L +G+LA  I +K       +  +   LL++P  P++YY+ L+ IT+G     R  V     E    G GGM+IDSGTT T+
Subjt:  SHCFLPFKFSNNPNFSSPLILGNLA--ISSK-------DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTH

Query:  LPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDEN
        L E  +  L     S +S P       +TG DLC+K+P       +A +  +P + FHF     + LP  N   A     +ST V CL   S +G     
Subjt:  LPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDEN

Query:  GNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDC
                       IFG+ QQQN  V++DLEKE + F   +C
Subjt:  GNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDC

AT3G25700.1 Eukaryotic aspartyl protease family protein1.8e-3727.31Show/hide
Query:  FAAKILSFFLL----LVYVSGIKLGQNLVAKPKS-FPTDSLVLGLVRSRTSLLFPKRGYYISRTRKPMEM--ADVIEPLREVRDGYLISLTLGTPAQVVQ
        F    LS FLL    +  VS       L    KS FP+ +  L         L  +R +++S  RKP+    + V+         Y + L +G P Q + 
Subjt:  FAAKILSFFLL----LVYVSGIKLGQNLVAKPKS-FPTDSLVLGLVRSRTSLLFPKRGYYISRTRKPMEM--ADVIEPLREVRDGYLISLTLGTPAQVVQ

Query:  VYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASG
        +  DTGSDL WV C      C++C  +           F P HSS+     C    C  +   D        +     T + +TC      + Y Y    
Subjt:  VYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASG

Query:  VVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC---------VGATYREPVGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI
        + +G   R+   +  SS   +R +    FGC          G ++    G+ G GRG +S  SQLG  F +K FS+C + +  S  P  +S LI+GN   
Subjt:  VVTGTLTRDVIVMHGSSPNSSRQIPRFCFGC---------VGATYREPVGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI

Query:  SSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCY
             L FTPLL +P+ P +YY+ L+S+ +   N ++  +   + E D  GNGG ++DSGTT   L EP Y  +++ +   +  P A    +  GFDLC 
Subjt:  SSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCY

Query:  KVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKER
         V     +  +  +  LP + F F      V P  N F           ++CL  QS+D                     + G+  QQ     +D ++ R
Subjt:  KVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKER

Query:  IGFQSVDCA
        +GF    CA
Subjt:  IGFQSVDCA

AT3G52500.1 Eukaryotic aspartyl protease family protein2.6e-4431.05Show/hide
Query:  AAKILSFFLL-LVYVSGIKLGQNLVAKPKSFPTDSLVLGLVRSRTSLLFPKRGYYISRTRKPME---------MADVIEPLREVRD--GYLISLTLGTPA
        A+ I  FFL+ L  VS +KL  +  +     P D   L L R   S +          + KP E          A V++     +   GY +SL+ GTP+
Subjt:  AAKILSFFLL-LVYVSGIKLGQNLVAKPKSFPTDSLVLGLVRSRTSLLFPKRGYYISRTRKPME---------MADVIEPLREVRD--GYLISLTLGTPA

Query:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTY
        Q +    DTGS L W+PC +  + C  C+   + +    +  F+P +SSSS    C S  C  ++    P   C   GC   T     C   CP +   Y
Subjt:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTY

Query:  GASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL----GNLAISSK
        G  G   G L  + +      P+ +  +P F  GC   + R+P GIAGFGRG +SLPSQ+    K FSHC +  +F ++ N ++ L L    G+ + S  
Subjt:  GASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL----GNLAISSK

Query:  DHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVIS-YPRAKQVEINTGFD
          L +TP  K+P   N     YYY+ L  I +G        +  K     T G+GG ++DSG+T+T +  P++  +     S +S Y R K +E  TG  
Subjt:  DHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVIS-YPRAKQVEINTGFD

Query:  LCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLE
         C+ +       S   D  +P + F F     + LP  N F  +     +T   CL   S         +   N  G  GPA I GSFQQQN  V YDLE
Subjt:  LCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLE

Query:  KERIGFQSVDCA
         +R GF    C+
Subjt:  KERIGFQSVDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein1.9e-5534.82Show/hide
Query:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC
        YLISL++G+ +  V +Y+DTGSDL W PC    F C  CE      S P       + SSS+   +C S  C   HSS    D C IS C L  +    C
Subjt:  YLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATC

Query:  ---PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN
             PCP F Y YG  G +   L  D +    S P+ S  +  F FGC   T  EP+G+AGFGRG LSLP+QL     H G  FS+C +   F S+   
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN

Query:  FSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEP
          SPLILG                         K+   FT +L++P +P +Y + L+ I+IG  N         LR  D  G GG+++DSGTT+T LP  
Subjt:  FSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVSLKLREFDTKGNGGMLIDSGTTYTHLPEP

Query:  LYSQLLSNLESVIS--YPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLLFQSMDGSG
         Y+ ++   +S +   + RA +VE ++G   CY +   N T       ++P++  HF  N  SV LP+ N FY              + CL+        
Subjt:  LYSQLLSNLESVIS--YPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLLFQSMDGSG

Query:  DENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCAS
            NG D ++   G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  DENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCAS

AT5G45120.1 Eukaryotic aspartyl protease family protein1.1e-17262.37Show/hide
Query:  PKSFPTDSLVLGLVRSRTSLLFPKRGYYISRTRKPMEMADVI-EPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN-VS
        P S  +  LVL L +S  SL  PK      R +KP+   DV+ EPLREVRDGYLI+L +GTP Q VQVY+DTGSDLTWVPCGNLSFDC +C + +NN + 
Subjt:  PKSFPTDSLVLGLVRSRTSLLFPKRGYYISRTRKPMEMADVI-EPLREVRDGYLISLTLGTPAQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN-VS

Query:  GPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCV
         P  + F P HSS+S RD+C SSFC++IHSSDNPFDPC ++GCS++ L+K+TC RPCPSFAYTYG  G+++G LTRD++         +R +PRF FGCV
Subjt:  GPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSPNSSRQIPRFCFGCV

Query:  GATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVS
         +TYREP+GIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG   L+I+  D LQFTP+L +P+YPN YYIGLESITIG  N +   V 
Subjt:  GATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFGVS

Query:  LKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQL---PSITFHFLNNVSVVLPQGNNF
        L LR+FD++GNGGML+DSGTTYTHLPEP YSQLL+ L+S I+YPRA + E  TGFDLCYKVPC NN  +S ++  +   PSITFHFLNN +++LPQGN+F
Subjt:  LKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQL---PSITFHFLNNVSVVLPQGNNF

Query:  YAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCASAAASQGLHK
        YAM+AP++ +VV+CLLFQ+M+             DG+ GPAG+FGSFQQQN++VVYDLEKERIGFQ++DC   AAS GL++
Subjt:  YAMAAPTNSTVVKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCASAAASQGLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGCAACAACAGCAACCCCCACTTTTGCAGCTAAAATCTTGAGTTTCTTCCTTCTTCTTGTGTATGTCTCAGGAATAAAGTTGGGACAAAACCTAGTAGCAAAACC
TAAAAGTTTCCCCACAGATTCTTTAGTTCTTGGTCTGGTTCGTTCAAGAACTTCCCTTCTTTTTCCCAAAAGAGGTTACTACATTTCCAGGACGAGAAAGCCAATGGAAA
TGGCAGATGTGATAGAGCCATTGAGGGAGGTTAGAGATGGTTATCTGATTTCTCTTACATTAGGGACCCCCGCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGAT
CTCACTTGGGTTCCATGTGGAAACCTCTCTTTTGATTGCCAAGATTGTGAAGAATATCAGAACAATGTTTCTGGGCCAAAGTTGGCTGCTTTTTTGCCTACCCATTCTTC
TTCCTCCATTAGAGACACTTGTGGTAGCTCCTTCTGCATTGACATCCATAGCTCTGATAACCCTTTTGATCCATGCACAATATCAGGTTGTTCTCTTGCTACCCTTGTGA
AGGCCACTTGCCCTAGGCCATGCCCTTCATTTGCTTACACTTATGGGGCAAGTGGGGTTGTCACTGGGACTTTAACCAGAGATGTCATTGTTATGCATGGGAGCTCCCCA
AATTCCAGTAGGCAAATTCCAAGGTTTTGTTTTGGGTGTGTTGGTGCAACTTATAGAGAGCCAGTTGGCATTGCTGGTTTCGGCAGAGGTCTACTTTCTCTCCCTTCCCA
ACTTGGGTTCTCTCATAAGGGTTTCTCTCACTGCTTCTTGCCCTTTAAGTTCTCGAATAACCCCAACTTTTCAAGCCCTTTGATACTTGGGAACCTAGCCATTTCTTCAA
AAGACCATTTGCAGTTCACTCCTTTGTTGAAAAGTCCAATTTACCCTAACTACTACTATATTGGTCTTGAGTCAATCACCATTGGCAATGGCAATAATTCTAGGTTTGGA
GTTTCTTTGAAATTGAGAGAGTTTGATACAAAGGGTAATGGAGGAATGTTGATTGATTCTGGTACAACTTACACTCATTTACCAGAACCATTGTATTCACAGCTTCTGTC
GAATCTTGAGTCAGTAATAAGCTATCCCAGAGCCAAACAAGTTGAAATCAATACTGGGTTTGATCTTTGTTACAAAGTCCCTTGTCAAAACAACACTACTTCTTCTGCAG
ATGATTCTCAGCTCCCTTCCATAACCTTCCACTTTTTGAACAATGTCAGTGTTGTTTTACCCCAAGGAAACAACTTCTATGCCATGGCTGCTCCAACCAACTCCACTGTG
GTGAAATGCTTGCTGTTTCAGAGCATGGACGGCAGCGGTGACGAAAACGGAAACGGCAACGACAACAACGACGGAGAAGACGGGCCGGCGGGCATTTTCGGAAGCTTCCA
ACAGCAAAATATGGAGGTTGTTTATGACTTGGAGAAGGAAAGAATAGGGTTTCAATCAGTGGACTGTGCTTCGGCTGCAGCCTCTCAGGGACTCCACAAAAATTTCAGAA
GCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGCAACAACAGCAACCCCCACTTTTGCAGCTAAAATCTTGAGTTTCTTCCTTCTTCTTGTGTATGTCTCAGGAATAAAGTTGGGACAAAACCTAGTAGCAAAACC
TAAAAGTTTCCCCACAGATTCTTTAGTTCTTGGTCTGGTTCGTTCAAGAACTTCCCTTCTTTTTCCCAAAAGAGGTTACTACATTTCCAGGACGAGAAAGCCAATGGAAA
TGGCAGATGTGATAGAGCCATTGAGGGAGGTTAGAGATGGTTATCTGATTTCTCTTACATTAGGGACCCCCGCACAAGTTGTTCAAGTGTATATGGACACTGGAAGTGAT
CTCACTTGGGTTCCATGTGGAAACCTCTCTTTTGATTGCCAAGATTGTGAAGAATATCAGAACAATGTTTCTGGGCCAAAGTTGGCTGCTTTTTTGCCTACCCATTCTTC
TTCCTCCATTAGAGACACTTGTGGTAGCTCCTTCTGCATTGACATCCATAGCTCTGATAACCCTTTTGATCCATGCACAATATCAGGTTGTTCTCTTGCTACCCTTGTGA
AGGCCACTTGCCCTAGGCCATGCCCTTCATTTGCTTACACTTATGGGGCAAGTGGGGTTGTCACTGGGACTTTAACCAGAGATGTCATTGTTATGCATGGGAGCTCCCCA
AATTCCAGTAGGCAAATTCCAAGGTTTTGTTTTGGGTGTGTTGGTGCAACTTATAGAGAGCCAGTTGGCATTGCTGGTTTCGGCAGAGGTCTACTTTCTCTCCCTTCCCA
ACTTGGGTTCTCTCATAAGGGTTTCTCTCACTGCTTCTTGCCCTTTAAGTTCTCGAATAACCCCAACTTTTCAAGCCCTTTGATACTTGGGAACCTAGCCATTTCTTCAA
AAGACCATTTGCAGTTCACTCCTTTGTTGAAAAGTCCAATTTACCCTAACTACTACTATATTGGTCTTGAGTCAATCACCATTGGCAATGGCAATAATTCTAGGTTTGGA
GTTTCTTTGAAATTGAGAGAGTTTGATACAAAGGGTAATGGAGGAATGTTGATTGATTCTGGTACAACTTACACTCATTTACCAGAACCATTGTATTCACAGCTTCTGTC
GAATCTTGAGTCAGTAATAAGCTATCCCAGAGCCAAACAAGTTGAAATCAATACTGGGTTTGATCTTTGTTACAAAGTCCCTTGTCAAAACAACACTACTTCTTCTGCAG
ATGATTCTCAGCTCCCTTCCATAACCTTCCACTTTTTGAACAATGTCAGTGTTGTTTTACCCCAAGGAAACAACTTCTATGCCATGGCTGCTCCAACCAACTCCACTGTG
GTGAAATGCTTGCTGTTTCAGAGCATGGACGGCAGCGGTGACGAAAACGGAAACGGCAACGACAACAACGACGGAGAAGACGGGCCGGCGGGCATTTTCGGAAGCTTCCA
ACAGCAAAATATGGAGGTTGTTTATGACTTGGAGAAGGAAAGAATAGGGTTTCAATCAGTGGACTGTGCTTCGGCTGCAGCCTCTCAGGGACTCCACAAAAATTTCAGAA
GCTAA
Protein sequenceShow/hide protein sequence
MPATTATPTFAAKILSFFLLLVYVSGIKLGQNLVAKPKSFPTDSLVLGLVRSRTSLLFPKRGYYISRTRKPMEMADVIEPLREVRDGYLISLTLGTPAQVVQVYMDTGSD
LTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSSSIRDTCGSSFCIDIHSSDNPFDPCTISGCSLATLVKATCPRPCPSFAYTYGASGVVTGTLTRDVIVMHGSSP
NSSRQIPRFCFGCVGATYREPVGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNSRFG
VSLKLREFDTKGNGGMLIDSGTTYTHLPEPLYSQLLSNLESVISYPRAKQVEINTGFDLCYKVPCQNNTTSSADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTV
VKCLLFQSMDGSGDENGNGNDNNDGEDGPAGIFGSFQQQNMEVVYDLEKERIGFQSVDCASAAASQGLHKNFRS