; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014619 (gene) of Chayote v1 genome

Gene IDSed0014619
OrganismSechium edule (Chayote v1)
DescriptionPeptidase A1 domain-containing protein
Genome locationLG01:14437592..14439151
RNA-Seq ExpressionSed0014619
SyntenySed0014619
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583807.1 putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia]9.9e-23582.3Show/hide
Query:  SIAAK---VLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI-NDNVVEPLREIRDGYLMSLTLG
        SIAA+   VL   L+LV VS   +GQTL AN K+K   +  SLVLGL+HSRTSLLTPK+GY NS+SRK+  IKPME+ ND+V+EPLREIRDGYLMSLTLG
Subjt:  SIAAK---VLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI-NDNVVEPLREIRDGYLMSLTLG

Query:  APPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFA
         PPQV+QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNVLGPKLA FLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVK TCPRPCPSF+
Subjt:  APPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFA

Query:  YTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS
        YTYGASG+VIGTLT+D I IHG  NS NS+++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS
Subjt:  YTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS

Query:  KDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLC
        K+H L+FTPLLKSP YPNYYYIGLESITIGNG NYSRFGVS  +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQ+ISNLE++ISYPRAK+ E+NTGFDLC
Subjt:  KDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLC

Query:  YRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERL
        Y+VP KNNT F  D+F +LPSITFHFLNNVSVVLPQGN+FYAMAAPSNSTVVKCLLFQSMDG           GDGPAGIFGSFQQQNLEVVYDLE ERL
Subjt:  YRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERL

Query:  GFQPMDCASAAASQ
        GF+ MDCAS A SQ
Subjt:  GFQPMDCASAAASQ

XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]5.6e-23882.23Show/hide
Query:  SIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLTLGAPP
        S A K LS FLLLV+VS     QTLA N K+  P +  SLVLGL+HSRTSLLTPKKGY N IS+K+   +   + +DNV+EPLREIRDGYLMSL++G PP
Subjt:  SIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLTLGAPP

Query:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTY
        QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GP+LA FLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVK TCPRPCPSFAYTY
Subjt:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTY

Query:  GASGVVIGTLTRDSISIHGNF-NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
        GASGVV G+LTRD +  HGN+ N+ N+ KQIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
Subjt:  GASGVVIGTLTRDSISIHGNF-NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD

Query:  HNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYR
         NLQFTPLLKSP+YPNYYYIGLESITIGNG+N  RFGVS FKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVE+NTGFDLCY+
Subjt:  HNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYR

Query:  VPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGF
        VPCKNN     DD  QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD   D ++  +GPAGIFGSFQQQN+EVVYDLE ERLGF
Subjt:  VPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGF

Query:  QPMDCASAAASQ
        QPMDC S AA Q
Subjt:  QPMDCASAAASQ

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]3.9e-23981.38Show/hide
Query:  SSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLT
        S +  SIA K LS FLLLV+ S     QTLA N K+  P +  SLVLGL+HSRTSLLTPKKGY N IS+K+   +  M+ +DNV+EPLREIRDGYLMSL+
Subjt:  SSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLT

Query:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS
        +G PPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLA FLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPS
Subjt:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS

Query:  FAYTYGASGVVIGTLTRDSISIHGNF-----NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
        FAYTYGASGVV G+LTRD + +HGN+     N+ N+ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLIL
Subjt:  FAYTYGASGVVIGTLTRDSISIHGNF-----NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL

Query:  GNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEM
        G+LAISSKD NLQFTPLLKSPIYPNYYYIGLESITIGNGNN  RFGVS FKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE+VISYPRAKQVE+
Subjt:  GNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEM

Query:  NTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVY
        NTGFDLCY+VPCKNN     DD  QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD   D ++  +GPAGIFGSFQQQNL+VVY
Subjt:  NTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVY

Query:  DLENERLGFQPMDCASAAASQ
        DLE ERLGFQ MDC S AA+Q
Subjt:  DLENERLGFQPMDCASAAASQ

XP_022142611.1 probable aspartyl protease At4g16563 [Momordica charantia]6.0e-24080.65Show/hide
Query:  STPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI--NDNVVEPLREIRDGYLMSLT
        S+  +I++KVL+FFLLL+ +  +++ +T A  H++  P N  SLVLGL+HSRTSLLTPK+GY +      S+ KPME   +DNV+EPLREIRDGYL+SLT
Subjt:  STPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI--NDNVVEPLREIRDGYLMSLT

Query:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS
        LG PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV GPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPS
Subjt:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS

Query:  FAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI
        FAYTYGASGVV GTLT+D I +HG   S NST QIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAI
Subjt:  FAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI

Query:  SSKDHNLQFTPLLKSPIYPNYYYIGLESITIGN--GNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTG
        SSKDH+LQFTPLLKSP+YPNYYYIGLES+TIG+  GNN SRFGVS  KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE+V++YPRAKQVE+NTG
Subjt:  SSKDHNLQFTPLLKSPIYPNYYYIGLESITIGN--GNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTG

Query:  FDLCYRVPCKNNTLFGG--DDF--DQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVV
        FDLCY++PCKNNT F    DD+  + LPSITFHFLNNVSVVLPQGNNFYAMAAP+NSTVVKCLLFQSMDGGG GD D     DGPAGIFGSFQQQN+EVV
Subjt:  FDLCYRVPCKNNTLFGG--DDF--DQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVV

Query:  YDLENERLGFQPMDCASAAASQ
        YDL+ ER+GFQ MDCAS+AASQ
Subjt:  YDLENERLGFQPMDCASAAASQ

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]1.1e-24183.43Show/hide
Query:  MSSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLT
        M+S   S A K+LS+FLLLVYVS     +TLA N K+  P +  SLV+GL+HSRT+LLTPKKGY N ISRK+  +K ME++DNV+EPLREIRDGYLMSLT
Subjt:  MSSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLT

Query:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS
        LG PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV GPKLA FLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK+TCPRPCPS
Subjt:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS

Query:  FAYTYGASGVVIGTLTRDSISIH-GNFNSQN-STKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL
        FAYTYGASGVVIGTLTRD + +H  N NS N STK+ PRFCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL
Subjt:  FAYTYGASGVVIGTLTRDSISIH-GNFNSQN-STKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL

Query:  AISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTG
        A+SSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGN+  RFGVS F LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE+VISYPRAKQVE+NTG
Subjt:  AISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTG

Query:  FDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDG-GGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDL
        FDLCY+VPCKNN  F   D  QLPSITFHFLNNVSVVLPQ NNFYAMAAP NSTVVKCLLFQSMDG GGD D D     DGPAGIFGSFQQQNLEVVYDL
Subjt:  FDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDG-GGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDL

Query:  ENERLGFQPMDCASAAASQ
        E ERLGFQPMDCA  AA+Q
Subjt:  ENERLGFQPMDCASAAASQ

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein2.7e-23882.23Show/hide
Query:  SIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLTLGAPP
        S A K LS FLLLV+VS     QTLA N K+  P +  SLVLGL+HSRTSLLTPKKGY N IS+K+   +   + +DNV+EPLREIRDGYLMSL++G PP
Subjt:  SIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLTLGAPP

Query:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTY
        QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GP+LA FLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVK TCPRPCPSFAYTY
Subjt:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTY

Query:  GASGVVIGTLTRDSISIHGNF-NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
        GASGVV G+LTRD +  HGN+ N+ N+ KQIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
Subjt:  GASGVVIGTLTRDSISIHGNF-NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD

Query:  HNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYR
         NLQFTPLLKSP+YPNYYYIGLESITIGNG+N  RFGVS FKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVE+NTGFDLCY+
Subjt:  HNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYR

Query:  VPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGF
        VPCKNN     DD  QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD   D ++  +GPAGIFGSFQQQN+EVVYDLE ERLGF
Subjt:  VPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGF

Query:  QPMDCASAAASQ
        QPMDC S AA Q
Subjt:  QPMDCASAAASQ

A0A1S3CAK9 aspartic proteinase nepenthesin-21.9e-23981.38Show/hide
Query:  SSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLT
        S +  SIA K LS FLLLV+ S     QTLA N K+  P +  SLVLGL+HSRTSLLTPKKGY N IS+K+   +  M+ +DNV+EPLREIRDGYLMSL+
Subjt:  SSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLT

Query:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS
        +G PPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLA FLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPS
Subjt:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS

Query:  FAYTYGASGVVIGTLTRDSISIHGNF-----NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
        FAYTYGASGVV G+LTRD + +HGN+     N+ N+ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLIL
Subjt:  FAYTYGASGVVIGTLTRDSISIHGNF-----NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL

Query:  GNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEM
        G+LAISSKD NLQFTPLLKSPIYPNYYYIGLESITIGNGNN  RFGVS FKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE+VISYPRAKQVE+
Subjt:  GNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEM

Query:  NTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVY
        NTGFDLCY+VPCKNN     DD  QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD   D ++  +GPAGIFGSFQQQNL+VVY
Subjt:  NTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVY

Query:  DLENERLGFQPMDCASAAASQ
        DLE ERLGFQ MDC S AA+Q
Subjt:  DLENERLGFQPMDCASAAASQ

A0A5A7TNC9 Aspartic proteinase nepenthesin-21.9e-23981.38Show/hide
Query:  SSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLT
        S +  SIA K LS FLLLV+ S     QTLA N K+  P +  SLVLGL+HSRTSLLTPKKGY N IS+K+   +  M+ +DNV+EPLREIRDGYLMSL+
Subjt:  SSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKK-SLIKPMEINDNVVEPLREIRDGYLMSLT

Query:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS
        +G PPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLA FLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPS
Subjt:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS

Query:  FAYTYGASGVVIGTLTRDSISIHGNF-----NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
        FAYTYGASGVV G+LTRD + +HGN+     N+ N+ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLIL
Subjt:  FAYTYGASGVVIGTLTRDSISIHGNF-----NSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL

Query:  GNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEM
        G+LAISSKD NLQFTPLLKSPIYPNYYYIGLESITIGNGNN  RFGVS FKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE+VISYPRAKQVE+
Subjt:  GNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEM

Query:  NTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVY
        NTGFDLCY+VPCKNN     DD  QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG GD   D ++  +GPAGIFGSFQQQNL+VVY
Subjt:  NTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVY

Query:  DLENERLGFQPMDCASAAASQ
        DLE ERLGFQ MDC S AA+Q
Subjt:  DLENERLGFQPMDCASAAASQ

A0A6J1CMP8 probable aspartyl protease At4g165632.9e-24080.65Show/hide
Query:  STPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI--NDNVVEPLREIRDGYLMSLT
        S+  +I++KVL+FFLLL+ +  +++ +T A  H++  P N  SLVLGL+HSRTSLLTPK+GY +      S+ KPME   +DNV+EPLREIRDGYL+SLT
Subjt:  STPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI--NDNVVEPLREIRDGYLMSLT

Query:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS
        LG PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV GPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPS
Subjt:  LGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPS

Query:  FAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI
        FAYTYGASGVV GTLT+D I +HG   S NST QIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAI
Subjt:  FAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI

Query:  SSKDHNLQFTPLLKSPIYPNYYYIGLESITIGN--GNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTG
        SSKDH+LQFTPLLKSP+YPNYYYIGLES+TIG+  GNN SRFGVS  KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE+V++YPRAKQVE+NTG
Subjt:  SSKDHNLQFTPLLKSPIYPNYYYIGLESITIGN--GNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTG

Query:  FDLCYRVPCKNNTLFGG--DDF--DQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVV
        FDLCY++PCKNNT F    DD+  + LPSITFHFLNNVSVVLPQGNNFYAMAAP+NSTVVKCLLFQSMDGGG GD D     DGPAGIFGSFQQQN+EVV
Subjt:  FDLCYRVPCKNNTLFGG--DDF--DQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVV

Query:  YDLENERLGFQPMDCASAAASQ
        YDL+ ER+GFQ MDCAS+AASQ
Subjt:  YDLENERLGFQPMDCASAAASQ

A0A6J1EHM1 probable aspartyl protease At4g165632.6e-23381.32Show/hide
Query:  SIAAKVLSFF---LLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI-NDNVVEPLREIRDGYLMSLTLG
        SIAA+  S+F   L+LV VS   +GQTL AN K+K   +  SLVLGL+HSRTSLLTPK+GY + ++++   IKPME+ ND+V+EPLREIRDGYLMSLTLG
Subjt:  SIAAKVLSFF---LLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEI-NDNVVEPLREIRDGYLMSLTLG

Query:  APPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFA
         PPQV+QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNVLGPKLA FLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVK TCPRPCPSF+
Subjt:  APPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFA

Query:  YTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS
        YTYGASG+VIGTLT+D I IHG  NS NS+++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS
Subjt:  YTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISS

Query:  KDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLC
        K+H L+FTP LKSP YPNYYYIGLESITIGNG NYSRFGVS  +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLE++ISYPRAK+ E+NTGFDLC
Subjt:  KDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLC

Query:  YRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERL
        Y+VP KNNT F  D+F +LPSITFHFLNNVSVVLPQGN+FYAMAAPSNSTVVKCLLFQSMDG           GDGPAGIFGSFQQQNLEVVYDLE ERL
Subjt:  YRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERL

Query:  GFQPMDCASAAASQ
        GF+ MDCAS A SQ
Subjt:  GFQPMDCASAAASQ

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-21.4e-3427.51Show/hide
Query:  KKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDT
        K+  +    R +S+   ++ +  +  P+      YLM++ +G P       MDTGSDL W  C      C  C      +       F P  SS+     
Subjt:  KKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDT

Query:  CGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC----VGATYREPIGIAGF
        C S +C D+     P + C    C                + Y YG      G +  ++ +          T  +P   FGC     G       G+ G 
Subjt:  CGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC----VGATYREPIGIAGF

Query:  GRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGV--SNFKLREIDTKG
        G G LSLPSQLG     FS+C   +  S+     S L LG+ A S        T L+ S + P YYYI L+ IT+G  N     G+  S F+L++    G
Subjt:  GRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGV--SNFKLREIDTKG

Query:  NGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVV
         GGM+IDSGTT T+LP+  Y+ +       I+ P     E ++G   C++ P   +T+       Q+P I+  F   V  +  Q      + +P+   + 
Subjt:  NGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVV

Query:  KCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCASA
         CL   S    G               IFG+ QQQ  +V+YDL+N  + F P  C ++
Subjt:  KCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCASA

Q766C3 Aspartic proteinase nepenthesin-13.2e-3427.18Show/hide
Query:  NSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSI---SRKKSLIKPMEINDNVVEPLREIRDG-YLMSLTL
        +S+ + +L+  ++ ++V+  +     A NH+ +       ++L  + S  + LT  +  + +I   SR+   ++ M    + VE      DG YLM+L++
Subjt:  NSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSI---SRKKSLIKPMEINDNVVEPLREIRDG-YLMSLTL

Query:  GAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSF
        G P Q     MDTGSDL W         CQ C +  N         F P  SS+     C S  C  + S                     TC      +
Subjt:  GAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSF

Query:  AYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN
         Y YG      G++  ++++          +  IP   FGC     G       G+ G GRG LSLPSQL  +   FS+C  P   S   N    L+LG+
Subjt:  AYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN

Query:  LAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNT
        LA +S       T L++S   P +YYI L  +++G+    +R  +           G GG++IDSGTT T+     Y  +     + I+ P       ++
Subjt:  LAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNT

Query:  GFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDL
        GFDLC++ P   + L       Q+P+   HF +   + LP  N F    +PSN  +  CL   S   G                IFG+ QQQN+ VVYD 
Subjt:  GFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDL

Query:  ENERLGFQPMDCASA
         N  + F    C ++
Subjt:  ENERLGFQPMDCASA

Q8S9J6 Aspartyl protease family protein At5g107701.7e-2425.87Show/hide
Query:  TLAANHKSKIPTNNHSLVLGLLHSRTSLL---TPKKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNL
        T +  +  K  + +H  +L L  +R + +     KK   + +S  KS   P +    +          Y++++ LG P   + +  DTGSDLTW  C   
Subjt:  TLAANHKSKIPTNNHSLVLGLLHSRTSLL---TPKKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNL

Query:  SFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNF
           C D +E            F P+ S++    +C S+ C  + S+      C+ + C                +   YG     +G L ++  ++    
Subjt:  SFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNF

Query:  NSQNSTKQIPRFCFGC---VGATYREPIGIAGFGRGLLSLPSQLGFSH-KGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYY
            ++       FGC       +    G+ G GR  LS PSQ   ++ K FS+C LP    ++ +++  L  G+  IS    +++FTP+       ++Y
Subjt:  NSQNSTKQIPRFCFGC---VGATYREPIGIAGFGRGLLSLPSQLGFSH-KGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYY

Query:  YIGLESITIGNGNNYSRFGVSNFKLREIDTK-GNGGMLIDSGTTYTHLPEPLYSQLISNLETVIS-YPRAKQVE-MNTGFDLCYRVPCKNNTLFGGDDFD
         + + +IT+G             KL    T     G LIDSGT  T LP   Y+ L S+ +  +S YP    V  ++T FDL             G    
Subjt:  YIGLESITIGNGNNYSRFGVSNFKLREIDTK-GNGGMLIDSGTTYTHLPEPLYSQLISNLETVIS-YPRAKQVE-MNTGFDLCYRVPCKNNTLFGGDDFD

Query:  QLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCA
         +P + F F     V L     FY            CL F        G+SD     D  A IFG+ QQQ LEVVYD    R+GF P  C+
Subjt:  QLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCA

Q940R4 Probable aspartyl protease At4g165631.9e-5535.11Show/hide
Query:  YLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTC
        YL+SL++G+    V +Y+DTGSDL W PC    F C  CE   +  L P   + L   SS++   +C S  C   HSS    D C I+ C L  +    C
Subjt:  YLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTC

Query:  ---PRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNN
             PCP F Y YG  G ++  L  DS+S+         +  +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F S+ 
Subjt:  ---PRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNN

Query:  PNFSSPLILGNLA------------------ISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTH
            SPLILG                        K +   FT +L++P +P +Y + L+ I+IG  N       +   LR ID  G GG+++DSGTT+T 
Subjt:  PNFSSPLILGNLA------------------ISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTH

Query:  LPEPLYSQLISNLETVIS--YPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFL-NNVSVVLPQGNNFYAMA----APSNSTVVKCLLFQS
        LP   Y+ ++   ++ +   + RA +VE ++G   CY +   N T+       ++P++  HF  N  SV LP+ N FY              + CL+   
Subjt:  LPEPLYSQLISNLETVIS--YPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFL-NNVSVVLPQGNNFYAMA----APSNSTVVKCLLFQS

Query:  MDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCAS
        M+G   GD     GG G   I G++QQQ  EVVYDL N R+GF    CAS
Subjt:  MDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCAS

Q9LNJ3 Aspartyl protease family protein 22.7e-3328.15Show/hide
Query:  KPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNP
        +P   + +VV  L +    Y   L +G P + V + +DTGSD+ W+ C      C+ C    + +       F P  S T     C S  C  + S    
Subjt:  KPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNP

Query:  FDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLG--FSHKG
              AGC+     + TC      +  +YG     +G  + ++++       +N  K +   C       +    G+ G G+G LS P Q G  F+ K 
Subjt:  FDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLG--FSHKG

Query:  FSHCFLPFKFSNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPL
        FS+C +    S+ P   S ++ GN A+S      +FTPLL +P    +YY+GL  I++G G        S FKL +I   GNGG++IDSGT+ T L  P 
Subjt:  FSHCFLPFKFSNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPL

Query:  YSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVE
        Y  +       +     K+    + FD C+ +   N          ++P++  HF     V LP  N  Y +   +N     C  F    GG        
Subjt:  YSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVE

Query:  NGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCA
                I G+ QQQ   VVYDL + R+GF P  CA
Subjt:  NGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein8.6e-3531.25Show/hide
Query:  KPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNP
        KP + N N+  P       +LM L++G P       +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  +  S+  
Subjt:  KPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNP

Query:  FDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSH
         D             K  C      + YTYG      G L  ++ +    F  +NS   I    FGC     G  + +  G+ G GRG LSL SQL    
Subjt:  FDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSH

Query:  KGFSHCFLPFKFSNNPNFSSPLILGNLAI-------SSKDHNLQFT-PLLKSPIYPNYYYIGLESITIGNGNNYSRFGV--SNFKLREIDTKGNGGMLID
          FS+C    + S     SS L +G+LA        +S D  +  T  LL++P  P++YY+ L+ IT+G      R  V  S F+L E    G GGM+ID
Subjt:  KGFSHCFLPFKFSNNPNFSSPLILGNLAI-------SSKDHNLQFT-PLLKSPIYPNYYYIGLESITIGNGNNYSRFGV--SNFKLREIDTKGNGGMLID

Query:  SGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQS
        SGTT T+L E  +  L     + +S P       +TG DLC+++P     +        +P + FHF     + LP G N+  M A S ST V CL   S
Subjt:  SGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQS

Query:  MDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDC
         +G                 IFG+ QQQN  V++DLE E + F P +C
Subjt:  MDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDC

AT3G25700.1 Eukaryotic aspartyl protease family protein1.6e-3629.14Show/hide
Query:  YLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTC
        Y + L +G PPQ + +  DTGSDL WV C      C++C  +           F P HSST     C    C  +   D        A     T + STC
Subjt:  YLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTC

Query:  PRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKF
              + Y Y    +  G   R++ S+  +   +   K +    FGC          G ++    G+ G GRG +S  SQLG  F +K FS+C + +  
Subjt:  PRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKF

Query:  SNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLET
        S  P  +S LI+GN         L FTPLL +P+ P +YY+ L+S+ +    N ++  +    + EID  GNGG ++DSGTT   L EP Y  +I+ +  
Subjt:  SNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLET

Query:  VISYPRAKQVEMNTGFDLCYRVP--CKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAG
         +  P A    +  GFDLC  V    K   +        LP + F F      V P  N F           ++CL  QS+D                  
Subjt:  VISYPRAKQVEMNTGFDLCYRVP--CKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAG

Query:  IFGSFQQQNLEVVYDLENERLGFQPMDCA
        + G+  QQ     +D +  RLGF    CA
Subjt:  IFGSFQQQNLEVVYDLENERLGFQPMDCA

AT3G52500.1 Eukaryotic aspartyl protease family protein9.1e-4532.48Show/hide
Query:  GYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKL-ATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKS
        GY +SL+ G P Q +    DTGS L W+PC +  + C  C+    + L P L   F+P +SS+S    C S  C  ++    P   C   GC   T    
Subjt:  GYLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKL-ATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKS

Query:  TCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP
         C   CP +   YG  G   G L  + +              +P F  GC   + R+P GIAGFGRG +SLPSQ+    K FSHC +  +F ++ N ++ 
Subjt:  TCPRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP

Query:  LILGNLA---ISSKDHNLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETV
        L L   +     SK   L +TP  K+P   N     YYY+ L  I +G      +     +K     T G+GG ++DSG+T+T +  P++  +     + 
Subjt:  LILGNLA---ISSKDHNLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETV

Query:  IS-YPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVE-NGGDGPAGI
        +S Y R K +E  TG   C+ +  K +          +P + F F     + LP  N F  +     +T   CL   S       D  V  +GG GPA I
Subjt:  IS-YPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVE-NGGDGPAGI

Query:  FGSFQQQNLEVVYDLENERLGFQPMDCA
         GSFQQQN  V YDLEN+R GF    C+
Subjt:  FGSFQQQNLEVVYDLENERLGFQPMDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein1.4e-5635.11Show/hide
Query:  YLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTC
        YL+SL++G+    V +Y+DTGSDL W PC    F C  CE   +  L P   + L   SS++   +C S  C   HSS    D C I+ C L  +    C
Subjt:  YLMSLTLGAPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTC

Query:  ---PRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNN
             PCP F Y YG  G ++  L  DS+S+         +  +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F S+ 
Subjt:  ---PRPCPSFAYTYGASGVVIGTLTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNN

Query:  PNFSSPLILGNLA------------------ISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTH
            SPLILG                        K +   FT +L++P +P +Y + L+ I+IG  N       +   LR ID  G GG+++DSGTT+T 
Subjt:  PNFSSPLILGNLA------------------ISSKDHNLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTH

Query:  LPEPLYSQLISNLETVIS--YPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFL-NNVSVVLPQGNNFYAMA----APSNSTVVKCLLFQS
        LP   Y+ ++   ++ +   + RA +VE ++G   CY +   N T+       ++P++  HF  N  SV LP+ N FY              + CL+   
Subjt:  LPEPLYSQLISNLETVIS--YPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFL-NNVSVVLPQGNNFYAMA----APSNSTVVKCLLFQS

Query:  MDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCAS
        M+G   GD     GG G   I G++QQQ  EVVYDL N R+GF    CAS
Subjt:  MDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCAS

AT5G45120.1 Eukaryotic aspartyl protease family protein7.0e-17059.52Show/hide
Query:  FFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDT
        F  LL+ +      +T A  HK+   +++  LVL L  S  SL TPK   Q  I       KP+   D V+EPLRE+RDGYL++L +G PPQ VQVY+DT
Subjt:  FFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQVYMDT

Query:  GSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGT
        GSDLTWVPCGNLSFDC +C + +NN L    + F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+KSTC RPCPSFAYTYG  G++ G 
Subjt:  GSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGT

Query:  LTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDHNLQFTPLL
        LTRD +           T+ +PRF FGCV +TYREPIGIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG  A+S +   +LQFTP+L
Subjt:  LTRDSISIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDHNLQFTPLL

Query:  KSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTL-
         +P+YPN YYIGLESITIG     ++  ++   LR+ D++GNGGML+DSGTTYTHLPEP YSQL++ L++ I+YPRA + E  TGFDLCY+VPC NN L 
Subjt:  KSPIYPNYYYIGLESITIGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTL-

Query:  -FGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCAS
            D     PSITFHFLNN +++LPQGN+FYAM+APS+ +VV+CLLFQ+M          E+G  GPAG+FGSFQQQN++VVYDLE ER+GFQ MDC  
Subjt:  -FGGDDFDQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCAS

Query:  AAAS
         AAS
Subjt:  AAAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTCAACACCAAACTCCATTGCAGCCAAAGTCTTGAGCTTTTTCCTCCTTCTTGTGTATGTCTCATGTATAAACTTGGGACAAACCCTAGCAGCAAACCATAAAAG
CAAAATCCCAACAAACAATCATTCTCTAGTTCTTGGTCTTCTTCATTCTAGAACCTCCCTTCTCACCCCCAAAAAAGGCTACCAAAATTCCATTTCAAGGAAGAAATCAT
TAATCAAGCCAATGGAAATTAATGATAATGTGGTAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCCTAACATTAGGGGCACCCCCACAAGTTGTCCAAGTC
TATATGGACACTGGAAGTGACCTTACATGGGTGCCTTGTGGGAACCTCTCCTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGTCCAAAGTTAGCAAC
TTTTTTGCCTACTCATTCTTCAACCTCAATTAGAGACACTTGTGGAAGCTCCTTTTGTATGGATATTCATAGCTCAGATAACCCTTTTGATCCTTGTACAATTGCTGGCT
GTTCCCTTGCTACCCTTGTTAAGAGCACCTGCCCTAGACCTTGCCCTTCTTTTGCTTACACTTATGGTGCAAGTGGGGTTGTGATTGGAACCCTAACAAGAGATTCCATT
TCTATCCATGGAAATTTTAATTCCCAAAATTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCTACTTATAGAGAGCCAATTGGCATTGCTGGCTTTGG
GAGAGGTTTACTTTCTCTTCCTTCTCAATTAGGGTTTTCTCATAAGGGCTTTTCTCATTGCTTTTTGCCCTTTAAATTCTCTAATAACCCTAATTTTTCAAGTCCTTTGA
TTCTTGGGAACCTTGCTATTTCTTCAAAAGACCATAATTTGCAATTCACCCCTTTGTTGAAAAGTCCAATTTACCCCAATTATTACTATATTGGGCTTGAGTCAATTACA
ATTGGGAATGGGAACAATTATTCTAGATTTGGGGTTTCTAATTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCTGGCACTACTTATAC
TCATTTGCCTGAACCATTGTATTCACAATTGATTTCTAATCTTGAGACAGTGATAAGCTATCCTAGAGCCAAACAAGTTGAAATGAACACTGGATTTGATCTTTGTTACA
GAGTTCCTTGTAAAAACAACACCTTGTTTGGTGGTGATGACTTTGATCAACTTCCTTCTATAACATTCCATTTCTTGAACAATGTTAGTGTTGTTTTGCCTCAAGGGAAC
AACTTTTATGCCATGGCTGCTCCAAGTAACTCCACTGTGGTGAAATGCTTGTTGTTTCAAAGCATGGATGGCGGCGGTGACGGCGATAGTGATGTCGAGAATGGCGGAGA
TGGGCCGGCGGGGATATTTGGAAGCTTTCAACAGCAGAATTTGGAGGTTGTTTATGACTTGGAGAATGAAAGGTTAGGGTTTCAACCAATGGATTGTGCTTCTGCTGCTG
CCTCTCAAAGGACTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTCAACACCAAACTCCATTGCAGCCAAAGTCTTGAGCTTTTTCCTCCTTCTTGTGTATGTCTCATGTATAAACTTGGGACAAACCCTAGCAGCAAACCATAAAAG
CAAAATCCCAACAAACAATCATTCTCTAGTTCTTGGTCTTCTTCATTCTAGAACCTCCCTTCTCACCCCCAAAAAAGGCTACCAAAATTCCATTTCAAGGAAGAAATCAT
TAATCAAGCCAATGGAAATTAATGATAATGTGGTAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCCTAACATTAGGGGCACCCCCACAAGTTGTCCAAGTC
TATATGGACACTGGAAGTGACCTTACATGGGTGCCTTGTGGGAACCTCTCCTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGTCCAAAGTTAGCAAC
TTTTTTGCCTACTCATTCTTCAACCTCAATTAGAGACACTTGTGGAAGCTCCTTTTGTATGGATATTCATAGCTCAGATAACCCTTTTGATCCTTGTACAATTGCTGGCT
GTTCCCTTGCTACCCTTGTTAAGAGCACCTGCCCTAGACCTTGCCCTTCTTTTGCTTACACTTATGGTGCAAGTGGGGTTGTGATTGGAACCCTAACAAGAGATTCCATT
TCTATCCATGGAAATTTTAATTCCCAAAATTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCTACTTATAGAGAGCCAATTGGCATTGCTGGCTTTGG
GAGAGGTTTACTTTCTCTTCCTTCTCAATTAGGGTTTTCTCATAAGGGCTTTTCTCATTGCTTTTTGCCCTTTAAATTCTCTAATAACCCTAATTTTTCAAGTCCTTTGA
TTCTTGGGAACCTTGCTATTTCTTCAAAAGACCATAATTTGCAATTCACCCCTTTGTTGAAAAGTCCAATTTACCCCAATTATTACTATATTGGGCTTGAGTCAATTACA
ATTGGGAATGGGAACAATTATTCTAGATTTGGGGTTTCTAATTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCTGGCACTACTTATAC
TCATTTGCCTGAACCATTGTATTCACAATTGATTTCTAATCTTGAGACAGTGATAAGCTATCCTAGAGCCAAACAAGTTGAAATGAACACTGGATTTGATCTTTGTTACA
GAGTTCCTTGTAAAAACAACACCTTGTTTGGTGGTGATGACTTTGATCAACTTCCTTCTATAACATTCCATTTCTTGAACAATGTTAGTGTTGTTTTGCCTCAAGGGAAC
AACTTTTATGCCATGGCTGCTCCAAGTAACTCCACTGTGGTGAAATGCTTGTTGTTTCAAAGCATGGATGGCGGCGGTGACGGCGATAGTGATGTCGAGAATGGCGGAGA
TGGGCCGGCGGGGATATTTGGAAGCTTTCAACAGCAGAATTTGGAGGTTGTTTATGACTTGGAGAATGAAAGGTTAGGGTTTCAACCAATGGATTGTGCTTCTGCTGCTG
CCTCTCAAAGGACTCCATAA
Protein sequenceShow/hide protein sequence
MSSTPNSIAAKVLSFFLLLVYVSCINLGQTLAANHKSKIPTNNHSLVLGLLHSRTSLLTPKKGYQNSISRKKSLIKPMEINDNVVEPLREIRDGYLMSLTLGAPPQVVQV
YMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLATFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKSTCPRPCPSFAYTYGASGVVIGTLTRDSI
SIHGNFNSQNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHNLQFTPLLKSPIYPNYYYIGLESIT
IGNGNNYSRFGVSNFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLETVISYPRAKQVEMNTGFDLCYRVPCKNNTLFGGDDFDQLPSITFHFLNNVSVVLPQGN
NFYAMAAPSNSTVVKCLLFQSMDGGGDGDSDVENGGDGPAGIFGSFQQQNLEVVYDLENERLGFQPMDCASAAASQRTP