; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025339 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025339
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPeptidase A1 domain-containing protein
Genome locationchr10:11587493..11589097
RNA-Seq ExpressionLag0025339
SyntenyLag0025339
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583807.1 putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia]3.3e-24183.3Show/hide
Query:  SIAAK---VLSFFLLLVYVSA-------INPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPP
        SIAA+   VL   L+LV VS         NPK+ F  D SLVLGLVHSRTSLLTPKRG YNS+SRKRIKPMEM G+DDVIEPLREIRDGYLMSL LGTPP
Subjt:  SIAAK---VLSFFLLLVYVSA-------INPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPP

Query:  QIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTY
        Q++QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTY
Subjt:  QIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTY

Query:  GASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS
        GASG+V GTLT+D I +HGNS NSS      ++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS
Subjt:  GASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS

Query:  SKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCY
        SKEHL+FTPLLKSP YPNYYYIGLESITIGNG N SRFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQ+ISNLES+ISYPRAK+ E+NTGFDLCY
Subjt:  SKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCY

Query:  KVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQ
        KVP KNN T F+D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAPSN TVVKCLLFQSMDG          GDGPAGIFGSFQQQNLEVVYDLEKERLGF+
Subjt:  KVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQ

Query:  PMDCASAAASQGLQK
         MDCAS A SQGL K
Subjt:  PMDCASAAASQGLQK

XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]9.8e-25484.57Show/hide
Query:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEM-GGDDDVIEPLREIRDGYLMSLKLGT
        MPS ++ S A K LS FLLLV+VS    A NPK+NFP D SLVLGLVHSRTSLLTPK+G YN IS+KR+K M+   GDD+VIEPLREIRDGYLMSL +GT
Subjt:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEM-GGDDDVIEPLREIRDGYLMSLKLGT

Query:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQ+VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAY
Subjt:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        TYGASGVVTG+LTRD +  HGN +N+++++   KQIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
Subjt:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFD
        ISSK E+LQFTPLLKSP+YPNYYYIGLESITIGNG+NN RFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVE+NTGFD
Subjt:  ISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFD

Query:  LCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERL
        LCYKVPCKNN +SF DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAP N TVVKCLL+QSMDG G D+D DD  +GPAGIFGSFQQQN+EVVYDLEKERL
Subjt:  LCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERL

Query:  GFQPMDCASAAASQGLQKNFRRNNA
        GFQPMDC S AA QGL KN RRN +
Subjt:  GFQPMDCASAAASQGLQKNFRRNNA

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]3.3e-25786.07Show/hide
Query:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPM-EMGGDDDVIEPLREIRDGYLMSLKLGT
        MPS ++TSIA K LS FLLLV+ S    A NPK+NFP D SLVLGLVHSRTSLLTPK+G YN IS+KR+K M +M GDD+VIEPLREIRDGYLMSL +GT
Subjt:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPM-EMGGDDDVIEPLREIRDGYLMSLKLGT

Query:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQ+VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTRDSISMHGN-SSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL
        TYGASGVVTG+LTRD + MHGN  +N++++S++ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG+L
Subjt:  TYGASGVVTGTLTRDSISMHGN-SSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL

Query:  AISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGF
        AISSK E+LQFTPLLKSPIYPNYYYIGLESITIGNGNNN RFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVE+NTGF
Subjt:  AISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGF

Query:  DLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKER
        DLCYKVPCKNN +SF DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP N TVVKCLL+QSMDG G D+D DD  +GPAGIFGSFQQQNL+VVYDLEKER
Subjt:  DLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKER

Query:  LGFQPMDCASAAASQGLQKNFRRN
        LGFQ MDC S AA+QGL KN RRN
Subjt:  LGFQPMDCASAAASQGLQKNFRRN

XP_022142611.1 probable aspartyl protease At4g16563 [Momordica charantia]3.1e-24782.16Show/hide
Query:  STATSIAAKVLSFFLLLVYVSAINP------KSNFPTDSSLVLGLVHSRTSLLTPKRGYYN--SISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGT
        S+AT+I++KVL+FFLLL+ + +++       ++NFP   SLVLGLVHSRTSLLTPKRGY++    S    KPME  G D+VIEPLREIRDGYL+SL LGT
Subjt:  STATSIAAKVLSFFLLLVYVSAINP------KSNFPTDSSLVLGLVHSRTSLLTPKRGYYN--SISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGT

Query:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQ++QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        TYGASGVVTGTLT+D I MHG S N      ST QIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LA
Subjt:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKEH-LQFTPLLKSPIYPNYYYIGLESITIGN--GNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTG
        ISSK+H LQFTPLLKSP+YPNYYYIGLES+TIG+  GNNNSRFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTG
Subjt:  ISSKEH-LQFTPLLKSPIYPNYYYIGLESITIGN--GNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTG

Query:  FDLCYKVPCKNNT--TSFADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVY
        FDLCYK+PCKNNT  +S  DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP+N TVVKCLLFQSMDGGGGD D +   DGPAGIFGSFQQQN+EVVY
Subjt:  FDLCYKVPCKNNT--TSFADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVY

Query:  DLEKERLGFQPMDCASAAASQGLQKNF
        DL+KER+GFQ MDCAS+AASQGL KNF
Subjt:  DLEKERLGFQPMDCASAAASQGLQKNF

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]3.4e-25486.85Show/hide
Query:  MPSTATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPP
        M S +TS A K+LS+FLLLVYVS    A NPK+N P D SLV+GLVHSRT+LLTPK+G YN ISRKR+K MEM  DD+VIEPLREIRDGYLMSL LGTPP
Subjt:  MPSTATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPP

Query:  QIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTY
        Q++QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVK TCPRPCPSFAYTY
Subjt:  QIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTY

Query:  GASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS
        GASGVV GTLTRD + MH N  N +S +SSTK+ PRFCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA+S
Subjt:  GASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS

Query:  SK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLC
        SK EHLQFTPLLKSPIYPNYYYIGLESITIGNGN+N RFGVSF LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVE+NTGFDLC
Subjt:  SK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLC

Query:  YKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGF
        YKVPCKNN  SF DDSQLPSITFHFLNNVSVVLPQ NNFYAMAAP N TVVKCLLFQSMDG GGD+  DD  DGPAGIFGSFQQQNLEVVYDLEKERLGF
Subjt:  YKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGF

Query:  QPMDCASAAASQGLQKN
        QPMDCA  AA+QGL KN
Subjt:  QPMDCASAAASQGLQKN

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein4.7e-25484.57Show/hide
Query:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEM-GGDDDVIEPLREIRDGYLMSLKLGT
        MPS ++ S A K LS FLLLV+VS    A NPK+NFP D SLVLGLVHSRTSLLTPK+G YN IS+KR+K M+   GDD+VIEPLREIRDGYLMSL +GT
Subjt:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEM-GGDDDVIEPLREIRDGYLMSLKLGT

Query:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQ+VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAY
Subjt:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        TYGASGVVTG+LTRD +  HGN +N+++++   KQIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
Subjt:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFD
        ISSK E+LQFTPLLKSP+YPNYYYIGLESITIGNG+NN RFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVE+NTGFD
Subjt:  ISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFD

Query:  LCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERL
        LCYKVPCKNN +SF DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAP N TVVKCLL+QSMDG G D+D DD  +GPAGIFGSFQQQN+EVVYDLEKERL
Subjt:  LCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERL

Query:  GFQPMDCASAAASQGLQKNFRRNNA
        GFQPMDC S AA QGL KN RRN +
Subjt:  GFQPMDCASAAASQGLQKNFRRNNA

A0A1S3CAK9 aspartic proteinase nepenthesin-21.6e-25786.07Show/hide
Query:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPM-EMGGDDDVIEPLREIRDGYLMSLKLGT
        MPS ++TSIA K LS FLLLV+ S    A NPK+NFP D SLVLGLVHSRTSLLTPK+G YN IS+KR+K M +M GDD+VIEPLREIRDGYLMSL +GT
Subjt:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPM-EMGGDDDVIEPLREIRDGYLMSLKLGT

Query:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQ+VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTRDSISMHGN-SSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL
        TYGASGVVTG+LTRD + MHGN  +N++++S++ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG+L
Subjt:  TYGASGVVTGTLTRDSISMHGN-SSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL

Query:  AISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGF
        AISSK E+LQFTPLLKSPIYPNYYYIGLESITIGNGNNN RFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVE+NTGF
Subjt:  AISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGF

Query:  DLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKER
        DLCYKVPCKNN +SF DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP N TVVKCLL+QSMDG G D+D DD  +GPAGIFGSFQQQNL+VVYDLEKER
Subjt:  DLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKER

Query:  LGFQPMDCASAAASQGLQKNFRRN
        LGFQ MDC S AA+QGL KN RRN
Subjt:  LGFQPMDCASAAASQGLQKNFRRN

A0A5A7TNC9 Aspartic proteinase nepenthesin-21.6e-25786.07Show/hide
Query:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPM-EMGGDDDVIEPLREIRDGYLMSLKLGT
        MPS ++TSIA K LS FLLLV+ S    A NPK+NFP D SLVLGLVHSRTSLLTPK+G YN IS+KR+K M +M GDD+VIEPLREIRDGYLMSL +GT
Subjt:  MPS-TATSIAAKVLSFFLLLVYVS----AINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPM-EMGGDDDVIEPLREIRDGYLMSLKLGT

Query:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQ+VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTRDSISMHGN-SSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL
        TYGASGVVTG+LTRD + MHGN  +N++++S++ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG+L
Subjt:  TYGASGVVTGTLTRDSISMHGN-SSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNL

Query:  AISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGF
        AISSK E+LQFTPLLKSPIYPNYYYIGLESITIGNGNNN RFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVE+NTGF
Subjt:  AISSK-EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGF

Query:  DLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKER
        DLCYKVPCKNN +SF DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP N TVVKCLL+QSMDG G D+D DD  +GPAGIFGSFQQQNL+VVYDLEKER
Subjt:  DLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKER

Query:  LGFQPMDCASAAASQGLQKNFRRN
        LGFQ MDC S AA+QGL KN RRN
Subjt:  LGFQPMDCASAAASQGLQKNFRRN

A0A6J1CMP8 probable aspartyl protease At4g165631.5e-24782.16Show/hide
Query:  STATSIAAKVLSFFLLLVYVSAINP------KSNFPTDSSLVLGLVHSRTSLLTPKRGYYN--SISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGT
        S+AT+I++KVL+FFLLL+ + +++       ++NFP   SLVLGLVHSRTSLLTPKRGY++    S    KPME  G D+VIEPLREIRDGYL+SL LGT
Subjt:  STATSIAAKVLSFFLLLVYVSAINP------KSNFPTDSSLVLGLVHSRTSLLTPKRGYYN--SISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGT

Query:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQ++QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
        TYGASGVVTGTLT+D I MHG S N      ST QIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LA
Subjt:  TYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKEH-LQFTPLLKSPIYPNYYYIGLESITIGN--GNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTG
        ISSK+H LQFTPLLKSP+YPNYYYIGLES+TIG+  GNNNSRFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTG
Subjt:  ISSKEH-LQFTPLLKSPIYPNYYYIGLESITIGN--GNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTG

Query:  FDLCYKVPCKNNT--TSFADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVY
        FDLCYK+PCKNNT  +S  DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP+N TVVKCLLFQSMDGGGGD D +   DGPAGIFGSFQQQN+EVVY
Subjt:  FDLCYKVPCKNNT--TSFADD---SQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVY

Query:  DLEKERLGFQPMDCASAAASQGLQKNF
        DL+KER+GFQ MDCAS+AASQGL KNF
Subjt:  DLEKERLGFQPMDCASAAASQGLQKNF

A0A6J1KLG7 probable aspartyl protease At4g165635.1e-24083.2Show/hide
Query:  SIAAKVLSFFLLLVYVSA-------INPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPPQIV
        SIAA+   F L+LV VS         NPK+ F  D SLVLGLVHSRTSLLTPKRG YNS+SRKRIKPMEM GDDDVIEPLREIRDGYLMSL LGTPPQ++
Subjt:  SIAAKVLSFFLLLVYVSA-------INPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPPQIV

Query:  QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGAS
        QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSSTSIR+TCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSF+YTYGAS
Subjt:  QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGAS

Query:  GVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKE
        G+V GTLT+D+I +HGNS NSS      ++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNP FSSPLILGNLAISSKE
Subjt:  GVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKE

Query:  HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCYKVP
        HL+FTPLLKSP YPNYYYIGLESITIGNG N SRFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLIS LES+ISYPRAK+ E+NTGFDLCYKVP
Subjt:  HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCYKVP

Query:  CKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMD
         KNN T F+D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAPSN TVVKCLLFQSMDG          GDGPAGIFGSFQQQNLEVVYDLEKERLGF+ MD
Subjt:  CKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMD

Query:  CASAAASQGLQK
        CAS A SQGL K
Subjt:  CASAAASQGLQK

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED31.6e-2825.87Show/hide
Query:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC
        Y++  KLGTPPQ++ + +DT +D  W+PC      C  C                 +++STS      S++             C+ A C+ A  +  TC
Subjt:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCP-----SFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPSQLGFSHKG-FSHCFLPF
        P   P     SF  +YG     + +L +D++++            +   IP F FGC+ +       P G+ G GRG +SL SQ    + G FS+C   F
Subjt:  PRPCP-----SFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPSQLGFSHKG-FSHCFLPF

Query:  KFSNNPNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLES
        +   +  FS  L LG   +   + +++TPLL++P  P+ YY+ L  +++G    + +  V       D     G +IDSGT  T   +P+Y  +      
Subjt:  KFSNNPNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLES

Query:  VISYPRAKQVEMNT-----GFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPA
               KQV +++      FD C+         S  +++  P IT H + ++ + LP  N     +A +    + CL   SM G           +   
Subjt:  VISYPRAKQVEMNT-----GFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPA

Query:  GIFGSFQQQNLEVVYDLEKERLGFQPMDC
         +  + QQQNL +++D+   R+G  P  C
Subjt:  GIFGSFQQQNLEVVYDLEKERLGFQPMDC

Q766C2 Aspartic proteinase nepenthesin-25.6e-3428.2Show/hide
Query:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC
        YLM++ +GTP       MDTGSDL W  C      C  C      +       F P  SS+     C S +C D+     P + C    C          
Subjt:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN
              + Y YG      G +  ++ +              T  +P   FGC     G       G+ G G G LSLPSQLG     FS+C   +  S+ 
Subjt:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN

Query:  PNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYP
            S L LG+ A    E    T L+ S + P YYYI L+ IT+G  N     G+     ++   G GGM+IDSGTT T+LP+  Y+ +       I+ P
Subjt:  PNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYP

Query:  RAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQN
             E ++G   C++ P   +T       Q+P I+  F   V  +  Q      + +P+   +  CL   S    G              IFG+ QQQ 
Subjt:  RAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQN

Query:  LEVVYDLEKERLGFQPMDCASA
         +V+YDL+   + F P  C ++
Subjt:  LEVVYDLEKERLGFQPMDCASA

Q766C3 Aspartic proteinase nepenthesin-16.8e-3229.18Show/hide
Query:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC
        YLM+L +GTP Q     MDTGSDL W         CQ C +  N         F P  SS+     C S  C  + S                     TC
Subjt:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN
              + Y YG      G++  ++++              +  IP   FGC     G       G+ G GRG LSLPSQL  +   FS+C  P   S  
Subjt:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN

Query:  PNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGN---NNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVI
         N    L+LG+LA S       T L++S   P +YYI L  +++G+     + S F ++         G GG++IDSGTT T+     Y  +     S I
Subjt:  PNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGN---NNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVI

Query:  SYPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQ
        + P       ++GFDLC++ P      S   + Q+P+   HF +   + LP  N F    +PSN  +  CL   S   G               IFG+ Q
Subjt:  SYPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQ

Query:  QQNLEVVYDLEKERLGFQPMDCASA
        QQN+ VVYD     + F    C ++
Subjt:  QQNLEVVYDLEKERLGFQPMDCASA

Q940R4 Probable aspartyl protease At4g165631.7e-5435.35Show/hide
Query:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC
        YL+SL +G+    V +Y+DTGSDL W PC    F C  CE      S P       + SS++   +C S  C   HSS    D C I+ C L  +  G C
Subjt:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  ---PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF
             PCP F Y YG  G +   L  DS+S+   S            +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF

Query:  -SNNPNFSSPLILGNLA-------------------ISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTT
         S+     SPLILG                         K    FT +L++P +P +Y + L+ I+IG  N          LR ID  G GG+++DSGTT
Subjt:  -SNNPNFSSPLILGNLA-------------------ISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTT

Query:  YTHLPEPLYSQLISNLESVIS--YPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFL-NNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMD
        +T LP   Y+ ++   +S +   + RA +VE ++G   CY +   N T       ++P++  HF  N  SV LP+ N FY      +    K  +   M 
Subjt:  YTHLPEPLYSQLISNLESVIS--YPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFL-NNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMD

Query:  GGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS
          GGD     GG G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  GGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS

Q9LNJ3 Aspartyl protease family protein 23.4e-3127.22Show/hide
Query:  SNFPTDSSLVLGLVH--SRTSLLTPKRGYYNSISR--KRIK-----------------PMEMGGDDDVIEPLREIRDGYLMSLKLGTPPQIVQVYMDTGS
        S+  + SS+ L L H  + +S  TP   + + + R  +R+K                 P   G    V+  L +    Y   L +GTP + V + +DTGS
Subjt:  SNFPTDSSLVLGLVH--SRTSLLTPKRGYYNSISR--KRIK-----------------PMEMGGDDDVIEPLREIRDGYLMSLKLGTPPQIVQVYMDTGS

Query:  DLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLT
        D+ W+ C      C+ C    + +       F P  S T     C S  C  + S          AGC+     + TC      +  +YG      G  +
Subjt:  DLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLT

Query:  RDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLQFTP
         ++++   N     +          F    VGA      G+ G G+G LS P Q G  F+ K FS+C +    S+ P   S ++ GN A+S     +FTP
Subjt:  RDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLQFTP

Query:  LLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCYKVPCKNNTT
        LL +P    +YY+GL  I++G        GV+  L ++D  GNGG++IDSGT+ T L  P Y  +       +     K+    + FD C+ +       
Subjt:  LLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCYKVPCKNNTT

Query:  SFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA
        S  ++ ++P++  HF     V LP  N    +     F    C  F    GG               I G+ QQQ   VVYDL   R+GF P  CA
Subjt:  SFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein4.4e-3431.07Show/hide
Query:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC
        +LM L +G P       +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  +  S+   D             K  C
Subjt:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN
              + YTYG      G L  ++ +    +S S              FGC     G  + +  G+ G GRG LSL SQL      FS+C    + S  
Subjt:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN

Query:  PNFSSPLILGNLA--ISSK-------EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLIS
           SS L +G+LA  I +K       E  +   LL++P  P++YY+ L+ IT+G      R  V     E+   G GGM+IDSGTT T+L E  +  L  
Subjt:  PNFSSPLILGNLA--ISSK-------EHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLIS

Query:  NLESVISYPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAG
           S +S P       +TG DLC+K+P        A +  +P + FHF     + LP G N+  M A S+ T V CL   S +G                
Subjt:  NLESVISYPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAG

Query:  IFGSFQQQNLEVVYDLEKERLGFQPMDC
        IFG+ QQQN  V++DLEKE + F P +C
Subjt:  IFGSFQQQNLEVVYDLEKERLGFQPMDC

AT3G25700.1 Eukaryotic aspartyl protease family protein1.6e-3929.74Show/hide
Query:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC
        Y + L++G PPQ + +  DTGSDL WV C      C++C  +           F P HSST     C    C  +   D        A     T +  TC
Subjt:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFL
              + Y Y    + +G   R++ S+        +SS    ++    FGC          G ++    G+ G GRG +S  SQLG  F +K FS+C +
Subjt:  PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFL

Query:  PFKFSNNPNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNL
         +  S  P  +S LI+GN        L FTPLL +P+ P +YY+ L+S+ +    N ++  +   + EID  GNGG ++DSGTT   L EP Y  +I+ +
Subjt:  PFKFSNNPNFSSPLILGNLAISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNL

Query:  ESVISYPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIF
           +  P A    +  GFDLC  V   +  T    +  LP + F F      V P  N F           ++CL  QS+D   G S           + 
Subjt:  ESVISYPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIF

Query:  GSFQQQNLEVVYDLEKERLGFQPMDCA
        G+  QQ     +D ++ RLGF    CA
Subjt:  GSFQQQNLEVVYDLEKERLGFQPMDCA

AT3G52500.1 Eukaryotic aspartyl protease family protein2.7e-4431.85Show/hide
Query:  GYLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGT
        GY +SL  GTP Q +    DTGS L W+PC +  + C  C+   + +    +  F+P +SS+S    C S  C  ++    P   C   GC   T     
Subjt:  GYLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGT

Query:  CPRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNF
        C   CP +   YG  G   G L  + +                  +P F  GC   + R+P GIAGFGRG +SLPSQ+    K FSHC +  +F ++ N 
Subjt:  CPRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNF

Query:  SSPLIL----GNLAISSKEHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE
        ++ L L    G+ + S    L +TP  K+P   N     YYY+ L  I +G         + +K     T G+GG ++DSG+T+T +  P++  +     
Subjt:  SSPLIL----GNLAISSKEHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE

Query:  SVIS-YPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIF
        S +S Y R K +E  TG   C+ +  K + T       +P + F F     + LP  N F  +      T   CL   S       +    GG GPA I 
Subjt:  SVIS-YPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIF

Query:  GSFQQQNLEVVYDLEKERLGFQPMDCA
        GSFQQQN  V YDLE +R GF    C+
Subjt:  GSFQQQNLEVVYDLEKERLGFQPMDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein1.2e-5535.35Show/hide
Query:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC
        YL+SL +G+    V +Y+DTGSDL W PC    F C  CE      S P       + SS++   +C S  C   HSS    D C I+ C L  +  G C
Subjt:  YLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  ---PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF
             PCP F Y YG  G +   L  DS+S+   S            +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF

Query:  -SNNPNFSSPLILGNLA-------------------ISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTT
         S+     SPLILG                         K    FT +L++P +P +Y + L+ I+IG  N          LR ID  G GG+++DSGTT
Subjt:  -SNNPNFSSPLILGNLA-------------------ISSKEHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTT

Query:  YTHLPEPLYSQLISNLESVIS--YPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFL-NNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMD
        +T LP   Y+ ++   +S +   + RA +VE ++G   CY +   N T       ++P++  HF  N  SV LP+ N FY      +    K  +   M 
Subjt:  YTHLPEPLYSQLISNLESVIS--YPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFL-NNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMD

Query:  GGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS
          GGD     GG G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  GGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS

AT5G45120.1 Eukaryotic aspartyl protease family protein1.9e-17061.99Show/hide
Query:  NPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCE
        NP S+  + S LVL L  S  SL TPK     S +++RIK   +   D V+EPLRE+RDGYL++L +GTPPQ VQVY+DTGSDLTWVPCGNLSFDC +C 
Subjt:  NPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGTPPQIVQVYMDTGSDLTWVPCGNLSFDCQDCE

Query:  EYQNN-VSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSS
        + +NN +  P  + F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K TC RPCPSFAYTYG  G+++G LTRD +             
Subjt:  EYQNN-VSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDSISMHGNSSNSSSSS

Query:  SSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKEHLQFTPLLKSPIYPNYYYIGLES
        + T+ +PRF FGCV +TYREPIGIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG   L+I+  + LQFTP+L +P+YPN YYIGLES
Subjt:  SSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKEHLQFTPLLKSPIYPNYYYIGLES

Query:  ITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCYKVPC-KNNTTSFADDSQL--PSITFH
        ITI  G N +   V   LR+ D++GNGGML+DSGTTYTHLPEP YSQL++ L+S I+YPRA + E  TGFDLCYKVPC  NN TS  +D  +  PSITFH
Subjt:  ITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCYKVPC-KNNTTSFADDSQL--PSITFH

Query:  FLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASAAASQGLQK
        FLNN +++LPQGN+FYAM+APS+ +VV+CLLFQ+M         +DG  GPAG+FGSFQQQN++VVYDLEKER+GFQ MDC   AAS GL +
Subjt:  FLNNVSVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASAAASQGLQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTTATCTCATTGCCCCTACTTCACAAACAGCTACAAATTAAAAATGCCTTCAACAGCAACCTCCATTGCAGCCAAAGTCTTGAGCTTTTTCCTTCTTCTTGTGTA
TGTCTCAGCCATAAACCCTAAAAGCAATTTCCCCACAGATTCTTCTCTAGTTCTTGGTCTTGTTCACTCAAGAACTTCCCTCCTCACTCCCAAAAGAGGCTACTACAATT
CCATTTCAAGGAAGAGAATCAAGCCAATGGAAATGGGAGGAGATGATGATGTGATAGAGCCATTGAGAGAGATTAGGGATGGTTATTTGATGTCCCTTAAATTAGGGACA
CCCCCACAAATTGTTCAAGTGTATATGGACACTGGAAGTGACCTTACATGGGTTCCTTGTGGGAACCTCTCCTTTGATTGCCAAGATTGTGAAGAGTATCAAAACAATGT
TTCGGGTCCAAAGTTGGCAGCTTTTTTGCCAACCCATTCTTCTACTTCCATTAGAGACACTTGTGGCAGCTCCTTTTGCTTGGATATCCATAGCTCTGATAACCCTTTTG
ACCCTTGCACAATTGCAGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACCTGCCCTAGACCTTGCCCTTCCTTTGCTTACACTTATGGGGCAAGTGGGGTTGTCACAGGA
ACCCTAACAAGAGATTCCATTTCTATGCATGGAAATTCCTCAAATTCTTCTTCTTCTTCTTCTTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCCAC
TTATAGAGAGCCAATTGGCATTGCTGGCTTTGGGAGAGGCTTGCTTTCTCTCCCTTCCCAATTAGGGTTTTCCCATAAGGGCTTCTCCCATTGCTTCTTGCCCTTTAAAT
TCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGGAACCTTGCCATTTCTTCAAAAGAGCATTTGCAATTCACCCCTTTGTTGAAAAGTCCCATTTACCCCAAC
TATTACTATATTGGGCTTGAGTCAATCACCATTGGGAATGGTAATAACAACTCTAGATTTGGGGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGAAATGGTGGAAT
GTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAGCTTATTTCCAATCTTGAGTCAGTGATAAGCTATCCAAGAGCCAAACAAGTTGAGA
TGAATACTGGGTTTGATCTTTGTTACAAAGTTCCTTGTAAAAACAACACCACTTCTTTTGCTGATGACTCTCAGCTCCCTTCTATAACTTTCCATTTCTTGAATAATGTT
AGTGTTGTTTTGCCCCAAGGAAACAACTTCTATGCCATGGCTGCTCCAAGTAACTTCACTGTGGTGAAATGCTTGTTGTTTCAAAGCATGGACGGCGGCGGTGGCGATAG
CGACTACGACGATGGCGGAGACGGGCCGGCAGGCATTTTCGGAAGCTTTCAACAGCAAAATTTGGAGGTTGTGTATGACTTGGAGAAGGAGAGATTAGGGTTTCAACCAA
TGGACTGTGCTTCTGCTGCTGCCTCTCAAGGACTACAAAAGAATTTTAGAAGGAATAATGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACTTATCTCATTGCCCCTACTTCACAAACAGCTACAAATTAAAAATGCCTTCAACAGCAACCTCCATTGCAGCCAAAGTCTTGAGCTTTTTCCTTCTTCTTGTGTA
TGTCTCAGCCATAAACCCTAAAAGCAATTTCCCCACAGATTCTTCTCTAGTTCTTGGTCTTGTTCACTCAAGAACTTCCCTCCTCACTCCCAAAAGAGGCTACTACAATT
CCATTTCAAGGAAGAGAATCAAGCCAATGGAAATGGGAGGAGATGATGATGTGATAGAGCCATTGAGAGAGATTAGGGATGGTTATTTGATGTCCCTTAAATTAGGGACA
CCCCCACAAATTGTTCAAGTGTATATGGACACTGGAAGTGACCTTACATGGGTTCCTTGTGGGAACCTCTCCTTTGATTGCCAAGATTGTGAAGAGTATCAAAACAATGT
TTCGGGTCCAAAGTTGGCAGCTTTTTTGCCAACCCATTCTTCTACTTCCATTAGAGACACTTGTGGCAGCTCCTTTTGCTTGGATATCCATAGCTCTGATAACCCTTTTG
ACCCTTGCACAATTGCAGGCTGTTCCCTTGCTACCCTTGTGAAGGGCACCTGCCCTAGACCTTGCCCTTCCTTTGCTTACACTTATGGGGCAAGTGGGGTTGTCACAGGA
ACCCTAACAAGAGATTCCATTTCTATGCATGGAAATTCCTCAAATTCTTCTTCTTCTTCTTCTTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCCAC
TTATAGAGAGCCAATTGGCATTGCTGGCTTTGGGAGAGGCTTGCTTTCTCTCCCTTCCCAATTAGGGTTTTCCCATAAGGGCTTCTCCCATTGCTTCTTGCCCTTTAAAT
TCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGGAACCTTGCCATTTCTTCAAAAGAGCATTTGCAATTCACCCCTTTGTTGAAAAGTCCCATTTACCCCAAC
TATTACTATATTGGGCTTGAGTCAATCACCATTGGGAATGGTAATAACAACTCTAGATTTGGGGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGAAATGGTGGAAT
GTTGATTGATTCTGGTACTACTTATACTCATTTACCTGAACCATTGTATTCACAGCTTATTTCCAATCTTGAGTCAGTGATAAGCTATCCAAGAGCCAAACAAGTTGAGA
TGAATACTGGGTTTGATCTTTGTTACAAAGTTCCTTGTAAAAACAACACCACTTCTTTTGCTGATGACTCTCAGCTCCCTTCTATAACTTTCCATTTCTTGAATAATGTT
AGTGTTGTTTTGCCCCAAGGAAACAACTTCTATGCCATGGCTGCTCCAAGTAACTTCACTGTGGTGAAATGCTTGTTGTTTCAAAGCATGGACGGCGGCGGTGGCGATAG
CGACTACGACGATGGCGGAGACGGGCCGGCAGGCATTTTCGGAAGCTTTCAACAGCAAAATTTGGAGGTTGTGTATGACTTGGAGAAGGAGAGATTAGGGTTTCAACCAA
TGGACTGTGCTTCTGCTGCTGCCTCTCAAGGACTACAAAAGAATTTTAGAAGGAATAATGCATGA
Protein sequenceShow/hide protein sequence
MNLSHCPYFTNSYKLKMPSTATSIAAKVLSFFLLLVYVSAINPKSNFPTDSSLVLGLVHSRTSLLTPKRGYYNSISRKRIKPMEMGGDDDVIEPLREIRDGYLMSLKLGT
PPQIVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCLDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTG
TLTRDSISMHGNSSNSSSSSSSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLQFTPLLKSPIYPN
YYYIGLESITIGNGNNNSRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVISYPRAKQVEMNTGFDLCYKVPCKNNTTSFADDSQLPSITFHFLNNV
SVVLPQGNNFYAMAAPSNFTVVKCLLFQSMDGGGGDSDYDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASAAASQGLQKNFRRNNA