; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020571 (gene) of Snake gourd v1 genome

Gene IDTan0020571
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationLG05:80039733..80041295
RNA-Seq ExpressionTan0020571
SyntenyTan0020571
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583807.1 putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia]5.4e-24984.71Show/hide
Query:  SIAAKVLSFFLLLLVHVSAINLGQALAN--SNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPP
        SIAA+  SFF+L+LV V  +  G+A+    +NPK+K   DSLVLGLVHSRTSLLTPKR Y NS+SRKRIKPMEM +DDVIEPLREIRDGYLMSLTLGTPP
Subjt:  SIAAKVLSFFLLLLVHVSAINLGQALAN--SNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPP

Query:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTY
        QV+QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTY
Subjt:  QVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTY

Query:  GASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQ
        GASG+V GTLT+D I +HGNSPNS+++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+HL+
Subjt:  GASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQ

Query:  FTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCK
        FTPLLKSP YPNYYYIGLESITIGNG NYSRFGV S +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQ+ISN++S+I+YPRAK+ E+NTGFDLCYKVP K
Subjt:  FTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCK

Query:  NNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCA
        NN  TFF+D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAPSNSTVVKCLLFQSMDG         GDGPAGIFGSFQQQNLEVVYDLEKERLGF  MDCA
Subjt:  NNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCA

Query:  SVAASQGLHK
        SVA SQGLHK
Subjt:  SVAASQGLHK

XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]8.4e-25084.09Show/hide
Query:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS
        MPS ++ S A K LS F LLLVHVS     Q LA +NPK+  P DSLVLGLVHSRTSLLTPK+ Y N IS+KR+K M+    DD+VIEPLREIRDGYLMS
Subjt:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS

Query:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
        L++GTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPC
Subjt:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC

Query:  PSFAYTYGASGVVTGTLTRDAISMHG---NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN
        PSFAYTYGASGVVTG+LTRD +  HG   N+ N+ KQIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN
Subjt:  PSFAYTYGASGVVTGTLTRDAISMHG---NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN

Query:  LAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINT
        LAISSKD +LQFTPLLKSP+YPNYYYIGLESITIGNG+N  RFGV SFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISN++ VI YPRAKQVE+NT
Subjt:  LAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINT

Query:  GFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEK
        GFDLCYKVPCKNN ++ F DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D+D D  +GPAGIFGSFQQQN+EVVYDLEK
Subjt:  GFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEK

Query:  ERLGFLPMDCASVAASQGLHKNFRRNES
        ERLGF PMDC SVAA QGLHKN RRNES
Subjt:  ERLGFLPMDCASVAASQGLHKNFRRNES

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]1.7e-25084.15Show/hide
Query:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS
        MPS ++TSIA K LS F LLLVH S     Q LA +NPK+  P DSLVLGLVHSRTSLLTPK+ Y N IS+KR+K M+    DD+VIEPLREIRDGYLMS
Subjt:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS

Query:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
        L++GTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
Subjt:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC

Query:  PSFAYTYGASGVVTGTLTRDAISMHG-------NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL
        PSFAYTYGASGVVTG+LTRD + MHG       N+ N+ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPL
Subjt:  PSFAYTYGASGVVTGTLTRDAISMHG-------NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL

Query:  ILGNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQV
        ILG+LAISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNN  RFGV SFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISN++SVI+YPRAKQV
Subjt:  ILGNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQV

Query:  EINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVY
        E+NTGFDLCYKVPCKNN ++ F DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D+D D  +GPAGIFGSFQQQNL+VVY
Subjt:  EINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVY

Query:  DLEKERLGFLPMDCASVAASQGLHKNFRRN
        DLEKERLGF  MDC SVAA+QGLHKN RRN
Subjt:  DLEKERLGFLPMDCASVAASQGLHKNFRRN

XP_023000974.1 probable aspartyl protease At4g16563 [Cucurbita maxima]1.2e-24885.43Show/hide
Query:  SIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPPQV
        SIAA+  SFF+L+LV VS   +GQ LA  NPK+K   DSLVLGLVHSRTSLLTPKR Y NS+SRKRIKPMEM DDDVIEPLREIRDGYLMSLTLGTPPQV
Subjt:  SIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPPQV

Query:  VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGA
        +QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSSTSIR+TCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSF+YTYGA
Subjt:  VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGA

Query:  SGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFT
        SG+V GTLT+DAI +HGNSPNS+++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNP FSSPLILGNLAISSK+HL+FT
Subjt:  SGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFT

Query:  PLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNN
        PLLKSP YPNYYYIGLESITIGNG NYSRFGV S +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLIS ++S+I+YPRAK+ E+NTGFDLCYKVP KNN
Subjt:  PLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNN

Query:  TTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV
          TFF+D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAPSNSTVVKCLLFQSMDG         GDGPAGIFGSFQQQNLEVVYDLEKERLGF  MDCASV
Subjt:  TTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV

Query:  AASQGLHK
        A SQGLHK
Subjt:  AASQGLHK

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]4.0e-25286.13Show/hide
Query:  MPSTATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDGYLMSLTLG
        M S +TS A K+LS+F LLLV+VS   L      +NPK+  P DSLV+GLVHSRT+LLTPK+ Y N ISRKR+K MEMDD+VIEPLREIRDGYLMSLTLG
Subjt:  MPSTATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDGYLMSLTLG

Query:  TPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFA
        TPPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPSFA
Subjt:  TPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFA

Query:  YTYGASGVVTGTLTRDAISMH---GNSPN-STKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI
        YTYGASGVV GTLTRD + MH    NSPN STK+ PRFCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA+
Subjt:  YTYGASGVVTGTLTRDAISMH---GNSPN-STKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI

Query:  SSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFD
        SSKD HLQFTPLLKSPIYPNYYYIGLESITIGNGN+  RFGV SF LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISN++SVI+YPRAKQVE+NTGFD
Subjt:  SSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFD

Query:  LCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERL
        LCYKVPCKNN  +F  DDSQLPSITFHFLNNVSVVLPQ NNFYAMAAP NSTVVKCLLFQSMDG GGD+DDD  DGPAGIFGSFQQQNLEVVYDLEKERL
Subjt:  LCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERL

Query:  GFLPMDCASVAASQGLHKN
        GF PMDCA VAA+QGLHKN
Subjt:  GFLPMDCASVAASQGLHKN

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein4.0e-25084.09Show/hide
Query:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS
        MPS ++ S A K LS F LLLVHVS     Q LA +NPK+  P DSLVLGLVHSRTSLLTPK+ Y N IS+KR+K M+    DD+VIEPLREIRDGYLMS
Subjt:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS

Query:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
        L++GTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPC
Subjt:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC

Query:  PSFAYTYGASGVVTGTLTRDAISMHG---NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN
        PSFAYTYGASGVVTG+LTRD +  HG   N+ N+ KQIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN
Subjt:  PSFAYTYGASGVVTGTLTRDAISMHG---NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGN

Query:  LAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINT
        LAISSKD +LQFTPLLKSP+YPNYYYIGLESITIGNG+N  RFGV SFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISN++ VI YPRAKQVE+NT
Subjt:  LAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINT

Query:  GFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEK
        GFDLCYKVPCKNN ++ F DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D+D D  +GPAGIFGSFQQQN+EVVYDLEK
Subjt:  GFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEK

Query:  ERLGFLPMDCASVAASQGLHKNFRRNES
        ERLGF PMDC SVAA QGLHKN RRNES
Subjt:  ERLGFLPMDCASVAASQGLHKNFRRNES

A0A1S3CAK9 aspartic proteinase nepenthesin-28.2e-25184.15Show/hide
Query:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS
        MPS ++TSIA K LS F LLLVH S     Q LA +NPK+  P DSLVLGLVHSRTSLLTPK+ Y N IS+KR+K M+    DD+VIEPLREIRDGYLMS
Subjt:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS

Query:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
        L++GTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
Subjt:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC

Query:  PSFAYTYGASGVVTGTLTRDAISMHG-------NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL
        PSFAYTYGASGVVTG+LTRD + MHG       N+ N+ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPL
Subjt:  PSFAYTYGASGVVTGTLTRDAISMHG-------NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL

Query:  ILGNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQV
        ILG+LAISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNN  RFGV SFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISN++SVI+YPRAKQV
Subjt:  ILGNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQV

Query:  EINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVY
        E+NTGFDLCYKVPCKNN ++ F DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D+D D  +GPAGIFGSFQQQNL+VVY
Subjt:  EINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVY

Query:  DLEKERLGFLPMDCASVAASQGLHKNFRRN
        DLEKERLGF  MDC SVAA+QGLHKN RRN
Subjt:  DLEKERLGFLPMDCASVAASQGLHKNFRRN

A0A5A7TNC9 Aspartic proteinase nepenthesin-28.2e-25184.15Show/hide
Query:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS
        MPS ++TSIA K LS F LLLVH S     Q LA +NPK+  P DSLVLGLVHSRTSLLTPK+ Y N IS+KR+K M+    DD+VIEPLREIRDGYLMS
Subjt:  MPS-TATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM---DDDVIEPLREIRDGYLMS

Query:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
        L++GTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC
Subjt:  LTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPC

Query:  PSFAYTYGASGVVTGTLTRDAISMHG-------NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL
        PSFAYTYGASGVVTG+LTRD + MHG       N+ N+ KQ+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPL
Subjt:  PSFAYTYGASGVVTGTLTRDAISMHG-------NSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL

Query:  ILGNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQV
        ILG+LAISSKD +LQFTPLLKSPIYPNYYYIGLESITIGNGNN  RFGV SFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISN++SVI+YPRAKQV
Subjt:  ILGNLAISSKD-HLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQV

Query:  EINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVY
        E+NTGFDLCYKVPCKNN ++ F DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D+D D  +GPAGIFGSFQQQNL+VVY
Subjt:  EINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVY

Query:  DLEKERLGFLPMDCASVAASQGLHKNFRRN
        DLEKERLGF  MDC SVAA+QGLHKN RRN
Subjt:  DLEKERLGFLPMDCASVAASQGLHKNFRRN

A0A6J1EHM1 probable aspartyl protease At4g165631.1e-24784.65Show/hide
Query:  SIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPPQV
        SIAA+     +L+LV VS   +GQ LA  NPK+K   DSLVLGLVHSRTSLLTPKR Y NS+  KRIKPMEM +DDVIEPLREIRDGYLMSLTLGTPPQV
Subjt:  SIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPPQV

Query:  VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGA
        +QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGA
Subjt:  VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGA

Query:  SGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFT
        SG+V GTLT+D I +HGNSPNS+++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK+HL+FT
Subjt:  SGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFT

Query:  PLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNN
        P LKSP YPNYYYIGLESITIGNG NYSRFGV S +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISN++S+I+YPRAK+ E+NTGFDLCYKVP KNN
Subjt:  PLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNN

Query:  TTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV
          TFF+D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAPSNSTVVKCLLFQSMDG         GDGPAGIFGSFQQQNLEVVYDLEKERLGF  MDCASV
Subjt:  TTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV

Query:  AASQGLHK
        A SQGLHK
Subjt:  AASQGLHK

A0A6J1KLG7 probable aspartyl protease At4g165635.8e-24985.43Show/hide
Query:  SIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPPQV
        SIAA+  SFF+L+LV VS   +GQ LA  NPK+K   DSLVLGLVHSRTSLLTPKR Y NS+SRKRIKPMEM DDDVIEPLREIRDGYLMSLTLGTPPQV
Subjt:  SIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEM-DDDVIEPLREIRDGYLMSLTLGTPPQV

Query:  VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGA
        +QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLAAFLPTHSSTSIR+TCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSF+YTYGA
Subjt:  VQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGA

Query:  SGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFT
        SG+V GTLT+DAI +HGNSPNS+++IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNP FSSPLILGNLAISSK+HL+FT
Subjt:  SGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFT

Query:  PLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNN
        PLLKSP YPNYYYIGLESITIGNG NYSRFGV S +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLIS ++S+I+YPRAK+ E+NTGFDLCYKVP KNN
Subjt:  PLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNN

Query:  TTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV
          TFF+D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAPSNSTVVKCLLFQSMDG         GDGPAGIFGSFQQQNLEVVYDLEKERLGF  MDCASV
Subjt:  TTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV

Query:  AASQGLHK
        A SQGLHK
Subjt:  AASQGLHK

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356151.9e-2631.09Show/hide
Query:  SYQNSISRKRIKPMEMDDDVIEPLREIRDG-YLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGS
        ++  S+SR R    ++    ++      DG + MS+T+GTPP  V    DTGSDLTWV C      CQ C  Y+ N  GP    F    SST   + C S
Subjt:  SYQNSISRKRIKPMEMDDDVIEPLREIRDG-YLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGS

Query:  SFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC---VGATYREP-IGIAGFGRGLL
          C  + S++         GC  +  +       C  + Y+YG      G +  + +S+   S  S    P   FGC    G T+ E   GI G G G L
Subjt:  SFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC---VGATYREP-IGIAGFGRGLL

Query:  SLPSQLGFS-HKGFSHCFLPFKFSNNPNFSSPLILGNLAIS---SKDH-LQFTPLL-KSPIYPNYYYIGLESITIG------NGNNYSRFGVGSFKLREI
        SL SQLG S  K FS+C L  K S   N +S + LG  +I    SKD  +  TPL+ K P+   YYY+ LE+I++G       G++Y+    G      I
Subjt:  SLPSQLGFS-HKGFSHCFLPFKFSNNPNFSSPLILGNLAIS---SKDH-LQFTPLL-KSPIYPNYYYIGLESITIG------NGNNYSRFGVGSFKLREI

Query:  DTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTG-FDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPS
         ++ +G ++IDSGTT T L    + +  S ++  +T   AK+V    G    C+K           + +  LP IT HF     V L   N F  +    
Subjt:  DTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTG-FDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPS

Query:  NSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCAS
         S  + CL                      I+G+F Q +  V YDLE   + F  MDC++
Subjt:  NSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCAS

Q766C2 Aspartic proteinase nepenthesin-22.9e-3528.67Show/hide
Query:  KRSYQNSISRKR-IKPMEMDDDVIEPLREIRDG-YLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDT
        KR+ +    R R I  M      IE      DG YLM++ +GTP       MDTGSDL W  C      C  C      +       F P  SS+     
Subjt:  KRSYQNSISRKR-IKPMEMDDDVIEPLREIRDG-YLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDT

Query:  CGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC----VGATYREPIGIAGFGR
        C S +C D+     P + C    C                + Y YG      G +  +  +        T  +P   FGC     G       G+ G G 
Subjt:  CGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC----VGATYREPIGIAGFGR

Query:  GLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGML
        G LSLPSQLG     FS+C   +  S+     S L LG+ A    +    T L+ S + P YYYI L+ IT+G  N     G+ S    ++   G GGM+
Subjt:  GLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGML

Query:  IDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLF
        IDSGTT T+LP+  Y+ +       I  P     E ++G   C++ P   +T        Q+P I+  F   V  +  Q      + +P+   +  CL  
Subjt:  IDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLF

Query:  QSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCAS
         S    G             IFG+ QQQ  +V+YDL+   + F+P  C +
Subjt:  QSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCAS

Q766C3 Aspartic proteinase nepenthesin-17.8e-3327.5Show/hide
Query:  ATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSL----LTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDG-YLMSLTL
        A+S+ + +L+  ++ +      +  +   N   ++K+    ++L  V S  +L    L  +   + S   +R++ M      +E      DG YLM+L++
Subjt:  ATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSL----LTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDG-YLMSLTL

Query:  GTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF
        GTP Q     MDTGSDL W         CQ C +  N         F P  SS+     C S  C  + S                     TC      +
Subjt:  GTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF

Query:  AYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA
         Y YG      G++  + ++        +  IP   FGC     G       G+ G GRG LSLPSQL  +   FS+C  P   S   N    L+LG+LA
Subjt:  AYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA

Query:  ISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFD
         S       T L++S   P +YYI L  +++G+    +R  +           G GG++IDSGTT T+     Y  +     S I  P       ++GFD
Subjt:  ISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFD

Query:  LCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERL
        LC++ P           + Q+P+   HF +   + LP  N F    +PSN  +  CL   S   S G S          IFG+ QQQN+ VVYD     +
Subjt:  LCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERL

Query:  GFLPMDCAS
         F    C +
Subjt:  GFLPMDCAS

Q940R4 Probable aspartyl protease At4g165632.3e-5634.43Show/hide
Query:  HSRTSLLTPKRSYQNSISR-KRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTH
        HS + L   K S   S +R +R    +    +  P+    D YL+SL++G+    V +Y+DTGSDL W PC    F C  CE      S P       + 
Subjt:  HSRTSLLTPKRSYQNSISR-KRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTH

Query:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPI
        SS++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +   L  D++S+   S      +  F FGC   T  EPI
Subjt:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPI

Query:  GIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLES
        G+AGFGRG LSLP+QL     H G  FS+C +   F S+     SPLILG                         K+   FT +L++P +P +Y + L+ 
Subjt:  GIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLES

Query:  ITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVI--TYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITF
        I+IG  N           LR ID  G GG+++DSGTT+T LP   Y+ ++   DS +   + RA +VE ++G   CY +   N T        ++P++  
Subjt:  ITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVI--TYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITF

Query:  HFL-NNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV
        HF  N  SV LP+ N FY      +    K  +   M  +GGD  +  G G   I G++QQQ  EVVYDL   R+GF    CAS+
Subjt:  HFL-NNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV

Q9LNJ3 Aspartyl protease family protein 21.1e-3128.8Show/hide
Query:  KPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNP
        +P      V+  L +    Y   L +GTP + V + +DTGSD+ W+ C      C+ C    + +       F P  S T     C S  C  + S    
Subjt:  KPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNP

Query:  FDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG-
              AGC+     + TC      +  +YG      G  + + ++   N      ++     GC        VGA      G+ G G+G LS P Q G 
Subjt:  FDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG-

Query:  -FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTH
         F+ K FS+C +    S+ P   S ++ GN A+S     +FTPLL +P    +YY+GL  I++G G          FKL +I   GNGG++IDSGT+ T 
Subjt:  -FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTH

Query:  LPEPLYSQLISNID-SVITYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGG
        L  P Y  +         T  RA    +   FD C+ +   N        + ++P++  HF     V LP  N  Y +   +N     C  F        
Subjt:  LPEPLYSQLISNID-SVITYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGG

Query:  DSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCA
             G  G   I G+ QQQ   VVYDL   R+GF P  CA
Subjt:  DSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein1.1e-3431.07Show/hide
Query:  YLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        +LM L++G P       +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  +  S+   D             K  C
Subjt:  YLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP
              + YTYG      G L  +  +    +      I    FGC     G  + +  G+ G GRG LSL SQL      FS+C    + S     SS 
Subjt:  PRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP

Query:  LILGNLA--ISSK-------DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGV--GSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNID
        L +G+LA  I +K       +  +   LL++P  P++YY+ L+ IT+G      R  V   +F+L E    G GGM+IDSGTT T+L E  +  L     
Subjt:  LILGNLA--ISSK-------DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGV--GSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNID

Query:  SVITYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFG
        S ++ P       +TG DLC+K+P         A +  +P + FHF     + LP G N+  M A S ST V CL   S +G               IFG
Subjt:  SVITYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFG

Query:  SFQQQNLEVVYDLEKERLGFLPMDCASV
        + QQQN  V++DLEKE + F+P +C  +
Subjt:  SFQQQNLEVVYDLEKERLGFLPMDCASV

AT3G25700.1 Eukaryotic aspartyl protease family protein1.7e-3828.54Show/hide
Query:  SFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTG
        SF  L L+  S I    A++N N   K+P          ++   L  +R +  S+ RK I  ++    V+         Y + L +G PPQ + +  DTG
Subjt:  SFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTG

Query:  SDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTL
        SDL WV C      C++C  +           F P HSST     C    C  +   D        A     T +  TC      + Y Y    + +G  
Subjt:  SDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTL

Query:  TRDAISMHGNSPNSTKQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHL
         R+  S+  +S    + +    FGC          G ++    G+ G GRG +S  SQLG  F +K FS+C + +  S  P  +S LI+GN        L
Subjt:  TRDAISMHGNSPNSTKQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHL

Query:  QFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPC
         FTPLL +P+ P +YY+ L+S+ +    N ++  +    + EID  GNGG ++DSGTT   L EP Y  +I+ +   +  P A    +  GFDLC     
Subjt:  QFTPLLKSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPC

Query:  KNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDC
          N +     +  LP + F F      V P  N F           ++CL  QS+D   G S          + G+  QQ     +D ++ RLGF    C
Subjt:  KNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDC

Query:  A
        A
Subjt:  A

AT3G52500.1 Eukaryotic aspartyl protease family protein7.0e-4530.53Show/hide
Query:  IAAKVLSFFLLLLVHVSAINLG-QALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISR----KRIKPMEMDDDVI------------EPL-REI
        +A+ +  FFL+ L  VSA+ L     ++S+   K P  SL              +R  ++SI+R    K    ++ D+D +             PL  + 
Subjt:  IAAKVLSFFLLLLVHVSAINLG-QALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISR----KRIKPMEMDDDVI------------EPL-REI

Query:  RDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK
          GY +SL+ GTP Q +    DTGS L W+PC +  + C  C+   + +    +  F+P +SS+S    C S  C  ++    P   C   GC   T   
Subjt:  RDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK

Query:  GTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL
          C   CP +   YG  G   G L  + +      P+ T  +P F  GC   + R+P GIAGFGRG +SLPSQ+    K FSHC +  +F ++ N ++ L
Subjt:  GTCPRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPL

Query:  IL----GNLAISSKDHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVI
         L    G+ + S    L +TP  K+P   N     YYY+ L  I +G      +     +K     T G+GG ++DSG+T+T +  P++  +     S +
Subjt:  IL----GNLAISSKDHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVI

Query:  T-YPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSF
        + Y R K +E  TG   C+ +  K + T        +P + F F     + LP  N F  +     +T   CL       S    +  GG GPA I GSF
Subjt:  T-YPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSF

Query:  QQQNLEVVYDLEKERLGFLPMDCA
        QQQN  V YDLE +R GF    C+
Subjt:  QQQNLEVVYDLEKERLGFLPMDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein1.6e-5734.43Show/hide
Query:  HSRTSLLTPKRSYQNSISR-KRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTH
        HS + L   K S   S +R +R    +    +  P+    D YL+SL++G+    V +Y+DTGSDL W PC    F C  CE      S P       + 
Subjt:  HSRTSLLTPKRSYQNSISR-KRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTH

Query:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPI
        SS++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +   L  D++S+   S      +  F FGC   T  EPI
Subjt:  SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFAYTYGASGVVTGTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPI

Query:  GIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLES
        G+AGFGRG LSLP+QL     H G  FS+C +   F S+     SPLILG                         K+   FT +L++P +P +Y + L+ 
Subjt:  GIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA-------------------ISSKDHLQFTPLLKSPIYPNYYYIGLES

Query:  ITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVI--TYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITF
        I+IG  N           LR ID  G GG+++DSGTT+T LP   Y+ ++   DS +   + RA +VE ++G   CY +   N T        ++P++  
Subjt:  ITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVI--TYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITF

Query:  HFL-NNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV
        HF  N  SV LP+ N FY      +    K  +   M  +GGD  +  G G   I G++QQQ  EVVYDL   R+GF    CAS+
Subjt:  HFL-NNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASV

AT5G45120.1 Eukaryotic aspartyl protease family protein4.2e-17560.95Show/hide
Query:  VLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMD
        VL  FLL+ + ++  N  QA  + NP S   +  LVL L  S  SL TPK   Q  I     KP+   D V+EPLRE+RDGYL++L +GTPPQ VQVY+D
Subjt:  VLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYMD

Query:  TGSDLTWVPCGNLSFDCQDCEEYQNN-VSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVT
        TGSDLTWVPCGNLSFDC +C + +NN +  P  + F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K TC RPCPSFAYTYG  G+++
Subjt:  TGSDLTWVPCGNLSFDCQDCEEYQNN-VSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVT

Query:  GTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKDHLQFTPLL
        G LTRD +         T+ +PRF FGCV +TYREPIGIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG   L+I+  D LQFTP+L
Subjt:  GTLTRDAISMHGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKDHLQFTPLL

Query:  KSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNNTTT
         +P+YPN YYIGLESITIG     ++       LR+ D++GNGGML+DSGTTYTHLPEP YSQL++ + S ITYPRA + E  TGFDLCYKVPC NN  T
Subjt:  KSPIYPNYYYIGLESITIGNGNNYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNNTTT

Query:  FFADDSQL--PSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASVA
           +D  +  PSITFHFLNN +++LPQGN+FYAM+APS+ +VV+CLLFQ+M        +DG  GPAG+FGSFQQQN++VVYDLEKER+GF  MDC   A
Subjt:  FFADDSQL--PSITFHFLNNVSVVLPQGNNFYAMAAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASVA

Query:  ASQGLHK
        AS GL++
Subjt:  ASQGLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCAACAGCAACCTCCATTGCAGCCAAAGTCTTGAGCTTTTTCCTTCTTCTTCTTGTGCATGTCTCAGCCATAAACTTGGGACAAGCCCTAGCTAATTCAAACCC
TAAAAGCAAAATCCCCACAGATTCTCTAGTTCTTGGTCTTGTTCATTCTAGAACTTCCCTCCTCACCCCCAAAAGAAGCTACCAAAATTCCATTTCAAGGAAGAGAATTA
AGCCAATGGAAATGGATGATGATGTGATAGAGCCATTGAGGGAGATTAGGGATGGTTATTTGATGTCCCTCACATTAGGGACACCCCCACAAGTTGTTCAAGTGTATATG
GACACTGGGAGTGACCTAACATGGGTTCCTTGTGGGAACCTCTCCTTTGATTGCCAAGATTGTGAAGAGTATCAAAACAATGTTTCAGGTCCAAAGTTGGCTGCTTTTTT
GCCAACCCATTCTTCAACCTCCATTAGAGACACTTGTGGGAGCTCCTTTTGCATGGATATCCATAGCTCTGATAACCCTTTTGACCCTTGCACAATTGCAGGATGTTCCC
TTGCTACCCTTGTGAAGGGCACCTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAGAGATGCCATTTCTATG
CATGGAAATTCCCCAAATTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCCACTTATAGAGAGCCAATTGGTATTGCTGGTTTTGGTAGAGGCTTACT
TTCTCTCCCTTCCCAATTAGGGTTTTCCCATAAGGGCTTCTCTCATTGCTTCTTGCCCTTTAAATTCTCAAATAACCCTAACTTCTCAAGCCCTTTGATTCTTGGGAACC
TTGCCATTTCTTCAAAAGACCATTTGCAATTCACCCCTTTGTTGAAAAGTCCAATTTACCCCAACTATTACTATATTGGGCTTGAGTCAATCACCATTGGAAATGGTAAC
AATTATTCTAGATTTGGGGTTGGTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCGGGTACTACTTATACTCATTTACCTGAACC
ATTATATTCACAACTTATTTCTAATATTGATTCAGTGATAACCTATCCTAGAGCCAAACAAGTAGAAATCAATACTGGATTTGATCTTTGTTATAAAGTTCCTTGTAAAA
ATAACACTACTACTTTTTTTGCTGATGATTCTCAACTTCCTTCTATAACATTCCATTTCTTGAACAATGTTAGCGTTGTTTTGCCTCAAGGAAATAACTTCTATGCCATG
GCTGCTCCAAGTAACTCCACTGTGGTGAAATGCTTGTTGTTTCAAAGCATGGACGGTAGCGGCGGCGATAGTGATGACGATGGCGGAGACGGGCCGGCGGGGATTTTCGG
AAGCTTCCAACAGCAAAATTTGGAGGTTGTTTATGACTTGGAGAAGGAAAGGTTAGGGTTTCTACCGATGGATTGTGCTTCTGTTGCTGCCTCTCAAGGACTCCACAAGA
ATTTTAGAAGGAATGAGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCAACAGCAACCTCCATTGCAGCCAAAGTCTTGAGCTTTTTCCTTCTTCTTCTTGTGCATGTCTCAGCCATAAACTTGGGACAAGCCCTAGCTAATTCAAACCC
TAAAAGCAAAATCCCCACAGATTCTCTAGTTCTTGGTCTTGTTCATTCTAGAACTTCCCTCCTCACCCCCAAAAGAAGCTACCAAAATTCCATTTCAAGGAAGAGAATTA
AGCCAATGGAAATGGATGATGATGTGATAGAGCCATTGAGGGAGATTAGGGATGGTTATTTGATGTCCCTCACATTAGGGACACCCCCACAAGTTGTTCAAGTGTATATG
GACACTGGGAGTGACCTAACATGGGTTCCTTGTGGGAACCTCTCCTTTGATTGCCAAGATTGTGAAGAGTATCAAAACAATGTTTCAGGTCCAAAGTTGGCTGCTTTTTT
GCCAACCCATTCTTCAACCTCCATTAGAGACACTTGTGGGAGCTCCTTTTGCATGGATATCCATAGCTCTGATAACCCTTTTGACCCTTGCACAATTGCAGGATGTTCCC
TTGCTACCCTTGTGAAGGGCACCTGCCCTAGACCTTGCCCTTCTTTTGCTTATACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAGAGATGCCATTTCTATG
CATGGAAATTCCCCAAATTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCCACTTATAGAGAGCCAATTGGTATTGCTGGTTTTGGTAGAGGCTTACT
TTCTCTCCCTTCCCAATTAGGGTTTTCCCATAAGGGCTTCTCTCATTGCTTCTTGCCCTTTAAATTCTCAAATAACCCTAACTTCTCAAGCCCTTTGATTCTTGGGAACC
TTGCCATTTCTTCAAAAGACCATTTGCAATTCACCCCTTTGTTGAAAAGTCCAATTTACCCCAACTATTACTATATTGGGCTTGAGTCAATCACCATTGGAAATGGTAAC
AATTATTCTAGATTTGGGGTTGGTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGAATGTTGATTGATTCGGGTACTACTTATACTCATTTACCTGAACC
ATTATATTCACAACTTATTTCTAATATTGATTCAGTGATAACCTATCCTAGAGCCAAACAAGTAGAAATCAATACTGGATTTGATCTTTGTTATAAAGTTCCTTGTAAAA
ATAACACTACTACTTTTTTTGCTGATGATTCTCAACTTCCTTCTATAACATTCCATTTCTTGAACAATGTTAGCGTTGTTTTGCCTCAAGGAAATAACTTCTATGCCATG
GCTGCTCCAAGTAACTCCACTGTGGTGAAATGCTTGTTGTTTCAAAGCATGGACGGTAGCGGCGGCGATAGTGATGACGATGGCGGAGACGGGCCGGCGGGGATTTTCGG
AAGCTTCCAACAGCAAAATTTGGAGGTTGTTTATGACTTGGAGAAGGAAAGGTTAGGGTTTCTACCGATGGATTGTGCTTCTGTTGCTGCCTCTCAAGGACTCCACAAGA
ATTTTAGAAGGAATGAGAGCTGA
Protein sequenceShow/hide protein sequence
MPSTATSIAAKVLSFFLLLLVHVSAINLGQALANSNPKSKIPTDSLVLGLVHSRTSLLTPKRSYQNSISRKRIKPMEMDDDVIEPLREIRDGYLMSLTLGTPPQVVQVYM
DTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTRDAISM
HGNSPNSTKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDHLQFTPLLKSPIYPNYYYIGLESITIGNGN
NYSRFGVGSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNIDSVITYPRAKQVEINTGFDLCYKVPCKNNTTTFFADDSQLPSITFHFLNNVSVVLPQGNNFYAM
AAPSNSTVVKCLLFQSMDGSGGDSDDDGGDGPAGIFGSFQQQNLEVVYDLEKERLGFLPMDCASVAASQGLHKNFRRNES