; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G009090 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G009090
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPeptidase A1 domain-containing protein
Genome locationCmU531Chr01:10844264..10845796
RNA-Seq ExpressionCmUC01G009090
SyntenyCmUC01G009090
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583807.1 putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia]3.6e-24584.37Show/hide
Query:  LSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGS
        L   L+LV VS + +    ANPKT F  DSLVLGLVHSRTSLL PK+GYNS+SRKR+K MEMG+DD VIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGS
Subjt:  LSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGS

Query:  DLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLT
        DLTWVPCGNLSFDCQDC+EYQNNVLGPKLAAFLPTHSSTSIR+TCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSF+YTYGASG+VIGTLT
Subjt:  DLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLT

Query:  RDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKS
        +DV+F+HG   NSPNS+++IP+FCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK +HL+FTPLLKS
Subjt:  RDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKS

Query:  PIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFID
        P YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQ+ISNLES+I+YPRAK+ ELNTGFDLCYKVP +NN F F D
Subjt:  PIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFID

Query:  DSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATQGLHK
        + +LPSITFHFLNNVSVVLPQGN+FYAMAAP NSTVVKCLLFQ+MDG      D DGPAGIFGSFQQQNLEVVYDLEKERLGF+ MDCASVA +QGLHK
Subjt:  DSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATQGLHK

XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]1.7e-27188.89Show/hide
Query:  MPSIST-SIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEM-GSDDNVIEPLREIRDGYLMSLTLGTPP
        MPSIS+ S ATKFLS FLLLV+VS +TLA NPKTNFP DSLVLGLVHSRTSLL PKKGYN IS+KR+K M+    DDNVIEPLREIRDGYLMSL++GTPP
Subjt:  MPSIST-SIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEM-GSDDNVIEPLREIRDGYLMSLTLGTPP

Query:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY
        QV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GP+LAAFLPTHSSTSIR+TCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKG CPRPCPSFAYTY
Subjt:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY

Query:  GASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
        GASGVV G+LTRDVLF HGN  N+ N+ KQIPRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
Subjt:  GASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD

Query:  DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKV
        ++LQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYKV
Subjt:  DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKV

Query:  PCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDC
        PC+NNN SF+DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLL+Q+MDGVG DND D +GPAGIFGSFQQQN+EVVYDLEKERLGFQPMDC
Subjt:  PCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDC

Query:  ASVAATQGLHKNV
         SVAA QGLHKNV
Subjt:  ASVAATQGLHKNV

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]2.4e-27389.36Show/hide
Query:  MPSI-STSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTM-EMGSDDNVIEPLREIRDGYLMSLTLGTPP
        MPSI STSIATKFLS FLLLV+ S++TLA NPKTNFP DSLVLGLVHSRTSLL PKKGYN IS+KR+K M +M  DDNVIEPLREIRDGYLMSL++GTPP
Subjt:  MPSI-STSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTM-EMGSDDNVIEPLREIRDGYLMSLTLGTPP

Query:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY
        QV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIR+TCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSFAYTY
Subjt:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY

Query:  GASGVVIGTLTRDVLFMHG----NNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI
        GASGVV G+LTRDVLFMHG    NN N+ N+ KQ+PRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAI
Subjt:  GASGVVIGTLTRDVLFMHG----NNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI

Query:  SSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDL
        SSKD++LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVI+YPRAKQVELNTGFDL
Subjt:  SSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDL

Query:  CYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQ
        CYKVPC+NNN SF+DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLL+Q+MDGVG DND D +GPAGIFGSFQQQNL+VVYDLEKERLGFQ
Subjt:  CYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQ

Query:  PMDCASVAATQGLHKNV
         MDC SVAA QGLHKNV
Subjt:  PMDCASVAATQGLHKNV

XP_023520027.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]1.1e-24683.27Show/hide
Query:  ISTSIATKFLSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQV
        +++ +A  F    L+LV VS + +    ANPKT F  DSLVLGLVHSRTSLL PK+GYNS+SRKR+K MEMG+DD VIEPLREIRDGYLMSLTLGTPPQV
Subjt:  ISTSIATKFLSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQV

Query:  IQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGA
        IQVYMDTGSDLTWVPCGNLSFDCQDC+EYQNNVLGPKLAAFLPTHSSTSIR+TCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSF+YTYGA
Subjt:  IQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGA

Query:  SGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDH
        SG+VIGTLT+DV+F+HG   NSPNS+++IP+FCFGCVGA+YREPIGIAGFGRGLLSLP+QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK +H
Subjt:  SGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDH

Query:  LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPC
        L+FTPLLKSP YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+I+YPRAK+ ELNTGFDLCYKVP 
Subjt:  LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPC

Query:  RNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
        +NN F F D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAP NSTVVKCLLFQ+MDG      D DGPAGIFGSFQQQNLEVVYDLEKERLGF+ MDCASV
Subjt:  RNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

Query:  AATQGLHK
        A +QGLHK
Subjt:  AATQGLHK

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]2.9e-27992.77Show/hide
Query:  MPSISTSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQV
        M SISTS A K LS+FLLLVYVSRKTLA NPKTN P DSLV+GLVHSRT+LL PKKGYN ISRKR+K MEM  DDNVIEPLREIRDGYLMSLTLGTPPQV
Subjt:  MPSISTSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQV

Query:  IQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGA
        IQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNV GPKLAAFLPTHSSTSIR+TCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK  CPRPCPSFAYTYGA
Subjt:  IQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGA

Query:  SGVVIGTLTRDVLFMHGNNINSPN-STKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDD
        SGVVIGTLTRDVL MH NNINSPN STK+ PRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLA+SSKD+
Subjt:  SGVVIGTLTRDVLFMHGNNINSPN-STKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDD

Query:  HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVP
        HLQFTPLLKSPIYPNYYYIGLESITIGNGN+NFRFGVSF LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVI+YPRAKQVELNTGFDLCYKVP
Subjt:  HLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVP

Query:  CRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGD-NDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA
        C+NNNFSFIDDSQLPSITFHFLNNVSVVLPQ NNFYAMAAPINSTVVKCLLFQ+MDGVGGD +DD DGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA
Subjt:  CRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGD-NDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA

Query:  SVAATQGLHKNV
         VAATQGLHKNV
Subjt:  SVAATQGLHKNV

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein8.2e-27288.89Show/hide
Query:  MPSIST-SIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEM-GSDDNVIEPLREIRDGYLMSLTLGTPP
        MPSIS+ S ATKFLS FLLLV+VS +TLA NPKTNFP DSLVLGLVHSRTSLL PKKGYN IS+KR+K M+    DDNVIEPLREIRDGYLMSL++GTPP
Subjt:  MPSIST-SIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEM-GSDDNVIEPLREIRDGYLMSLTLGTPP

Query:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY
        QV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GP+LAAFLPTHSSTSIR+TCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKG CPRPCPSFAYTY
Subjt:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY

Query:  GASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
        GASGVV G+LTRDVLF HGN  N+ N+ KQIPRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD
Subjt:  GASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKD

Query:  DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKV
        ++LQFTPLLKSP+YPNYYYIGLESITIGNG+NNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE VI YPRAKQVELNTGFDLCYKV
Subjt:  DHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKV

Query:  PCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDC
        PC+NNN SF+DD+QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLL+Q+MDGVG DND D +GPAGIFGSFQQQN+EVVYDLEKERLGFQPMDC
Subjt:  PCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDC

Query:  ASVAATQGLHKNV
         SVAA QGLHKNV
Subjt:  ASVAATQGLHKNV

A0A1S3CAK9 aspartic proteinase nepenthesin-21.1e-27389.36Show/hide
Query:  MPSI-STSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTM-EMGSDDNVIEPLREIRDGYLMSLTLGTPP
        MPSI STSIATKFLS FLLLV+ S++TLA NPKTNFP DSLVLGLVHSRTSLL PKKGYN IS+KR+K M +M  DDNVIEPLREIRDGYLMSL++GTPP
Subjt:  MPSI-STSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTM-EMGSDDNVIEPLREIRDGYLMSLTLGTPP

Query:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY
        QV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIR+TCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSFAYTY
Subjt:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY

Query:  GASGVVIGTLTRDVLFMHG----NNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI
        GASGVV G+LTRDVLFMHG    NN N+ N+ KQ+PRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAI
Subjt:  GASGVVIGTLTRDVLFMHG----NNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI

Query:  SSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDL
        SSKD++LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVI+YPRAKQVELNTGFDL
Subjt:  SSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDL

Query:  CYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQ
        CYKVPC+NNN SF+DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLL+Q+MDGVG DND D +GPAGIFGSFQQQNL+VVYDLEKERLGFQ
Subjt:  CYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQ

Query:  PMDCASVAATQGLHKNV
         MDC SVAA QGLHKNV
Subjt:  PMDCASVAATQGLHKNV

A0A5A7TNC9 Aspartic proteinase nepenthesin-21.1e-27389.36Show/hide
Query:  MPSI-STSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTM-EMGSDDNVIEPLREIRDGYLMSLTLGTPP
        MPSI STSIATKFLS FLLLV+ S++TLA NPKTNFP DSLVLGLVHSRTSLL PKKGYN IS+KR+K M +M  DDNVIEPLREIRDGYLMSL++GTPP
Subjt:  MPSI-STSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTM-EMGSDDNVIEPLREIRDGYLMSLTLGTPP

Query:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY
        QV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+ GPKLAAFLPTHSSTSIR+TCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSFAYTY
Subjt:  QVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTY

Query:  GASGVVIGTLTRDVLFMHG----NNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI
        GASGVV G+LTRDVLFMHG    NN N+ N+ KQ+PRFCFGCVGA+YREPIGIAGFGRGLLSLPFQLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAI
Subjt:  GASGVVIGTLTRDVLFMHG----NNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI

Query:  SSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDL
        SSKD++LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVI+YPRAKQVELNTGFDL
Subjt:  SSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDL

Query:  CYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQ
        CYKVPC+NNN SF+DDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLL+Q+MDGVG DND D +GPAGIFGSFQQQNL+VVYDLEKERLGFQ
Subjt:  CYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDND-DSDGPAGIFGSFQQQNLEVVYDLEKERLGFQ

Query:  PMDCASVAATQGLHKNV
         MDC SVAA QGLHKNV
Subjt:  PMDCASVAATQGLHKNV

A0A6J1EHM1 probable aspartyl protease At4g165634.3e-24483.3Show/hide
Query:  ATKFLSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYM
        A  +    L+LV VS + +    ANPKT F  DSLVLGLVHSRTSLL PK+GYNS+  KR+K MEMG+DD VIEPLREIRDGYLMSLTLGTPPQVIQVYM
Subjt:  ATKFLSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYM

Query:  DTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVI
        DTGSDLTWVPCGNLSFDCQDC+EYQNNVLGPKLAAFLPTHSSTSIR+TCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSF+YTYGASG+VI
Subjt:  DTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVI

Query:  GTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTP
        GTLT+DV+F+HG   NSPNS+++IP+FCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK +HL+FTP
Subjt:  GTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTP

Query:  LLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNF
         LKSP YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+I+YPRAK+ ELNTGFDLCYKVP +NN F
Subjt:  LLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNF

Query:  SFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATQG
         F D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAP NSTVVKCLLFQ+MDG      D DGPAGIFGSFQQQNLEVVYDLEKERLGF+ MDCASVA +QG
Subjt:  SFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATQG

Query:  LHK
        LHK
Subjt:  LHK

A0A6J1KLG7 probable aspartyl protease At4g165632.5e-24484.51Show/hide
Query:  FFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDL
        F L+LV VS + +    ANPKT F  DSLVLGLVHSRTSLL PK+GYNS+SRKR+K MEMG DD+VIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDL
Subjt:  FFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDL

Query:  TWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLTRD
        TWVPCGNLSFDCQDC+EYQNNVLGPKLAAFLPTHSSTSIRETCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSF+YTYGASG+VIGTLT+D
Subjt:  TWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLTRD

Query:  VLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKSPI
         +F+HG   NSPNS+++IP+FCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNP FSSPLILGNLAISSK +HL+FTPLLKSP 
Subjt:  VLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKSPI

Query:  YPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDS
        YPNYYYIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLIS LES+I+YPRAK+ ELNTGFDLCYKVP +NN F F D+ 
Subjt:  YPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDS

Query:  QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATQGLHK
        +LPSITFHFLNNVSVVLPQGN+FYAMAAP NSTVVKCLLFQ+MDG      D DGPAGIFGSFQQQNLEVVYDLEKERLGF+ MDCASVA +QGLHK
Subjt:  QLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATQGLHK

SwissProt top hitse value%identityAlignment
Q6XBF8 Aspartic proteinase CDR12.1e-3028.27Show/hide
Query:  IATKFLSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIP---------KKGYNSISRKRLKTMEMGSDDNVIEPLREIRDG---YLMSL
        +A+ F S  L L  +S   L+   A PK  F  D     L+H R S   P         ++  N+I R   +       DN  +P  ++      YLM++
Subjt:  IATKFLSFFLLLVYVSRKTLA---ANPKTNFPTDSLVLGLVHSRTSLLIP---------KKGYNSISRKRLKTMEMGSDDNVIEPLREIRDG---YLMSL

Query:  TLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCP
        ++GTPP  I    DTGSDL W  C      C DC    + +  PK        SST    +C SS C  + +          A CS        C     
Subjt:  TLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCP

Query:  SFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPL
        S++ +YG +    G +  D L + G++   P   K I     GC     G   ++  GI G G G +SL  QLG S  G FS+C +P   ++  + +S +
Subjt:  SFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPL

Query:  ILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVE
          G  AI S    +  TPL+       +YY+ L+SI++G+    +    S           G ++IDSGTT T LP   YS+L   + S I     K+ +
Subjt:  ILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVE

Query:  LNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEK
          +G  LCY         S   D ++P IT HF +   V L   N F  +     S  + C  F+                 I+G+  Q N  V YD   
Subjt:  LNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEK

Query:  ERLGFQPMDCASV
        + + F+P DCA +
Subjt:  ERLGFQPMDCASV

Q766C2 Aspartic proteinase nepenthesin-22.6e-3327.88Show/hide
Query:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNC
        YLM++ +GTP       MDTGSDL W  C      C  C      +       F P  SS+     C S +C D+     P + C    C          
Subjt:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNC

Query:  PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNF
              + Y YG      G +  +              T  +P   FGC     G       G+ G G G LSLP QLG     FS+C   +  S+    
Subjt:  PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNF

Query:  SSPLILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRA
         S L LG+ A S   +    T L+ S + P YYYI L+ IT+G  N     G+     ++   G GGM+IDSGTT T+LP+  Y+ +       I  P  
Subjt:  SSPLILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRA

Query:  KQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVY
           E ++G   C++ P   +        Q+P I+  F   V  +  Q      + +P    +  CL   +   +G           IFG+ QQQ  +V+Y
Subjt:  KQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVY

Query:  DLEKERLGFQPMDCAS
        DL+   + F P  C +
Subjt:  DLEKERLGFQPMDCAS

Q766C3 Aspartic proteinase nepenthesin-14.7e-3028.96Show/hide
Query:  KRLKTME-MGSDDNVIEPLREIRDG-YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMD
        +RL+ +E M +  + +E      DG YLM+L++GTP Q     MDTGSDL W         CQ C +  N         F P  SS+     C S  C  
Subjt:  KRLKTME-MGSDDNVIEPLREIRDG-YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMD

Query:  IHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSL
        + S                      C      + Y YG      G++  + L            +  IP   FGC     G       G+ G GRG LSL
Subjt:  IHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSL

Query:  PFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGT
        P QL  +   FS+C  P   S   N    L+LG+LA +S       T L++S   P +YYI L  +++G+         +F L      G GG++IDSGT
Subjt:  PFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGT

Query:  TYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGV
        T T+     Y  +     S I  P       ++GFDLC++ P   +N       Q+P+   HF +   + LP  N F    +P N  +  CL        
Subjt:  TYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGV

Query:  GGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS
              S     IFG+ QQQN+ VVYD     + F    C +
Subjt:  GGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCAS

Q940R4 Probable aspartyl protease At4g165633.8e-5634.45Show/hide
Query:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNC
        YL+SL++G+    + +Y+DTGSDL W PC    F C  CE   +  L P   + L   SS++   +C S  C   HSS    D C I+ C L  +  G+C
Subjt:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNC

Query:  ---PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SN
             PCP F Y YG  G ++  L  D L +   ++++         F FGC   +  EPIG+AGFGRG LSLP QL     H G  FS+C +   F S+
Subjt:  ---PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SN

Query:  NPNFSSPLILGNLA------ISSKDDH------------LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTH
             SPLILG         + + DDH              FT +L++P +P +Y + L+ I+IG  N          LR ID  G GG+++DSGTT+T 
Subjt:  NPNFSSPLILGNLA------ISSKDDH------------LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTH

Query:  LPEPLYSQLISNLESVI--AYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APINSTVVKCLLFQTM
        LP   Y+ ++   +S +   + RA +VE ++G   CY +             ++P++  HF  N  SV LP+ N FY              + CL+    
Subjt:  LPEPLYSQLISNLESVI--AYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APINSTVVKCLLFQTM

Query:  DGVGGDNDDSDGPAG-IFGSFQQQNLEVVYDLEKERLGFQPMDCASV
           GGD  +  G  G I G++QQQ  EVVYDL   R+GF    CAS+
Subjt:  DGVGGDNDDSDGPAG-IFGSFQQQNLEVVYDLEKERLGFQPMDCASV

Q9LNJ3 Aspartyl protease family protein 23.4e-3328.31Show/hide
Query:  GSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDP
        G   +V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +       F P  S T     C S  C  + S       
Subjt:  GSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDP

Query:  CTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC--------VGASYREPIGIAGFGRGLLSLPFQLG-
           AGC        N  R    +  +YG     +G  + + L    N +              GC        VGA+     G+ G G+G LS P Q G 
Subjt:  CTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC--------VGASYREPIGIAGFGRGLLSLPFQLG-

Query:  -FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTH
         F+ K FS+C +    S+ P   S ++ GN A+S      +FTPLL +P    +YY+GL  I++G        GV+  L ++D  GNGG++IDSGT+ T 
Subjt:  -FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTH

Query:  LPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQ-TMDGVGGD
        L  P Y  +       +     K+    + FD C+       + S +++ ++P++  HF     V LP  N       P+++    C  F  TM G+   
Subjt:  LPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQ-TMDGVGGD

Query:  NDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA
                 I G+ QQQ   VVYDL   R+GF P  CA
Subjt:  NDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein5.8e-3629.32Show/hide
Query:  STSIATKFLSFFLLL------VYVSRKTLAAN--PKTNFPTDSLVLGLVH--SRTSLLIPKKGYNSISRKRLKTMEMGS----------DD--NVIEPLR
        S+S ++    FFL+L      V  SR++L     PK N P     L L H  S  +L   +K    I+R   +   +G+          DD  N+  P  
Subjt:  STSIATKFLSFFLLL------VYVSRKTLAAN--PKTNFPTDSLVLGLVH--SRTSLLIPKKGYNSISRKRLKTMEMGS----------DD--NVIEPLR

Query:  EIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATL
             +LM L++G P       +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C                      L
Subjt:  EIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATL

Query:  VKGNC--PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFK
         + NC   +    + YTYG      G L  +       N         I    FGC     G  + +  G+ G GRG LSL  QL      FS+C    +
Subjt:  VKGNC--PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC----VGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFK

Query:  FSNNPNFSSPLILGNLAI-------SSKDDHLQFT-PLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYS
         S     SS L +G+LA        +S D  +  T  LL++P  P++YY+ L+ IT+G      R  V     E+   G GGM+IDSGTT T+L E  + 
Subjt:  FSNNPNFSSPLILGNLAI-------SSKDDHLQFT-PLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYS

Query:  QLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPA
         L     S ++ P       +TG DLC+K+P    N +      +P + FHF     + LP G N+  M A  +ST V CL   + +G+           
Subjt:  QLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPA

Query:  GIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
         IFG+ QQQN  V++DLEKE + F P +C  +
Subjt:  GIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

AT3G25700.1 Eukaryotic aspartyl protease family protein2.4e-3728.06Show/hide
Query:  FLSFFLL----LVYVSRKT----LAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQV
        FLS FLL    +  VS       L    K+ FP+ +  L L   R   L       S+ RK +  ++      V+         Y + L +G PPQ + +
Subjt:  FLSFFLL----LVYVSRKT----LAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQV

Query:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGV
          DTGSDL WV C      C++C  +           F P HSST     C    C  +   D        A     T +   C      + Y Y    +
Subjt:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGV

Query:  VIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC---------VGASYREPIGIAGFGRGLLSLPFQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNL
          G   R+   +      S     ++    FGC          G S+    G+ G GRG +S   QLG  F +K FS+C + +  S  P  +S LI+GN 
Subjt:  VIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGC---------VGASYREPIGIAGFGRGLLSLPFQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNL

Query:  AISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGF
                L FTPLL +P+ P +YY+ L+S+ +    N  +  +   + EID  GNGG ++DSGTT   L EP Y  +I+ +   +  P A    L  GF
Subjt:  AISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGF

Query:  DLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGF
        DLC  V           +  LP + F F      V P  N F           ++CL  Q++D   G          + G+  QQ     +D ++ RLGF
Subjt:  DLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGF

Query:  QPMDCA
            CA
Subjt:  QPMDCA

AT3G52500.1 Eukaryotic aspartyl protease family protein4.8e-4632.94Show/hide
Query:  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKL-AAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG
        GY +SL+ GTP Q I    DTGS L W+PC +  + C  C+    + L P L   F+P +SS+S    C S  C  ++    P   C   GC   T    
Subjt:  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKL-AAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG

Query:  NCPRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSS
        NC   CP +   YG  G   G L  +        ++ P+ T  +P F  GC   S R+P GIAGFGRG +SLP Q+    K FSHC +  +F ++ N ++
Subjt:  NCPRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSS

Query:  PLILGNLA---ISSKDDHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV
         L L   +     SK   L +TP  K+P   N     YYY+ L  I +G  +      + +K     T G+GG ++DSG+T+T +  P++  +     S 
Subjt:  PLILGNLA---ISSKDDHLQFTPLLKSPIYPN-----YYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV

Query:  IA-YPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQ
        ++ Y R K +E  TG   C+       N S   D  +P + F F     + LP  +N++      ++  +  +  +T++  GG      GPA I GSFQQ
Subjt:  IA-YPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQ

Query:  QNLEVVYDLEKERLGFQPMDCA
        QN  V YDLE +R GF    C+
Subjt:  QNLEVVYDLEKERLGFQPMDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein2.7e-5734.45Show/hide
Query:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNC
        YL+SL++G+    + +Y+DTGSDL W PC    F C  CE   +  L P   + L   SS++   +C S  C   HSS    D C I+ C L  +  G+C
Subjt:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNC

Query:  ---PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SN
             PCP F Y YG  G ++  L  D L +   ++++         F FGC   +  EPIG+AGFGRG LSLP QL     H G  FS+C +   F S+
Subjt:  ---PRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGF--SHKG--FSHCFLPFKF-SN

Query:  NPNFSSPLILGNLA------ISSKDDH------------LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTH
             SPLILG         + + DDH              FT +L++P +P +Y + L+ I+IG  N          LR ID  G GG+++DSGTT+T 
Subjt:  NPNFSSPLILGNLA------ISSKDDH------------LQFTPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTH

Query:  LPEPLYSQLISNLESVI--AYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APINSTVVKCLLFQTM
        LP   Y+ ++   +S +   + RA +VE ++G   CY +             ++P++  HF  N  SV LP+ N FY              + CL+    
Subjt:  LPEPLYSQLISNLESVI--AYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFL-NNVSVVLPQGNNFYAMA----APINSTVVKCLLFQTM

Query:  DGVGGDNDDSDGPAG-IFGSFQQQNLEVVYDLEKERLGFQPMDCASV
           GGD  +  G  G I G++QQQ  EVVYDL   R+GF    CAS+
Subjt:  DGVGGDNDDSDGPAG-IFGSFQQQNLEVVYDLEKERLGFQPMDCASV

AT5G45120.1 Eukaryotic aspartyl protease family protein2.6e-16959.65Show/hide
Query:  TKFLSFFL---LLVYVSRKTLAANPKTNFPTDS--LVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVY
        T  L  FL   LL+  + KT A   K    + S  LVL L  S  SL  PK    S +++R+K   + S D V+EPLRE+RDGYL++L +GTPPQ +QVY
Subjt:  TKFLSFFL---LLVYVSRKTLAANPKTNFPTDS--LVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVY

Query:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVV
        +DTGSDLTWVPCGNLSFDC +C + +NN L    + F P HSSTS R++C SSFC++IHSSDNPFDPC +AGCS++ L+K  C RPCPSFAYTYG  G++
Subjt:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVV

Query:  IGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDDHLQF
         G LTRD+L            T+ +PRF FGCV ++YREPIGIAGFGRGLLSLP QLGF  KGFSHCFLPFKF NNPN SSPLILG  A+S +  D LQF
Subjt:  IGTLTRDVLFMHGNNINSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAIS-SKDDHLQF

Query:  TPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNN
        TP+L +P+YPN YYIGLESITI  G N     V   LR+ D++GNGGML+DSGTTYTHLPEP YSQL++ L+S I YPRA + E  TGFDLCYKVPC NN
Subjt:  TPLLKSPIYPNYYYIGLESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNN

Query:  NFSFIDDSQL---PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV
        N + +++  +   PSITFHFLNN +++LPQGN+FYAM+AP + +VV+CLLFQ M+      D   GPAG+FGSFQQQN++VVYDLEKER+GFQ MDC   
Subjt:  NFSFIDDSQL---PSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASV

Query:  AATQGLHK
        AA+ GL++
Subjt:  AATQGLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCAATATCAACTTCCATTGCCACAAAATTCTTGAGCTTTTTCCTTCTTCTTGTATATGTCTCAAGAAAAACCCTAGCTGCAAACCCTAAAACCAATTTTCCCAC
AGATTCTCTAGTTCTTGGCCTTGTTCATTCAAGAACATCCCTCCTTATCCCTAAAAAAGGCTATAATTCCATTTCAAGGAAGAGACTGAAGACAATGGAAATGGGTAGTG
ATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACTGGAAGTGAT
CTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGCCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTC
TACTTCTATTAGAGAAACTTGTGGTAGCTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCCTTGCTACCCTTGTGA
AGGGCAATTGCCCTAGGCCATGCCCTTCCTTTGCTTACACTTATGGGGCGAGTGGGGTTGTAATTGGAACTTTAACAAGAGATGTCCTTTTCATGCATGGAAATAATATT
AATTCTCCAAATTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCAAGTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGTAGAGGTTTACTTTCTCT
TCCTTTCCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGTTTCTTGCCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAACCTTGCTA
TTTCTTCAAAAGATGACCATTTGCAATTTACTCCTTTGTTGAAAAGTCCAATTTACCCCAACTACTACTATATTGGGCTTGAATCAATCACTATAGGAAACGGAAATAAT
AATTTTAGATTTGGTGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGTATGTTGATTGATTCTGGTACTACTTACACTCATTTACCTGAACCTTTGTA
TTCACAACTTATTTCAAATCTTGAGTCAGTGATAGCCTATCCAAGAGCCAAACAAGTTGAACTCAATACTGGATTTGATCTTTGTTATAAAGTTCCTTGTAGAAACAACA
ATTTTTCTTTTATTGATGACTCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTTAGTGTTGTTTTGCCACAAGGAAATAACTTCTATGCCATGGCTGCTCCA
ATTAACTCCACTGTGGTTAAATGCTTGTTGTTTCAAACCATGGACGGTGTCGGTGGCGATAACGACGACAGCGACGGGCCGGCTGGCATTTTTGGAAGCTTCCAACAACA
AAATTTAGAGGTTGTTTATGACTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGCTTCTGTTGCTGCCACTCAAGGACTCCACAAGAATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCAATATCAACTTCCATTGCCACAAAATTCTTGAGCTTTTTCCTTCTTCTTGTATATGTCTCAAGAAAAACCCTAGCTGCAAACCCTAAAACCAATTTTCCCAC
AGATTCTCTAGTTCTTGGCCTTGTTCATTCAAGAACATCCCTCCTTATCCCTAAAAAAGGCTATAATTCCATTTCAAGGAAGAGACTGAAGACAATGGAAATGGGTAGTG
ATGATAATGTGATAGAGCCATTGAGAGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACTGGAAGTGAT
CTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGTCAAGATTGTGAGGAGTATCAAAACAATGTTTTAGGCCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTC
TACTTCTATTAGAGAAACTTGTGGTAGCTCCTTTTGTATGGATATTCATAGCTCTGATAACCCTTTTGATCCTTGCACAATTGCTGGTTGTTCCCTTGCTACCCTTGTGA
AGGGCAATTGCCCTAGGCCATGCCCTTCCTTTGCTTACACTTATGGGGCGAGTGGGGTTGTAATTGGAACTTTAACAAGAGATGTCCTTTTCATGCATGGAAATAATATT
AATTCTCCAAATTCCACTAAGCAAATCCCTAGGTTTTGTTTTGGATGTGTTGGTGCAAGTTATAGAGAGCCAATTGGGATTGCTGGTTTTGGTAGAGGTTTACTTTCTCT
TCCTTTCCAATTAGGGTTTTCTCATAAGGGGTTTTCTCATTGTTTCTTGCCTTTTAAATTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATTCTTGGTAACCTTGCTA
TTTCTTCAAAAGATGACCATTTGCAATTTACTCCTTTGTTGAAAAGTCCAATTTACCCCAACTACTACTATATTGGGCTTGAATCAATCACTATAGGAAACGGAAATAAT
AATTTTAGATTTGGTGTTTCTTTCAAATTGAGAGAGATTGATACAAAGGGTAATGGGGGTATGTTGATTGATTCTGGTACTACTTACACTCATTTACCTGAACCTTTGTA
TTCACAACTTATTTCAAATCTTGAGTCAGTGATAGCCTATCCAAGAGCCAAACAAGTTGAACTCAATACTGGATTTGATCTTTGTTATAAAGTTCCTTGTAGAAACAACA
ATTTTTCTTTTATTGATGACTCTCAACTCCCTTCTATAACATTCCATTTTTTGAATAATGTTAGTGTTGTTTTGCCACAAGGAAATAACTTCTATGCCATGGCTGCTCCA
ATTAACTCCACTGTGGTTAAATGCTTGTTGTTTCAAACCATGGACGGTGTCGGTGGCGATAACGACGACAGCGACGGGCCGGCTGGCATTTTTGGAAGCTTCCAACAACA
AAATTTAGAGGTTGTTTATGACTTGGAGAAGGAAAGATTAGGGTTTCAACCAATGGATTGTGCTTCTGTTGCTGCCACTCAAGGACTCCACAAGAATGTTTGA
Protein sequenceShow/hide protein sequence
MPSISTSIATKFLSFFLLLVYVSRKTLAANPKTNFPTDSLVLGLVHSRTSLLIPKKGYNSISRKRLKTMEMGSDDNVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSD
LTWVPCGNLSFDCQDCEEYQNNVLGPKLAAFLPTHSSTSIRETCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGNCPRPCPSFAYTYGASGVVIGTLTRDVLFMHGNNI
NSPNSTKQIPRFCFGCVGASYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDDHLQFTPLLKSPIYPNYYYIGLESITIGNGNN
NFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIAYPRAKQVELNTGFDLCYKVPCRNNNFSFIDDSQLPSITFHFLNNVSVVLPQGNNFYAMAAP
INSTVVKCLLFQTMDGVGGDNDDSDGPAGIFGSFQQQNLEVVYDLEKERLGFQPMDCASVAATQGLHKNV