; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg21439 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg21439
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPeptidase A1 domain-containing protein
Genome locationCarg_Chr13:6044844..6056922
RNA-Seq ExpressionCarg21439
SyntenyCarg21439
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR006634 - TRAM/LAG1/CLN8 homology domain
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY27492.1 Aspartyl protease family protein [Theobroma cacao]4.6e-29168.04Show/hide
Query:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS
        A+  + VMAIKSY SQA  LV+NYLLAD  IPYTS+LGGILACK+ YDLTQ++SNFY K+Y  LTKIQRVEWNNRG+STIHAI++S +SLYFVFWSDLFS
Subjt:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS

Query:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY
        DQ+ AGLV F+SS LSTF LG+SVGYFL DLG+I+WLYPSLGGMEYV+HHS+SG+AVAY++F+GE QLYTYMVLISE+TTPEINMRWYLDTAGMKRS AY
Subjt:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY

Query:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK
        LINGVVIFFAWL+AR+LLFGY FYHVYLHYDQ  +   +G  L       L                   +S+VLGL  S TS   PK   +  SRKR+ 
Subjt:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK

Query:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP
         +     D++E LR +RDGYL++L +GTP QVIQVYMDTGSDLTWVPCGN+SFDC DCD+Y+NN L   +  F P+HSS+++RD+CGSSFCIDIHSSDN 
Subjt:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP

Query:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC
        FDPC  AGCSL+TL+K TC RPCPSF+YTYG  GLV G LT+D + +HG+SP  +R IP+F FGCVG+TYREPIGIAGFG+G+LS+PSQLGF  KGFSHC
Subjt:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC

Query:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS
        FL FK++NNPN SSPL +G++AISS ++L+FTP+LKSP +PNYYYIGLE+IT+G   N S   V L LRE D++GNGG+LIDSGTTYTHLPEP YSQL+S
Subjt:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS

Query:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL
         L+S+++YPRA + E  TGFDLCY+VP  NN F +D F  P+ITFHFLNNVS+VLPQ N FYAM+APSNST VKCLLFQSMD    GPAG+FG+FQQQN+
Subjt:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL

Query:  EVVYDLEKERLGFEGMDCASVAVSQGLHK
        +VVYDLEKER+GF+ MDCA+ A SQGLHK
Subjt:  EVVYDLEKERLGFEGMDCASVAVSQGLHK

KAG6583807.1 putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia]2.2e-28599.79Show/hide
Query:  GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF
        GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF
Subjt:  GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF

Query:  DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN
        DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN
Subjt:  DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN

Query:  SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI
        SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI
Subjt:  SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI

Query:  GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV
        GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQ+ISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV
Subjt:  GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV

Query:  VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE
        VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE
Subjt:  VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE

KAG7019432.1 putative aspartyl protease [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MAMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLF
        MAMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLF
Subjt:  MAMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLF

Query:  SDQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCA
        SDQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCA
Subjt:  SDQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCA

Query:  YLINGVVIFFAWLVARILLFGYTFYHVYLHYDQKKGEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRD
        YLINGVVIFFAWLVARILLFGYTFYHVYLHYDQKKGEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRD
Subjt:  YLINGVVIFFAWLVARILLFGYTFYHVYLHYDQKKGEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRD

Query:  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGT
        GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGT
Subjt:  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGT

Query:  CPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
        CPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL
Subjt:  CPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLIL

Query:  GNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNT
        GNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNT
Subjt:  GNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNT

Query:  GFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDC
        GFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDC
Subjt:  GFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDC

Query:  ASVAVSQGLHKKE
        ASVAVSQGLHKKE
Subjt:  ASVAVSQGLHKKE

XP_011460458.1 PREDICTED: probable aspartic protease At2g35615 isoform X1 [Fragaria vesca subsp. vesca]1.3e-29669.3Show/hide
Query:  MAMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLF
        MA+  KT MAIKSYQ+QA+ LV++YLLAD  +PYTSV+GGI  CKLVYDLTQ++S FY KSY  LTKIQR+EWN+RG+S+IHAI+I+++SLYFVFWSDLF
Subjt:  MAMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLF

Query:  SDQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCA
        SDQ+ AGLVTF+SS+LS F LG+SVGYF ADLG+++WLYPSLGGMEYV+HHSL+G+AVAYS+FSGEGQLYTYM+LISEITTPEINMRWYLDTAGMKRS A
Subjt:  SDQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCA

Query:  YLINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTL--ANPKT----------KFLK-----DSLVLGLVHSRTSLLTPKRGYNSLSRKRIK
        YLING+VIFF+WLV RILLFGY FYHVYLHYDQ  +    G  L  A P            K +K      SLVLG+ HSR+S+ +P    NS   K+I 
Subjt:  YLINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTL--ANPKT----------KFLK-----DSLVLGLVHSRTSLLTPKRGYNSLSRKRIK

Query:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP
           +   D++EPLRE+RDGYL+SL LGTPPQVIQVYMDTGSDLTWVPCGNLSF C DCD+Y+N +L P    F P+ SS+S+RD CGS FC DIHSSDNP
Subjt:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP

Query:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNS---PNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGF
         DPCTIAGCSL+TL+KGTCPRPCPSF+YTYGA G+V+GTL++D + +HG S    N + +IP FCFGC+G+T+REPIGIAGFGRG LSLPSQLGF  KGF
Subjt:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNS---PNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGF

Query:  SHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQ
        SHCFL FK+ NNPN SSPL++G++AISSK++L+FTP+LKSP YPN YYIGLE+ITIGN    S   V L LRE D++GNGG+LIDSGTTYTHLPEP YS 
Subjt:  SHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQ

Query:  LISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQ
        ++S L+SLI+YPRAKE E+ T FDLCYKVPY  N F  + F  PSITFHFLNNVS+ LPQGN FYAM AP NSTVVKCLLFQ+MD    GPAG+FGSFQQ
Subjt:  LISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQ

Query:  QNLEVVYDLEKERLGFEGMDCASVAVSQGLHKK
        QN+EVVYDL+K+R+GF+ MDCAS A SQGLHK+
Subjt:  QNLEVVYDLEKERLGFEGMDCASVAVSQGLHKK

XP_021294342.1 probable aspartyl protease At4g16563 isoform X1 [Herrania umbratica]1.3e-28867.35Show/hide
Query:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS
        A+  +TVMAIKSY +QA  LV+NYLLAD  IPYTS+LGGILACK+ YDLTQ++SNFY K+Y  LTKIQRVEWNNRG+STIHAI++S +S+YFVFWSDLFS
Subjt:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS

Query:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY
        DQ+ AGLV F+SS LSTF LG+SVGYFL DLG+I+WLYPSLGGMEYV+HHSLSG+AVAY++ +GE QLYTYMVLISE+TTPEINMRWYLDTAGMKRS AY
Subjt:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY

Query:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK
        LINGVVIFFAWL+AR+LLFGY FYHVYLHYDQ  +    G  L       L                   +S+VLGL  S TS   PK   +  SRKR+ 
Subjt:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK

Query:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP
         +     D++E LR +RDGYL+SL +GTP QVIQVYMDTGSDLTWVPCGN+SFDC DCD+Y+NN L   +  F P+HSS+++RD+CGS+FC+DIHSSDN 
Subjt:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP

Query:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC
        FDPC  AGCSL+TL+K TC RPCPSF+YTYG  GLV GTLT+D + +HG+S   +R IP+F FGCVG+TYREPIGIAGFG+G+LS+PSQLGF  KGFS+C
Subjt:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC

Query:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS
        FL FK++NNPN SSPL +G++A+SS ++L+FTP+LKSP +PNYYYIGLE+IT+G   N S   V L LRE D++GNGG+LIDSGTTYTHLPEP YSQ++S
Subjt:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS

Query:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL
         L+S+++YPRA + E  TGFDLCY+VP  NN F +D F  PSITFHFLNNVS+VLPQ N FYAM+APSNST VKCLLFQ MD    GPAG+FG+FQQQN+
Subjt:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL

Query:  EVVYDLEKERLGFEGMDCASVAVSQGLHK
        +VVYDLEKER+GF+ MDCA+ A SQGLHK
Subjt:  EVVYDLEKERLGFEGMDCASVAVSQGLHK

TrEMBL top hitse value%identityAlignment
A0A061GEA6 Aspartyl protease family protein2.2e-29168.04Show/hide
Query:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS
        A+  + VMAIKSY SQA  LV+NYLLAD  IPYTS+LGGILACK+ YDLTQ++SNFY K+Y  LTKIQRVEWNNRG+STIHAI++S +SLYFVFWSDLFS
Subjt:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS

Query:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY
        DQ+ AGLV F+SS LSTF LG+SVGYFL DLG+I+WLYPSLGGMEYV+HHS+SG+AVAY++F+GE QLYTYMVLISE+TTPEINMRWYLDTAGMKRS AY
Subjt:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY

Query:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK
        LINGVVIFFAWL+AR+LLFGY FYHVYLHYDQ  +   +G  L       L                   +S+VLGL  S TS   PK   +  SRKR+ 
Subjt:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK

Query:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP
         +     D++E LR +RDGYL++L +GTP QVIQVYMDTGSDLTWVPCGN+SFDC DCD+Y+NN L   +  F P+HSS+++RD+CGSSFCIDIHSSDN 
Subjt:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP

Query:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC
        FDPC  AGCSL+TL+K TC RPCPSF+YTYG  GLV G LT+D + +HG+SP  +R IP+F FGCVG+TYREPIGIAGFG+G+LS+PSQLGF  KGFSHC
Subjt:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC

Query:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS
        FL FK++NNPN SSPL +G++AISS ++L+FTP+LKSP +PNYYYIGLE+IT+G   N S   V L LRE D++GNGG+LIDSGTTYTHLPEP YSQL+S
Subjt:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS

Query:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL
         L+S+++YPRA + E  TGFDLCY+VP  NN F +D F  P+ITFHFLNNVS+VLPQ N FYAM+APSNST VKCLLFQSMD    GPAG+FG+FQQQN+
Subjt:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL

Query:  EVVYDLEKERLGFEGMDCASVAVSQGLHK
        +VVYDLEKER+GF+ MDCA+ A SQGLHK
Subjt:  EVVYDLEKERLGFEGMDCASVAVSQGLHK

A0A5A7TNC9 Aspartic proteinase nepenthesin-21.4e-23782.35Show/hide
Query:  AMGQTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEM--GNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLS
        A  QTLA NPKT F KDSLVLGLVHSRTSLLTPK+GYN +S+KR+K M+   G+D+VIEPLREIRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCGNLS
Subjt:  AMGQTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEM--GNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLS

Query:  FDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHG---
        FDCQDC+EYQNN+ GPKLAAFLPTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGASG+V G+LT+DV+F+HG   
Subjt:  FDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHG---

Query:  ----NSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK-EHLKFTPLLKSPFYPNYY
            N+ N+++++P+FCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAISSK E+L+FTPLLKSP YPNYY
Subjt:  ----NSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK-EHLKFTPLLKSPFYPNYY

Query:  YIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYK-NNTFFSDEFELPSI
        YIGLESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+ISYPRAK+ ELNTGFDLCYKVP K NN+ F D+ +LPSI
Subjt:  YIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYK-NNTFFSDEFELPSI

Query:  TFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGD-------GPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHK
        TFHFLNNVSVVLPQGN+FYAMAAP NSTVVKCLL+QSMDG GD       GPAGIFGSFQQQNL+VVYDLEKERLGF+ MDC SVA +QGLHK
Subjt:  TFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGD-------GPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHK

A0A6J1B4R0 probable aspartyl protease At4g16563 isoform X16.1e-28967.35Show/hide
Query:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS
        A+  +TVMAIKSY +QA  LV+NYLLAD  IPYTS+LGGILACK+ YDLTQ++SNFY K+Y  LTKIQRVEWNNRG+STIHAI++S +S+YFVFWSDLFS
Subjt:  AMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFS

Query:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY
        DQ+ AGLV F+SS LSTF LG+SVGYFL DLG+I+WLYPSLGGMEYV+HHSLSG+AVAY++ +GE QLYTYMVLISE+TTPEINMRWYLDTAGMKRS AY
Subjt:  DQRHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAY

Query:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK
        LINGVVIFFAWL+AR+LLFGY FYHVYLHYDQ  +    G  L       L                   +S+VLGL  S TS   PK   +  SRKR+ 
Subjt:  LINGVVIFFAWLVARILLFGYTFYHVYLHYDQK-KGEAMGQTLANPKTKFL------------------KDSLVLGLVHSRTSLLTPKRGYNSLSRKRIK

Query:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP
         +     D++E LR +RDGYL+SL +GTP QVIQVYMDTGSDLTWVPCGN+SFDC DCD+Y+NN L   +  F P+HSS+++RD+CGS+FC+DIHSSDN 
Subjt:  PMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNP

Query:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC
        FDPC  AGCSL+TL+K TC RPCPSF+YTYG  GLV GTLT+D + +HG+S   +R IP+F FGCVG+TYREPIGIAGFG+G+LS+PSQLGF  KGFS+C
Subjt:  FDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHC

Query:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS
        FL FK++NNPN SSPL +G++A+SS ++L+FTP+LKSP +PNYYYIGLE+IT+G   N S   V L LRE D++GNGG+LIDSGTTYTHLPEP YSQ++S
Subjt:  FLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS

Query:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL
         L+S+++YPRA + E  TGFDLCY+VP  NN F +D F  PSITFHFLNNVS+VLPQ N FYAM+APSNST VKCLLFQ MD    GPAG+FG+FQQQN+
Subjt:  NLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNL

Query:  EVVYDLEKERLGFEGMDCASVAVSQGLHK
        +VVYDLEKER+GF+ MDCA+ A SQGLHK
Subjt:  EVVYDLEKERLGFEGMDCASVAVSQGLHK

A0A6J1EHM1 probable aspartyl protease At4g165632.2e-28399.16Show/hide
Query:  GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF
        GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSL  KRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF
Subjt:  GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF

Query:  DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN
        DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN
Subjt:  DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN

Query:  SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI
        SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTP LKSPFYPNYYYIGLESITI
Subjt:  SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI

Query:  GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV
        GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV
Subjt:  GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV

Query:  VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE
        VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFE MDCASVAVSQGLHKKE
Subjt:  VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE

A0A6J1KLG7 probable aspartyl protease At4g165632.1e-28198.54Show/hide
Query:  GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF
        GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMG+DDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF
Subjt:  GEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSF

Query:  DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN
        DCQDCDEYQNNVLGPKLAAFLPTHSSTSIR+TCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSFSYTYGASGLVIGTLTKD IFIHGNSPN
Subjt:  DCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPN

Query:  SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI
        SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNP FSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI
Subjt:  SSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITI

Query:  GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV
        GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS LESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV
Subjt:  GNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV

Query:  VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE
        VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFE MDCASVAVSQGLHKKE
Subjt:  VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356154.7e-2831.21Show/hide
Query:  SLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCI
        S+SR R    ++   D+   L      + MS+T+GTPP  +    DTGSDLTWV C      CQ C  Y+ N  GP    F    SST   + C S  C 
Subjt:  SLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCI

Query:  DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC---VGATYREP-IGIAGFGRGLLSLPS
         + S++         GC  +  +       C  + Y+YG      G +  + + I   S  S    P   FGC    G T+ E   GI G G G LSL S
Subjt:  DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC---VGATYREP-IGIAGFGRGLLSLPS

Query:  QLGFS-HKGFSHCFLPFKFSNNPNFSSPLILGNLAI----SSKEHLKFTPLL-KSPFYPNYYYIGLESITIGNGE-NYSRFGVSLQLREIDTKGNGGILI
        QLG S  K FS+C L  K S   N +S + LG  +I    S    +  TPL+ K P    YYY+ LE+I++G  +  Y+    +     I ++ +G I+I
Subjt:  QLGFS-HKGFSHCFLPFKFSNNPNFSSPLILGNLAI----SSKEHLKFTPLL-KSPFYPNYYYIGLESITIGNGE-NYSRFGVSLQLREIDTKGNGGILI

Query:  DSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSM
        DSGTT T L    + +  S +E  ++  + +  +       C+K         S E  LP IT HF     V L   N+F  +     S  + CL     
Subjt:  DSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSM

Query:  DGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCAS
                 I+G+F Q +  V YDLE   + F+ MDC++
Subjt:  DGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCAS

Q766C2 Aspartic proteinase nepenthesin-27.4e-3428.15Show/hide
Query:  KRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCG
        KRG   +  + I  M   +  +  P+      YLM++ +GTP       MDTGSDL W  C      C  C      +       F P  SS+     C 
Subjt:  KRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCG

Query:  SSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC----VGATYREPIGIAGFGRGL
        S +C D+     P + C    C                ++Y YG      G +  +      +S      +P   FGC     G       G+ G G G 
Subjt:  SSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC----VGATYREPIGIAGFGRGL

Query:  LSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDS
        LSLPSQLG     FS+C   +  S+     S L LG+ A    E    T L+ S   P YYYI L+ IT+G G+N    G+     ++   G GG++IDS
Subjt:  LSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDS

Query:  GTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDG
        GTT T+LP+  Y+ +       I+ P   E   ++G   C++ P   +T      ++P I+  F   V  +  Q      + +P+   +  CL   S   
Subjt:  GTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDG

Query:  DGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCAS
         G     IFG+ QQQ  +V+YDL+   + F    C +
Subjt:  DGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCAS

Q766C3 Aspartic proteinase nepenthesin-12.6e-3430.15Show/hide
Query:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTC
        YLM+L++GTP Q     MDTGSDL W         CQ C +  N         F P  SS+     C S  C                      L   TC
Subjt:  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP
              ++Y YG      G++  + +         S  IP   FGC     G       G+ G GRG LSLPSQL  +   FS+C  P   S   N    
Subjt:  PRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP

Query:  LILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGE---NYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAK
        L+LG+LA S       T L++S   P +YYI L  +++G+     + S F ++         G GGI+IDSGTT T+     Y  +     S I+ P   
Subjt:  LILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGE---NYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAK

Query:  EHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLG
         +  ++GFDLC++ P           ++P+   HF +   + LP  N F    +PSN  +  CL      G       IFG+ QQQN+ VVYD     + 
Subjt:  EHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLG

Query:  FEGMDCAS
        F    C +
Subjt:  FEGMDCAS

Q940R4 Probable aspartyl protease At4g165632.2e-5431.62Show/hide
Query:  ILLFGY-TFYHVYLHYDQKKGEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVI
        ++ F Y T    Y H+          +L+ P    L  SL      S    L       S +R R    +     +  P+    D YL+SL++G+    +
Subjt:  ILLFGY-TFYHVYLHYDQKKGEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVI

Query:  QVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFSYTY
         +Y+DTGSDL W PC    F C  C   ++  L P   + L   SS++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y Y
Subjt:  QVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFSYTY

Query:  GASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA---
        G  G ++  L  D + +       S  +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F S+     SPLILG      
Subjt:  GASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPNFSSPLILGNLA---

Query:  ----------------ISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLI
                           K    FT +L++P +P +Y + L+ I+IG             LR ID  G GG+++DSGTT+T LP   Y+ ++   +S +
Subjt:  ----------------ISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLI

Query:  S--YPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFL-NNVSVVLPQGNSFYAMA----APSNSTVVKCLLFQSMDGDGD---GPAGIFGSFQ
           + RA   E ++G   CY   Y N T      ++P++  HF  N  SV LP+ N FY              + CL+  +   + +   G   I G++Q
Subjt:  S--YPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFL-NNVSVVLPQGNSFYAMA----APSNSTVVKCLLFQSMDGDGD---GPAGIFGSFQ

Query:  QQNLEVVYDLEKERLGFEGMDCASV
        QQ  EVVYDL   R+GF    CAS+
Subjt:  QQNLEVVYDLEKERLGFEGMDCASV

Q9LNJ3 Aspartyl protease family protein 23.7e-3329.11Show/hide
Query:  NDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCT
        +  V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +       F P  S T     C S  C  + S         
Subjt:  NDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCT

Query:  IAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG--FSHK
         AGC+     + TC      +  +YG     +G  + + +    N      ++     GC        VGA      G+ G G+G LS P Q G  F+ K
Subjt:  IAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG--FSHK

Query:  GFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRF-GVSLQLREIDTKGNGGILIDSGTTYTHLPEPL
         FS+C +    S+ P   S ++ GN A+S     +FTPLL +P    +YY+GL  I++G     +R  GV+  L ++D  GNGG++IDSGT+ T L  P 
Subjt:  GFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRF-GVSLQLREIDTKGNGGILIDSGTTYTHLPEPL

Query:  YSQLISNLE-SLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFG
        Y  +         +  RA +  L   FD C+ +   N      E ++P++  HF     V LP  N  Y +   +N     C  F    G       I G
Subjt:  YSQLISNLE-SLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFG

Query:  SFQQQNLEVVYDLEKERLGFEGMDCA
        + QQQ   VVYDL   R+GF    CA
Subjt:  SFQQQNLEVVYDLEKERLGFEGMDCA

Arabidopsis top hitse value%identityAlignment
AT1G31300.1 TRAM, LAG1 and CLN8 (TLC) lipid-sensing domain containing protein1.8e-9675.22Show/hide
Query:  SLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQ
        SL+T+ AIKSY  QA  LV+NYLLAD  IPYTSVL GI  CK+VYDL   +SN + K+Y+ LTKIQR+EWNNRG+ST+HAI+IS MSLYFVFWSDLFSD+
Subjt:  SLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQ

Query:  RHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLI
         H  LV F+SS LS+  LGIS+GYFLADLG+I W YPSLGG+EY+VHHSLSG+AVAYS+FSGEGQLYTYMVLISEITTPEIN+RWYLDTAGMK+S AY++
Subjt:  RHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLI

Query:  NGVVIFFAWLVARILLFGYTFYHVYLHYDQ
        NGV IF AWLVARILLF Y FYHVYLHY+Q
Subjt:  NGVVIFFAWLVARILLFGYTFYHVYLHYDQ

AT1G31300.2 TRAM, LAG1 and CLN8 (TLC) lipid-sensing domain containing protein1.8e-9675.22Show/hide
Query:  SLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQ
        SL+T+ AIKSY  QA  LV+NYLLAD  IPYTSVL GI  CK+VYDL   +SN + K+Y+ LTKIQR+EWNNRG+ST+HAI+IS MSLYFVFWSDLFSD+
Subjt:  SLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQ

Query:  RHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLI
         H  LV F+SS LS+  LGIS+GYFLADLG+I W YPSLGG+EY+VHHSLSG+AVAYS+FSGEGQLYTYMVLISEITTPEIN+RWYLDTAGMK+S AY++
Subjt:  RHAGLVTFQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLI

Query:  NGVVIFFAWLVARILLFGYTFYHVYLHYDQ
        NGV IF AWLVARILLF Y FYHVYLHY+Q
Subjt:  NGVVIFFAWLVARILLFGYTFYHVYLHYDQ

AT4G19645.1 TRAM, LAG1 and CLN8 (TLC) lipid-sensing domain containing protein1.8e-9173.01Show/hide
Query:  MAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQRHAGL
        M IKSYQ+QA+  VE+YLLAD  +PYTSVL GI  CKLVYDLT++ S+ + KSY  LTKI+R+EWNNRG+ST+HAI+IS M+LYF F+SDLFSDQR    
Subjt:  MAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQRHAGL

Query:  VT-FQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLINGVV
        +T F++S LSTF LG+SVGYFLADLG+I WLYPSLGG EY++HH LSG AVAYS+FSGE QLYTYMVLISE+TTPEIN+RWYLD AG+KRS AYL+NGV 
Subjt:  VT-FQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLINGVV

Query:  IFFAWLVARILLFGYTFYHVYLHYDQ
        IFFAWL ARILLF Y FYHVY HYDQ
Subjt:  IFFAWLVARILLFGYTFYHVYLHYDQ

AT4G19645.2 TRAM, LAG1 and CLN8 (TLC) lipid-sensing domain containing protein1.8e-9173.01Show/hide
Query:  MAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQRHAGL
        M IKSYQ+QA+  VE+YLLAD  +PYTSVL GI  CKLVYDLT++ S+ + KSY  LTKI+R+EWNNRG+ST+HAI+IS M+LYF F+SDLFSDQR    
Subjt:  MAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQRHAGL

Query:  VT-FQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLINGVV
        +T F++S LSTF LG+SVGYFLADLG+I WLYPSLGG EY++HH LSG AVAYS+FSGE QLYTYMVLISE+TTPEIN+RWYLD AG+KRS AYL+NGV 
Subjt:  VT-FQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLINGVV

Query:  IFFAWLVARILLFGYTFYHVYLHYDQ
        IFFAWL ARILLF Y FYHVY HYDQ
Subjt:  IFFAWLVARILLFGYTFYHVYLHYDQ

AT5G45120.1 Eukaryotic aspartyl protease family protein2.5e-17063.28Show/hide
Query:  LVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAA
        LVL L  S  SL TPK    S +++RIK      D V+EPLRE+RDGYL++L +GTPPQ +QVY+DTGSDLTWVPCGNLSFDC +C + +NN L    + 
Subjt:  LVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAA

Query:  FLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYRE
        F P HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K TC RPCPSF+YTYG  GL+ G LT+D++         +R +P+F FGCV +TYRE
Subjt:  FLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYRE

Query:  PIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLRE
        PIGIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG   L+I+  + L+FTP+L +P YPN YYIGLESITIG   N +   V L LR+
Subjt:  PIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG--NLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLRE

Query:  IDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFEL----PSITFHFLNNVSVVLPQGNSFYAMAA
         D++GNGG+L+DSGTTYTHLPEP YSQL++ L+S I+YPRA E E  TGFDLCYKVP  NN   S E ++    PSITFHFLNN +++LPQGNSFYAM+A
Subjt:  IDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFEL----PSITFHFLNNVSVVLPQGNSFYAMAA

Query:  PSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHK
        PS+ +VV+CLLFQ+M+    GPAG+FGSFQQQN++VVYDLEKER+GF+ MDC   A S GL++
Subjt:  PSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATGTCTTTGAAGACCGTAATGGCTATTAAATCTTACCAAAGTCAAGCTGATGCATTAGTAGAGAACTACTTATTAGCAGACCGTTTGATCCCGTACACCTCTGT
CCTTGGAGGCATACTTGCTTGTAAATTGGTCTATGATCTTACTCAAGTAGTTAGTAATTTTTACTTCAAGAGTTATCTGGGTCTCACGAAAATCCAACGAGTCGAGTGGA
ATAACCGCGGCATGTCCACTATTCATGCAATCTATATCTCAATTATGTCATTGTACTTCGTTTTCTGGTCAGATCTCTTCTCTGACCAGCGTCATGCTGGCCTCGTTACC
TTTCAAAGTTCAACGTTGTCTACTTTCGTATTGGGGATTTCAGTTGGATACTTCTTGGCTGATCTTGGATTGATTGTTTGGCTGTATCCTTCTTTAGGTGGGATGGAGTA
TGTGGTCCACCACTCTCTTTCTGGACTAGCAGTAGCATATTCTGTTTTTTCTGGAGAAGGGCAACTCTACACGTACATGGTCCTCATTTCGGAGATTACGACTCCCGAGA
TTAATATGAGATGGTATCTTGACACAGCTGGTATGAAGAGGTCCTGTGCATATCTGATTAATGGCGTTGTAATATTTTTTGCATGGCTGGTTGCTCGCATACTGCTGTTT
GGTTACACATTCTATCATGTTTACCTGCACTATGATCAGAAGAAAGGCGAAGCTATGGGACAAACCCTAGCAAACCCTAAAACCAAATTCCTTAAAGATTCTCTAGTTCT
TGGTCTTGTTCATTCAAGAACTTCCCTCCTAACCCCAAAAAGAGGCTATAATTCCCTTTCAAGGAAGAGAATTAAGCCAATGGAAATGGGTAATGATGATGTGATAGAGC
CATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACTGGAAGTGACCTTACATGGGTACCTTGT
GGGAACCTCTCATTTGATTGCCAAGATTGTGATGAGTATCAAAACAATGTTTTAGGTCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCCATTAGAGACAC
TTGTGGGAGCTCCTTTTGCATTGATATCCATAGCTCTGATAACCCTTTTGACCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTAAAGGGCACTTGCCCTAGAC
CATGCCCTTCATTCTCTTACACTTATGGGGCTAGTGGCCTTGTAATTGGAACCCTAACTAAAGATGTCATTTTTATCCATGGAAATTCCCCAAATTCCTCAAGAAAAATC
CCTAAATTTTGCTTTGGATGTGTTGGTGCCACTTATAGAGAGCCAATTGGGATTGCTGGCTTTGGTAGAGGCCTTCTTTCTTTACCTTCTCAATTAGGGTTTTCTCATAA
GGGTTTCTCTCATTGTTTCTTGCCCTTTAAATTCTCAAATAACCCTAATTTTTCAAGCCCTTTGATTCTTGGTAATCTAGCTATTTCTTCTAAAGAACATTTGAAATTCA
CCCCTTTGTTGAAAAGTCCATTTTACCCTAATTATTACTATATTGGGCTTGAGTCAATCACTATTGGAAATGGTGAAAATTACTCTAGATTTGGAGTTTCCTTGCAATTG
AGAGAGATTGACACAAAGGGTAATGGTGGAATTTTGATTGATTCTGGTACTACTTATACTCATTTACCAGAACCATTATATTCACAGCTTATTTCAAATCTTGAGTCATT
AATAAGCTATCCAAGAGCTAAAGAACATGAACTCAATACTGGGTTTGATCTTTGTTACAAAGTTCCTTATAAAAACAACACCTTTTTTAGTGATGAATTTGAGCTTCCTT
CTATAACATTTCATTTTTTAAACAATGTTAGTGTTGTTTTGCCTCAAGGGAACAGTTTTTATGCCATGGCTGCTCCTAGTAACTCCACTGTTGTGAAATGCTTGCTGTTT
CAAAGCATGGACGGCGACGGAGATGGGCCGGCGGGCATTTTCGGGAGCTTTCAACAGCAAAATTTGGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTGAAGG
AATGGATTGTGCTTCTGTTGCTGTGTCTCAAGGACTTCATAAGAAGGAATGA
mRNA sequenceShow/hide mRNA sequence
AAAATCGTCGGCTAAGGCGCGAAGTTATCATATTCCTTTGGTTACTACTGCAAGGATCGGATAGAATTCGATTCAATCCACAAGAACACGACGATCCCAGCTCTGGATAC
TCAAATCCATTGCTGATTACCAAAACCCACTTCCCTTATCGCTCTTATAATTCCTAATTTCTCTTCGATTTTTTGGGTTGCGGTTGGAGATTCCTAACGCTTCACATTGA
TTGAACTTCACGATTCTTTGAACCCATCGGCTGTTGCATTTCCTTTACAGATTGTTTTAATTTGTTTTGTTCCTATTTTTTGTTATCTTTCTAAGAGGAAAAACCGCTTC
GGCATCTCCTCGTGATCTGGGGGCTTCATTTATGAGGGATAACAACATGGAGCGGGGAGCAAATAGTTAATTCAGTCGAAGGCAGCGATCTGCCGAGTGTTATGTAGTCA
AACTACACAATGGCAATGTCTTTGAAGACCGTAATGGCTATTAAATCTTACCAAAGTCAAGCTGATGCATTAGTAGAGAACTACTTATTAGCAGACCGTTTGATCCCGTA
CACCTCTGTCCTTGGAGGCATACTTGCTTGTAAATTGGTCTATGATCTTACTCAAGTAGTTAGTAATTTTTACTTCAAGAGTTATCTGGGTCTCACGAAAATCCAACGAG
TCGAGTGGAATAACCGCGGCATGTCCACTATTCATGCAATCTATATCTCAATTATGTCATTGTACTTCGTTTTCTGGTCAGATCTCTTCTCTGACCAGCGTCATGCTGGC
CTCGTTACCTTTCAAAGTTCAACGTTGTCTACTTTCGTATTGGGGATTTCAGTTGGATACTTCTTGGCTGATCTTGGATTGATTGTTTGGCTGTATCCTTCTTTAGGTGG
GATGGAGTATGTGGTCCACCACTCTCTTTCTGGACTAGCAGTAGCATATTCTGTTTTTTCTGGAGAAGGGCAACTCTACACGTACATGGTCCTCATTTCGGAGATTACGA
CTCCCGAGATTAATATGAGATGGTATCTTGACACAGCTGGTATGAAGAGGTCCTGTGCATATCTGATTAATGGCGTTGTAATATTTTTTGCATGGCTGGTTGCTCGCATA
CTGCTGTTTGGTTACACATTCTATCATGTTTACCTGCACTATGATCAGAAGAAAGGCGAAGCTATGGGACAAACCCTAGCAAACCCTAAAACCAAATTCCTTAAAGATTC
TCTAGTTCTTGGTCTTGTTCATTCAAGAACTTCCCTCCTAACCCCAAAAAGAGGCTATAATTCCCTTTCAAGGAAGAGAATTAAGCCAATGGAAATGGGTAATGATGATG
TGATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACTGGAAGTGACCTTACATGG
GTACCTTGTGGGAACCTCTCATTTGATTGCCAAGATTGTGATGAGTATCAAAACAATGTTTTAGGTCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCCAT
TAGAGACACTTGTGGGAGCTCCTTTTGCATTGATATCCATAGCTCTGATAACCCTTTTGACCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTAAAGGGCACTT
GCCCTAGACCATGCCCTTCATTCTCTTACACTTATGGGGCTAGTGGCCTTGTAATTGGAACCCTAACTAAAGATGTCATTTTTATCCATGGAAATTCCCCAAATTCCTCA
AGAAAAATCCCTAAATTTTGCTTTGGATGTGTTGGTGCCACTTATAGAGAGCCAATTGGGATTGCTGGCTTTGGTAGAGGCCTTCTTTCTTTACCTTCTCAATTAGGGTT
TTCTCATAAGGGTTTCTCTCATTGTTTCTTGCCCTTTAAATTCTCAAATAACCCTAATTTTTCAAGCCCTTTGATTCTTGGTAATCTAGCTATTTCTTCTAAAGAACATT
TGAAATTCACCCCTTTGTTGAAAAGTCCATTTTACCCTAATTATTACTATATTGGGCTTGAGTCAATCACTATTGGAAATGGTGAAAATTACTCTAGATTTGGAGTTTCC
TTGCAATTGAGAGAGATTGACACAAAGGGTAATGGTGGAATTTTGATTGATTCTGGTACTACTTATACTCATTTACCAGAACCATTATATTCACAGCTTATTTCAAATCT
TGAGTCATTAATAAGCTATCCAAGAGCTAAAGAACATGAACTCAATACTGGGTTTGATCTTTGTTACAAAGTTCCTTATAAAAACAACACCTTTTTTAGTGATGAATTTG
AGCTTCCTTCTATAACATTTCATTTTTTAAACAATGTTAGTGTTGTTTTGCCTCAAGGGAACAGTTTTTATGCCATGGCTGCTCCTAGTAACTCCACTGTTGTGAAATGC
TTGCTGTTTCAAAGCATGGACGGCGACGGAGATGGGCCGGCGGGCATTTTCGGGAGCTTTCAACAGCAAAATTTGGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGG
GTTTGAAGGAATGGATTGTGCTTCTGTTGCTGTGTCTCAAGGACTTCATAAGAAGGAATGA
Protein sequenceShow/hide protein sequence
MAMSLKTVMAIKSYQSQADALVENYLLADRLIPYTSVLGGILACKLVYDLTQVVSNFYFKSYLGLTKIQRVEWNNRGMSTIHAIYISIMSLYFVFWSDLFSDQRHAGLVT
FQSSTLSTFVLGISVGYFLADLGLIVWLYPSLGGMEYVVHHSLSGLAVAYSVFSGEGQLYTYMVLISEITTPEINMRWYLDTAGMKRSCAYLINGVVIFFAWLVARILLF
GYTFYHVYLHYDQKKGEAMGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPC
GNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKI
PKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQL
REIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLF
QSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE