; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0666 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0666
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationMC01:12885335..12887443
RNA-Seq ExpressionMC01g0666
SyntenyMC01g0666
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034471.1 Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyrosperma]7.09e-25174.89Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLK-NRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS
        MEF  IPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HLK  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS
Subjt:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLK-NRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS

Query:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET
         VFDTGSSLVWFPCTA Y CS CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD CPGYGIQYGSG TAGFLLSET
Subjt:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET

Query:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY
        LD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Subjt:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY

Query:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA
        L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KYPRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LA
Subjt:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA

Query:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        LPP+NY ALVA++ VVC+TM+TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Subjt:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

XP_011657732.1 probable aspartyl protease At4g16563 [Cucumis sativus]6.04e-24975Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVF
        MEFLPIPFLFSIFLLL TSSSSS T LPLT FPS    DP+K +N L SAS+ RA HLK     S+   ++ S L PRSYGAYSVS+ FGTPPQNLSF+F
Subjt:  MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVF

Query:  DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDL
        DTGSSLVWFPCTA Y CSRCSFP V+ ATI+KF+PKLSSS ++VGC N KCAWIF PN++SRCRNC   SR CSD+CPGYG+QYGSG TAG LLSETLDL
Subjt:  DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDL

Query:  PDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL
         +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKRFS+CL SR FDDSPVSSPLVLD GS+S ++ T   IY+PFRENPS S+AAFREYYYLSL
Subjt:  PDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL

Query:  RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALP
        RRILIGGKPVKFPYKYL PDSTGNGG IIDSGSTFT LDKPIFEA+A+E EKQL+KYPRA  VEA+SGLRPCFN+ KE+ + EFP++VLKFKGG +L+L 
Subjt:  RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALP

Query:  PANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
          NY A+V + GVVC+TM+TD+  VGG   GGGPAIILGAFQQQN+LVEYDLAK RIGFRKQ+C
Subjt:  PANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

XP_022925946.1 probable aspartyl protease At4g16563 [Cucurbita moschata]3.52e-25174.89Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS
        MEF  IPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HLK  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS
Subjt:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS

Query:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET
         VFDTGSSLVWFPCTA Y CS CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD CPGYGIQYGSG TAGFLLSET
Subjt:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET

Query:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY
        LD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Subjt:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY

Query:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA
        L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KYPRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LA
Subjt:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA

Query:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        LPP+NY ALVA++ VVC+TM+TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Subjt:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

XP_022979057.1 probable aspartyl protease At4g16563 [Cucurbita maxima]1.11e-25475.97Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS
        MEF PI FL SI LLLS SSSSS   +TLPLTAFPS     PWKN+ +L SAS+ RA HLK  + KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS
Subjt:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS

Query:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET
         VFDTGSSLVWFPCTA Y CS CSFPNV+ ATI KFIPKLSSSARI+GC NRKC+WIF PN++S CR+C+P SR CSD CPGYGIQYGSG TAGFLLSET
Subjt:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET

Query:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY
        LD P+KRVPDFLVGCSVLSVHQPAGI GFGRGP+SLPSQM LKRFS+CL  RQFDDSPVSSPLVLD   +SGD+ TN LIY+PFRENPS S+AAFREYYY
Subjt:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY

Query:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA
        L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KYPRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LA
Subjt:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA

Query:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        LPPANY ALV ++GVVC+TM+TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Subjt:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

XP_023543736.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]4.52e-25475.75Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS
        MEF PIPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HLK  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS
Subjt:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS

Query:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET
         VFDTGSSLVWFPCTA Y CS CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN++S CR+C+P SR CSD CPGYGIQYGSG TAGFLLSET
Subjt:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET

Query:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY
        LD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Subjt:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY

Query:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA
        L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KYPRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LA
Subjt:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA

Query:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        LPPANY ALV ++GVVC+TM+TD   +GG   GGGPAII GAFQQQN+LV+YDLAKDRIGFRKQRC
Subjt:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

TrEMBL top hitse value%identityAlignment
A0A0A0KHK2 Peptidase A1 domain-containing protein2.92e-24975Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVF
        MEFLPIPFLFSIFLLL TSSSSS T LPLT FPS    DP+K +N L SAS+ RA HLK     S+   ++ S L PRSYGAYSVS+ FGTPPQNLSF+F
Subjt:  MEFLPIPFLFSIFLLLSTSSSSSIT-LPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVF

Query:  DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDL
        DTGSSLVWFPCTA Y CSRCSFP V+ ATI+KF+PKLSSS ++VGC N KCAWIF PN++SRCRNC   SR CSD+CPGYG+QYGSG TAG LLSETLDL
Subjt:  DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDL

Query:  PDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL
         +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKRFS+CL SR FDDSPVSSPLVLD GS+S ++ T   IY+PFRENPS S+AAFREYYYLSL
Subjt:  PDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSL

Query:  RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALP
        RRILIGGKPVKFPYKYL PDSTGNGG IIDSGSTFT LDKPIFEA+A+E EKQL+KYPRA  VEA+SGLRPCFN+ KE+ + EFP++VLKFKGG +L+L 
Subjt:  RRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALP

Query:  PANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
          NY A+V + GVVC+TM+TD+  VGG   GGGPAIILGAFQQQN+LVEYDLAK RIGFRKQ+C
Subjt:  PANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

A0A1S3CHV2 aspartic proteinase nepenthesin-2-like1.55e-24673.59Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFD
        MEFLPIPFLFSIFLLL TSSSSSITLPL  FPS    DP K +N+L SAS+ RA HLK+    S+   ++ S L PRSYGAY+VS+ FGTPPQNLSF+FD
Subjt:  MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFD

Query:  TGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDLP
        TGSSLVWFPCTA Y C+ CSFP+V+ ATI+KF+PKLSSS +IVGC N KCAWIF PN++SRCRNC P SR CSD+CPGYGIQYGSG TAG LLSETLDL 
Subjt:  TGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDLP

Query:  DKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR
        +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKRFS+CL  R FDDSPVSSPLVLD G +S ++ T   IY+PF+ENPS S+ AFREYYYLSLR
Subjt:  DKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR

Query:  RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPP
        RILIGGKPVKFPYKYL PDSTG GG IIDSGSTFT LDKPIFEA+A E EKQL+KYPRA  +EA++GLRPCFN+SKE+ + EFPE+ LKFKGG +L+LPP
Subjt:  RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPP

Query:  ANYFALVAESGVVCMTMLTD-DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
         NY  +V ++ VVC+TM+T+ +V G  VGGGPAII GAFQQQN+LVEYDLAK RIGFRKQ+C
Subjt:  ANYFALVAESGVVCMTMLTD-DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

A0A5A7SGF9 Aspartic proteinase nepenthesin-2-like1.55e-24673.59Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFD
        MEFLPIPFLFSIFLLL TSSSSSITLPL  FPS    DP K +N+L SAS+ RA HLK+    S+   ++ S L PRSYGAY+VS+ FGTPPQNLSF+FD
Subjt:  MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFD

Query:  TGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDLP
        TGSSLVWFPCTA Y C+ CSFP+V+ ATI+KF+PKLSSS +IVGC N KCAWIF PN++SRCRNC P SR CSD+CPGYGIQYGSG TAG LLSETLDL 
Subjt:  TGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDLP

Query:  DKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR
        +KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQMRLKRFS+CL  R FDDSPVSSPLVLD G +S ++ T   IY+PF+ENPS S+ AFREYYYLSLR
Subjt:  DKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLR

Query:  RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPP
        RILIGGKPVKFPYKYL PDSTG GG IIDSGSTFT LDKPIFEA+A E EKQL+KYPRA  +EA++GLRPCFN+SKE+ + EFPE+ LKFKGG +L+LPP
Subjt:  RILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEK-TVEFPELVLKFKGGLELALPP

Query:  ANYFALVAESGVVCMTMLTD-DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
         NY  +V ++ VVC+TM+T+ +V G  VGGGPAII GAFQQQN+LVEYDLAK RIGFRKQ+C
Subjt:  ANYFALVAESGVVCMTMLTD-DVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

A0A6J1EDJ0 probable aspartyl protease At4g165631.70e-25174.89Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS
        MEF  IPFL SI LLLS SSSSS   +TLPLT FPS     PWKN+ +L SAS+ RA HLK  R KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS
Subjt:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS

Query:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET
         VFDTGSSLVWFPCTA Y CS CSFPNV+ ATI KFIPKLSSSA+I+GC NRKC+WIF PN+++ CR+C+P SR CSD CPGYGIQYGSG TAGFLLSET
Subjt:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET

Query:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY
        LD P+KRVPDFLVGCSV+SVHQPAGI GFGRGP+SLPSQM LKRFS+CL  RQFDDSPVSSPLVLD  S+SG++  N LIY+PFRENPS S+AAFREYYY
Subjt:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY

Query:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA
        L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KYPRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LA
Subjt:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA

Query:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        LPP+NY ALVA++ VVC+TM+TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Subjt:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

A0A6J1IMR7 probable aspartyl protease At4g165635.38e-25575.97Show/hide
Query:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS
        MEF PI FL SI LLLS SSSSS   +TLPLTAFPS     PWKN+ +L SAS+ RA HLK  + KS+  +     AL PRSYGAYS+S+ FGTPPQ+LS
Subjt:  MEFLPIPFLFSIFLLLSTSSSSS---ITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKN-RNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLS

Query:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET
         VFDTGSSLVWFPCTA Y CS CSFPNV+ ATI KFIPKLSSSARI+GC NRKC+WIF PN++S CR+C+P SR CSD CPGYGIQYGSG TAGFLLSET
Subjt:  FVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSET

Query:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY
        LD P+KRVPDFLVGCSVLSVHQPAGI GFGRGP+SLPSQM LKRFS+CL  RQFDDSPVSSPLVLD   +SGD+ TN LIY+PFRENPS S+AAFREYYY
Subjt:  LDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYY

Query:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA
        L+LRRILIG KPVKFPYKYL P+S GNGG IIDSGSTFT LDKPIFEAVAEE EKQL+KYPRA GVEA SGLRPCF++SKE++VEFPEL+LKFKGG  LA
Subjt:  LSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELA

Query:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        LPPANY ALV ++GVVC+TM+TD   +GG   GGGPAII GAFQQQN+LV+YDLAK+RIGFRKQRC
Subjt:  LPPANYFALVAESGVVCMTMLTDD--VGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-21.1e-3531.65Show/hide
Query:  KNRNKSSDFVHKSKSALTPRSY---GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIF
        + R +S + + +S S +    Y   G Y +++  GTP  + S + DTGS L+W  C     C++C      +     F P+ SSS   + C ++ C  + 
Subjt:  KNRNKSSDFVHKSKSALTPRSY---GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIF

Query:  DPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQ
                      S  C++    Y   YG G  T G++ +ET       VP+   GC            AG++G G GP SLPSQ+ + +FSYC+ S  
Subjt:  DPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQ

Query:  FDDSPVSSPLVLDFGSKS-----GDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEA
              SSP  L  GS +     G  +T  LI+S    NP+        YYY++L+ I +GG  +  P         G GG IIDSG+T T L +  + A
Subjt:  FDDSPVSSPLVLDFGSKS-----GDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEA

Query:  VAEEFEKQLIKYPRATGVEARSGLRPCF-NVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNIL
        VA+ F  Q I  P  T  E+ SGL  CF   S   TV+ PE+ ++F GG+ L L   N     AE GV+C+ M +    G         I G  QQQ   
Subjt:  VAEEFEKQLIKYPRATGVEARSGLRPCF-NVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNIL

Query:  VEYDLAKDRIGFRKQRC
        V YDL    + F   +C
Subjt:  VEYDLAKDRIGFRKQRC

Q766C3 Aspartic proteinase nepenthesin-15.1e-3630.26Show/hide
Query:  GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY
        G Y +++  GTP Q  S + DTGS L+W  C     C   S P  N        P+ SSS   + C ++ C  +  P               CS+    Y
Subjt:  GAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGY

Query:  GIQYGSGL-TAGFLLSETLDLPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPV--SSPLVLDFGSKSGDTNT
           YG G  T G + +ETL      +P+   GC            AG+VG GRGP SLPSQ+ + +FSYC+       +P+  S+P  L  GS +     
Subjt:  GIQYGSGL-TAGFLLSETLDLPDKRVPDFLVGCSV----LSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPV--SSPLVLDFGSKSGDTNT

Query:  NGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKF-PYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPC
             SP         +    +YY++L  + +G   +   P  +    + G GG IIDSG+T T      +++V +EF  Q I  P   G  + SG   C
Subjt:  NGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKF-PYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPC

Query:  FNV-SKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        F   S    ++ P  V+ F GG +L LP  NYF +   +G++C+ M +   G          I G  QQQN+LV YD     + F   +C
Subjt:  FNV-SKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

Q8S9J6 Aspartyl protease family protein At5g107704.1e-3330.39Show/hide
Query:  LASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA------YSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSS
        L  A +   H   ++  ++D V +SKS   P   G+      Y V++G GTP  +LS +FDTGS L W  C     C R  +          F P  S+S
Subjt:  LASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA------YSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSS

Query:  ARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYG-SGLTAGFLLSETLDLPDKRVPD-FLVGCSVLS---VHQPAGIVGFGRGPQSLPS
           V C +  C  +          + T N+ +CS +   YGIQYG    + GFL  E   L +  V D    GC   +       AG++G GR   S PS
Subjt:  ARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYG-SGLTAGFLLSETLDLPDKRVPD-FLVGCSVLS---VHQPAGIVGFGRGPQSLPS

Query:  QMRL---KRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSG
        Q      K FSYCL       S  S    L FGS          I    +  P ++      +Y L++  I +GG+ +  P    S       G +IDSG
Subjt:  QMRL---KRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSG

Query:  STFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTML--TDDVGGEKVGGG
        +  T L    + A+   F+ ++ KYP  +GV   S L  CF++S  KTV  P++   F GG  + L     F  V +   VC+     +DD         
Subjt:  STFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTML--TDDVGGEKVGGG

Query:  PAIILGAFQQQNILVEYDLAKDRIGFRKQRC
         A I G  QQQ + V YD A  R+GF    C
Subjt:  PAIILGAFQQQNILVEYDLAKDRIGFRKQRC

Q940R4 Probable aspartyl protease At4g165639.0e-4931.54Show/hide
Query:  LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTG
        L  P L  +   LSTS  SS  L L    S+R           +SA   R HH + + + S           P S G+ Y +S+  G+    +S   DTG
Subjt:  LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTG

Query:  SSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCSDACPGYGIQYGSGLTAGFLL
        S LVWFPC   + C  C    +  +  +     LSSSA  V C +  C+        S      NC      T +    S  CP +   YG G     L 
Subjt:  SSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCSDACPGYGIQYGSGLTAGFLL

Query:  SETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN---------
        S++L LP   V +F  GC+  ++ +P G+ GFGRG  SLP+Q+ +        FSYCL S  FD   V  P  L  G        + G T+         
Subjt:  SETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN---------

Query:  --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGL
           N  +++   ENP         +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT+L    + +V EEF+ ++ + + RA  VE  SG+
Subjt:  --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGL

Query:  RPCFNVSKEKTVEFPELVLKFKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQR
         PC+ ++  +TV+ P LVL F G    + LP  NYF    + G        + C+ ML +     ++ GG   ILG +QQQ   V YDL   R+GF K++
Subjt:  RPCFNVSKEKTVEFPELVLKFKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQR

Query:  C
        C
Subjt:  C

Q9LNJ3 Aspartyl protease family protein 24.8e-3429.27Show/hide
Query:  TSSSSSITLPL---TAFPSTRAPDPW---------KNLNYLAS-ASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSS
        + SSSSITL L    A  S + PD           + +  +A+ A+ I   ++ +  +   F     S L+  S G Y   +G GTP + +  V DTGS 
Subjt:  TSSSSSITLPL---TAFPSTRAPDPW---------KNLNYLAS-ASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSS

Query:  LVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKR
        +VW  C     C RC      + +   F P+ S +   + C +  C  +      +R + C             Y + YG G  T G   +ETL     R
Subjt:  LVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKR

Query:  VPDFLVGCSVLSVHQ-------PAGIVGFGRGPQSLPSQMRLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFRE
        V    +GC     H         AG++G G+G  S P Q   +   +FSYCL  R    S  S P  + FG        N  +    R  P  S+     
Subjt:  VPDFLVGCSVLSVHQ-------PAGIVGFGRGPQSLPSQMRLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFRE

Query:  YYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGG
        +YY+ L  I +GG  V          D  GNGG IIDSG++ T L +P + A+ + F        RA      S    CF++S    V+ P +VL F+G 
Subjt:  YYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGG

Query:  LELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
         +++LP  NY   V  +G  C       +GG         I+G  QQQ   V YDLA  R+GF    C
Subjt:  LELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

Arabidopsis top hitse value%identityAlignment
AT1G25510.1 Eukaryotic aspartyl protease family protein9.3e-4131.97Show/hide
Query:  TPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSD
        T +  G Y   +G G P + +  V DTGS + W  CT    C+ C        T   F P  SSS   + C   +C    +    S CRN T     C  
Subjt:  TPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSD

Query:  ACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSVLS---VHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFG-SKSGD
            Y + YG G  T G   +ETL +    V +  VGC   +       AG++G G G  +LPSQ+    FSYCL  R  D     S   +DFG S S D
Subjt:  ACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSVLS---VHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFG-SKSGD

Query:  TNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLR
              + +P   N          +YYL L  I +GG+ ++ P      D +G+GG IIDSG+  T L   I+ ++ + F K  +   +A GV   +   
Subjt:  TNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLR

Query:  PCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
         C+N+S + TVE P +   F GG  LALP  NY   V   G  C+                  I+G  QQQ   V +DLA   IGF   +C
Subjt:  PCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

AT3G52500.1 Eukaryotic aspartyl protease family protein2.3e-14053.68Show/hide
Query:  FLLLSTSSSSSITLPLTAFP-STRAP-DPWKNLNYLASASIIRAHHLKNRNK---SSDFVHKS--------KSALTPRSYGAYSVSIGFGTPPQNLSFVF
        F L+  S  S++ LPL+ F  S ++P DP+ +L  LA +SI RAH LK+        D +  +        KS L+ +SYG YSVS+ FGTP Q + FVF
Subjt:  FLLLSTSSSSSITLPLTAFP-STRAP-DPWKNLNYLASASIIRAHHLKNRNK---SSDFVHKS--------KSALTPRSYGAYSVSIGFGTPPQNLSFVF

Query:  DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDL
        DTGSSLVW PCT+RYLCS C F  ++   I +FIPK SSS++I+GC + KC +++ PNV  +CR C PN+RNC+  CP Y +QYG G TAG L++E LD 
Subjt:  DTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDL

Query:  PDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGS-KSGDTNTNGLIYSPFRENPSASDAAFREYYYLS
        PD  VPDF+VGCS++S  QPAGI GFGRGP SLPSQM LKRFS+CL SR+FDD+ V++ L LD GS  +  + T GL Y+PFR+NP+ S+ AF EYYYL+
Subjt:  PDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGS-KSGDTNTNGLIYSPFRENPSASDAAFREYYYLS

Query:  LRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALP
        LRRI +G K VK PYKYL+P + G+GG+I+DSGSTFT +++P+FE VAEEF  Q+  Y R   +E  +GL PCFN+S +  V  PEL+ +FKGG +L LP
Subjt:  LRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALP

Query:  PANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
         +NYF  V  +  VC+T+++D       G GPAIILG+FQQQN LVEYDL  DR GF K++C
Subjt:  PANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

AT3G61820.1 Eukaryotic aspartyl protease family protein1.1e-4132.18Show/hide
Query:  KNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSA
        K++  LA+ S  R    +    +  F     S L+  S G Y + +G GTP  N+  V DTGS +VW  C+    C  C        T   F PK S + 
Subjt:  KNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSA

Query:  RIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQ-------PAGIVGFGRGPQSL
          V CG+R C  + D    S C   T  S+ C      Y + YG G  T G   +ETL     RV    +GC     H         AG++G GRG  S 
Subjt:  RIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSG-LTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQ-------PAGIVGFGRGPQSL

Query:  PSQMRLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTII
        PSQ + +   +FSYCL  R    S    P  + FG+ +    +   +++P   NP         +YYL L  I +GG  V          D+TGNGG II
Subjt:  PSQMRLK---RFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVK-FPYKYLSPDSTGNGGTII

Query:  DSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGG
        DSG++ T L +P + A+ + F     K  RA    + S    CF++S   TV+ P +V  F GG E++LP +NY   V   G  C               
Subjt:  DSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGG

Query:  GPAIILGAFQQQNILVEYDLAKDRIGFRKQRC
        G   I+G  QQQ   V YDL   R+GF  + C
Subjt:  GPAIILGAFQQQNILVEYDLAKDRIGFRKQRC

AT4G16563.1 Eukaryotic aspartyl protease family protein6.4e-5031.54Show/hide
Query:  LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTG
        L  P L  +   LSTS  SS  L L    S+R           +SA   R HH + + + S           P S G+ Y +S+  G+    +S   DTG
Subjt:  LPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGA-YSVSIGFGTPPQNLSFVFDTG

Query:  SSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCSDACPGYGIQYGSGLTAGFLL
        S LVWFPC   + C  C    +  +  +     LSSSA  V C +  C+        S      NC      T +    S  CP +   YG G     L 
Subjt:  SSLVWFPCTARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRS---RCRNC------TPNSRNCSDACPGYGIQYGSGLTAGFLL

Query:  SETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN---------
        S++L LP   V +F  GC+  ++ +P G+ GFGRG  SLP+Q+ +        FSYCL S  FD   V  P  L  G        + G T+         
Subjt:  SETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL------KRFSYCLASRQFDDSPVSSPLVLDFG-------SKSGDTN---------

Query:  --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGL
           N  +++   ENP         +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT+L    + +V EEF+ ++ + + RA  VE  SG+
Subjt:  --TNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIK-YPRATGVEARSGL

Query:  RPCFNVSKEKTVEFPELVLKFKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQR
         PC+ ++  +TV+ P LVL F G    + LP  NYF    + G        + C+ ML +     ++ GG   ILG +QQQ   V YDL   R+GF K++
Subjt:  RPCFNVSKEKTVEFPELVLKFKGG-LELALPPANYFALVAESG--------VVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGFRKQR

Query:  C
        C
Subjt:  C

AT5G45120.1 Eukaryotic aspartyl protease family protein1.2e-5133.99Show/hide
Query:  YSVSIGFGTPPQNLSFVFDTGSSLVWFPC-TARYLCSRC-SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWI------FDPNVRSRCRNCTPNSRNCS
        Y +++  GTPPQ +    DTGS L W PC    + C  C    N +  + + F P  SS++    C +  C  I      FDP   + C         C 
Subjt:  YSVSIGFGTPPQNLSFVFDTGSSLVWFPC-TARYLCSRC-SFPNVNTATITKFIPKLSSSARIVGCGNRKCAWI------FDPNVRSRCRNCTPNSRNCS

Query:  DACPGYGIQYG-SGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL--KRFSYCLASRQFDDSP-VSSPLVLDFGSKSGD
          CP +   YG  GL +G L  + L    + VP F  GC   +  +P GI GFGRG  SLPSQ+    K FS+C    +F ++P +SSPL+L   + S +
Subjt:  DACPGYGIQYG-SGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPAGIVGFGRGPQSLPSQMRL--KRFSYCLASRQFDDSP-VSSPLVLDFGSKSGD

Query:  TNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGK--PVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSG
          T+ L ++P    P      +   YY+ L  I IG    P + P      DS GNGG ++DSG+T+T L +P +  +    +   I YPRAT  E+R+G
Subjt:  TNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGK--PVKFPYKYLSPDSTGNGGTIIDSGSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSG

Query:  LRPCFNV----------SKEKTVEFPELVLKFKGGLELALPPAN-YFALVAES-GVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGF
           C+ V            +  + FP +   F     L LP  N ++A+ A S G V   +L  ++  E    GPA + G+FQQQN+ V YDL K+RIGF
Subjt:  LRPCFNV----------SKEKTVEFPELVLKFKGGLELALPPAN-YFALVAES-GVVCMTMLTDDVGGEKVGGGPAIILGAFQQQNILVEYDLAKDRIGF

Query:  RKQRCV
        +   CV
Subjt:  RKQRCV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTCTTCCAATTCCCTTTCTGTTTTCCATCTTCCTTCTTCTTTCCACTTCGTCTTCTTCCTCCATCACACTCCCCCTCACCGCCTTCCCTTCAACTCGAGCTCC
AGATCCATGGAAGAATCTCAATTACCTTGCCTCTGCTTCGATCATCAGAGCTCATCACCTCAAGAACCGAAACAAATCAAGCGATTTCGTCCATAAATCCAAATCCGCAC
TCACTCCTCGTAGCTATGGCGCTTACTCGGTTTCTATCGGCTTCGGAACTCCTCCCCAGAATTTATCGTTCGTCTTCGATACTGGAAGTAGTCTCGTGTGGTTCCCCTGC
ACCGCCCGTTATCTCTGCTCCAGGTGTTCGTTTCCCAACGTGAATACTGCGACGATTACGAAATTCATCCCGAAATTATCTTCTTCTGCGAGGATTGTCGGTTGCGGAAA
CCGGAAATGTGCTTGGATTTTTGACCCTAATGTGAGATCTAGGTGTCGAAATTGTACCCCTAATTCTCGAAATTGTTCCGATGCTTGTCCTGGTTATGGAATTCAGTACG
GTTCTGGATTAACCGCTGGATTTCTACTCTCTGAAACGCTTGATTTACCGGATAAACGAGTGCCAGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTCCACCAACCTGCC
GGCATTGTCGGATTTGGCCGCGGTCCTCAATCGTTGCCGTCGCAAATGCGCCTGAAAAGATTCTCCTACTGCCTCGCTTCTCGCCAGTTCGACGACTCGCCGGTGAGCAG
CCCTCTCGTGCTGGACTTCGGATCCAAATCCGGCGACACCAACACTAACGGCCTCATTTACTCGCCGTTCCGAGAGAATCCCTCCGCCTCCGACGCCGCATTTAGAGAAT
ACTATTACCTTTCTCTTCGCAGAATCCTCATCGGCGGAAAACCGGTGAAATTTCCGTACAAGTATCTCTCACCGGACTCCACCGGTAACGGCGGCACGATCATCGATTCC
GGCTCGACCTTTACGATTCTGGATAAGCCAATTTTCGAAGCCGTAGCGGAAGAATTCGAGAAGCAACTTATTAAATATCCCCGAGCTACCGGCGTGGAAGCTCGGTCCGG
TCTAAGGCCGTGCTTCAATGTCTCGAAGGAGAAGACGGTGGAATTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGCCTGGAACTGGCTCTGCCGCCGGCTAATTACTTCG
CGTTGGTGGCGGAGTCCGGCGTGGTGTGCATGACGATGTTGACGGATGACGTCGGCGGTGAGAAGGTCGGCGGTGGACCGGCGATTATACTCGGCGCGTTTCAGCAGCAG
AATATATTGGTGGAGTATGACTTGGCGAAGGACAGAATCGGATTTCGAAAGCAGAGATGCGTATGA
mRNA sequenceShow/hide mRNA sequence
GGCTACATTACAATGATAAAGACTGATGAGTCAATAAAATTCTTCAGGATTAGAATAAAATTTTATGCTCAATAAATAAGAAACCATTACTTTATTTTCGTACGCGTTCG
TGTTTGGTTTAAATTCAAAAAAAAAAAAAAAAAGGCAAAATTTGGATCGAGCACAAATTCAAAGCATTTATGAAAACTGAAGGATGAGAAATTGAAAATACCAAGGTTAT
TTATGAAGGTGGGCCACGTGGCATGCAGTTGTCACTCGGTGGATTGCTTGCAGCGGCCACGCCATCGCCATTTTCCTCTGCTTCCATCAACGACTGAACAGAGCATCCAT
TAGCTTCAGAAGCAGAGCACATTCCAATTCCAATTCCAAATCCATGGAGTTTCTTCCAATTCCCTTTCTGTTTTCCATCTTCCTTCTTCTTTCCACTTCGTCTTCTTCCT
CCATCACACTCCCCCTCACCGCCTTCCCTTCAACTCGAGCTCCAGATCCATGGAAGAATCTCAATTACCTTGCCTCTGCTTCGATCATCAGAGCTCATCACCTCAAGAAC
CGAAACAAATCAAGCGATTTCGTCCATAAATCCAAATCCGCACTCACTCCTCGTAGCTATGGCGCTTACTCGGTTTCTATCGGCTTCGGAACTCCTCCCCAGAATTTATC
GTTCGTCTTCGATACTGGAAGTAGTCTCGTGTGGTTCCCCTGCACCGCCCGTTATCTCTGCTCCAGGTGTTCGTTTCCCAACGTGAATACTGCGACGATTACGAAATTCA
TCCCGAAATTATCTTCTTCTGCGAGGATTGTCGGTTGCGGAAACCGGAAATGTGCTTGGATTTTTGACCCTAATGTGAGATCTAGGTGTCGAAATTGTACCCCTAATTCT
CGAAATTGTTCCGATGCTTGTCCTGGTTATGGAATTCAGTACGGTTCTGGATTAACCGCTGGATTTCTACTCTCTGAAACGCTTGATTTACCGGATAAACGAGTGCCAGA
TTTTCTCGTCGGTTGTTCCGTCTTGTCCGTCCACCAACCTGCCGGCATTGTCGGATTTGGCCGCGGTCCTCAATCGTTGCCGTCGCAAATGCGCCTGAAAAGATTCTCCT
ACTGCCTCGCTTCTCGCCAGTTCGACGACTCGCCGGTGAGCAGCCCTCTCGTGCTGGACTTCGGATCCAAATCCGGCGACACCAACACTAACGGCCTCATTTACTCGCCG
TTCCGAGAGAATCCCTCCGCCTCCGACGCCGCATTTAGAGAATACTATTACCTTTCTCTTCGCAGAATCCTCATCGGCGGAAAACCGGTGAAATTTCCGTACAAGTATCT
CTCACCGGACTCCACCGGTAACGGCGGCACGATCATCGATTCCGGCTCGACCTTTACGATTCTGGATAAGCCAATTTTCGAAGCCGTAGCGGAAGAATTCGAGAAGCAAC
TTATTAAATATCCCCGAGCTACCGGCGTGGAAGCTCGGTCCGGTCTAAGGCCGTGCTTCAATGTCTCGAAGGAGAAGACGGTGGAATTTCCGGAACTGGTTTTGAAGTTT
AAAGGCGGCCTGGAACTGGCTCTGCCGCCGGCTAATTACTTCGCGTTGGTGGCGGAGTCCGGCGTGGTGTGCATGACGATGTTGACGGATGACGTCGGCGGTGAGAAGGT
CGGCGGTGGACCGGCGATTATACTCGGCGCGTTTCAGCAGCAGAATATATTGGTGGAGTATGACTTGGCGAAGGACAGAATCGGATTTCGAAAGCAGAGATGCGTATGAT
TTATGAACTATGAAATTAATAATTTAGTGGAAAATAAAAATCGAAAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGTAAATG
TAAAACAAAGTTCACGAATTGAATGTGGTGGTAACTTCTAAGTTTCAATAGACTTCCAAATTTCAAGTTATTTGGTAAAAATTTTTTAATTACAATCAAATATGTATTGG
TAAAAAAGAGAGAAACAATAAAAAATAAAACTATTTTTCTGATTTTTTAAAATTAATTATTTAAATATAAGGGTAAAACTTTTGGGCGATCTAGGTCATTTAATTAAAGT
GTCACTCTTTCAATTTGAG
Protein sequenceShow/hide protein sequence
MEFLPIPFLFSIFLLLSTSSSSSITLPLTAFPSTRAPDPWKNLNYLASASIIRAHHLKNRNKSSDFVHKSKSALTPRSYGAYSVSIGFGTPPQNLSFVFDTGSSLVWFPC
TARYLCSRCSFPNVNTATITKFIPKLSSSARIVGCGNRKCAWIFDPNVRSRCRNCTPNSRNCSDACPGYGIQYGSGLTAGFLLSETLDLPDKRVPDFLVGCSVLSVHQPA
GIVGFGRGPQSLPSQMRLKRFSYCLASRQFDDSPVSSPLVLDFGSKSGDTNTNGLIYSPFRENPSASDAAFREYYYLSLRRILIGGKPVKFPYKYLSPDSTGNGGTIIDS
GSTFTILDKPIFEAVAEEFEKQLIKYPRATGVEARSGLRPCFNVSKEKTVEFPELVLKFKGGLELALPPANYFALVAESGVVCMTMLTDDVGGEKVGGGPAIILGAFQQQ
NILVEYDLAKDRIGFRKQRCV