; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0005233 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0005233
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptioncysteine proteinase COT44-like
Genome locationContig00104_ERROPOS3811648:81936..85776
RNA-Seq ExpressionPay0005233
SyntenyPay0005233
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027046.1 Oryzain alpha chain [Cucurbita argyrosperma subsp. argyrosperma]2.4e-12075.73Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MA ATT LA LSFF LSI  SAL++R+DGEVREIYD+WLAKHGKAYNGI+EREKRF IFK+NLNF+D+HNS+NRTY VGLNMFADLTN+EYRA +LGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
         PARRVMKAK+ASRRYAVN+ DRLPESVDWR +GAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELVSCD KYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P   L       +E  +  SID YEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYG ENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        LVRNSW  G
Subjt:  LVRNSWAQG

XP_004141903.1 cysteine proteinase COT44 [Cucumis sativus]2.8e-13283.01Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENL FIDDHNSENRTYKVGLNMFADLTN+EYRALYLGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
        PPARRVMKAKTASRRYAVNN DRLPES+DWR RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELVSCDKKYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P  +        ++  +  SID+YEDVPANDEE+LKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYGKENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSW
        LVRNSW
Subjt:  LVRNSW

XP_008440309.1 PREDICTED: cysteine proteinase COT44-like [Cucumis melo]8.3e-13785.11Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
        PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P  +        ++  +  SIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYGKENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        LVRNSW  G
Subjt:  LVRNSWAQG

XP_023518142.1 cysteine proteinase COT44-like [Cucurbita pepo subsp. pepo]6.4e-12176.38Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MA ATT LA LSFF LSI  SAL++R+DGEVREIYD+WLAKHGKAYNGI+EREKRF IFK+NLNFID+HNS+NRTY VGLNMFADLTN+EYRA +LGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
         PARRVMKAK+ASRRYAVN+ DRLPESVDWR RGAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELVSCD KYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P   L       +E  +  SID YEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYG ENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        LVRNSW  G
Subjt:  LVRNSWAQG

XP_038880922.1 cysteine proteinase COT44-like [Benincasa hispida]4.4e-13081.55Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MA+ATT LALLSFFFLSIS+SALS RSD EVREIYDLWLAKHGKAYNGI+EREKRFQIFKENL FIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
        PPARRVMKAKTASRRYAVNNRDRLPES DWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELVSCDKKYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P           ++  +  SID YEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYG ENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        LVRNSW  G
Subjt:  LVRNSWAQG

TrEMBL top hitse value%identityAlignment
A0A0A0KGD9 Uncharacterized protein1.3e-13283.01Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENL FIDDHNSENRTYKVGLNMFADLTN+EYRALYLGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
        PPARRVMKAKTASRRYAVNN DRLPES+DWR RGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELVSCDKKYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P  +        ++  +  SID+YEDVPANDEE+LKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYGKENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSW
        LVRNSW
Subjt:  LVRNSW

A0A1S3B0U6 cysteine proteinase COT44-like4.0e-13785.11Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
        PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P  +        ++  +  SIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYGKENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        LVRNSW  G
Subjt:  LVRNSWAQG

A0A5D3CLQ0 Cysteine proteinase COT44-like4.0e-13785.11Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
        PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P  +        ++  +  SIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYGKENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        LVRNSW  G
Subjt:  LVRNSWAQG

A0A6J1HGZ6 zingipain-2-like1.5e-12075.73Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MA ATT LA LSFF LSI  SAL++R+DGEVREIYD+WLAKHGKAYNGI+EREKRF IFK+NLNF+D+HNS+NRTY VGLNMFADLTN+EYRA +LGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
         PARRVMKAK+ASRRYAVN+ DRLPESVDWR +GAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELVSCD KYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P   L       +E  +  SID YEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYG ENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        LVRNSW  G
Subjt:  LVRNSWAQG

A0A6J1KPD5 cysteine proteinase COT44-like7.7e-12075.73Show/hide
Query:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS
        MA AT SLA LSFF LSI  SAL++RSDGEVREIYD+WLAKHGKAYNGI+E EKRF IFK+NLNFID+HNS NRTY VGLNMFADLTN+EYRA +LGTRS
Subjt:  MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL
         PARRVMKAK+ASRRYAVN+ DRLPESVDWR +GAVAP+KNQGSCGSCWAFSTIAAVEGINQIVTG+LISLSEQELVSCD KYNSGCNGGLM   F   +
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSL

Query:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW
              T    P   L       +E  +  SID YEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTG   + L    VAVGYG ENGVDYW
Subjt:  TMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDYW

Query:  LVRNSWAQG
        L RNSW  G
Subjt:  LVRNSWAQG

SwissProt top hitse value%identityAlignment
P25251 Cysteine proteinase COT44 (Fragment)4.8e-7956.07Show/hide
Query:  IYDLWLAKHGKA---YNG-IDEREKRFQIFKENLNFIDDHNSENR--TYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRY-AVNNRDRLPE
        IY  W  +HGK+    NG I+++++RF IFK+NL FID HN  N+  TYK+GL +FA+LTNDEYR+LYLG R+ P RR+ KAK  + +Y A  N D +P 
Subjt:  IYDLWLAKHGKA---YNG-IDEREKRFQIFKENLNFIDDHNSENR--TYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRY-AVNNRDRLPE

Query:  SVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAWTLRK-IPLRSLRWSMRSHQEK
        +VDWR +GAV  +K+QG+CGSCWAFST AAVEGIN+IVTG+L+SLSEQELV CDK YN GCNGGLM   F   +        K  P         S  + 
Subjt:  SVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAWTLRK-IPLRSLRWSMRSHQEK

Query:  CQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW
         +  +ID YEDVP+ DE ALK+AV++QPVSVAI+A G A Q YQSG+FTG     +    VAVGYG ENGVDYW+VRNSW
Subjt:  CQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW

P25776 Oryzain alpha chain1.4e-7851.78Show/hide
Query:  ATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSEN----RTYKVGLNMFADLTNDEYRALYLGTR
        A   L LLS     +S  +   RS+ E R +Y  W A+HGK+YN + E E+R+  F++NL +ID+HN+       ++++GLN FADLTN+EYR  YLG R
Subjt:  ATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSEN----RTYKVGLNMFADLTNDEYRALYLGTR

Query:  SPPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSS
        + P R     +  S RY   + + LPESVDWR +GAVA +K+QG CGSCWAFS IAAVEGINQIVTGDLISLSEQELV CD  YN GCNGGLM   F   
Subjt:  SPPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSS

Query:  LTMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDY
        +      T    P +        +++  +  +IDSYEDV  N E +L+KAVA+QPVSVAIEA G A QLY SG+FTG     L     AVGYG ENG DY
Subjt:  LTMAAW-TLRKIPLRSLRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNV-ALLSTMAVAVGYGKENGVDY

Query:  WLVRNSWAQ
        W+VRNSW +
Subjt:  WLVRNSWAQ

P43297 Cysteine proteinase RD21A6.1e-8254.83Show/hide
Query:  SALSRRSDGEVREIYDLWLAKHGKA--YNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAV
        S    RS+ EV  IY+ WL KHGKA   N + E+++RF+IFK+NL F+D+HN +N +Y++GL  FADLTNDEYR+ YLG +          +  S RY  
Subjt:  SALSRRSDGEVREIYDLWLAKHGKA--YNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAV

Query:  NNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRW
           D LPES+DWR +GAVA VK+QG CGSCWAFSTI AVEGINQIVTGDLI+LSEQELV CD  YN GCNGGLM   F   +      T +  P + +  
Subjt:  NNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRW

Query:  SMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSWAQ
        +    ++  +  +IDSYEDVP   EE+LKKAVAHQP+S+AIEA G A QLY SG+F G+    L    VAVGYG ENG DYW+VRNSW +
Subjt:  SMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSWAQ

Q94B08 Germination-specific cysteine protease 11.1e-8658.82Show/hide
Query:  RSDGEVREIYDLWLAKHGKAYNG----IDEREKRFQIFKENLNFIDDHNSENR--TYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRY--A
        R+D EVR IY  W A+HGK  N     I++++KRF IFK+NL FID HN +N+  TYK+GL  F DLTNDEYR LYLG R+ PARR+ KAK  +++Y  A
Subjt:  RSDGEVREIYDLWLAKHGKAYNG----IDEREKRFQIFKENLNFIDDHNSENR--TYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRY--A

Query:  VNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAWTLRK-IPLRSLR
        VN ++ +PE+VDWR +GAV P+K+QG+CGSCWAFST AAVEGIN+IVTG+LISLSEQELV CDK YN GCNGGLM   F   +        K  P R   
Subjt:  VNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAWTLRK-IPLRSLR

Query:  WSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW
            S  +  +  SID YEDVP  DE ALKKA+++QPVSVAIEA G   Q YQSG+FTG+    L    VAVGYG ENGVDYW+VRNSW
Subjt:  WSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW

Q9FMH8 Probable cysteine protease RD21B2.7e-8257.54Show/hide
Query:  RSDGEVREIYDLWLAKHGKA---YNGID-EREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNNR
        RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFK+NL FID+HN++N +YK+GL  FADLTN+EYR++YLG +  P +RV+K    S RY     
Subjt:  RSDGEVREIYDLWLAKHGKA---YNGID-EREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNNR

Query:  DRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRWSMR
        D LP+SVDWR  GAVA VK+QGSCGSCWAFSTI AVEGIN+IVTGDLISLSEQELV CD  YN GCNGGLM   F   +      T    P ++      
Subjt:  DRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRWSMR

Query:  SHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW
         +++  +  +IDSYEDVP N E +LKKA+AHQP+SVAIEA G A QLY SGVF G     L    VAVGYG ENG DYW+VRNSW
Subjt:  SHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW

Arabidopsis top hitse value%identityAlignment
AT1G47128.1 Granulin repeat cysteine protease family protein4.3e-8354.83Show/hide
Query:  SALSRRSDGEVREIYDLWLAKHGKA--YNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAV
        S    RS+ EV  IY+ WL KHGKA   N + E+++RF+IFK+NL F+D+HN +N +Y++GL  FADLTNDEYR+ YLG +          +  S RY  
Subjt:  SALSRRSDGEVREIYDLWLAKHGKA--YNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAV

Query:  NNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRW
           D LPES+DWR +GAVA VK+QG CGSCWAFSTI AVEGINQIVTGDLI+LSEQELV CD  YN GCNGGLM   F   +      T +  P + +  
Subjt:  NNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRW

Query:  SMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSWAQ
        +    ++  +  +IDSYEDVP   EE+LKKAVAHQP+S+AIEA G A QLY SG+F G+    L    VAVGYG ENG DYW+VRNSW +
Subjt:  SMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSWAQ

AT3G19390.1 Granulin repeat cysteine protease family protein1.7e-7953.16Show/hide
Query:  MATATTS--LALLSFFFLSISASALS------RRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNS-ENRTYKVGLNMFADLTNDEY
        MAT+  S  LALL F  L IS S  S       R++ E R +Y+ WL ++ K YNG+ E+E+RF+IFK+NL F+++H+S  NRTY+VGL  FADLTNDE+
Subjt:  MATATTS--LALLSFFFLSISASALS------RRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNS-ENRTYKVGLNMFADLTNDEY

Query:  RALYLGTRSPPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGL
        RA+YL ++    R  +K +    +Y     D LP+++DWRA+GAV PVK+QGSCGSCWAFS I AVEGINQI TG+LISLSEQELV CD  YN GC GGL
Subjt:  RALYLGTRSPPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGL

Query:  MAMPFSSSLTMAAW-TLRKIPLRSLRWSM-RSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVG
        M   F   +      T    P  +   ++  S ++  +  +ID YEDVP NDE++LKKA+A+QP+SVAIEA G A QLY SGVFTG     L    VAVG
Subjt:  MAMPFSSSLTMAAW-TLRKIPLRSLRWSM-RSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVG

Query:  YGKENGVDYWLVRNSW
        YG E G DYW+VRNSW
Subjt:  YGKENGVDYWLVRNSW

AT3G19400.1 Cysteine proteinases superfamily protein2.8e-7449.84Show/hide
Query:  TSLALLSFFFLSISASALS----RRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNS-ENRTYKVGLNMFADLTNDEYRALYLGTRS
        ++L +LS   LS S    +     R++ EVR +Y+ WL ++ K YNG+ E+E+RF+IFK+NL F+D+HNS  +RT++VGL  FADLTN+E+RA+YL  + 
Subjt:  TSLALLSFFFLSISASALS----RRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNS-ENRTYKVGLNMFADLTNDEYRALYLGTRS

Query:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKY-NSGCNGGLMAMPFSSS
           +  +K    + RY     D LP+ VDWRA GAV  VK+QG+CGSCWAFS + AVEGINQI TG+LISLSEQELV CD+ + N+GC+GG+M   F   
Subjt:  PPARRVMKAKTASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKY-NSGCNGGLMAMPFSSS

Query:  LTMAA-WTLRKIPLRS--LRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGV
        +      T +  P  +  L           +  +ID YEDVP +DE++LKKAVAHQPVSVAIEAS  A QLY+SGV TG   + L    V VGYG  +G 
Subjt:  LTMAA-WTLRKIPLRS--LRWSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGV

Query:  DYWLVRNSW
        DYW++RNSW
Subjt:  DYWLVRNSW

AT4G36880.1 cysteine proteinase11.3e-8758.82Show/hide
Query:  RSDGEVREIYDLWLAKHGKAYNG----IDEREKRFQIFKENLNFIDDHNSENR--TYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRY--A
        R+D EVR IY  W A+HGK  N     I++++KRF IFK+NL FID HN  N+  TYK+GL  F DLTNDEYR LYLG R+ PARR+ KAK  +++Y  A
Subjt:  RSDGEVREIYDLWLAKHGKAYNG----IDEREKRFQIFKENLNFIDDHNSENR--TYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRY--A

Query:  VNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAWTLRK-IPLRSLR
        VN ++ +PE+VDWR +GAV P+K+QG+CGSCWAFST AAVEGIN+IVTG+LISLSEQELV CDK YN GCNGGLM   F   +        K  P R   
Subjt:  VNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAWTLRK-IPLRSLR

Query:  WSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW
            S  +  +  SID YEDVP  DE ALKKA+++QPVSVAIEA G   Q YQSG+FTG+    L    VAVGYG ENGVDYW+VRNSW
Subjt:  WSMRSHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW

AT5G43060.1 Granulin repeat cysteine protease family protein1.9e-8357.54Show/hide
Query:  RSDGEVREIYDLWLAKHGKA---YNGID-EREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNNR
        RSD EV  IY+ W+ +HGK     NG+  E+++RF+IFK+NL FID+HN++N +YK+GL  FADLTN+EYR++YLG +  P +RV+K    S RY     
Subjt:  RSDGEVREIYDLWLAKHGKA---YNGID-EREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAKTASRRYAVNNR

Query:  DRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRWSMR
        D LP+SVDWR  GAVA VK+QGSCGSCWAFSTI AVEGIN+IVTGDLISLSEQELV CD  YN GCNGGLM   F   +      T    P ++      
Subjt:  DRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAW-TLRKIPLRSLRWSMR

Query:  SHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW
         +++  +  +IDSYEDVP N E +LKKA+AHQP+SVAIEA G A QLY SGVF G     L    VAVGYG ENG DYW+VRNSW
Subjt:  SHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVAL-LSTMAVAVGYGKENGVDYWLVRNSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTGCCACCACTTCGCTTGCCCTCCTCTCCTTCTTTTTCCTTTCCATTTCCGCCTCCGCCCTCAGCCGCCGAAGCGACGGCGAGGTTAGAGAAATCTACGACCT
ATGGTTGGCGAAGCACGGCAAGGCCTATAACGGAATCGATGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTCAATTTTATCGATGATCATAATTCCGAGAATC
GGACGTATAAGGTTGGATTGAATATGTTCGCCGATTTGACCAACGACGAGTATCGGGCTCTGTATTTGGGGACTAGGTCTCCCCCAGCTCGACGAGTTATGAAGGCCAAG
ACTGCCAGCCGCCGATACGCCGTCAATAACCGCGATCGGTTGCCGGAATCTGTGGATTGGAGGGCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAG
TTGCTGGGCATTCTCGACCATAGCAGCTGTGGAAGGCATAAATCAAATTGTTACCGGAGATTTAATTTCTCTCTCTGAACAAGAGCTTGTTAGTTGTGACAAAAAGTACA
ACTCAGGCTGCAATGGAGGCCTTATGGCTATGCCTTTCAGTTCATCATTGACAATGGCGGCTTGGACACTGAGGAAGATACCCTTACGAAGCCTTCGATGGTCAATGCGA
TCCCACCAGGAAAAATGCCAAGGTTTTAGCATCGACTCTTACGAAGACGTCCCTGCCAACGACGAGGAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGTGTTGC
CATTGAAGCTAGTGGCTTGGCTTTGCAACTCTACCAATCGGGTGTATTCACTGGAAATGTGGCTCTGCTCTCGACCATGGCCGTCGCCGTTGGATATGGTAAAGAAAATG
GAGTTGATTATTGGCTTGTAAGGAACTCATGGGCACAGGGAGTTATCTCACAAAATAGTTATGATTATGAAGCATGTTCTTGTCATGATTTTGCACATTGTTTCTACCTA
ATGCTATCATTGGGTCAACTGTGGAGTGACAAGATGGTGGCAATATTTAATACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACTGCCACCACTTCGCTTGCCCTCCTCTCCTTCTTTTTCCTTTCCATTTCCGCCTCCGCCCTCAGCCGCCGAAGCGACGGCGAGGTTAGAGAAATCTACGACCT
ATGGTTGGCGAAGCACGGCAAGGCCTATAACGGAATCGATGAACGGGAGAAGAGGTTTCAGATCTTCAAGGAGAATCTCAATTTTATCGATGATCATAATTCCGAGAATC
GGACGTATAAGGTTGGATTGAATATGTTCGCCGATTTGACCAACGACGAGTATCGGGCTCTGTATTTGGGGACTAGGTCTCCCCCAGCTCGACGAGTTATGAAGGCCAAG
ACTGCCAGCCGCCGATACGCCGTCAATAACCGCGATCGGTTGCCGGAATCTGTGGATTGGAGGGCCAGAGGTGCCGTTGCTCCAGTCAAAAATCAAGGAAGTTGCGGGAG
TTGCTGGGCATTCTCGACCATAGCAGCTGTGGAAGGCATAAATCAAATTGTTACCGGAGATTTAATTTCTCTCTCTGAACAAGAGCTTGTTAGTTGTGACAAAAAGTACA
ACTCAGGCTGCAATGGAGGCCTTATGGCTATGCCTTTCAGTTCATCATTGACAATGGCGGCTTGGACACTGAGGAAGATACCCTTACGAAGCCTTCGATGGTCAATGCGA
TCCCACCAGGAAAAATGCCAAGGTTTTAGCATCGACTCTTACGAAGACGTCCCTGCCAACGACGAGGAAGCATTGAAGAAGGCTGTTGCTCATCAGCCAGTCAGTGTTGC
CATTGAAGCTAGTGGCTTGGCTTTGCAACTCTACCAATCGGGTGTATTCACTGGAAATGTGGCTCTGCTCTCGACCATGGCCGTCGCCGTTGGATATGGTAAAGAAAATG
GAGTTGATTATTGGCTTGTAAGGAACTCATGGGCACAGGGAGTTATCTCACAAAATAGTTATGATTATGAAGCATGTTCTTGTCATGATTTTGCACATTGTTTCTACCTA
ATGCTATCATTGGGTCAACTGTGGAGTGACAAGATGGTGGCAATATTTAATACTTGA
Protein sequenceShow/hide protein sequence
MATATTSLALLSFFFLSISASALSRRSDGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLNFIDDHNSENRTYKVGLNMFADLTNDEYRALYLGTRSPPARRVMKAK
TASRRYAVNNRDRLPESVDWRARGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVSCDKKYNSGCNGGLMAMPFSSSLTMAAWTLRKIPLRSLRWSMR
SHQEKCQGFSIDSYEDVPANDEEALKKAVAHQPVSVAIEASGLALQLYQSGVFTGNVALLSTMAVAVGYGKENGVDYWLVRNSWAQGVISQNSYDYEACSCHDFAHCFYL
MLSLGQLWSDKMVAIFNT