; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022648 (gene) of Snake gourd v1 genome

Gene IDTan0022648
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationLG01:23201356..23203382
RNA-Seq ExpressionTan0022648
SyntenyTan0022648
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577689.1 putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia]3.2e-25691.72Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS
        MASPVF LFLLC L+SS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHRRRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGS
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS

Query:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA
        DLVWFPCSPFECILCEGKPK+QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSLI +LYRDSLSLPA
Subjt:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA

Query:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY
        PAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+
Subjt:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY

Query:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS
        YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+VA+RASRIEENTGLSPCY Y+ S+EVPRVVLHFVGEKSS
Subjt:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS

Query:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        V LPRKNYFYEFLDGGDG GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_022923540.1 probable aspartyl protease At4g16563 [Cucurbita moschata]5.5e-25691.72Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS
        MASPVF LFLLC L SS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHRRRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGS
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS

Query:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA
        DLVWFPCSPFECILCEGKPK+QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSLI +LYRDSLSLPA
Subjt:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA

Query:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY
        PAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+
Subjt:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY

Query:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS
        YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+VA+RASRIEENTGLSPCY Y+ S+EVPRVVLHFVGEKSS
Subjt:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS

Query:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        V LPRKNYFYEFLDGGDG GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_023007805.1 probable aspartyl protease At4g16563 [Cucurbita maxima]5.5e-25691.48Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS
        MASPVF LFLLC LL S VFSS+ILLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHRRRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGS
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS

Query:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA
        DLVWFPCSPFECILCEGKPK+QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSLI +LYRDSLSLPA
Subjt:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA

Query:  PAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYS
        PAP+PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+YS
Subjt:  PAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYS

Query:  VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVM
        VGLAGISVGSV IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+VA+RAS+IEENTGLSPCYYY+ S+EVPRVVLHFVGEKSSVM
Subjt:  VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVM

Query:  LPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        LPRKNYFYEFLDGGDG GRK KVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  LPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_023553227.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]6.5e-25791.93Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS
        MASPVF LFLLC LLSS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHRRRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGS
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS

Query:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA
        DLVWFPCSPFECILCEGKPK+QSPLPKI+ +KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSLI +LYRDSLSLPA
Subjt:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA

Query:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY
        PAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTSLLENPKHPY+
Subjt:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY

Query:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS
        YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+VA+RASRIEENTGLSPCYYY+NS+EVPRVVLHFVGEKSS
Subjt:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS

Query:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        V+LPRKNYFYEFLDGGDG  RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_038905814.1 probable aspartyl protease At4g16563 [Benincasa hispida]1.2e-25390.87Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTG
        MAS VF+L LLC LLSS VFSS++LLLPLSHSLSSS SDFNNTHNLLKSTAARS ARFHHRRRT   +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTG
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTG

Query:  SDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLP
        SDLVWFPCSPFECILCEGKPKVQSPLPKIS  KSVSCSA ACSAAHGGSLS+SHLCAIS+CPLESIEISECSSFSCPPFYYAYGDGSLIA+LYRDSLSLP
Subjt:  SDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLP

Query:  APAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYY
        APAP+PAI+VRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAA+RVRRPSPLILGRYYGGETEF+YTSLLENPKHPY+Y
Subjt:  APAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYY

Query:  SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSV
        SVGL GISVG++ IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLYDSVV  FENRTG+VA RA RIEENTGLSPCYYY+NS+EVPRVVLHFVGEKSSV
Subjt:  SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSV

Query:  MLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        +LP+KNYFYEFLDGGDG G+KRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDL KNRVGFARRQCSTLWDSLNRS
Subjt:  MLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

TrEMBL top hitse value%identityAlignment
A0A0A0L5I7 Pepsin A7.8e-24889.19Show/hide
Query:  SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSD
        SPVF +FLLC LLSS VFSS+I LLPLSHSLSSS SDFNNTHNLLKSTA RS ARFH     HR +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTGSD
Subjt:  SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSD

Query:  LVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP
        LVWFPCSPFECILCEGKPK+QSPLPKI+  KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL+A+LYRDSLSLP P
Subjt:  LVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP

Query:  APAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSV
        AP+P I+VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY GETEF+YTSLLENPKHPY+YSV
Subjt:  APAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSV

Query:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML
        GLAGISVG++RIPAPEFL +VDEGGSGGVVVDSGTTFTMLPAGLY+SVV +FENRTG+VA RA RIEENTGLSPCYYY+NS+ VPRVVLHFVGEKS+V+L
Subjt:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML

Query:  PRKNYFYEFLDGGDG-TGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        PRKNYFYEFLDGGDG  GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  PRKNYFYEFLDGGDG-TGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A1S3BK28 aspartic proteinase nepenthesin-11.5e-24688.61Show/hide
Query:  SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSD
        SPVF +FLLC LLSS VFSS+I LLPLSHSLSSS SDFN+THNLLKSTA RS ARFH     HR +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTGSD
Subjt:  SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSD

Query:  LVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP
        LVWFPCSPFECILCEGKPK+QSPLPKIS  KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL+A+LYRDSLSLP P
Subjt:  LVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP

Query:  APAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSV
        AP+P I+VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+ GETEF+YTSLLENPKHPY+YSV
Subjt:  APAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSV

Query:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML
        GLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY+SVV +FENRTG+VA RA RIEENTGLSPCYYY+NS+ VPRVVLHFVGEKSSV+L
Subjt:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML

Query:  PRKNYFYEFLDGGDG---TGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        PRKNYFYEFLDGGDG    GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  PRKNYFYEFLDGGDG---TGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A5D3CP11 Aspartic proteinase nepenthesin-12.1e-24588.45Show/hide
Query:  SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSD
        SPVF +FLLC LLSS VFSS+I LLPLSHSLSSS SDFN+THNLLKSTA RS ARFH     HR +HLSLPLSPGGDYTLSFNLGSE+HKISLYMDTGSD
Subjt:  SPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSS-SDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSD

Query:  LVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP
        LVWFPCSPFECILCEGKPK+QSPLPKIS  KSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL+A+LYRDSLSLP P
Subjt:  LVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAP

Query:  APAPA--ISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYY
        APAP+  I+VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+ GETEF+YTSLLENPKHPY+Y
Subjt:  APAPA--ISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYY

Query:  SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSV
        SVGLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY+SVV +FENRTG+VA RA RIEENTGLSPCYYY NS+ VPRVVLHFVGEKSSV
Subjt:  SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSV

Query:  MLPRKNYFYEFLDGGDG---TGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        +LPRKNYFYEFLDGGDG    GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  MLPRKNYFYEFLDGGDG---TGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A6J1EC44 probable aspartyl protease At4g165632.7e-25691.72Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS
        MASPVF LFLLC L SS VFSS++LLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHRRRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGS
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS

Query:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA
        DLVWFPCSPFECILCEGKPK+QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSLI +LYRDSLSLPA
Subjt:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA

Query:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY
        PAPA  PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+
Subjt:  PAPA--PAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYY

Query:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS
        YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+VA+RASRIEENTGLSPCY Y+ S+EVPRVVLHFVGEKSS
Subjt:  YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSS

Query:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        V LPRKNYFYEFLDGGDG GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  VMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A6J1L3Z9 probable aspartyl protease At4g165632.7e-25691.48Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS
        MASPVF LFLLC LL S VFSS+ILLLPLS+SLSSSSDFNNTHNLLKSTAARS ARFHHRRRTH RSHLSLPLSPGGDYTLSFNLGSE+ KISLYMDTGS
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGS

Query:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA
        DLVWFPCSPFECILCEGKPK+QSPLPKIS QKSVSCSAAACSAAHGGSLS+SHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSLI +LYRDSLSLPA
Subjt:  DLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPA

Query:  PAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYS
        PAP+PAI+VRNFTFGCAH+ALGEP+GVAGFGRG LSMP QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG ETEF+YTS+LENPKHPY+YS
Subjt:  PAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYS

Query:  VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVM
        VGLAGISVGSV IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVV QFENRTG+VA+RAS+IEENTGLSPCYYY+ S+EVPRVVLHFVGEKSSVM
Subjt:  VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVM

Query:  LPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        LPRKNYFYEFLDGGDG GRK KVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  LPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED31.3e-3429.16Show/hide
Query:  SLPLSPG-----GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPL--PKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPL
        S+P++ G     G+Y +   LG+    + + +DT +D VW PCS      C G     +       S   +VSCS A C+ A G             CP 
Subjt:  SLPLSPG-----GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPL--PKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPL

Query:  ESIEISECSSFSCPPFYYAY-GDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGE---PVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVS
         S + S CS      F  +Y GD S  A L +D+L+L     AP + + NF+FGC ++A G    P G+ G GRG +S+ SQ  +        FSYCL S
Subjt:  ESIEISECSSFSCPPFYYAY-GDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGE---PVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVS

Query:  -HSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRT
          SF          L LG   G      YT LL NP+ P  Y V L G+SVGSV++P        D     G ++DSGT  T     +Y+++  +F  + 
Subjt:  -HSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRT

Query:  GQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVY
               S          C+  DN    P++ LH       + LP +N           T      G L  ++     + A      + N QQQ   +++
Subjt:  GQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVY

Query:  DLEKNRVGFARRQCS
        D+  +R+G A   C+
Subjt:  DLEKNRVGFARRQCS

Q766C3 Aspartic proteinase nepenthesin-12.7e-3529.88Show/hide
Query:  GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCP
        G+Y ++ ++G+ A   S  MDTGSDL+W  C P  C  C          P  + Q S S S   C         SS LC       +++    CS+  C 
Subjt:  GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCP

Query:  PFYYAYGDGS-LIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHT----ALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSP
         + Y YGDGS     +  ++L+        ++S+ N TFGC         G   G+ G GRG LS+PSQL         +FSYC+     +      PS 
Subjt:  PFYYAYGDGS-LIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHT----ALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSP

Query:  LILGRYYGGETE-FVYTSLLENPKHPYYYSVGLAGISVGSVRIPA-PEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEE
        L+LG      T     T+L+++ + P +Y + L G+SVGS R+P  P         G+GG+++DSGTT T      Y SV  +F ++        S    
Subjt:  LILGRYYGGETE-FVYTSLLENPKHPYYYSVGLAGISVGSVRIPA-PEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEE

Query:  NTGLSPCYYY---DNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGF
        ++G   C+      +++++P  V+HF G    + LP +NYF                G + L  G     +     +  GN QQQ   VVYD   + V F
Subjt:  NTGLSPCYYY---DNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGF

Query:  ARRQC
        A  QC
Subjt:  ARRQC

Q940R4 Probable aspartyl protease At4g165631.1e-16661.43Show/hide
Query:  LLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSP
        LLL LSHSLS+S   ++  +LLKS+++RS ARF       ++  LSLP+S G DY +S ++GS +  +SLY+DTGSDLVWFPC PF CILCE KP   SP
Subjt:  LLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSP

Query:  LPKISKQ-KSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTAL
           +S    +VSCS+ +CSAAH  SL SS LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSL      P++SV NFTFGCAHT L
Subjt:  LPKISKQ-KSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTAL

Query:  GEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG--------------------GETEFVYTSLLENPKHPYYYSV
         EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF +DRVRRPSPLILGR+                       + EFV+T +LENPKHPY+YSV
Subjt:  GEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG--------------------GETEFVYTSLLENPKHPYYYSV

Query:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML
         L GIS+G   IPAP  L+R+D+ G GGVVVDSGTTFTMLPA  Y+SVV +F++R G+V  RA R+E ++G+SPCYY + +++VP +VLHF G +SSV L
Subjt:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML

Query:  PRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL
        PR+NYFYEF+DGGDG   KRK+GCLMLMNGGDE EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Subjt:  PRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL

Q9LNJ3 Aspartyl protease family protein 21.0e-3432.44Show/hide
Query:  LSPG-GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECS
        LS G G+Y     +G+ A  + + +DTGSD+VW  C+P  C  C  +       P    +KS + +   CS+ H   L S+  C   R          C 
Subjt:  LSPG-GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECS

Query:  SFSCPPFYYAYGDGSL-IAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR
              +  +YGDGS  +     ++L+           V+    GC H   G  VG A   G G+G LS P Q      +   +FSYCLV  S ++    
Subjt:  SFSCPPFYYAYGDGSL-IAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR

Query:  RPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASR
        +PS ++ G          +T LL NPK   +Y VGL GISVG  R+P     L ++D+ G+GGV++DSGT+ T L    Y ++   F  R G  A    R
Subjt:  RPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASR

Query:  IEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPG--ATLGNYQQQGFEVVYDLEKN
          + +    C+     N ++VP VVLHF G  + V LP  NY                    +  NG      AG  G  + +GN QQQGF VVYDL  +
Subjt:  IEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPG--ATLGNYQQQGFEVVYDLEKN

Query:  RVGFARRQCS
        RVGFA   C+
Subjt:  RVGFARRQCS

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 12.9e-3429.93Show/hide
Query:  GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCP
        G+Y     +G+ A ++ L +DTGSD+ W  C P  C  C  +          S  KS++CSA  CS                      +E S C S  C 
Subjt:  GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCP

Query:  PFYYAYGDGSL-IAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALG---EPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPL
         +  +YGDGS  + +L  D+++        +  + N   GC H   G      G+ G G G LS+ +Q+   S      FSYCLV            + +
Subjt:  PFYYAYGDGSL-IAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALG---EPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPL

Query:  ILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTG
         LG   GG+       LL N K   +Y VGL+G SVG  ++  P+ +  VD  GSGGV++D GT  T L    Y+S+   F   T  +   +S I   + 
Subjt:  ILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTG

Query:  LSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQ
           CY +   ++++VP V  HF G K S+ LP KNY     D G          C           +       +GN QQQG  + YDL KN +G +  +
Subjt:  LSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQ

Query:  C
        C
Subjt:  C

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein7.2e-3632.44Show/hide
Query:  LSPG-GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECS
        LS G G+Y     +G+ A  + + +DTGSD+VW  C+P  C  C  +       P    +KS + +   CS+ H   L S+  C   R          C 
Subjt:  LSPG-GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECS

Query:  SFSCPPFYYAYGDGSL-IAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR
              +  +YGDGS  +     ++L+           V+    GC H   G  VG A   G G+G LS P Q      +   +FSYCLV  S ++    
Subjt:  SFSCPPFYYAYGDGSL-IAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR

Query:  RPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASR
        +PS ++ G          +T LL NPK   +Y VGL GISVG  R+P     L ++D+ G+GGV++DSGT+ T L    Y ++   F  R G  A    R
Subjt:  RPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASR

Query:  IEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPG--ATLGNYQQQGFEVVYDLEKN
          + +    C+     N ++VP VVLHF G  + V LP  NY                    +  NG      AG  G  + +GN QQQGF VVYDL  +
Subjt:  IEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPG--ATLGNYQQQGFEVVYDLEKN

Query:  RVGFARRQCS
        RVGFA   C+
Subjt:  RVGFARRQCS

AT1G25510.1 Eukaryotic aspartyl protease family protein4.2e-3629.93Show/hide
Query:  THRRSHLSLPLSPG-----GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAI
        T     +  PL  G     G+Y     +G  A ++ + +DTGSD+ W  C+P  C  C  + +        S  + +SC    C+A              
Subjt:  THRRSHLSLPLSPG-----GDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAI

Query:  SRCPLESIEISECSSFSCPPFYYAYGDGS-LIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQLGNRFS
               +E+SEC + +C  +  +YGDGS  +     ++L++ +        V+N   GC H+  G  VG A   G G G L++PSQL T S      FS
Subjt:  SRCPLESIEISECSSFSCPPFYYAYGDGS-LIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVA---GFGRGTLSMPSQLATFSPQLGNRFS

Query:  YCLVSH-SFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQ
        YCLV   S +A  V   + L          + V   LL N +   +Y +GL GISVG   +  P+    +DE GSGG+++DSGT  T L   +Y+S+   
Subjt:  YCLVSH-SFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQ

Query:  FENRTGQVATRASRIEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQ
        F   T  +   A     +T    CY      ++EVP V  HF G K  + LP KNY                VG   L        L     A +GN QQ
Subjt:  FENRTGQVATRASRIEENTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQ

Query:  QGFEVVYDLEKNRVGFARRQC
        QG  V +DL  + +GF+  +C
Subjt:  QGFEVVYDLEKNRVGFARRQC

AT3G52500.1 Eukaryotic aspartyl protease family protein1.1e-4430.04Show/hide
Query:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRR---------------THRRSHLSLPLSPG--GDYTLSF
        MAS +F  FL+ + + S+V   ++ L P SHS  S  D    +  L+  A  S+AR H  +                T   + +  PLS    G Y++S 
Subjt:  MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRR---------------THRRSHLSLPLSPG--GDYTLSF

Query:  NLGSEAHKISLYMDTGSDLVWFPC-SPFECILCEGKPKVQSPLPKI-----SKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPP
        + G+ +  I    DTGS LVW PC S + C  C+      + +P+      S  K + C +  C   +G ++         +C         C +  CPP
Subjt:  NLGSEAHKISLYMDTGSDLVWFPC-SPFECILCEGKPKVQSPLPKI-----SKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPP

Query:  FYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY
        +   YG GS    L  + L        P ++V +F  GC+  +  +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G  
Subjt:  FYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY

Query:  Y--GGETE-FVYTSLLENPK-----HPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEE
        +  G +T    YT   +NP         YY + L  I VG   +  P         G GG +VDSG+TFT +   +++ V  +F ++     TR   +E+
Subjt:  Y--GGETE-FVYTSLLENPK-----HPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEE

Query:  NTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAG-GPGATLGNYQQQGFEVVYDLEKNRVGF
         TGL PC+       + VP ++  F G  + + LP  NYF  F+   D         CL +++    +   G GP   LG++QQQ + V YDLE +R GF
Subjt:  NTGLSPCYYYD--NSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAG-GPGATLGNYQQQGFEVVYDLEKNRVGF

Query:  ARRQCS
        A+++CS
Subjt:  ARRQCS

AT4G16563.1 Eukaryotic aspartyl protease family protein7.9e-16861.43Show/hide
Query:  LLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSP
        LLL LSHSLS+S   ++  +LLKS+++RS ARF       ++  LSLP+S G DY +S ++GS +  +SLY+DTGSDLVWFPC PF CILCE KP   SP
Subjt:  LLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPFECILCEGKPKVQSP

Query:  LPKISKQ-KSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTAL
           +S    +VSCS+ +CSAAH  SL SS LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSL      P++SV NFTFGCAHT L
Subjt:  LPKISKQ-KSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTAL

Query:  GEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG--------------------GETEFVYTSLLENPKHPYYYSV
         EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF +DRVRRPSPLILGR+                       + EFV+T +LENPKHPY+YSV
Subjt:  GEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYG--------------------GETEFVYTSLLENPKHPYYYSV

Query:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML
         L GIS+G   IPAP  L+R+D+ G GGVVVDSGTTFTMLPA  Y+SVV +F++R G+V  RA R+E ++G+SPCYY + +++VP +VLHF G +SSV L
Subjt:  GLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVML

Query:  PRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL
        PR+NYFYEF+DGGDG   KRK+GCLMLMNGGDE EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Subjt:  PRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL

AT5G45120.1 Eukaryotic aspartyl protease family protein1.8e-5032.26Show/hide
Query:  VFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHR-RSHLSLPLSP----GGDYTLSFNLGSEAHKISLYMDTG
        V  LFLL  LL ++   ++        + SSSS       L KS+ +    +   + R  +  S + + + P       Y ++ N+G+    + +Y+DTG
Subjt:  VFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHR-RSHLSLPLSP----GGDYTLSFNLGSEAHKISLYMDTG

Query:  SDLVWFPCS--PFECILCEG-------KPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQ
        SDL W PC    F+CI C          P V SPL   +  +  SC+++ C   H  S +    CA++ C +  +  S C    CP F Y YG+G LI+ 
Subjt:  SDLVWFPCS--PFECILCEG-------KPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQ

Query:  -LYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGR---YYGGETEFVY
         L RD L       A    V  F+FGC  +   EP+G+AGFGRG LS+PSQL      L   FS+C +   F  +     SPLILG             +
Subjt:  -LYRDSLSLPAPAPAPAISVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGR---YYGGETEFVY

Query:  TSLLENPKHPYYYSVGLAGISVGSVRIP--APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYY-----
        T +L  P +P  Y +GL  I++G+   P   P  L++ D  G+GG++VDSGTT+T LP   Y  ++   ++       RA+  E  TG   CY       
Subjt:  TSLLENPKHPYYYSVGLAGISVGSVRIP--APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYY-----

Query:  -------DNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC
               D  +  P +  HF+   ++++LP+ N FY      DG+     V CL+  N  D D    GP    G++QQQ  +VVYDLEK R+GF    C
Subjt:  -------DNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCCTGTTTTTCTCCTCTTCCTCCTCTGTATTCTCCTTTCTTCCTCTGTTTTCTCTTCAGAAATTCTCCTTCTACCTCTCTCCCACTCCTTATCATCCTCATC
AGATTTCAACAACACCCACAACCTCCTCAAATCTACTGCTGCCCGCTCCGTCGCCCGCTTCCACCACCGCCGCCGTACCCACCGCCGCAGCCACCTCTCTCTCCCACTCT
CTCCAGGTGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGGCTCACAAAATTTCCCTCTATATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCATTT
GAATGTATTCTCTGCGAAGGCAAACCAAAAGTTCAATCCCCTTTGCCCAAAATCTCAAAACAGAAATCAGTTTCTTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGCGG
CTCCCTCTCCTCCTCCCACCTCTGTGCAATTTCCCGATGCCCACTTGAATCCATTGAAATTTCTGAGTGTTCTTCCTTTTCTTGTCCGCCTTTCTATTATGCTTATGGCG
ATGGGAGTTTAATTGCTCAGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCCGCACCGGCACCGGCGATTAGTGTTCGGAATTTTACTTTTGGATGTGCCCACACGGCG
CTCGGTGAGCCGGTCGGGGTCGCCGGGTTCGGTCGGGGAACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAATTGGGGAACCGGTTTTCTTATTGTTTGGT
TTCTCATTCGTTTGCGGCGGACCGGGTTCGCCGCCCGAGTCCGCTGATTCTGGGGCGGTACTACGGCGGCGAGACGGAGTTCGTTTACACTTCCTTGCTTGAGAATCCGA
AGCATCCTTACTATTACTCGGTTGGGTTGGCGGGAATTTCGGTCGGGTCGGTGAGAATTCCGGCGCCGGAGTTTTTGAAACGGGTGGATGAGGGGGGCAGCGGCGGCGTT
GTAGTGGATTCCGGTACTACTTTCACTATGCTGCCGGCCGGTTTGTATGACTCGGTGGTGGGTCAGTTTGAGAATCGGACCGGGCAAGTTGCGACCCGGGCGAGCCGTAT
TGAAGAAAATACCGGGTTGAGCCCTTGTTATTACTATGATAACTCAATTGAAGTGCCACGTGTCGTGCTGCATTTCGTCGGGGAAAAATCCAGTGTGATGCTTCCTAGGA
AGAATTATTTTTACGAGTTTTTGGATGGTGGAGATGGGACGGGGAGGAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGATGAGCTGGCAGGTGGG
CCCGGAGCCACGCTGGGGAACTATCAACAACAGGGTTTTGAAGTGGTCTATGATTTAGAGAAGAACCGGGTCGGTTTTGCCCGGCGGCAGTGTTCGACGCTTTGGGACAG
CTTGAACCGGAGTTAG
mRNA sequenceShow/hide mRNA sequence
TTATTGTACAAGGAATCACACAACCAAAGTTGTAATTGAACCAAAGAAGAAAGAGGAAGAAAGAAGCCCCTAGTTCCCATGGCCACCACTCCTCAAACCCATACAAAAAC
CCATTTCCCCCTTCTAACCAGAACCTTCTTCTTCTTCTTCCTCAATCCTCACCTCTTTATATTCCCAACTTTCTCTCTCTAACTTCACTCCAAAAACCCCATCTCTTTCT
GAACAGCCCTTCAATGGCTTCCCCTGTTTTTCTCCTCTTCCTCCTCTGTATTCTCCTTTCTTCCTCTGTTTTCTCTTCAGAAATTCTCCTTCTACCTCTCTCCCACTCCT
TATCATCCTCATCAGATTTCAACAACACCCACAACCTCCTCAAATCTACTGCTGCCCGCTCCGTCGCCCGCTTCCACCACCGCCGCCGTACCCACCGCCGCAGCCACCTC
TCTCTCCCACTCTCTCCAGGTGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGGCTCACAAAATTTCCCTCTATATGGACACCGGCAGCGACCTCGTTTGGTTCCC
CTGTTCCCCATTTGAATGTATTCTCTGCGAAGGCAAACCAAAAGTTCAATCCCCTTTGCCCAAAATCTCAAAACAGAAATCAGTTTCTTGCAGCGCCGCCGCATGCTCCG
CCGCCCACGGCGGCTCCCTCTCCTCCTCCCACCTCTGTGCAATTTCCCGATGCCCACTTGAATCCATTGAAATTTCTGAGTGTTCTTCCTTTTCTTGTCCGCCTTTCTAT
TATGCTTATGGCGATGGGAGTTTAATTGCTCAGCTTTATAGAGATAGCCTCAGTTTGCCGGCGCCCGCACCGGCACCGGCGATTAGTGTTCGGAATTTTACTTTTGGATG
TGCCCACACGGCGCTCGGTGAGCCGGTCGGGGTCGCCGGGTTCGGTCGGGGAACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAATTGGGGAACCGGTTTT
CTTATTGTTTGGTTTCTCATTCGTTTGCGGCGGACCGGGTTCGCCGCCCGAGTCCGCTGATTCTGGGGCGGTACTACGGCGGCGAGACGGAGTTCGTTTACACTTCCTTG
CTTGAGAATCCGAAGCATCCTTACTATTACTCGGTTGGGTTGGCGGGAATTTCGGTCGGGTCGGTGAGAATTCCGGCGCCGGAGTTTTTGAAACGGGTGGATGAGGGGGG
CAGCGGCGGCGTTGTAGTGGATTCCGGTACTACTTTCACTATGCTGCCGGCCGGTTTGTATGACTCGGTGGTGGGTCAGTTTGAGAATCGGACCGGGCAAGTTGCGACCC
GGGCGAGCCGTATTGAAGAAAATACCGGGTTGAGCCCTTGTTATTACTATGATAACTCAATTGAAGTGCCACGTGTCGTGCTGCATTTCGTCGGGGAAAAATCCAGTGTG
ATGCTTCCTAGGAAGAATTATTTTTACGAGTTTTTGGATGGTGGAGATGGGACGGGGAGGAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGATGA
GCTGGCAGGTGGGCCCGGAGCCACGCTGGGGAACTATCAACAACAGGGTTTTGAAGTGGTCTATGATTTAGAGAAGAACCGGGTCGGTTTTGCCCGGCGGCAGTGTTCGA
CGCTTTGGGACAGCTTGAACCGGAGTTAGTATGAACCGTGGGCCCGGTCGAGGACGTGAAGGTTGACAATTGAATGGTTTTGACTTGGGACTGTGCCAATGGTCAACGCT
TTTGTGGTAAATAAGTTATTTTGACATTTGATGGGGTCTTTTTTGTAAATTCTTGTGAGCACTTCACTTCTTGCTTCACTGCTATTTCTAATAGTTAAAATTTGTATATG
AAAAGTGTTCAAAATATATAATAAACAAAAAAGGGAAAAGATTATTTAATGGCTTATTGATTTATTAGTTCTGTTTTTAATGCATGAAGCAAGAAATGAAGTGCTTGCTA
CATTGGATGAATTTTTTTAAACTTTTTTTTTTTTTTTTTTTAAGGTT
Protein sequenceShow/hide protein sequence
MASPVFLLFLLCILLSSSVFSSEILLLPLSHSLSSSSDFNNTHNLLKSTAARSVARFHHRRRTHRRSHLSLPLSPGGDYTLSFNLGSEAHKISLYMDTGSDLVWFPCSPF
ECILCEGKPKVQSPLPKISKQKSVSCSAAACSAAHGGSLSSSHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLIAQLYRDSLSLPAPAPAPAISVRNFTFGCAHTA
LGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGGETEFVYTSLLENPKHPYYYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGV
VVDSGTTFTMLPAGLYDSVVGQFENRTGQVATRASRIEENTGLSPCYYYDNSIEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGTGRKRKVGCLMLMNGGDEDELAGG
PGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS