; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg04941 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg04941
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionaspartyl protease family protein 2
Genome locationCarg_Chr08:568777..570183
RNA-Seq ExpressionCarg04941
SyntenyCarg04941
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR033873 - CND41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592908.1 Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. sororia]1.9e-24595.98Show/hide
Query:  LRRLPIPSPL----------PFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRTPLRRAGFSSSVISGLAQ
        LRR+ +P+P             PQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTA RSRTPLRRAGFSSSVISGLAQ
Subjt:  LRRLPIPSPL----------PFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRTPLRRAGFSSSVISGLAQ

Query:  GSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATET
        GSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATET
Subjt:  GSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATET

Query:  LTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGI
        LTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGI
Subjt:  LTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGI

Query:  SVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATNYLIP
        SVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATNYLIP
Subjt:  SVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATNYLIP

Query:  VDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        VDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  VDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

KAG7025313.1 Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-269100Show/hide
Query:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR
        MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR
Subjt:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR

Query:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
        SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
Subjt:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC

Query:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
Subjt:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
        TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT

Query:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

XP_022959948.1 aspartyl protease family protein 2 [Cucurbita moschata]8.8e-26799.15Show/hide
Query:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR
        MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLS NKTPSDLFNLRLHRDALRVDSLTSLTA R
Subjt:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR

Query:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
        SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTP RYI+MVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
Subjt:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC

Query:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
Subjt:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
        TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT

Query:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

XP_023005015.1 aspartyl protease family protein 2 [Cucurbita maxima]5.0e-26298.08Show/hide
Query:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR
        MESPPR LL FFFFF AAVASAASEFQTLTLRRLPIPSPL FPQ QFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTA R
Subjt:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR

Query:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
        SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRY++MVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
Subjt:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC

Query:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
Subjt:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
        TPLILNPKLETFYYVELIG SVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT

Query:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

XP_023514169.1 aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo]2.8e-26598.93Show/hide
Query:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR
        MESPPRYLL FFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLS NKTPSDLFNLRLHRDALRVDSLTSLTA R
Subjt:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR

Query:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
        SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYI+MVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
Subjt:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC

Query:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
Subjt:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
        TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFR GASHLKRGPEFSLFDTCYDLSGQSAVKVPT
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT

Query:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

TrEMBL top hitse value%identityAlignment
A0A0A0K4G2 Peptidase A1 domain-containing protein2.8e-22685.81Show/hide
Query:  RYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLEST--AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRT
        +YLL FFF     +++AASEFQTLTLR LP PSPLP     F   ++L+S+  A LT++LHHLDSLS NKTP+DLFNLRLHRD LRV +L S        
Subjt:  RYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLEST--AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRT

Query:  PLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQ
          R AGFSSSV+SGL+QGSGEYFTRLGVGTPPRY++MVLDTGSDVVWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRLDSSGC+TRRHTCLYQ
Subjt:  PLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQ

Query:  VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL
        VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGH NEGLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL
Subjt:  VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL

Query:  ILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVL
        I NPKL+TFYYV LIGISVGGVRVRG+S SLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFR GA HLKRGPEFSLFDTCYDLSGQS+VKVPTVVL
Subjt:  ILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVL

Query:  HFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        HFRGADMALPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  HFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A1S3CHC4 aspartyl protease family protein 23.7e-22686.45Show/hide
Query:  RYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLEST--AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRT
        +YLL F+F     ++SAASEFQTLTLR LP PSPL      F   E+L+S+  AALT++LHHLDSLS NKTP+DLFNLRLHRDALRV +LTS  A     
Subjt:  RYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLEST--AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRT

Query:  PLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQ
             GFSSSVISGLAQGSGEYFTRLGVGTPPRY++MVLDTGSDVVWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRLDSSGC+TRRHTCLYQ
Subjt:  PLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQ

Query:  VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL
        VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGH NEGLFVGAAGLLGLGRGR SFPSQTG+RFN KFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL
Subjt:  VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL

Query:  ILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVL
        I NPKL+TFYYVELIGISVGGVRVRG+  SLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA HLKRGPEFSLFDTCYDLSGQS+VKVPTVVL
Subjt:  ILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVL

Query:  HFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        HFRGADM LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  HFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A5A7U8Z2 Aspartyl protease family protein 24.3e-22786.67Show/hide
Query:  RYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLEST--AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRT
        +YLL F+F     ++SAASEFQTLTLR LP PSPLP     F   E+L+S+  AALT++LHHLDSLS NKTP+DLFNLRLHRDALRV +LTS  A     
Subjt:  RYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLEST--AALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRT

Query:  PLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQ
             GFSSSVISGLAQGSGEYFTRLGVGTPPRY++MVLDTGSDVVWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRLDSSGC+TRRHTCLYQ
Subjt:  PLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQ

Query:  VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL
        VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGH NEGLFVGAAGLLGLGRGR SFPSQTG+RFN KFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL
Subjt:  VSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPL

Query:  ILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVL
        I NPKL+TFYYVELIGISVGGVRVRG+  SLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA HLKRGPEFSLFDTCYDLSGQS+VKVPTVVL
Subjt:  ILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVL

Query:  HFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        HFRGADM LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  HFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A6J1H7E0 aspartyl protease family protein 24.3e-26799.15Show/hide
Query:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR
        MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLS NKTPSDLFNLRLHRDALRVDSLTSLTA R
Subjt:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR

Query:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
        SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTP RYI+MVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
Subjt:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC

Query:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
Subjt:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
        TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT

Query:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A6J1KW81 aspartyl protease family protein 22.4e-26298.08Show/hide
Query:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR
        MESPPR LL FFFFF AAVASAASEFQTLTLRRLPIPSPL FPQ QFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTA R
Subjt:  MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGR

Query:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
        SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRY++MVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC
Subjt:  SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTC

Query:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
Subjt:  LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
        TPLILNPKLETFYYVELIG SVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPT

Query:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  VVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

SwissProt top hitse value%identityAlignment
Q766C3 Aspartic proteinase nepenthesin-11.4e-7341.02Show/hide
Query:  VELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCY
        + L H+DS   N T   L    + R + R+  L ++  G S       G  +SV +    G GEY   L +GTP +    ++DTGSD++W QC PC +C+
Subjt:  VELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCY

Query:  SQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVG-AAGLLGLGRGRFSF
        +QS PIFNP  S SF+ +PCSS LC+ L S  C+   + C Y   YGDGS T G   TETLTF    I  +  GCG +N+G   G  AGL+G+GRG  S 
Subjt:  SQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVG-AAGLLGLGRGRFSF

Query:  PSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF--TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDS-AGNGGVIIDSGTSVTRL
        PSQ  +    KFSYC+     SS PS+++ G  A S  A    T LI + ++ TFYY+ L G+SVG  R+  I  S F L+S  G GG+IIDSGT++T  
Subjt:  PSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF--TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDS-AGNGGVIIDSGTSVTRL

Query:  TRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDL-SGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDL
           AY ++R  F +  +        S FD C+   S  S +++PT V+HF G D+ LP+ NY I    +G  C A   +  G+SI GNIQQQ   VVYD 
Subjt:  TRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDL-SGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDL

Query:  AGSRIGFAPRGC
          S + FA   C
Subjt:  AGSRIGFAPRGC

Q8S9J6 Aspartyl protease family protein At5g107701.8e-7340.76Show/hide
Query:  LHRDALRVDSLTSLTAGRSRTPLRRAGFSSSVIS--GLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPC-RKCYSQSDPIFNPFKSKSFAGIP
        L  D  RV+S+ S  + +  T       S+ + +  G   GSG Y   +G+GTP   + ++ DTGSD+ W QC PC R CY Q +PIFNP KS S+  + 
Subjt:  LHRDALRVDSLTSLTAGRSRTPLRRAGFSSSVIS--GLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPC-RKCYSQSDPIFNPFKSKSFAGIP

Query:  CSSPLCRRLDSSGCNT---RRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAK-VALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCL
        CSS  C  L S+  N        C+Y + YGD SF+ G  A E  T   + +   V  GCG +N+GLF G AGLLGLGR + SFPSQT   +N  FSYCL
Subjt:  CSSPLCRRLDSSGCNT---RRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAK-VALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCL

Query:  VDRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHL
           S++S    + FG A ISR  +FTP+       +FY + ++ I+VGG ++  I +++F        G +IDSGT +TRL   AY ALR +F+A  S  
Subjt:  VDRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHL

Query:  KRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
              S+ DTC+DLSG   V +P V   F G  +    +  +  V      C AFAG    S  +I GN+QQQ   VVYD AG R+GFAP GC+
Subjt:  KRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 25.1e-10844.8Show/hide
Query:  LLFFFFFFFAAVASAAS----EFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLS--TNKTPSDLFNLRLHRDALRVDSLTSLTAGR-
        L FFF      ++S++S    +FQ + + + P+      P    +T  + ES++  T+ L H D     T +      + R+ RD  RV ++    +G+ 
Subjt:  LLFFFFFFFAAVASAAS----EFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLS--TNKTPSDLFNLRLHRDALRVDSLTSLTAGR-

Query:  ---SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRR
           S +      F S ++SG+ QGSGEYF R+GVG+PPR  +MV+D+GSD+VW+QC PC+ CY QSDP+F+P KS S+ G+ C S +C R+++SGC++  
Subjt:  ---SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRR

Query:  HTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRL
          C Y+V YGDGS+T G  A ETLTF    +  VA+GCGH N G+F+GAAGLLG+G G  SF  Q   +    F YCLV R   S   S+VFG  A+   
Subjt:  HTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRL

Query:  ARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVK
        A + PL+ NP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+VTRL   AY A RD F++  ++L R    S+FDTCYDLSG  +V+
Subjt:  ARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVK

Query:  VPTVVLHF-RGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        VPTV  +F  G  + LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +GF P  C
Subjt:  VPTVVLHF-RGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Q9LNJ3 Aspartyl protease family protein 25.7e-19271.34Show/hide
Query:  LLFFFFFFFAAVASAAS--EFQTL--TLRRLPIPSPLPFPQPQFDTQETL----------ESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSL
        LLF   FFF ++ S +S   FQTL      LP  SP+ F QP  D++  L          ES++++T+ L H+D+LS+NKTP +LF+ RL RD+ RV S+
Subjt:  LLFFFFFFFAAVASAAS--EFQTL--TLRRLPIPSPLPFPQPQFDTQETL----------ESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSL

Query:  TSLTA---GRSRTPL-RRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD
         +L A   GR+ T   R  GFSSSV+SGL+QGSGEYFTRLGVGTP RY++MVLDTGSD+VWLQC+PCR+CYSQSDPIF+P KSK++A IPCSSP CRRLD
Subjt:  TSLTA---GRSRTPL-RRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD

Query:  SSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVF
        S+GCNTRR TCLYQVSYGDGSFT GDF+TETLTFR N++  VALGCGHDNEGLFVGAAGLLGLG+G+ SFP QTG RFN KFSYCLVDRSASSKPSS+VF
Subjt:  SSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVF

Query:  GDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYD
        G+AA+SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV G++ASLFKLD  GNGGVIIDSGTSVTRL RPAY A+RDAFR GA  LKR P+FSLFDTC+D
Subjt:  GDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYD

Query:  LSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        LS  + VKVPTVVLHFRGAD++LPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+GFAP GC
Subjt:  LSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 13.2e-11049.04Show/hide
Query:  VASAASEFQTLTLRRLPIPSPLPFPQPQ-FDTQETLESTAALTVELHHLDSL--STNKTPSDLFNLRLHRDALRVDSLTS----LTAGRSRTPLR-----
        V S+  + QT+ L   P  S L   +P+         S++ L++ELH  D+   S +K    L   RL RD+ RV  + +       G  R+ L+     
Subjt:  VASAASEFQTLTLRRLPIPSPLPFPQPQ-FDTQETLESTAALTVELHHLDSL--STNKTPSDLFNLRLHRDALRVDSLTS----LTAGRSRTPLR-----

Query:  -----RAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCL
                 ++ V+SG +QGSGEYF+R+GVGTP + +++VLDTGSDV W+QC PC  CY QSDP+FNP  S ++  + CS+P C  L++S C  R + CL
Subjt:  -----RAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCL

Query:  YQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        YQVSYGDGSFT G+ AT+T+TF    KI  VALGCGHDNEGLF GAAGLLGLG G  S  +Q        FSYCLVDR  S K SS+ F    +      
Subjt:  YQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRG-PEFSLFDTCYDLSGQSAVKVP
         PL+ N K++TFYYV L G SVGG +V  +  ++F +D++G+GGVI+D GT+VTRL   AY +LRDAF     +LK+G    SLFDTCYD S  S VKVP
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRG-PEFSLFDTCYDLSGQSAVKVP

Query:  TVVLHFRGA-DMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        TV  HF G   + LPA NYLIPVDDSG+FCFAFA T S LSIIGN+QQQG R+ YDL+ + IG +   C
Subjt:  TVVLHFRGA-DMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein4.0e-19371.34Show/hide
Query:  LLFFFFFFFAAVASAAS--EFQTL--TLRRLPIPSPLPFPQPQFDTQETL----------ESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSL
        LLF   FFF ++ S +S   FQTL      LP  SP+ F QP  D++  L          ES++++T+ L H+D+LS+NKTP +LF+ RL RD+ RV S+
Subjt:  LLFFFFFFFAAVASAAS--EFQTL--TLRRLPIPSPLPFPQPQFDTQETL----------ESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSL

Query:  TSLTA---GRSRTPL-RRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD
         +L A   GR+ T   R  GFSSSV+SGL+QGSGEYFTRLGVGTP RY++MVLDTGSD+VWLQC+PCR+CYSQSDPIF+P KSK++A IPCSSP CRRLD
Subjt:  TSLTA---GRSRTPL-RRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD

Query:  SSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVF
        S+GCNTRR TCLYQVSYGDGSFT GDF+TETLTFR N++  VALGCGHDNEGLFVGAAGLLGLG+G+ SFP QTG RFN KFSYCLVDRSASSKPSS+VF
Subjt:  SSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVF

Query:  GDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYD
        G+AA+SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV G++ASLFKLD  GNGGVIIDSGTSVTRL RPAY A+RDAFR GA  LKR P+FSLFDTC+D
Subjt:  GDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYD

Query:  LSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        LS  + VKVPTVVLHFRGAD++LPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+GFAP GC
Subjt:  LSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT1G25510.1 Eukaryotic aspartyl protease family protein4.7e-10946.34Show/hide
Query:  PRYLLFFFFFFFAAVASAAS----EFQTLT---------LRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSL--STNKTPSDLFNLRLHRDALR
        P Y  FFF FF  + +S  S    E  T T         + R    S     Q +   ++T  ++++ +++LH   S+  + +     L   RL+RD  R
Subjt:  PRYLLFFFFFFFAAVASAAS----EFQTLT---------LRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSL--STNKTPSDLFNLRLHRDALR

Query:  VDSL-------------TSLTAGRSRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA
        V SL               L    +          + +ISG  QGSGEYFTR+G+G P R ++MVLDTGSDV WLQC+PC  CY Q++PIF P  S S+ 
Subjt:  VDSL-------------TSLTAGRSRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFA

Query:  GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLV
         + C +P C  L+ S C  R  TCLY+VSYGDGS+T GDFATETLT     +  VA+GCGH NEGLFVGAAGLLGLG G  + PSQ        FSYCLV
Subjt:  GIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLV

Query:  DRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLK
        DR + S  S++ FG  ++S  A   PL+ N +L+TFYY+ L GISVGG  ++ I  S F++D +G+GG+IIDSGT+VTRL    Y +LRD+F  G   L+
Subjt:  DRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLK

Query:  RGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM-ALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        +    ++FDTCY+LS ++ V+VPTV  HF G  M ALPA NY+IPVD  G+FC AFA T S L+IIGN+QQQG RV +DLA S IGF+   C
Subjt:  RGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM-ALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT3G18490.1 Eukaryotic aspartyl protease family protein2.3e-11149.04Show/hide
Query:  VASAASEFQTLTLRRLPIPSPLPFPQPQ-FDTQETLESTAALTVELHHLDSL--STNKTPSDLFNLRLHRDALRVDSLTS----LTAGRSRTPLR-----
        V S+  + QT+ L   P  S L   +P+         S++ L++ELH  D+   S +K    L   RL RD+ RV  + +       G  R+ L+     
Subjt:  VASAASEFQTLTLRRLPIPSPLPFPQPQ-FDTQETLESTAALTVELHHLDSL--STNKTPSDLFNLRLHRDALRVDSLTS----LTAGRSRTPLR-----

Query:  -----RAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCL
                 ++ V+SG +QGSGEYF+R+GVGTP + +++VLDTGSDV W+QC PC  CY QSDP+FNP  S ++  + CS+P C  L++S C  R + CL
Subjt:  -----RAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCL

Query:  YQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF
        YQVSYGDGSFT G+ AT+T+TF    KI  VALGCGHDNEGLF GAAGLLGLG G  S  +Q        FSYCLVDR  S K SS+ F    +      
Subjt:  YQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF

Query:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRG-PEFSLFDTCYDLSGQSAVKVP
         PL+ N K++TFYYV L G SVGG +V  +  ++F +D++G+GGVI+D GT+VTRL   AY +LRDAF     +LK+G    SLFDTCYD S  S VKVP
Subjt:  TPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRG-PEFSLFDTCYDLSGQSAVKVP

Query:  TVVLHFRGA-DMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        TV  HF G   + LPA NYLIPVDDSG+FCFAFA T S LSIIGN+QQQG R+ YDL+ + IG +   C
Subjt:  TVVLHFRGA-DMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT3G20015.1 Eukaryotic aspartyl protease family protein3.6e-10944.8Show/hide
Query:  LLFFFFFFFAAVASAAS----EFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLS--TNKTPSDLFNLRLHRDALRVDSLTSLTAGR-
        L FFF      ++S++S    +FQ + + + P+      P    +T  + ES++  T+ L H D     T +      + R+ RD  RV ++    +G+ 
Subjt:  LLFFFFFFFAAVASAAS----EFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLS--TNKTPSDLFNLRLHRDALRVDSLTSLTAGR-

Query:  ---SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRR
           S +      F S ++SG+ QGSGEYF R+GVG+PPR  +MV+D+GSD+VW+QC PC+ CY QSDP+F+P KS S+ G+ C S +C R+++SGC++  
Subjt:  ---SRTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRR

Query:  HTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRL
          C Y+V YGDGS+T G  A ETLTF    +  VA+GCGH N G+F+GAAGLLG+G G  SF  Q   +    F YCLV R   S   S+VFG  A+   
Subjt:  HTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRL

Query:  ARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVK
        A + PL+ NP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+VTRL   AY A RD F++  ++L R    S+FDTCYDLSG  +V+
Subjt:  ARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVK

Query:  VPTVVLHF-RGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        VPTV  +F  G  + LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +GF P  C
Subjt:  VPTVVLHF-RGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT3G61820.1 Eukaryotic aspartyl protease family protein1.4e-17265.89Show/hide
Query:  LLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETL-ESTAALTVELHHLDSLS--TNKTPSDLFNLRLHRDALRVDSLTSLTA---GRS
        L F  F      +SA+S++QTL +  LP  + L +P+ +  T E+L EST +L+V L H+D+LS  ++ +P+DLFNLRL RD+LRV S+TSL A   GR+
Subjt:  LLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETL-ESTAALTVELHHLDSLS--TNKTPSDLFNLRLHRDALRVDSLTSLTA---GRS

Query:  ---RTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRL-DSSGCNTRR
           RTP    GFS +VISGL+QGSGEYF RLGVGTP   ++MVLDTGSDVVWLQCSPC+ CY+Q+D IF+P KSK+FA +PC S LCRRL DSS C TRR
Subjt:  ---RTPLRRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRL-DSSGCNTRR

Query:  -HTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDR----SASSKPSSMVFGDA
          TCLYQVSYGDGSFT GDF+TETLTF G ++  V LGCGHDNEGLFVGAAGLLGLGRG  SFPSQT  R+N KFSYCLVDR    S+S  PS++VFG+A
Subjt:  -HTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDR----SASSKPSSMVFGDA

Query:  AISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSG
        A+ + + FTPL+ NPKL+TFYY++L+GISVGG RV G+S S FKLD+ GNGGVIIDSGTSVTRLT+PAY ALRDAFR GA+ LKR P +SLFDTC+DLSG
Subjt:  AISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGISASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSG

Query:  QSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
         + VKVPTVV HF G +++LPA+NYLIPV+  G FCFAFAGTM  LSIIGNIQQQGFRV YDL GSR+GF  R C
Subjt:  QSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTCCTCCAAGATATCTCCTCTTCTTCTTCTTCTTCTTCTTCGCCGCCGTGGCCTCCGCCGCGTCGGAGTTCCAAACCCTTACTCTCCGCCGTCTTCCAATTCC
CTCTCCCCTTCCCTTTCCACAACCCCAATTCGACACCCAAGAAACGCTCGAATCCACCGCCGCCCTCACGGTTGAGCTCCACCATTTGGATTCACTCTCTACCAACAAAA
CCCCCTCCGATCTCTTCAACCTCCGGCTCCACCGCGACGCCCTCCGTGTTGACTCGTTGACCTCCCTGACTGCTGGCCGGAGCCGGACTCCTCTCCGGCGAGCCGGGTTC
AGTAGCTCTGTTATCTCCGGCCTTGCTCAAGGTAGCGGTGAGTACTTCACCCGCCTTGGCGTCGGAACGCCTCCTAGATATATCTTTATGGTCCTCGACACCGGAAGCGA
CGTCGTTTGGCTTCAATGCTCCCCTTGCCGGAAATGCTACTCCCAATCTGATCCCATTTTTAACCCCTTCAAATCCAAATCCTTCGCCGGAATCCCCTGTTCTTCACCTC
TCTGCCGCCGCCTTGACTCCTCCGGCTGCAACACTCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGATGGGTCCTTTACCACCGGCGATTTCGCCACCGAAACG
CTCACGTTTCGTGGGAATAAAATCGCCAAAGTCGCCCTCGGCTGCGGCCACGACAATGAAGGCCTCTTCGTCGGTGCCGCTGGATTATTGGGTCTTGGCCGTGGTCGGTT
TTCTTTCCCTTCTCAAACCGGACTCCGGTTCAACCATAAATTTTCTTATTGTTTGGTGGACCGGTCCGCTTCCTCCAAACCCTCCTCTATGGTTTTCGGTGATGCGGCAA
TTTCCCGGCTCGCCCGGTTCACTCCTCTGATTTTGAACCCGAAATTGGAAACGTTTTATTATGTCGAACTTATCGGAATCAGCGTCGGCGGAGTCCGAGTCCGCGGCATC
TCCGCCTCCCTCTTCAAGCTCGATTCCGCCGGCAACGGCGGCGTCATAATCGATTCGGGTACATCGGTAACCCGGCTGACCCGACCCGCGTACACTGCTCTTCGCGACGC
GTTCCGGGCTGGAGCGTCCCATTTAAAAAGAGGTCCCGAGTTTTCGCTGTTCGATACGTGTTACGACTTGTCGGGACAGTCGGCGGTGAAGGTCCCGACAGTGGTGCTGC
ATTTCCGGGGAGCTGACATGGCATTGCCGGCGACAAATTATTTGATACCAGTAGACGACAGTGGAAGCTTTTGCTTTGCGTTTGCGGGTACCATGTCCGGATTGTCGATA
ATTGGGAATATTCAACAGCAAGGGTTTCGGGTCGTGTACGATTTGGCGGGTTCTCGGATCGGGTTCGCTCCACGTGGGTGCACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTCCTCCAAGATATCTCCTCTTCTTCTTCTTCTTCTTCTTCGCCGCCGTGGCCTCCGCCGCGTCGGAGTTCCAAACCCTTACTCTCCGCCGTCTTCCAATTCC
CTCTCCCCTTCCCTTTCCACAACCCCAATTCGACACCCAAGAAACGCTCGAATCCACCGCCGCCCTCACGGTTGAGCTCCACCATTTGGATTCACTCTCTACCAACAAAA
CCCCCTCCGATCTCTTCAACCTCCGGCTCCACCGCGACGCCCTCCGTGTTGACTCGTTGACCTCCCTGACTGCTGGCCGGAGCCGGACTCCTCTCCGGCGAGCCGGGTTC
AGTAGCTCTGTTATCTCCGGCCTTGCTCAAGGTAGCGGTGAGTACTTCACCCGCCTTGGCGTCGGAACGCCTCCTAGATATATCTTTATGGTCCTCGACACCGGAAGCGA
CGTCGTTTGGCTTCAATGCTCCCCTTGCCGGAAATGCTACTCCCAATCTGATCCCATTTTTAACCCCTTCAAATCCAAATCCTTCGCCGGAATCCCCTGTTCTTCACCTC
TCTGCCGCCGCCTTGACTCCTCCGGCTGCAACACTCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGATGGGTCCTTTACCACCGGCGATTTCGCCACCGAAACG
CTCACGTTTCGTGGGAATAAAATCGCCAAAGTCGCCCTCGGCTGCGGCCACGACAATGAAGGCCTCTTCGTCGGTGCCGCTGGATTATTGGGTCTTGGCCGTGGTCGGTT
TTCTTTCCCTTCTCAAACCGGACTCCGGTTCAACCATAAATTTTCTTATTGTTTGGTGGACCGGTCCGCTTCCTCCAAACCCTCCTCTATGGTTTTCGGTGATGCGGCAA
TTTCCCGGCTCGCCCGGTTCACTCCTCTGATTTTGAACCCGAAATTGGAAACGTTTTATTATGTCGAACTTATCGGAATCAGCGTCGGCGGAGTCCGAGTCCGCGGCATC
TCCGCCTCCCTCTTCAAGCTCGATTCCGCCGGCAACGGCGGCGTCATAATCGATTCGGGTACATCGGTAACCCGGCTGACCCGACCCGCGTACACTGCTCTTCGCGACGC
GTTCCGGGCTGGAGCGTCCCATTTAAAAAGAGGTCCCGAGTTTTCGCTGTTCGATACGTGTTACGACTTGTCGGGACAGTCGGCGGTGAAGGTCCCGACAGTGGTGCTGC
ATTTCCGGGGAGCTGACATGGCATTGCCGGCGACAAATTATTTGATACCAGTAGACGACAGTGGAAGCTTTTGCTTTGCGTTTGCGGGTACCATGTCCGGATTGTCGATA
ATTGGGAATATTCAACAGCAAGGGTTTCGGGTCGTGTACGATTTGGCGGGTTCTCGGATCGGGTTCGCTCCACGTGGGTGCACGTGA
Protein sequenceShow/hide protein sequence
MESPPRYLLFFFFFFFAAVASAASEFQTLTLRRLPIPSPLPFPQPQFDTQETLESTAALTVELHHLDSLSTNKTPSDLFNLRLHRDALRVDSLTSLTAGRSRTPLRRAGF
SSSVISGLAQGSGEYFTRLGVGTPPRYIFMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCNTRRHTCLYQVSYGDGSFTTGDFATET
LTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRFSFPSQTGLRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLILNPKLETFYYVELIGISVGGVRVRGI
SASLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGASHLKRGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMALPATNYLIPVDDSGSFCFAFAGTMSGLSI
IGNIQQQGFRVVYDLAGSRIGFAPRGCT