; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003138 (gene) of Snake gourd v1 genome

Gene IDTan0003138
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionaspartyl protease family protein 2
Genome locationLG06:35073820..35075600
RNA-Seq ExpressionTan0003138
SyntenyTan0003138
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR033873 - CND41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592908.1 Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. sororia]1.3e-23090.09Show/hide
Query:  LPNPSRPP--------LLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGE
        +PNP  PP        L +PQF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL  ARSR PL RAGFSSSVISGLAQGSGE
Subjt:  LPNPSRPP--------LLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVISGLAQGSGE

Query:  YFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFR
        YFTRLGVGTPPRY++MVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TRRHTCLYQVSYGDGSFTTGDFATETLTFR
Subjt:  YFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFR

Query:  GNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGG
        GN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLI NPKLETFYYVELIGISVGG
Subjt:  GNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGG

Query:  VRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDS
        VRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM+LPATNYLIPVDDS
Subjt:  VRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDS

Query:  GSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        GSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  GSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

KAG7025313.1 Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]7.1e-24089.19Show/hide
Query:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL
        MESPP  +LFFFFFFF  +    +AASEFQ LTLR LP PS  P  +PQF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL
Subjt:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR
           RSR PL RAGFSSSVISGLAQGSGEYFTRLGVGTPPRY++MVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TR
Subjt:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR

Query:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR
        RHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Subjt:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR

Query:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV
        LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAV
Subjt:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV

Query:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        KVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

XP_022959948.1 aspartyl protease family protein 2 [Cucurbita moschata]1.2e-23989.41Show/hide
Query:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL
        MESPP  +LFFFFFFF  +    +AASEFQ LTLR LP PS  P  +PQF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL
Subjt:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR
          ARSR PL RAGFSSSVISGLAQGSGEYFTRLGVGTP RY+YMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TR
Subjt:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR

Query:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR
        RHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Subjt:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR

Query:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV
        LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAV
Subjt:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV

Query:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        KVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

XP_023005015.1 aspartyl protease family protein 2 [Cucurbita maxima]3.9e-23888.98Show/hide
Query:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL
        MESPP N+LFFFFF         +AASEFQ LTLR LP PS     + QF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL
Subjt:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR
          ARSR PL RAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TR
Subjt:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR

Query:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR
        RHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Subjt:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR

Query:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV
        LARFTPLI NPKLETFYYVELIG SVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAV
Subjt:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV

Query:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        KVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

XP_023514169.1 aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo]2.1e-23989.19Show/hide
Query:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL
        MESPP  +LFFFFFF        +AASEFQ LTLR LP PS  P  +PQF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL
Subjt:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR
          ARSR PL RAGFSSSVISGLAQGSGEYFTRLGVGTPPRY+YMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TR
Subjt:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR

Query:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR
        RHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Subjt:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR

Query:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV
        LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFR GA+HLK+GPEFSLFDTCYDLSGQSAV
Subjt:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV

Query:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        KVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

TrEMBL top hitse value%identityAlignment
A0A0A0K4G2 Peptidase A1 domain-containing protein1.9e-22285.65Show/hide
Query:  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRA
        +    FF   AASEFQ LTLR LP PS  PL       + ++L+S+  A +TL+LHHLDSLSLNKTPTDLFNLRLHRD LRV +L S A          A
Subjt:  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRA

Query:  GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD
        GFSSSV+SGL+QGSGEYFTRLGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD
Subjt:  GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD

Query:  GSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
        GSFTTGDFATETLTFRGN+IAKVALGCGH N+GLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
Subjt:  GSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK

Query:  LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGA
        L+TFYYV LIGISVGGVRVRGVS +LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFR GA HLK+GPEFSLFDTCYDLSGQS+VKVPTVVLHFRGA
Subjt:  LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGA

Query:  DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        DM+LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A1S3CHC4 aspartyl protease family protein 21.9e-22286.52Show/hide
Query:  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRA
        +    +F  +AASEFQ LTLR LP PS P  L P    + E+L+S+  A +TL+LHHLDSLSLNKTPTDLFNLRLHRDALRV +LTS A           
Subjt:  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRA

Query:  GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD
        GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD
Subjt:  GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD

Query:  GSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
        GSFTTGDFATETLTFRGN+IAKVALGCGH N+GLFVGAAGLLGLGRGRLSFPSQTGIRFN KFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
Subjt:  GSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK

Query:  LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGA
        L+TFYYVELIGISVGGVRVRGV  +LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA HLK+GPEFSLFDTCYDLSGQS+VKVPTVVLHFRGA
Subjt:  LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGA

Query:  DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        DM LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A5A7U8Z2 Aspartyl protease family protein 22.9e-22386.3Show/hide
Query:  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRA
        +    +F  +AASEFQ LTLR LP PS  PL       + E+L+S+  A +TL+LHHLDSLSLNKTPTDLFNLRLHRDALRV +LTS A           
Subjt:  FFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKST--AGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRA

Query:  GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD
        GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNP+KSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD
Subjt:  GFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGD

Query:  GSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
        GSFTTGDFATETLTFRGN+IAKVALGCGH N+GLFVGAAGLLGLGRGRLSFPSQTGIRFN KFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK
Subjt:  GSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPK

Query:  LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGA
        L+TFYYVELIGISVGGVRVRGV  +LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA HLK+GPEFSLFDTCYDLSGQS+VKVPTVVLHFRGA
Subjt:  LETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGA

Query:  DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        DM LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A6J1H7E0 aspartyl protease family protein 25.9e-24089.41Show/hide
Query:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL
        MESPP  +LFFFFFFF  +    +AASEFQ LTLR LP PS  P  +PQF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL
Subjt:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR
          ARSR PL RAGFSSSVISGLAQGSGEYFTRLGVGTP RY+YMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TR
Subjt:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR

Query:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR
        RHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Subjt:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR

Query:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV
        LARFTPLI NPKLETFYYVELIGISVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAV
Subjt:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV

Query:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        KVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

A0A6J1KW81 aspartyl protease family protein 21.9e-23888.98Show/hide
Query:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL
        MESPP N+LFFFFF         +AASEFQ LTLR LP PS     + QF + +E L+STA +T+ELHHLDSLS NKTP+DLFNLRLHRDALRV SLTSL
Subjt:  MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR
          ARSR PL RAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSD+VWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGC+TR
Subjt:  AGARSRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTR

Query:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR
        RHTCLYQVSYGDGSFTTGDFATETLTFRGN+IAKVALGCGHDN+GLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASSKPSSMVFGDAAISR
Subjt:  RHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISR

Query:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV
        LARFTPLI NPKLETFYYVELIG SVGGVRVRG+SA+LFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLK+GPEFSLFDTCYDLSGQSAV
Subjt:  LARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAV

Query:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
        KVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT
Subjt:  KVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT

SwissProt top hitse value%identityAlignment
Q766C3 Aspartic proteinase nepenthesin-14.4e-7539.12Show/hide
Query:  FSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS
        +SF  A     + + P  + SR  L     +   EA     G  + L H+DS   N T   L    + R + R+Q L ++    S       G  +SV +
Subjt:  FSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLPRAGFSSSVIS

Query:  GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDF
            G GEY   L +GTP +    ++DTGSD++W QC PC +C++QS PIFNP  S SF+ +PCSS LC+ L S  CS   + C Y   YGDGS T G  
Subjt:  GLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDF

Query:  ATETLTFRGNQIAKVALGCGHDNQGLFVG-AAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF--TPLIRNPKLETFY
         TETLTF    I  +  GCG +NQG   G  AGL+G+GRG LS PSQ  +    KFSYC+     SS PS+++ G  A S  A    T LI++ ++ TFY
Subjt:  ATETLTFRGNQIAKVALGCGHDNQGLFVG-AAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARF--TPLIRNPKLETFY

Query:  YVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDL-SGQSAVKVPTVVLHFRGADMSL
        Y+ L G+SVG  R+    +A       G GG+IIDSGT++T     AY ++R  F +           S FD C+   S  S +++PT V+HF G D+ L
Subjt:  YVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDL-SGQSAVKVPTVVLHFRGADMSL

Query:  PATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        P+ NY I    +G  C A   +  G+SI GNIQQQ   VVYD   S + FA   C
Subjt:  PATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Q9LEW3 Aspartyl protease AED14.4e-7541.39Show/hide
Query:  LHRDALRVQSLTSLAGARSRNPLPRAGFSS-SVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPC-RKCYSQSDPIFNPFKSKSFAGIPC
        + RD  RV+S+ S     S N +  A  +     SG+  GSG Y   +G+GTP   L +V DTGSD+ W QC PC   CYSQ +P FNP  S ++  + C
Subjt:  LHRDALRVQSLTSLAGARSRNPLPRAGFSS-SVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPC-RKCYSQSDPIFNPFKSKSFAGIPC

Query:  SSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAK-VALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRS
        SSP+C   D+  CS     C+Y + YGD SFT G  A E  T   + + + V  GCG +NQGLF G AGLLGLG G+LS P+QT   +N+ FSYCL   +
Subjt:  SSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAK-VALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRS

Query:  ASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGP
        ++S    + FG A IS   +FTP+   P     Y +++IGISVG   +  ++   F  +     G IIDSGT  TRL    Y  LR  F+   +  K   
Subjt:  ASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGP

Query:  EFSLFDTCYDLSGQSAVKVPTVVLHFRGAD-MSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
         + LFDTCYD +G   V  PT+   F G+  + L  +   +P+  S   C AFAG     +I GN+QQ    VVYD+AG R+GFAP GC
Subjt:  EFSLFDTCYDLSGQSAVKVPTVVLHFRGAD-MSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 25.7e-10744.96Show/hide
Query:  VLFFFFFFFFFLFFSFAAAS-----EFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSL
        +L   FFFF  L    +++S     +FQI+ +  L  P       P F++   + +S++  TL L H D       +      + R+ RD  RV ++   
Subjt:  VLFFFFFFFFFLFFSFAAAS-----EFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGAR----SRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSG
           +    S +      F S ++SG+ QGSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PC+ CY QSDP+F+P KS S+ G+ C S +C R+++SG
Subjt:  AGAR----SRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSG

Query:  CSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDA
        C +    C Y+V YGDGS+T G  A ETLTF    +  VA+GCGH N+G+F+GAAGLLG+G G +SF  Q   +    F YCLV R   S   S+VFG  
Subjt:  CSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDA

Query:  AISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSG
        A+   A + PL+RNP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+VTRL   AY A RD F++  A+L +    S+FDTCYDLSG
Subjt:  AISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSG

Query:  QSAVKVPTVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
          +V+VPTV  +F  G  ++LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +GF P  C
Subjt:  QSAVKVPTVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Q9LNJ3 Aspartyl protease family protein 25.0e-18870.5Show/hide
Query:  LFFFFFFFFFLFFSFAAASEFQIL--------TLRPL---PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSL
        L F   FFF    SF++   FQ L           P+   P+     LLE +F S  ++ +S++ +TL L H+D+LS NKTP +LF+ RL RD+ RV+S+
Subjt:  LFFFFFFFFFLFFSFAAASEFQIL--------TLRPL---PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSL

Query:  TSLAG---ARSRNPLPR-AGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD
         +LA     R+    PR  GFSSSV+SGL+QGSGEYFTRLGVGTP RY+YMVLDTGSDIVWLQC+PCR+CYSQSDPIF+P KSK++A IPCSSP CRRLD
Subjt:  TSLAG---ARSRNPLPR-AGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD

Query:  SSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVF
        S+GC+TRR TCLYQVSYGDGSFT GDF+TETLTFR N++  VALGCGHDN+GLFVGAAGLLGLG+G+LSFP QTG RFN KFSYCLVDRSASSKPSS+VF
Subjt:  SSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVF

Query:  GDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYD
        G+AA+SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV GV+A+LFKLD  GNGGVIIDSGTSVTRL RPAY A+RDAFR GA  LK+ P+FSLFDTC+D
Subjt:  GDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYD

Query:  LSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        LS  + VKVPTVVLHFRGAD+SLPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+GFAP GC
Subjt:  LSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.3e-11149.89Show/hide
Query:  PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRVQSLT-----SLAGARSRNPLP---------RAGFSSSVISG
        P  S     +P+  S+     S++ ++LELH  D+   S +K    L   RL RD+ RV  +      ++ G    +  P             ++ V+SG
Subjt:  PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRVQSLT-----SLAGARSRNPLP---------RAGFSSSVISG

Query:  LAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFA
         +QGSGEYF+R+GVGTP + +Y+VLDTGSD+ W+QC PC  CY QSDP+FNP  S ++  + CS+P C  L++S C  R + CLYQVSYGDGSFT G+ A
Subjt:  LAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFA

Query:  TETLTF-RGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVE
        T+T+TF    +I  VALGCGHDN+GLF GAAGLLGLG G LS  +Q        FSYCLVDR  S K SS+ F    +       PL+RN K++TFYYV 
Subjt:  TETLTF-RGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVE

Query:  LIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKG-PEFSLFDTCYDLSGQSAVKVPTVVLHFRGA-DMSLPA
        L G SVGG +V  +  A+F +D +G+GGVI+D GT+VTRL   AY +LRDAF     +LKKG    SLFDTCYD S  S VKVPTV  HF G   + LPA
Subjt:  LIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKG-PEFSLFDTCYDLSGQSAVKVPTVVLHFRGA-DMSLPA

Query:  TNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
         NYLIPVDDSG+FCFAFA T S LSIIGN+QQQG R+ YDL+ + IG +   C
Subjt:  TNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein3.6e-18970.5Show/hide
Query:  LFFFFFFFFFLFFSFAAASEFQIL--------TLRPL---PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSL
        L F   FFF    SF++   FQ L           P+   P+     LLE +F S  ++ +S++ +TL L H+D+LS NKTP +LF+ RL RD+ RV+S+
Subjt:  LFFFFFFFFFLFFSFAAASEFQIL--------TLRPL---PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSL

Query:  TSLAG---ARSRNPLPR-AGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD
         +LA     R+    PR  GFSSSV+SGL+QGSGEYFTRLGVGTP RY+YMVLDTGSDIVWLQC+PCR+CYSQSDPIF+P KSK++A IPCSSP CRRLD
Subjt:  TSLAG---ARSRNPLPR-AGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLD

Query:  SSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVF
        S+GC+TRR TCLYQVSYGDGSFT GDF+TETLTFR N++  VALGCGHDN+GLFVGAAGLLGLG+G+LSFP QTG RFN KFSYCLVDRSASSKPSS+VF
Subjt:  SSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVF

Query:  GDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYD
        G+AA+SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV GV+A+LFKLD  GNGGVIIDSGTSVTRL RPAY A+RDAFR GA  LK+ P+FSLFDTC+D
Subjt:  GDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYD

Query:  LSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
        LS  + VKVPTVVLHFRGAD+SLPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+GFAP GC
Subjt:  LSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT1G25510.1 Eukaryotic aspartyl protease family protein1.8e-10846.23Show/hide
Query:  FFFFFFLFFSFAAASEFQILTLRPLPNPSRPPL----LEPQFH------------SEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRV
        + FFFF+FF  + +S F     R LP  S        +    H             EE+   +++  +L+LH   S+  + +     L   RL+RD  RV
Subjt:  FFFFFFLFFSFAAASEFQILTLRPLPNPSRPPL----LEPQFH------------SEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRV

Query:  QSLTSLAGARSRN-------------PLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAG
        +SL +       N                     + +ISG  QGSGEYFTR+G+G P R +YMVLDTGSD+ WLQC+PC  CY Q++PIF P  S S+  
Subjt:  QSLTSLAGARSRN-------------PLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAG

Query:  IPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVD
        + C +P C  L+ S C  R  TCLY+VSYGDGS+T GDFATETLT     +  VA+GCGH N+GLFVGAAGLLGLG G L+ PSQ        FSYCLVD
Subjt:  IPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVD

Query:  RSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKK
        R + S  S++ FG  ++S  A   PL+RN +L+TFYY+ L GISVGG  ++ +  + F++D +G+GG+IIDSGT+VTRL    Y +LRD+F  G   L+K
Subjt:  RSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKK

Query:  GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM-SLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
            ++FDTCY+LS ++ V+VPTV  HF G  M +LPA NY+IPVD  G+FC AFA T S L+IIGN+QQQG RV +DLA S IGF+   C
Subjt:  GPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADM-SLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT3G18490.1 Eukaryotic aspartyl protease family protein9.3e-11349.89Show/hide
Query:  PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRVQSLT-----SLAGARSRNPLP---------RAGFSSSVISG
        P  S     +P+  S+     S++ ++LELH  D+   S +K    L   RL RD+ RV  +      ++ G    +  P             ++ V+SG
Subjt:  PNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSL--SLNKTPTDLFNLRLHRDALRVQSLT-----SLAGARSRNPLP---------RAGFSSSVISG

Query:  LAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFA
         +QGSGEYF+R+GVGTP + +Y+VLDTGSD+ W+QC PC  CY QSDP+FNP  S ++  + CS+P C  L++S C  R + CLYQVSYGDGSFT G+ A
Subjt:  LAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFA

Query:  TETLTF-RGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVE
        T+T+TF    +I  VALGCGHDN+GLF GAAGLLGLG G LS  +Q        FSYCLVDR  S K SS+ F    +       PL+RN K++TFYYV 
Subjt:  TETLTF-RGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVE

Query:  LIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKG-PEFSLFDTCYDLSGQSAVKVPTVVLHFRGA-DMSLPA
        L G SVGG +V  +  A+F +D +G+GGVI+D GT+VTRL   AY +LRDAF     +LKKG    SLFDTCYD S  S VKVPTV  HF G   + LPA
Subjt:  LIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKG-PEFSLFDTCYDLSGQSAVKVPTVVLHFRGA-DMSLPA

Query:  TNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
         NYLIPVDDSG+FCFAFA T S LSIIGN+QQQG R+ YDL+ + IG +   C
Subjt:  TNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT3G20015.1 Eukaryotic aspartyl protease family protein4.0e-10844.96Show/hide
Query:  VLFFFFFFFFFLFFSFAAAS-----EFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSL
        +L   FFFF  L    +++S     +FQI+ +  L  P       P F++   + +S++  TL L H D       +      + R+ RD  RV ++   
Subjt:  VLFFFFFFFFFLFFSFAAAS-----EFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSL

Query:  AGAR----SRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSG
           +    S +      F S ++SG+ QGSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PC+ CY QSDP+F+P KS S+ G+ C S +C R+++SG
Subjt:  AGAR----SRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSG

Query:  CSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDA
        C +    C Y+V YGDGS+T G  A ETLTF    +  VA+GCGH N+G+F+GAAGLLG+G G +SF  Q   +    F YCLV R   S   S+VFG  
Subjt:  CSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDA

Query:  AISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSG
        A+   A + PL+RNP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+VTRL   AY A RD F++  A+L +    S+FDTCYDLSG
Subjt:  AISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSG

Query:  QSAVKVPTVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
          +V+VPTV  +F  G  ++LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +GF P  C
Subjt:  QSAVKVPTVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC

AT3G61820.1 Eukaryotic aspartyl protease family protein4.1e-16964.48Show/hide
Query:  FFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSLA------GAR
        F  F  LFF+ +A+S++Q L +  LP+ +     E +  ++E   +ST  +++ L H+D+LS   + +P DLFNLRL RD+LRV+S+TSLA       A 
Subjt:  FFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLS--LNKTPTDLFNLRLHRDALRVQSLTSLA------GAR

Query:  SRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRL-DSSGCSTRR-H
         R P    GFS +VISGL+QGSGEYF RLGVGTP   +YMVLDTGSD+VWLQCSPC+ CY+Q+D IF+P KSK+FA +PC S LCRRL DSS C TRR  
Subjt:  SRNPLPRAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRL-DSSGCSTRR-H

Query:  TCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDR----SASSKPSSMVFGDAAI
        TCLYQVSYGDGSFT GDF+TETLTF G ++  V LGCGHDN+GLFVGAAGLLGLGRG LSFPSQT  R+N KFSYCLVDR    S+S  PS++VFG+AA+
Subjt:  TCLYQVSYGDGSFTTGDFATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDR----SASSKPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQS
         + + FTPL+ NPKL+TFYY++L+GISVGG RV GVS + FKLD  GNGGVIIDSGTSVTRLT+PAY ALRDAFR GA  LK+ P +SLFDTC+DLSG +
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC
         VKVPTVV HF G ++SLPA+NYLIPV+  G FCFAFAGTM  LSIIGNIQQQGFRV YDL GSR+GF  R C
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCACCACCAACAAATGTCCTATTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTTTTCTTTCGCCGCCGCATCGGAGTTCCAAATCCTAACTCTCCGCCCTCT
TCCGAATCCCTCTCGCCCTCCCTTACTAGAACCCCAGTTCCACTCCGAGGAAGAAGCCCTCAAATCCACCGCCGGCGTCACCCTTGAGCTCCATCATTTGGACTCACTCT
CCCTCAACAAAACCCCCACCGATCTCTTCAACCTCCGGCTCCACCGTGACGCCCTCCGCGTCCAGTCTCTGACCTCTCTGGCCGGCGCAAGGAGCCGGAACCCACTCCCA
CGCGCCGGTTTCAGCAGCTCCGTCATCTCCGGCCTCGCCCAAGGCAGCGGCGAGTACTTCACACGCCTCGGCGTTGGAACCCCTCCTAGATACCTCTACATGGTCCTCGA
CACTGGAAGCGACATTGTTTGGCTCCAATGCTCCCCTTGCCGCAAATGCTACTCCCAATCCGATCCCATTTTCAACCCCTTTAAATCCAAATCCTTCGCCGGAATCCCCT
GCTCTTCCCCTCTCTGCCGCCGTCTCGACTCCTCCGGCTGCTCCACCCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGACGGCTCCTTCACCACCGGCGACTTC
GCCACCGAAACCCTCACCTTTCGTGGCAATCAAATCGCCAAAGTCGCCCTCGGCTGCGGCCACGACAATCAAGGCCTCTTCGTCGGCGCCGCCGGTTTGTTGGGCCTCGG
CCGTGGCCGCTTGTCTTTCCCTTCCCAAACCGGAATCCGGTTCAACCACAAATTCTCTTATTGTTTGGTGGACCGGTCCGCTTCCTCCAAACCCTCCTCCATGGTTTTCG
GCGATGCGGCGATTTCCCGGCTCGCCCGGTTCACTCCTCTGATTCGGAACCCCAAATTGGAAACGTTTTATTACGTCGAACTCATCGGAATCAGCGTCGGCGGAGTCCGA
GTCCGCGGCGTCTCCGCCGCTCTCTTCAAGCTCGATCCGGCCGGCAACGGCGGCGTCATCATCGACTCGGGTACCTCGGTAACCCGATTGACCCGACCCGCTTACACGGC
TCTTCGCGACGCGTTCCGGGCCGGAGCGGCCCATTTGAAAAAGGGTCCCGAGTTTTCGCTGTTCGATACGTGTTACGACTTGTCGGGTCAGTCCGCCGTGAAGGTTCCGA
CGGTGGTGCTGCATTTCCGGGGAGCCGACATGTCGTTGCCGGCGACGAATTATTTGATTCCGGTGGACGACAGTGGGAGCTTTTGCTTTGCGTTTGCGGGTACTATGTCC
GGGTTGTCGATTATTGGGAACATTCAACAGCAGGGGTTCCGGGTTGTGTACGACTTGGCGGGTTCTCGGATCGGGTTTGCTCCACGTGGGTGCACGTGA
mRNA sequenceShow/hide mRNA sequence
CCCATTCCCATTTCCCACTCCTAAAAGTACCGCATTGTCCGCTCAATGCGCTCATCATCTCTTTGTCTCTTTTTTCTTTTTCATTATAATTAACTCTCTTCTCTCCCAAA
AACCACAGTCAATCTCTCTCTCTCTCTCTCTCAATGGAATCACCACCAACAAATGTCCTATTCTTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTTTTCTTTCGCCGCCGC
ATCGGAGTTCCAAATCCTAACTCTCCGCCCTCTTCCGAATCCCTCTCGCCCTCCCTTACTAGAACCCCAGTTCCACTCCGAGGAAGAAGCCCTCAAATCCACCGCCGGCG
TCACCCTTGAGCTCCATCATTTGGACTCACTCTCCCTCAACAAAACCCCCACCGATCTCTTCAACCTCCGGCTCCACCGTGACGCCCTCCGCGTCCAGTCTCTGACCTCT
CTGGCCGGCGCAAGGAGCCGGAACCCACTCCCACGCGCCGGTTTCAGCAGCTCCGTCATCTCCGGCCTCGCCCAAGGCAGCGGCGAGTACTTCACACGCCTCGGCGTTGG
AACCCCTCCTAGATACCTCTACATGGTCCTCGACACTGGAAGCGACATTGTTTGGCTCCAATGCTCCCCTTGCCGCAAATGCTACTCCCAATCCGATCCCATTTTCAACC
CCTTTAAATCCAAATCCTTCGCCGGAATCCCCTGCTCTTCCCCTCTCTGCCGCCGTCTCGACTCCTCCGGCTGCTCCACCCGCCGCCACACCTGCCTCTACCAAGTCTCC
TACGGCGACGGCTCCTTCACCACCGGCGACTTCGCCACCGAAACCCTCACCTTTCGTGGCAATCAAATCGCCAAAGTCGCCCTCGGCTGCGGCCACGACAATCAAGGCCT
CTTCGTCGGCGCCGCCGGTTTGTTGGGCCTCGGCCGTGGCCGCTTGTCTTTCCCTTCCCAAACCGGAATCCGGTTCAACCACAAATTCTCTTATTGTTTGGTGGACCGGT
CCGCTTCCTCCAAACCCTCCTCCATGGTTTTCGGCGATGCGGCGATTTCCCGGCTCGCCCGGTTCACTCCTCTGATTCGGAACCCCAAATTGGAAACGTTTTATTACGTC
GAACTCATCGGAATCAGCGTCGGCGGAGTCCGAGTCCGCGGCGTCTCCGCCGCTCTCTTCAAGCTCGATCCGGCCGGCAACGGCGGCGTCATCATCGACTCGGGTACCTC
GGTAACCCGATTGACCCGACCCGCTTACACGGCTCTTCGCGACGCGTTCCGGGCCGGAGCGGCCCATTTGAAAAAGGGTCCCGAGTTTTCGCTGTTCGATACGTGTTACG
ACTTGTCGGGTCAGTCCGCCGTGAAGGTTCCGACGGTGGTGCTGCATTTCCGGGGAGCCGACATGTCGTTGCCGGCGACGAATTATTTGATTCCGGTGGACGACAGTGGG
AGCTTTTGCTTTGCGTTTGCGGGTACTATGTCCGGGTTGTCGATTATTGGGAACATTCAACAGCAGGGGTTCCGGGTTGTGTACGACTTGGCGGGTTCTCGGATCGGGTT
TGCTCCACGTGGGTGCACGTGATCTCTGACCCAGTGATGGGATTTCTGTTTGTTTAGGGACAAAGAAGGAATAACAAAGAAATGGAAAATTAAAAGAAAAATAGGATTTT
TCGTGATGACATTGTCTTCTAGTTCTATTTAAGGTTTGTTTTTTGGTGTATTGTATTTATTATCTATTGAAAGTCATTAAAGCCTCTAACTTGGAGGTGATTTGGTTTGT
TTCAAAGGTAAAGATTTGGGC
Protein sequenceShow/hide protein sequence
MESPPTNVLFFFFFFFFFLFFSFAAASEFQILTLRPLPNPSRPPLLEPQFHSEEEALKSTAGVTLELHHLDSLSLNKTPTDLFNLRLHRDALRVQSLTSLAGARSRNPLP
RAGFSSSVISGLAQGSGEYFTRLGVGTPPRYLYMVLDTGSDIVWLQCSPCRKCYSQSDPIFNPFKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDF
ATETLTFRGNQIAKVALGCGHDNQGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVR
VRGVSAALFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKKGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMS
GLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT