; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022554 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022554
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein ASPARTIC PROTEASE IN GUARD CELL 1-like
Genome locationscaffold2:3731372..3732829
RNA-Seq ExpressionSpg022554
SyntenySpg022554
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7021604.1 Protein ASPARTIC PROTEASE IN GUARD CELL 1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-20378.51Show/hide
Query:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR
        NN   FS  LF+ +LNS  FSSSL+R+  E   +TT+ DVSASS RAQNALSI P Q  HSH       NSSLSLPLH RLAIHK   SYKDY SLVRAR
Subjt:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR

Query:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP
        LARDAARV+SLNRNL LAL G A VRP+S+TAPVVSGQSQGSGEYFARI VGQP QSFY VPDTGSD+TWLQCLPCS  N CY+QTDPIF+P SSSSY P
Subjt:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP

Query:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD
        LSCDSQQCQSL+R  CQSGTC YQV YGDGSFT G+F TETL+F NSKS+PNLPIGCGHDNEGLFVGAAGLIGLGGG LSLSSQLKASSFSYCLVDRDSD
Subjt:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD

Query:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF
        SSSTLEF+S  PSDS+T+PLLKN+R  +YRYV+VTGMSVGGK L ISSTRF+IDGSG+GGIIVDSGTFITRLP+DVYESLR+AFV    +LT  G +SPF
Subjt:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF

Query:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        DTCY+ +GQS+VQVPTVAFELSKG  L+LPANNYLIRMD+AG++CLAFL TTSSLSIIGSFQQQG+RVSYDLVNSLVGFSSNKC
Subjt:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

XP_022933467.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita moschata]1.6e-20278.1Show/hide
Query:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR
        NN   FS  LF+ +LNS  FSSSL+R+  E   +TT+ DVSASS RAQ+ALS+ P Q  HSH       NSSLSLPLH RLAIHK   SYKDY+SLVRAR
Subjt:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR

Query:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP
        LARDAARV+SLNRNL LAL G A VRP+S+TAPVVSGQSQGSGEYFARI VGQP QSFY VPDTGSD+TWLQCLPCS  N CY+QTDPIF+P SSSSY P
Subjt:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP

Query:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD
        LSCDSQQCQSL+R  CQSGTC YQV YGDGSFT G+F TETL+F NSKS+PNLPIGCGHDNEGLFVGAAGLIGLGGG LSLSSQLKASSFSYCLVDRDSD
Subjt:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD

Query:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF
        SSSTLEF+S  PSDS+T+PLLKN+R  +YRYV+VTGMSVGGK L ISSTRF+IDGSG+GGIIVDSGTFITRLP+DVYESLR+AFV    +LT  G +SPF
Subjt:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF

Query:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        DTCY+ +GQS+VQVPTVAFELSKG  L+LPANNYLIRMD+AG++CLAFL TTSSLSIIGSFQQQG+RVSYDLVNSLVGFSSNKC
Subjt:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

XP_023007215.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita maxima]3.2e-20378.97Show/hide
Query:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR
        NN   FS FLF  +LNS  FSSSL+R+ TE   +TT+ DVSASS RAQNALSI P Q  HSH       NSSLSL LH RLAIHK   +YKDY+SLVRAR
Subjt:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR

Query:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP
        LARDAARV+SLNRNL LAL G A VRP+S+TAPVVSGQSQGSGEYFARI VGQPAQSFY VPDTGSD+TWLQCLPCS  N CY QTDPIF+P SSSSY P
Subjt:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP

Query:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD
        LSCDSQQCQSL+R  CQSGTC YQV YGDGSFTTG+FATETL+F NSKS+PNLPIGCGHDN+GLFVGAAGLIGLGGG LSLSSQLKASSFSYCLVDRDSD
Subjt:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD

Query:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF
        SSSTLEF+S RPSDS+T+PLLKN+R  +YRYV+VTGMSVGGK L ISSTRF+IDGSG+GGIIVDSGTFITRLP+DVYESLR+AFV    +LT  G +SPF
Subjt:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF

Query:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFL-ATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        DTCY+ +GQS+VQVPTVAFELSKG+ L+LPA NYLIRMD+AGT+CLAFL  TTSSLSIIGSFQQQG+RVSYDLVNSLVGFSSNKC
Subjt:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFL-ATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

XP_023531915.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita pepo subsp. pepo]6.5e-20478.72Show/hide
Query:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR
        NN   FS FL + +LNS  FSSSL+R+ T+   +T + DVSASS RAQNALSI P Q  HSH       NSSLSLPLH RLAIHK   +YKDYDSLVRAR
Subjt:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR

Query:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP
        LARDAARV+SLNRNL LAL   A VRP+S+TAPVVSGQSQGSGEYFARI VGQPAQSFY VPDTGSD+TWLQCLPCS  N CY+QTDPIF+P SSSSY P
Subjt:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP

Query:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD
        LSCDSQQCQSLDR  CQSGTC YQV YGDGSFTTG+FATETL+F NSKS+PNLPIGCGHDNEGLFVGAAGLIGLGGG LSLSSQLKASSFSYCLVDRDSD
Subjt:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD

Query:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF
        SSSTLEF+S RPSDS+T+PLLKN+R  +YRYV+VTGMSVGGK L ISSTRF+IDGSG+GGIIVDSGTFITRLP+DVYESLR+AFV    +LT    +SPF
Subjt:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF

Query:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        DTCY+ +GQ++VQVPTVAFELSKG  L+LPA NYLIRMD+AGT+CLAFL TTSSLSIIGSFQQQG+RVSYDLVNSLVGFSSNKC
Subjt:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

XP_038878113.1 protein ASPARTIC PROTEASE IN GUARD CELL 1 [Benincasa hispida]6.3e-19173.98Show/hide
Query:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQI-LHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARDA
        S F+FLT+L S  F S  SR LT+SP+ST + DVSAS+K+AQNALSI P     HSH     +PNS LSLPLHPRL ++  +PSYKDY SLVRARLARDA
Subjt:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQI-LHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARDA

Query:  ARVRSLNRNLQLALTGAAVV--------RPDSITAPVVSGQSQGS-GEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSS
         RV+SLNRNL+L+L G   +          DSITAPVVSGQS G+ GEYFARIGVGQP QSFY VPDTGSDVTWLQC PC+   ACYKQ DPIFDPKSSS
Subjt:  ARVRSLNRNLQLALTGAAVV--------RPDSITAPVVSGQSQGS-GEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSS

Query:  SYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVD
        SY+ LSC+SQQCQ LD+ +C S  C YQV+YGDGSFTTGE ATETLSF NS S+PNLPIGCGHDNEGLFVGAAGLIGLGGG +SLSSQLKASSFSYCLVD
Subjt:  SYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVD

Query:  RDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGG
         DSDSSSTLEFN+ RPSDSLTSPL+KNDRF +YRYV+V GMSVGG PLPISSTRF+ID SGLGGIIVDSGT IT+LPSDVYESLR+AFV  T NL    G
Subjt:  RDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGG

Query:  VSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        +S FDTCYD SGQSSV+VP +AF L   +SLRLPA NYLI +DS GT+CLAF  T SSLSIIGSFQQQGIRVSYDL NSLVGFS+NKC
Subjt:  VSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

TrEMBL top hitse value%identityAlignment
A0A0A0LPJ3 Aspartic proteinase nepenthesin-11.3e-18671.4Show/hide
Query:  NNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQIL-HSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR
        N    S+FLFLT+  S  F S LSR LT S +ST++ DVSAS+ +A +ALSI P  +  HSH      PNS  SLPL+PRLA+H  +PSYKDY++LVRAR
Subjt:  NNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQIL-HSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR

Query:  LARDAARVRSLNRNLQLALTGAA--------VVRPDSITAPVVSGQSQGSG-EYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFD
        L RDAARV+ LNRNL+ +L G           +  DSITAPVVSGQS+GSG EY A+IGVGQP + FY VPDTGSDVTWLQC PC+  N CYKQ DPIFD
Subjt:  LARDAARVRSLNRNLQLALTGAA--------VVRPDSITAPVVSGQSQGSG-EYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFD

Query:  PKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFS
        PKSSSSYSPLSC+SQQC+ LD+ +C S TC YQV YGDGSFTTGE ATETLSF NS S+PNLPIGCGHDNEGLF G AGLIGLGGG +SLSSQLKASSFS
Subjt:  PKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFS

Query:  YCLVDRDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNL
        YCLV+ DSDSSSTLEFNS  PSDSLTSPL+KNDRF +YRYV+V G+SVGGK LPIS TRF+ID SGLGGIIVDSGT I+RLPSDVYESLR+AFV LT +L
Subjt:  YCLVDRDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNL

Query:  TMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        +   G+S FDTCY+FSGQS+V+VPT+AF LS+G+SLRLPA NYLI +D+AGT+CLAF+ T SSLSIIGSFQQQGIRVSYDL NSLVGFS+NKC
Subjt:  TMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

A0A5A7UQC2 Protein ASPARTIC PROTEASE IN GUARD CELL 1-like1.2e-19073.36Show/hide
Query:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQI-LHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARDA
        S+FLFLT+  S  FSS LSR LT+SP+ST++ DV AS+ +A NALSI P  +  HSH      PNSSLSLPL+PRL++H  +PSYKDYDSLVRARLARDA
Subjt:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQI-LHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARDA

Query:  ARVRSLNRNLQLALTGA--------AVVRPDSITAPVVSGQSQGSG-EYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSS
        ARV+ LNRNL+ +L G           +  DSITAPVVSGQS+GSG EY A++GVGQP + FY VPDTGSDVTWLQC PC+  NACYKQ DPIFDPKSSS
Subjt:  ARVRSLNRNLQLALTGA--------AVVRPDSITAPVVSGQSQGSG-EYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSS

Query:  SYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVD
        SY+PLSC+SQQC  LDR +C SGTC YQV YGDGSFTTGE ATETLSF NS S+PNLPIGCGHDNEGLF G AGLIGLGGG +SLSSQLKASSFSYCLV+
Subjt:  SYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVD

Query:  RDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGG
         DSDSSSTLEFNS  PSDSLTSPL+KNDRF +YRYV+V G+SVGGK LPISSTRF+ID SGLGGIIVDSGT I+RLPSDVYESLR+AFV LT +L+   G
Subjt:  RDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGG

Query:  VSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        +S FDTCY+ S QS+V+VPT+AF LS G+SLRLPA NYLIR+D+AGT+CLAF+ T SSLSIIGSFQQQGIRVSYDL NSLVGFS+NKC
Subjt:  VSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

A0A5D3DYG2 Protein ASPARTIC PROTEASE IN GUARD CELL 1-like3.4e-19073.16Show/hide
Query:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQI-LHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARDA
        S+FLFLT+  S  FSS LSR LT+SP+ST++ DV AS+ +A NALSI P  +  HSH      PNSSLSLPL+PRL++H  +PSYKDYDSLVRARLARDA
Subjt:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQI-LHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARDA

Query:  ARVRSLNRNLQLALTGA--------AVVRPDSITAPVVSGQSQGSG-EYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSS
        ARV+ LNRNL+ +L G           +  DSITAPVVSGQS+GSG EY A++GVGQP + FY VPDTGSDVTWLQC PC+  NACYKQ DPIFDPKSSS
Subjt:  ARVRSLNRNLQLALTGA--------AVVRPDSITAPVVSGQSQGSG-EYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSS

Query:  SYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVD
        SY+PLSC+SQQC  LDR +C SGTC YQV YGDGSFTTGE ATE LSF NS S+PNLPIGCGHDNEGLF G AGLIGLGGG +SLSSQLKASSFSYCLV+
Subjt:  SYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVD

Query:  RDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGG
         DSDSSSTLEFNS  PSDSLTSPL+KNDRF +YRYV+V G+SVGG  LPISSTRF+ID SGLGGIIVDSGT I+RLPSDVYESLR+AFV LT +L+   G
Subjt:  RDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGG

Query:  VSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        +S FDTCY+ SGQS+V+VPT+AF LS G+SLRLPA NYLIR+D+AGT+CLAF+ T SSLSIIGSFQQQGIRVSYDL NSLVGFS+NKC
Subjt:  VSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

A0A6J1EZU8 protein ASPARTIC PROTEASE IN GUARD CELL 1-like7.7e-20378.1Show/hide
Query:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR
        NN   FS  LF+ +LNS  FSSSL+R+  E   +TT+ DVSASS RAQ+ALS+ P Q  HSH       NSSLSLPLH RLAIHK   SYKDY+SLVRAR
Subjt:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR

Query:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP
        LARDAARV+SLNRNL LAL G A VRP+S+TAPVVSGQSQGSGEYFARI VGQP QSFY VPDTGSD+TWLQCLPCS  N CY+QTDPIF+P SSSSY P
Subjt:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP

Query:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD
        LSCDSQQCQSL+R  CQSGTC YQV YGDGSFT G+F TETL+F NSKS+PNLPIGCGHDNEGLFVGAAGLIGLGGG LSLSSQLKASSFSYCLVDRDSD
Subjt:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD

Query:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF
        SSSTLEF+S  PSDS+T+PLLKN+R  +YRYV+VTGMSVGGK L ISSTRF+IDGSG+GGIIVDSGTFITRLP+DVYESLR+AFV    +LT  G +SPF
Subjt:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF

Query:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        DTCY+ +GQS+VQVPTVAFELSKG  L+LPANNYLIRMD+AG++CLAFL TTSSLSIIGSFQQQG+RVSYDLVNSLVGFSSNKC
Subjt:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

A0A6J1L2D2 protein ASPARTIC PROTEASE IN GUARD CELL 1-like1.6e-20378.97Show/hide
Query:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR
        NN   FS FLF  +LNS  FSSSL+R+ TE   +TT+ DVSASS RAQNALSI P Q  HSH       NSSLSL LH RLAIHK   +YKDY+SLVRAR
Subjt:  NNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRAR

Query:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP
        LARDAARV+SLNRNL LAL G A VRP+S+TAPVVSGQSQGSGEYFARI VGQPAQSFY VPDTGSD+TWLQCLPCS  N CY QTDPIF+P SSSSY P
Subjt:  LARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSP

Query:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD
        LSCDSQQCQSL+R  CQSGTC YQV YGDGSFTTG+FATETL+F NSKS+PNLPIGCGHDN+GLFVGAAGLIGLGGG LSLSSQLKASSFSYCLVDRDSD
Subjt:  LSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSD

Query:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF
        SSSTLEF+S RPSDS+T+PLLKN+R  +YRYV+VTGMSVGGK L ISSTRF+IDGSG+GGIIVDSGTFITRLP+DVYESLR+AFV    +LT  G +SPF
Subjt:  SSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPF

Query:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFL-ATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        DTCY+ +GQS+VQVPTVAFELSKG+ L+LPA NYLIRMD+AGT+CLAFL  TTSSLSIIGSFQQQG+RVSYDLVNSLVGFSSNKC
Subjt:  DTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFL-ATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-24.5e-6740.51Show/hide
Query:  LVRARLARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSS
        L++  + R   R+RS+N  LQ +           I  PV +    G GEY   + +G P  SF  + DTGSD+ W QC PC+    C+ Q  PIF+P+ S
Subjt:  LVRARLARDAARVRSLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSS

Query:  SSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVG-AAGLIGLGGGDLSLSSQLKASSFSYCL
        SS+S L C+SQ CQ L   +C +  C Y   YGDGS T G  ATET +F  S SVPN+  GCG DN+G   G  AGLIG+G G LSL SQL    FSYC+
Subjt:  SSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVG-AAGLIGLGGGDLSLSSQLKASSFSYCL

Query:  VDRDSDSSSTLEFN---SGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNL
            S S STL      SG P  S ++ L+ +    TY Y+ + G++VGG  L I S+ F +   G GG+I+DSGT +T LP D Y ++  AF +     
Subjt:  VDRDSDSSSTLEFN---SGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNL

Query:  TMTGGVSPFDTCYDF-SGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTS-SLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        T+    S   TC+   S  S+VQVP ++ +   G  L L   N LI   + G  CLA  +++   +SI G+ QQQ  +V YDL N  V F   +C
Subjt:  TMTGGVSPFDTCYDF-SGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTS-SLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

Q766C3 Aspartic proteinase nepenthesin-17.0e-6841.71Show/hide
Query:  GSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATE
        G GEY   + +G PAQ F  + DTGSD+ W QC PC+    C+ Q+ PIF+P+ SSS+S L C SQ CQ+L   +C +  C Y   YGDGS T G   TE
Subjt:  GSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATE

Query:  TLSFANSKSVPNLPIGCGHDNEGLFVG-AAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSDSSSTLEFNSGRPSDSLTSP---LLKNDRFGTYRYVEVTG
        TL+F  S S+PN+  GCG +N+G   G  AGL+G+G G LSL SQL  + FSYC+    S + S L   S   S +  SP   L+++ +  T+ Y+ + G
Subjt:  TLSFANSKSVPNLPIGCGHDNEGLFVG-AAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSDSSSTLEFNSGRPSDSLTSP---LLKNDRFGTYRYVEVTG

Query:  MSVGGKPLPISSTRFDID-GSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPFDTCYDF-SGQSSVQVPTVAFELSKGSSLRLPANNY
        +SVG   LPI  + F ++  +G GGII+DSGT +T   ++ Y+S+R  F++      + G  S FD C+   S  S++Q+PT       G  L LP+ NY
Subjt:  MSVGGKPLPISSTRFDID-GSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPFDTCYDF-SGQSSVQVPTVAFELSKGSSLRLPANNY

Query:  LIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
         I   S G  CLA  +++  +SI G+ QQQ + V YD  NS+V F+S +C
Subjt:  LIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 23.8e-9845.72Show/hide
Query:  PSPSYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVVRPDS------ITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDA
        PS +Y+++   + AR+ RD  RV ++ R     ++G  +   DS        + +VSG  QGSGEYF RIGVG P +  Y V D+GSD+ W+QC PC   
Subjt:  PSPSYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVVRPDS------ITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDA

Query:  NACYKQTDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDL
          CYKQ+DP+FDP  S SY+ +SC S  C  ++   C SG C Y+V YGDGS+T G  A ETL+FA +  V N+ +GCGH N G+F+GAAGL+G+GGG +
Subjt:  NACYKQTDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDL

Query:  SLSSQLK---ASSFSYCLVDRDSDSSSTLEF-NSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSD
        S   QL      +F YCLV R +DS+ +L F     P  +   PL++N R  ++ YV + G+ VGG  +P+    FD+  +G GG+++D+GT +TRLP+ 
Subjt:  SLSSQLK---ASSFSYCLVDRDSDSSSTLEF-NSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSD

Query:  VYESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNS
         Y + RD F + T NL    GVS FDTCYD SG  SV+VPTV+F  ++G  L LPA N+L+ +D +GT+C AF A+ + LSIIG+ QQ+GI+VS+D  N 
Subjt:  VYESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNS

Query:  LVGFSSNKC
         VGF  N C
Subjt:  LVGFSSNKC

Q9LNJ3 Aspartyl protease family protein 21.5e-9448.65Show/hide
Query:  SYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVV---RPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQ
        S K  D L  +RL RD+ RV+S+   L   + G  V    RP   ++ VVSG SQGSGEYF R+GVG PA+  Y V DTGSD+ WLQC PC     CY Q
Subjt:  SYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVV---RPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQ

Query:  TDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQS--GTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSS
        +DPIFDP+ S +Y+ + C S  C+ LD   C +   TC YQVSYGDGSFT G+F+TETL+F  ++ V  + +GCGHDNEGLFVGAAGL+GLG G LS   
Subjt:  TDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQS--GTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSS

Query:  QLK---ASSFSYCLVDRDSDS--SSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLP-ISSTRFDIDGSGLGGIIVDSGTFITRLPSDVY
        Q        FSYCLVDR + S  SS +  N+     +  +PLL N +  T+ YV + G+SVGG  +P ++++ F +D  G GG+I+DSGT +TRL    Y
Subjt:  QLK---ASSFSYCLVDRDSDS--SSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLP-ISSTRFDIDGSGLGGIIVDSGTFITRLPSDVY

Query:  ESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLV
         ++RDAF      L      S FDTC+D S  + V+VPTV     +G+ + LPA NYLI +D+ G FC AF  T   LSIIG+ QQQG RV YDL +S V
Subjt:  ESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLV

Query:  GFSSNKC
        GF+   C
Subjt:  GFSSNKC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.5e-12650.3Show/hide
Query:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFP-------ILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRA
        ++ LFLT  ++S  S SLS     +P  T +LDV +S ++ Q  LS++P +   + + P         N +S LSL LH R      +  +KDY SL  +
Subjt:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFP-------ILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRA

Query:  RLARDAARVRSLNRNLQLALTGA--AVVRP----------DSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTD
        RL RD++RV  +   ++ A+ G   + ++P          + +T PVVSG SQGSGEYF+RIGVG PA+  Y V DTGSDV W+QC PC+D   CY+Q+D
Subjt:  RLARDAARVRSLNRNLQLALTGA--AVVRP----------DSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTD

Query:  PIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKA
        P+F+P SSS+Y  L+C + QC  L+  +C+S  C YQVSYGDGSFT GE AT+T++F NS  + N+ +GCGHDNEGLF GAAGL+GLGGG LS+++Q+KA
Subjt:  PIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKA

Query:  SSFSYCLVDRDSDSSSTLEFNSGR-PSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVN
        +SFSYCLVDRDS  SS+L+FNS +      T+PLL+N +  T+ YV ++G SVGG+ + +    FD+D SG GG+I+D GT +TRL +  Y SLRDAF+ 
Subjt:  SSFSYCLVDRDSDSSSTLEFNSGR-PSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVN

Query:  LTGNLTM-TGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        LT NL   +  +S FDTCYDFS  S+V+VPTVAF  + G SL LPA NYLI +D +GTFC AF  T+SSLSIIG+ QQQG R++YDL  +++G S NKC
Subjt:  LTGNLTM-TGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein1.1e-9548.65Show/hide
Query:  SYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVV---RPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQ
        S K  D L  +RL RD+ RV+S+   L   + G  V    RP   ++ VVSG SQGSGEYF R+GVG PA+  Y V DTGSD+ WLQC PC     CY Q
Subjt:  SYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVV---RPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQ

Query:  TDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQS--GTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSS
        +DPIFDP+ S +Y+ + C S  C+ LD   C +   TC YQVSYGDGSFT G+F+TETL+F  ++ V  + +GCGHDNEGLFVGAAGL+GLG G LS   
Subjt:  TDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQS--GTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSS

Query:  QLK---ASSFSYCLVDRDSDS--SSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLP-ISSTRFDIDGSGLGGIIVDSGTFITRLPSDVY
        Q        FSYCLVDR + S  SS +  N+     +  +PLL N +  T+ YV + G+SVGG  +P ++++ F +D  G GG+I+DSGT +TRL    Y
Subjt:  QLK---ASSFSYCLVDRDSDS--SSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLP-ISSTRFDIDGSGLGGIIVDSGTFITRLPSDVY

Query:  ESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLV
         ++RDAF      L      S FDTC+D S  + V+VPTV     +G+ + LPA NYLI +D+ G FC AF  T   LSIIG+ QQQG RV YDL +S V
Subjt:  ESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLV

Query:  GFSSNKC
        GF+   C
Subjt:  GFSSNKC

AT1G25510.1 Eukaryotic aspartyl protease family protein9.2e-13252.55Show/hide
Query:  FSIFLFLTLLNSSFFSSSLSRLLTESPHSTT-LLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARD
        +S F F+  L S   SS  SR+L E+  +TT +L+V+ S  R +   S      L+       + +SS SL LH R+++      + DY SL  ARL RD
Subjt:  FSIFLFLTLLNSSFFSSSLSRLLTESPHSTT-LLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARD

Query:  AARVRSLNRNLQLALT--GAAVVRPDS---------ITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPK
         ARV+SL   L LA+     A ++P S         I AP++SG +QGSGEYF R+G+G+PA+  Y V DTGSDV WLQC PC+D   CY QT+PIF+P 
Subjt:  AARVRSLNRNLQLALT--GAAVVRPDS---------ITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPK

Query:  SSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYC
        SSSSY PLSCD+ QC +L+   C++ TC Y+VSYGDGS+T G+FATETL+   S  V N+ +GCGH NEGLFVGAAGL+GLGGG L+L SQL  +SFSYC
Subjt:  SSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYC

Query:  LVDRDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTM
        LVDRDSDS+ST++F +    D++ +PLL+N +  T+ Y+ +TG+SVGG+ L I  + F++D SG GGII+DSGT +TRL +++Y SLRD+FV  T +L  
Subjt:  LVDRDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTM

Query:  TGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
          GV+ FDTCY+ S +++V+VPTVAF    G  L LPA NY+I +DS GTFCLAF  T SSL+IIG+ QQQG RV++DL NSL+GFSSNKC
Subjt:  TGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

AT3G18490.1 Eukaryotic aspartyl protease family protein1.1e-12750.3Show/hide
Query:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFP-------ILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRA
        ++ LFLT  ++S  S SLS     +P  T +LDV +S ++ Q  LS++P +   + + P         N +S LSL LH R      +  +KDY SL  +
Subjt:  SIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFP-------ILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRA

Query:  RLARDAARVRSLNRNLQLALTGA--AVVRP----------DSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTD
        RL RD++RV  +   ++ A+ G   + ++P          + +T PVVSG SQGSGEYF+RIGVG PA+  Y V DTGSDV W+QC PC+D   CY+Q+D
Subjt:  RLARDAARVRSLNRNLQLALTGA--AVVRP----------DSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTD

Query:  PIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKA
        P+F+P SSS+Y  L+C + QC  L+  +C+S  C YQVSYGDGSFT GE AT+T++F NS  + N+ +GCGHDNEGLF GAAGL+GLGGG LS+++Q+KA
Subjt:  PIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKA

Query:  SSFSYCLVDRDSDSSSTLEFNSGR-PSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVN
        +SFSYCLVDRDS  SS+L+FNS +      T+PLL+N +  T+ YV ++G SVGG+ + +    FD+D SG GG+I+D GT +TRL +  Y SLRDAF+ 
Subjt:  SSFSYCLVDRDSDSSSTLEFNSGR-PSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVN

Query:  LTGNLTM-TGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC
        LT NL   +  +S FDTCYDFS  S+V+VPTVAF  + G SL LPA NYLI +D +GTFC AF  T+SSLSIIG+ QQQG R++YDL  +++G S NKC
Subjt:  LTGNLTM-TGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC

AT3G20015.1 Eukaryotic aspartyl protease family protein2.7e-9945.72Show/hide
Query:  PSPSYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVVRPDS------ITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDA
        PS +Y+++   + AR+ RD  RV ++ R     ++G  +   DS        + +VSG  QGSGEYF RIGVG P +  Y V D+GSD+ W+QC PC   
Subjt:  PSPSYKDYDSLVRARLARDAARVRSLNRNLQLALTGAAVVRPDS------ITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDA

Query:  NACYKQTDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDL
          CYKQ+DP+FDP  S SY+ +SC S  C  ++   C SG C Y+V YGDGS+T G  A ETL+FA +  V N+ +GCGH N G+F+GAAGL+G+GGG +
Subjt:  NACYKQTDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDL

Query:  SLSSQLK---ASSFSYCLVDRDSDSSSTLEF-NSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSD
        S   QL      +F YCLV R +DS+ +L F     P  +   PL++N R  ++ YV + G+ VGG  +P+    FD+  +G GG+++D+GT +TRLP+ 
Subjt:  SLSSQLK---ASSFSYCLVDRDSDSSSTLEF-NSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSD

Query:  VYESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNS
         Y + RD F + T NL    GVS FDTCYD SG  SV+VPTV+F  ++G  L LPA N+L+ +D +GT+C AF A+ + LSIIG+ QQ+GI+VS+D  N 
Subjt:  VYESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNS

Query:  LVGFSSNKC
         VGF  N C
Subjt:  LVGFSSNKC

AT3G61820.1 Eukaryotic aspartyl protease family protein3.5e-9945.78Show/hide
Query:  NNCFFSIF--LFLTLLNSSFFSSSLSRLLTES-----PHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYD
        N   FS+F  LF T   SS + + +   L  S     P S +L D S S      ++ ++ V  L S S                       SP+     
Subjt:  NNCFFSIF--LFLTLLNSSFFSSSLSRLLTES-----PHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYD

Query:  SLVRARLARDAARVRSLNRNLQLALTGAAVVRPDSITA-----PVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPI
         L   RL RD+ RV+S+  +L    TG    +    TA      V+SG SQGSGEYF R+GVG PA + Y V DTGSDV WLQC PC    ACY QTD I
Subjt:  SLVRARLARDAARVRSLNRNLQLALTGAAVVRPDSITA-----PVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPI

Query:  FDPKSSSSYSPLSCDSQQCQSLDRGS-C---QSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQL
        FDPK S +++ + C S+ C+ LD  S C   +S TC YQVSYGDGSFT G+F+TETL+F  ++ V ++P+GCGHDNEGLFVGAAGL+GLG G LS  SQ 
Subjt:  FDPKSSSSYSPLSCDSQQCQSLDRGS-C---QSGTCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQL

Query:  K---ASSFSYCLVDRDSDSS-----STLEF-NSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLP-ISSTRFDIDGSGLGGIIVDSGTFITRLPSD
        K      FSYCLVDR S  S     ST+ F N+  P  S+ +PLL N +  T+ Y+++ G+SVGG  +P +S ++F +D +G GG+I+DSGT +TRL   
Subjt:  K---ASSFSYCLVDRDSDSS-----STLEF-NSGRPSDSLTSPLLKNDRFGTYRYVEVTGMSVGGKPLP-ISSTRFDIDGSGLGGIIVDSGTFITRLPSD

Query:  VYESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNS
         Y +LRDAF      L      S FDTC+D SG ++V+VPTV F    G  + LPA+NYLI +++ G FC AF  T  SLSIIG+ QQQG RV+YDLV S
Subjt:  VYESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMDSAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNS

Query:  LVGFSSNKC
         VGF S  C
Subjt:  LVGFSSNKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACAACTGCTTTTTTTCCATCTTCCTCTTCCTAACACTCCTCAATTCCTCTTTCTTTTCTTCCTCTCTCTCTCGCTTGCTCACAGAATCGCCTCATTCCACCAC
ACTCCTCGATGTCTCTGCCTCTTCCAAGCGAGCCCAAAACGCCCTCTCCATAAACCCTGTCCAAATCCTTCACTCCCATTCATTTCCTATTCTCAATCCAAATTCCTCTC
TCTCTCTGCCATTGCACCCCAGATTGGCCATTCATAAGCCTTCGCCTTCTTACAAGGACTACGACAGCCTAGTCCGAGCCCGACTCGCCCGCGATGCCGCCCGGGTTCGC
TCCCTCAACCGAAATCTCCAACTCGCTTTGACTGGGGCTGCAGTGGTCCGACCCGATTCCATAACCGCCCCTGTTGTTTCTGGCCAGAGCCAGGGGAGTGGGGAGTATTT
TGCACGGATTGGCGTCGGGCAGCCTGCCCAGTCGTTCTACTTCGTGCCCGACACTGGCAGCGATGTCACGTGGCTTCAGTGCCTGCCCTGCAGTGATGCGAACGCCTGTT
ACAAACAAACCGACCCGATATTCGACCCCAAATCGTCGTCGTCTTACAGTCCTCTCTCCTGCGATTCGCAGCAATGTCAATCACTCGACAGAGGCAGTTGTCAATCCGGC
ACGTGTAATTACCAAGTCTCGTACGGCGACGGCTCATTCACGACCGGCGAATTCGCCACCGAAACGCTGTCGTTCGCGAATTCGAAATCCGTCCCCAATCTCCCCATCGG
CTGCGGCCACGACAACGAAGGCCTCTTCGTCGGAGCCGCCGGTTTGATTGGCCTCGGCGGTGGGGATCTCTCCCTCTCCTCCCAGCTCAAAGCGTCGTCGTTTTCATACT
GCCTCGTCGACCGCGACTCGGACTCGTCCTCGACTCTCGAGTTCAACTCCGGGCGACCCAGTGACTCCCTCACCTCTCCGCTCCTCAAAAACGACCGATTCGGCACGTAC
CGGTACGTGGAGGTCACCGGAATGAGCGTCGGCGGGAAGCCGCTGCCGATTTCGTCAACGAGATTCGATATCGATGGGTCGGGACTCGGGGGAATAATCGTGGACTCGGG
GACGTTTATAACTCGGCTACCGAGTGACGTGTACGAATCGCTGAGAGACGCGTTTGTGAATCTGACGGGGAACTTGACGATGACGGGAGGGGTGTCGCCGTTCGACACGT
GTTACGATTTTTCCGGTCAGTCGAGCGTGCAAGTGCCGACGGTGGCGTTTGAGTTGTCGAAGGGGAGCTCGCTGCGGCTGCCGGCGAATAACTACTTGATACGGATGGAC
TCGGCTGGAACTTTTTGCTTGGCGTTTCTTGCAACGACGTCGTCGCTTTCCATAATTGGGAGCTTCCAACAGCAGGGAATACGTGTCAGCTATGACCTGGTCAACTCTCT
CGTCGGATTCTCATCCAACAAGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAACAACTGCTTTTTTTCCATCTTCCTCTTCCTAACACTCCTCAATTCCTCTTTCTTTTCTTCCTCTCTCTCTCGCTTGCTCACAGAATCGCCTCATTCCACCAC
ACTCCTCGATGTCTCTGCCTCTTCCAAGCGAGCCCAAAACGCCCTCTCCATAAACCCTGTCCAAATCCTTCACTCCCATTCATTTCCTATTCTCAATCCAAATTCCTCTC
TCTCTCTGCCATTGCACCCCAGATTGGCCATTCATAAGCCTTCGCCTTCTTACAAGGACTACGACAGCCTAGTCCGAGCCCGACTCGCCCGCGATGCCGCCCGGGTTCGC
TCCCTCAACCGAAATCTCCAACTCGCTTTGACTGGGGCTGCAGTGGTCCGACCCGATTCCATAACCGCCCCTGTTGTTTCTGGCCAGAGCCAGGGGAGTGGGGAGTATTT
TGCACGGATTGGCGTCGGGCAGCCTGCCCAGTCGTTCTACTTCGTGCCCGACACTGGCAGCGATGTCACGTGGCTTCAGTGCCTGCCCTGCAGTGATGCGAACGCCTGTT
ACAAACAAACCGACCCGATATTCGACCCCAAATCGTCGTCGTCTTACAGTCCTCTCTCCTGCGATTCGCAGCAATGTCAATCACTCGACAGAGGCAGTTGTCAATCCGGC
ACGTGTAATTACCAAGTCTCGTACGGCGACGGCTCATTCACGACCGGCGAATTCGCCACCGAAACGCTGTCGTTCGCGAATTCGAAATCCGTCCCCAATCTCCCCATCGG
CTGCGGCCACGACAACGAAGGCCTCTTCGTCGGAGCCGCCGGTTTGATTGGCCTCGGCGGTGGGGATCTCTCCCTCTCCTCCCAGCTCAAAGCGTCGTCGTTTTCATACT
GCCTCGTCGACCGCGACTCGGACTCGTCCTCGACTCTCGAGTTCAACTCCGGGCGACCCAGTGACTCCCTCACCTCTCCGCTCCTCAAAAACGACCGATTCGGCACGTAC
CGGTACGTGGAGGTCACCGGAATGAGCGTCGGCGGGAAGCCGCTGCCGATTTCGTCAACGAGATTCGATATCGATGGGTCGGGACTCGGGGGAATAATCGTGGACTCGGG
GACGTTTATAACTCGGCTACCGAGTGACGTGTACGAATCGCTGAGAGACGCGTTTGTGAATCTGACGGGGAACTTGACGATGACGGGAGGGGTGTCGCCGTTCGACACGT
GTTACGATTTTTCCGGTCAGTCGAGCGTGCAAGTGCCGACGGTGGCGTTTGAGTTGTCGAAGGGGAGCTCGCTGCGGCTGCCGGCGAATAACTACTTGATACGGATGGAC
TCGGCTGGAACTTTTTGCTTGGCGTTTCTTGCAACGACGTCGTCGCTTTCCATAATTGGGAGCTTCCAACAGCAGGGAATACGTGTCAGCTATGACCTGGTCAACTCTCT
CGTCGGATTCTCATCCAACAAGTGTTAA
Protein sequenceShow/hide protein sequence
MNNNCFFSIFLFLTLLNSSFFSSSLSRLLTESPHSTTLLDVSASSKRAQNALSINPVQILHSHSFPILNPNSSLSLPLHPRLAIHKPSPSYKDYDSLVRARLARDAARVR
SLNRNLQLALTGAAVVRPDSITAPVVSGQSQGSGEYFARIGVGQPAQSFYFVPDTGSDVTWLQCLPCSDANACYKQTDPIFDPKSSSSYSPLSCDSQQCQSLDRGSCQSG
TCNYQVSYGDGSFTTGEFATETLSFANSKSVPNLPIGCGHDNEGLFVGAAGLIGLGGGDLSLSSQLKASSFSYCLVDRDSDSSSTLEFNSGRPSDSLTSPLLKNDRFGTY
RYVEVTGMSVGGKPLPISSTRFDIDGSGLGGIIVDSGTFITRLPSDVYESLRDAFVNLTGNLTMTGGVSPFDTCYDFSGQSSVQVPTVAFELSKGSSLRLPANNYLIRMD
SAGTFCLAFLATTSSLSIIGSFQQQGIRVSYDLVNSLVGFSSNKC