; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G013410 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G013410
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein ASPARTIC PROTEASE IN GUARD CELL 1-like
Genome locationchr06:23987178..23988638
RNA-Seq ExpressionLsi06G013410
SyntenyLsi06G013410
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058063.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo var. makuwa]7.4e-22483.78Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA
        M TSLSS FLFLTI TSLQF S LSRKL+QS  ST+IFDV ASTNQ  NALS+KPK  +THSH PNS LSLPL+PRL+LHN SYKDYDSLVRARLARDAA
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA

Query:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS
        RVQ LNRNLE SLNGG+  GE  NGS   DSITAPVVSGQS+GSGAEY A++GVGQP + F+LVPDTGSDVTWLQCQPCA ENACYKQIDPIFDPKSSSS
Subjt:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS

Query:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ
        Y+PL CNSQQC LLD   CNS  C+YQV YGDGSFTTGELATETL+FGNSNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQLKASSFSYCLV+ 
Subjt:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ

Query:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI
        DSDSSSTLEFNS+ PSDSLTSPLVKN+RF SYRYVKV G+SVGG  LPISSTRFEI+ESGLGGIIVDSGTII+RLPSDVYESLREAFVKLTS+L PA GI
Subjt:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI

Query:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        S+FDTCY+ S QSNVEVPTIAFVL GG SLRLPA+NYLI VD+AGTYCLAFIKT SSLSIIGSFQQQGIRVSYDL NSLVGFSTNKC
Subjt:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

TYK28409.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo var. makuwa]1.5e-22483.98Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA
        M TSLSS FLFLTI TSLQF S LSRKL+QS  ST+IFDV ASTNQ  NALS+KPK  +THSH PNS LSLPL+PRL+LHN SYKDYDSLVRARLARDAA
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA

Query:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS
        RVQ LNRNLE SLNGG+  GE  NGS   DSITAPVVSGQS+GSGAEY A++GVGQP + F+LVPDTGSDVTWLQCQPCA ENACYKQIDPIFDPKSSSS
Subjt:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS

Query:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ
        Y+PL CNSQQC LLD   CNS  C+YQV YGDGSFTTGELATE L+FGNSNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQLKASSFSYCLV+ 
Subjt:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ

Query:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI
        DSDSSSTLEFNS+ PSDSLTSPLVKN+RF SYRYVKV G+SVGGN LPISSTRFEI+ESGLGGIIVDSGTII+RLPSDVYESLREAFVKLTS+L PA GI
Subjt:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI

Query:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        S+FDTCY+ SGQSNVEVPTIAFVL GG SLRLPA+NYLI VD+AGTYCLAFIKT SSLSIIGSFQQQGIRVSYDL NSLVGFSTNKC
Subjt:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

XP_004138238.1 protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis sativus]6.7e-22583.57Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA
        MNTSLSS FLFLTI TSLQFPS LSRKL+ S  ST+IFDVSASTNQ  +ALS+KPKP + HSH PNSP SLPL+PRL LHN SYKDY++LVRARL RDAA
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA

Query:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS
        RVQ LNRNLE SLNGG   GE IN S   DSITAPVVSGQS+GSGAEY A+IGVGQP + F+LVPDTGSDVTWLQCQPCA+EN CYKQ DPIFDPKSSSS
Subjt:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS

Query:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ
        YSPL CNSQQC+LLD A CNSD C+YQV YGDGSFTTGELATETL+FGNSNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQLKASSFSYCLV+ 
Subjt:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ

Query:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI
        DSDSSSTLEFNS+ PSDSLTSPLVKN+RF SYRYVKV G+SVGG  LPIS TRFEI+ESGLGGIIVDSGTII+RLPSDVYESLREAFVKLTS+L PA GI
Subjt:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI

Query:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        S+FDTCY+FSGQSNVEVPTIAFVL  G SLRLPA+NYLI +D+AGTYCLAFIKT SSLSIIGSFQQQGIRVSYDL NSLVGFSTNKC
Subjt:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

XP_008453383.1 PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo]1.1e-21178.69Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD--STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDA
        MNTSLS A LFLTI T LQFPS LSRKL+     STT FDVSAS NQ  NALS+KPKPF+THS+H NSPLSL LHPRLT+HN SYKDY +LVRARLAR A
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD--STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDA

Query:  ARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSS
         RVQSLNR LELSLNG +  G+RINGS S +S+TAPV SG S G G EYFARIGVGQP QSFFLVPDTGSDVTWLQC+PCA ENAC+KQ+DPIFDPKSSS
Subjt:  ARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSS

Query:  SYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVD
        SYS L CNS+QCQLLD AGC+S+ C+Y+V+YGDGSFT GELATETL+FGNSNSIPNLPIGCGHDNEGLF  AAGLIGLGGGAISLSSQL+ASSFSYCLVD
Subjt:  SYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVD

Query:  QDSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQG
         DSDSSSTL+FN+D+PSDSLTSPLVKNNRF S+RYVKV GMSVGG  LPISS+RFEI+ESG GGIIVDSGT IT+LPSDVY+ LR+AFV LT+NLP A G
Subjt:  QDSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQG

Query:  ISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        +S FDTCYD S QS+VEVP IAF+LPGG SL+LPAKN LI VDSAGT+CLAF+  T  LSIIG+ QQQGIRVSYDL NS+VGF+TNKC
Subjt:  ISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

XP_038878113.1 protein ASPARTIC PROTEASE IN GUARD CELL 1 [Benincasa hispida]2.1e-24289.32Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA
        MN S+SSAF+FLTILTSLQFPS  SRKL+QS  ST IFDVSAST Q QNALS+KPKPFETHSHHPNSPLSLPLHPRLTL+N SYKDY SLVRARLARDA 
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA

Query:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS
        RVQSLNRNLELSLNGGQ +G  INGS+S DSITAPVVSGQS G+G EYFARIGVGQP QSF+LVPDTGSDVTWLQCQPCAAE ACYKQIDPIFDPKSSSS
Subjt:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS

Query:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ
        Y+ L CNSQQCQLLD A CNSDVC YQV YGDGSFTTGELATETL+FGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVD 
Subjt:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ

Query:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI
        DSDSSSTLEFN+DRPSDSLTSPLVKN+RF SYRYVKV GMSVGGNPLPISSTRFEI+ESGLGGIIVDSGT IT+LPSDVYESLREAFVK TSNLPPAQGI
Subjt:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI

Query:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        SLFDTCYD SGQS+VEVP IAFVLPG NSLRLPAKNYLIPVDS GTYCLAF KT SSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
Subjt:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

TrEMBL top hitse value%identityAlignment
A0A0A0LPJ3 Aspartic proteinase nepenthesin-13.2e-22583.57Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA
        MNTSLSS FLFLTI TSLQFPS LSRKL+ S  ST+IFDVSASTNQ  +ALS+KPKP + HSH PNSP SLPL+PRL LHN SYKDY++LVRARL RDAA
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA

Query:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS
        RVQ LNRNLE SLNGG   GE IN S   DSITAPVVSGQS+GSGAEY A+IGVGQP + F+LVPDTGSDVTWLQCQPCA+EN CYKQ DPIFDPKSSSS
Subjt:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS

Query:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ
        YSPL CNSQQC+LLD A CNSD C+YQV YGDGSFTTGELATETL+FGNSNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQLKASSFSYCLV+ 
Subjt:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ

Query:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI
        DSDSSSTLEFNS+ PSDSLTSPLVKN+RF SYRYVKV G+SVGG  LPIS TRFEI+ESGLGGIIVDSGTII+RLPSDVYESLREAFVKLTS+L PA GI
Subjt:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI

Query:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        S+FDTCY+FSGQSNVEVPTIAFVL  G SLRLPA+NYLI +D+AGTYCLAFIKT SSLSIIGSFQQQGIRVSYDL NSLVGFSTNKC
Subjt:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

A0A1S3BW42 protein ASPARTIC PROTEASE IN GUARD CELL 15.3e-21278.69Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD--STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDA
        MNTSLS A LFLTI T LQFPS LSRKL+     STT FDVSAS NQ  NALS+KPKPF+THS+H NSPLSL LHPRLT+HN SYKDY +LVRARLAR A
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD--STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDA

Query:  ARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSS
         RVQSLNR LELSLNG +  G+RINGS S +S+TAPV SG S G G EYFARIGVGQP QSFFLVPDTGSDVTWLQC+PCA ENAC+KQ+DPIFDPKSSS
Subjt:  ARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSS

Query:  SYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVD
        SYS L CNS+QCQLLD AGC+S+ C+Y+V+YGDGSFT GELATETL+FGNSNSIPNLPIGCGHDNEGLF  AAGLIGLGGGAISLSSQL+ASSFSYCLVD
Subjt:  SYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVD

Query:  QDSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQG
         DSDSSSTL+FN+D+PSDSLTSPLVKNNRF S+RYVKV GMSVGG  LPISS+RFEI+ESG GGIIVDSGT IT+LPSDVY+ LR+AFV LT+NLP A G
Subjt:  QDSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQG

Query:  ISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        +S FDTCYD S QS+VEVP IAF+LPGG SL+LPAKN LI VDSAGT+CLAF+  T  LSIIG+ QQQGIRVSYDL NS+VGF+TNKC
Subjt:  ISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

A0A5A7UQC2 Protein ASPARTIC PROTEASE IN GUARD CELL 1-like3.6e-22483.78Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA
        M TSLSS FLFLTI TSLQF S LSRKL+QS  ST+IFDV ASTNQ  NALS+KPK  +THSH PNS LSLPL+PRL+LHN SYKDYDSLVRARLARDAA
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA

Query:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS
        RVQ LNRNLE SLNGG+  GE  NGS   DSITAPVVSGQS+GSGAEY A++GVGQP + F+LVPDTGSDVTWLQCQPCA ENACYKQIDPIFDPKSSSS
Subjt:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS

Query:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ
        Y+PL CNSQQC LLD   CNS  C+YQV YGDGSFTTGELATETL+FGNSNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQLKASSFSYCLV+ 
Subjt:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ

Query:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI
        DSDSSSTLEFNS+ PSDSLTSPLVKN+RF SYRYVKV G+SVGG  LPISSTRFEI+ESGLGGIIVDSGTII+RLPSDVYESLREAFVKLTS+L PA GI
Subjt:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI

Query:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        S+FDTCY+ S QSNVEVPTIAFVL GG SLRLPA+NYLI VD+AGTYCLAFIKT SSLSIIGSFQQQGIRVSYDL NSLVGFSTNKC
Subjt:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

A0A5A7UQC3 Protein ASPARTIC PROTEASE IN GUARD CELL 15.3e-21278.69Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD--STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDA
        MNTSLS A LFLTI T LQFPS LSRKL+     STT FDVSAS NQ  NALS+KPKPF+THS+H NSPLSL LHPRLT+HN SYKDY +LVRARLAR A
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD--STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDA

Query:  ARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSS
         RVQSLNR LELSLNG +  G+RINGS S +S+TAPV SG S G G EYFARIGVGQP QSFFLVPDTGSDVTWLQC+PCA ENAC+KQ+DPIFDPKSSS
Subjt:  ARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSS

Query:  SYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVD
        SYS L CNS+QCQLLD AGC+S+ C+Y+V+YGDGSFT GELATETL+FGNSNSIPNLPIGCGHDNEGLF  AAGLIGLGGGAISLSSQL+ASSFSYCLVD
Subjt:  SYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVD

Query:  QDSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQG
         DSDSSSTL+FN+D+PSDSLTSPLVKNNRF S+RYVKV GMSVGG  LPISS+RFEI+ESG GGIIVDSGT IT+LPSDVY+ LR+AFV LT+NLP A G
Subjt:  QDSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQG

Query:  ISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        +S FDTCYD S QS+VEVP IAF+LPGG SL+LPAKN LI VDSAGT+CLAF+  T  LSIIG+ QQQGIRVSYDL NS+VGF+TNKC
Subjt:  ISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

A0A5D3DYG2 Protein ASPARTIC PROTEASE IN GUARD CELL 1-like7.2e-22583.98Show/hide
Query:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA
        M TSLSS FLFLTI TSLQF S LSRKL+QS  ST+IFDV ASTNQ  NALS+KPK  +THSH PNS LSLPL+PRL+LHN SYKDYDSLVRARLARDAA
Subjt:  MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSD-STTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAA

Query:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS
        RVQ LNRNLE SLNGG+  GE  NGS   DSITAPVVSGQS+GSGAEY A++GVGQP + F+LVPDTGSDVTWLQCQPCA ENACYKQIDPIFDPKSSSS
Subjt:  RVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSS

Query:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ
        Y+PL CNSQQC LLD   CNS  C+YQV YGDGSFTTGELATE L+FGNSNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQLKASSFSYCLV+ 
Subjt:  YSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQ

Query:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI
        DSDSSSTLEFNS+ PSDSLTSPLVKN+RF SYRYVKV G+SVGGN LPISSTRFEI+ESGLGGIIVDSGTII+RLPSDVYESLREAFVKLTS+L PA GI
Subjt:  DSDSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGI

Query:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        S+FDTCY+ SGQSNVEVPTIAFVL GG SLRLPA+NYLI VD+AGTYCLAFIKT SSLSIIGSFQQQGIRVSYDL NSLVGFSTNKC
Subjt:  SLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-22.0e-6739.26Show/hide
Query:  LVRARLARDAARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQI
        L++  + R   R++S+N  L+                 S   I  PV +G       EY   + +G P  SF  + DTGSD+ W QC+PC     C+ Q 
Subjt:  LVRARLARDAARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQI

Query:  DPIFDPKSSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVG-AAGLIGLGGGAISLSSQL
         PIF+P+ SSS+S LPC SQ CQ L    CN++ C Y   YGDGS T G +ATET TF  ++S+PN+  GCG DN+G   G  AGLIG+G G +SL SQL
Subjt:  DPIFDPKSSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVG-AAGLIGLGGGAISLSSQL

Query:  KASSFSYCLVDQDSDSSSTLEFNSDR---PSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLRE
            FSYC+    S S STL   S     P  S ++ L+ ++   +Y Y+ + G++VGG+ L I S+ F++++ G GG+I+DSGT +T LP D Y ++ +
Subjt:  KASSFSYCLVDQDSDSSSTLEFNSDR---PSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLRE

Query:  AFVKLTSNLPPA-QGISLFDTCYDF-SGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTS-SLSIIGSFQQQGIRVSYDLANSLVGF
        AF     NLP   +  S   TC+   S  S V+VP I+    GG  L L  +N LI   + G  CLA   ++   +SI G+ QQQ  +V YDL N  V F
Subjt:  AFVKLTSNLPPA-QGISLFDTCYDF-SGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTS-SLSIIGSFQQQGIRVSYDLANSLVGF

Query:  STNKC
           +C
Subjt:  STNKC

Q766C3 Aspartic proteinase nepenthesin-11.8e-7140.45Show/hide
Query:  RDAARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPK
        ++  + Q L R +E      Q L   +NG   V++    V +G       EY   + +G PAQ F  + DTGSD+ W QCQPC     C+ Q  PIF+P+
Subjt:  RDAARVQSLNRNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPK

Query:  SSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVG-AAGLIGLGGGAISLSSQLKASSFSY
         SSS+S LPC+SQ CQ L    C+++ C Y   YGDGS T G + TETLTFG S SIPN+  GCG +N+G   G  AGL+G+G G +SL SQL  + FSY
Subjt:  SSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVG-AAGLIGLGGGAISLSSQLKASSFSY

Query:  CLVDQDSDSSSTLEFNSDRPSDSLTSP---LVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEI-EESGLGGIIVDSGTIITRLPSDVYESLREAFVKLT
        C+    S + S L   S   S +  SP   L+++++  ++ Y+ ++G+SVG   LPI  + F +   +G GGII+DSGT +T   ++ Y+S+R+ F+   
Subjt:  CLVDQDSDSSSTLEFNSDRPSDSLTSP---LVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEI-EESGLGGIIVDSGTIITRLPSDVYESLREAFVKLT

Query:  SNLPPAQGISL-FDTCYDF-SGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
         NLP   G S  FD C+   S  SN+++PT      GG+ L LP++NY I   S G  CLA   ++  +SI G+ QQQ + V YD  NS+V F++ +C
Subjt:  SNLPPAQGISL-FDTCYDF-SGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 22.6e-9942.68Show/hide
Query:  LFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETH-SHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLNRNL
        L L+  +S+ FP F    + Q   T    V+A+     N          TH S   +S  +L L  R    + +Y+++   + AR+ RD  RV ++ R  
Subjt:  LFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETH-SHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLNRNL

Query:  ELSLNGGQILGERINGSKS---VDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC
               +I G+ I  S S   V+   + +VSG  QGSG EYF RIGVG P +  ++V D+GSD+ W+QCQPC     CYKQ DP+FDP  S SY+ + C
Subjt:  ELSLNGGQILGERINGSKS---VDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC

Query:  NSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQDSD
         S  C  ++ +GC+S  C Y+V YGDGS+T G LA ETLTF  +  + N+ +GCGH N G+F+GAAGL+G+GGG++S   QL      +F YCLV + +D
Subjt:  NSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQDSD

Query:  SSSTLEFNSDR-PSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGISL
        S+ +L F  +  P  +   PLV+N R  S+ YV + G+ VGG  +P+    F++ E+G GG+++D+GT +TRLP+  Y + R+ F   T+NLP A G+S+
Subjt:  SSSTLEFNSDR-PSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGISL

Query:  FDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        FDTCYD SG  +V VPT++F    G  L LPA+N+L+PVD +GTYC AF  + + LSIIG+ QQ+GI+VS+D AN  VGF  N C
Subjt:  FDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

Q9LNJ3 Aspartyl protease family protein 23.3e-9443.15Show/hide
Query:  SSAFLFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLN
        S  F FL++ +    PSF  + L  +  +       S     ++ S+    FE+ S   +S  S+ L+       +S K  D L  +RL RD+ RV+S  
Subjt:  SSAFLFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLN

Query:  RNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC
            ++    QI G  +  +      ++ VVSG SQGSG EYF R+GVG PA+  ++V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+ +PC
Subjt:  RNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC

Query:  NSQQCQLLDIAGCNS--DVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQD
        +S  C+ LD AGCN+    CLYQV YGDGSFT G+ +TETLTF   N +  + +GCGHDNEGLFVGAAGL+GLG G +S   Q        FSYCLVD+ 
Subjt:  NSQQCQLLDIAGCNS--DVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQD

Query:  SDS--SSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLP-ISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQ
        + S  SS +  N+     +  +PL+ N +  ++ YV + G+SVGG  +P ++++ F++++ G GG+I+DSGT +TRL    Y ++R+AF      L  A 
Subjt:  SDS--SSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLP-ISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQ

Query:  GISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
          SLFDTC+D S  + V+VPT+     G + + LPA NYLIPVD+ G +C AF  T   LSIIG+ QQQG RV YDLA+S VGF+   C
Subjt:  GISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.5e-13150.5Show/hide
Query:  FLTILTSLQFPSFL------SRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHP-----------NSPLSLPLHPRLTLHNTSYKDYDSLVRAR
        FL++L  +    FL      SR LS    T + DV +S  Q Q  LS+ P      +  P           +SPLSL LH R T   + +KDY SL  +R
Subjt:  FLTILTSLQFPSFL------SRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHP-----------NSPLSLPLHPRLTLHNTSYKDYDSLVRAR

Query:  LARDAARVQSLNRNLELSLNG------GQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQ
        L RD++RV  +   +  ++ G        +  E  +     + +T PVVSG SQGSG EYF+RIGVG PA+  +LV DTGSDV W+QC+PCA    CY+Q
Subjt:  LARDAARVQSLNRNLELSLNG------GQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQ

Query:  IDPIFDPKSSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQL
         DP+F+P SSS+Y  L C++ QC LL+ + C S+ CLYQV YGDGSFT GELAT+T+TFGNS  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q+
Subjt:  IDPIFDPKSSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQL

Query:  KASSFSYCLVDQDSDSSSTLEFNS-DRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAF
        KA+SFSYCLVD+DS  SS+L+FNS        T+PL++N +  ++ YV + G SVGG  + +    F+++ SG GG+I+D GT +TRL +  Y SLR+AF
Subjt:  KASSFSYCLVDQDSDSSSTLEFNS-DRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAF

Query:  VKLTSNLPP-AQGISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNK
        +KLT NL   +  ISLFDTCYDFS  S V+VPT+AF   GG SL LPAKNYLIPVD +GT+C AF  T+SSLSIIG+ QQQG R++YDL+ +++G S NK
Subjt:  VKLTSNLPP-AQGISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNK

Query:  C
        C
Subjt:  C

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein2.4e-9543.15Show/hide
Query:  SSAFLFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLN
        S  F FL++ +    PSF  + L  +  +       S     ++ S+    FE+ S   +S  S+ L+       +S K  D L  +RL RD+ RV+S  
Subjt:  SSAFLFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLN

Query:  RNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC
            ++    QI G  +  +      ++ VVSG SQGSG EYF R+GVG PA+  ++V DTGSD+ WLQC PC     CY Q DPIFDP+ S +Y+ +PC
Subjt:  RNLELSLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC

Query:  NSQQCQLLDIAGCNS--DVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQD
        +S  C+ LD AGCN+    CLYQV YGDGSFT G+ +TETLTF   N +  + +GCGHDNEGLFVGAAGL+GLG G +S   Q        FSYCLVD+ 
Subjt:  NSQQCQLLDIAGCNS--DVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQD

Query:  SDS--SSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLP-ISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQ
        + S  SS +  N+     +  +PL+ N +  ++ YV + G+SVGG  +P ++++ F++++ G GG+I+DSGT +TRL    Y ++R+AF      L  A 
Subjt:  SDS--SSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLP-ISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQ

Query:  GISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
          SLFDTC+D S  + V+VPT+     G + + LPA NYLIPVD+ G +C AF  T   LSIIG+ QQQG RV YDLA+S VGF+   C
Subjt:  GISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

AT1G25510.1 Eukaryotic aspartyl protease family protein4.7e-13651.34Show/hide
Query:  SAFLFLTILTSLQFPSFLSRKLSQSDSTT--IFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSL
        S F F+  LTS    S  SR L ++ +TT  I +V+ S ++ +   S +    E  +H  +S  SL LH R+++  T + DY SL  ARL RD ARV+SL
Subjt:  SAFLFLTILTSLQFPSFLSRKLSQSDSTT--IFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSL

Query:  NRNLELSLNG---GQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYS
           L+L++N      +       +     I AP++SG +QGSG EYF R+G+G+PA+  ++V DTGSDV WLQC PCA    CY Q +PIF+P SSSSY 
Subjt:  NRNLELSLNG---GQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYS

Query:  PLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQDS
        PL C++ QC  L+++ C +  CLY+V YGDGS+T G+ ATETLT G S  + N+ +GCGH NEGLFVGAAGL+GLGGG ++L SQL  +SFSYCLVD+DS
Subjt:  PLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQDS

Query:  DSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGISL
        DS+ST++F +    D++ +PL++N++  ++ Y+ + G+SVGG  L I  + FE++ESG GGII+DSGT +TRL +++Y SLR++FVK T +L  A G+++
Subjt:  DSSSTLEFNSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGISL

Query:  FDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        FDTCY+ S ++ VEVPT+AF  PGG  L LPAKNY+IPVDS GT+CLAF  T SSL+IIG+ QQQG RV++DLANSL+GFS+NKC
Subjt:  FDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

AT3G18490.1 Eukaryotic aspartyl protease family protein1.1e-13250.5Show/hide
Query:  FLTILTSLQFPSFL------SRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHP-----------NSPLSLPLHPRLTLHNTSYKDYDSLVRAR
        FL++L  +    FL      SR LS    T + DV +S  Q Q  LS+ P      +  P           +SPLSL LH R T   + +KDY SL  +R
Subjt:  FLTILTSLQFPSFL------SRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHP-----------NSPLSLPLHPRLTLHNTSYKDYDSLVRAR

Query:  LARDAARVQSLNRNLELSLNG------GQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQ
        L RD++RV  +   +  ++ G        +  E  +     + +T PVVSG SQGSG EYF+RIGVG PA+  +LV DTGSDV W+QC+PCA    CY+Q
Subjt:  LARDAARVQSLNRNLELSLNG------GQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQ

Query:  IDPIFDPKSSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQL
         DP+F+P SSS+Y  L C++ QC LL+ + C S+ CLYQV YGDGSFT GELAT+T+TFGNS  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q+
Subjt:  IDPIFDPKSSSSYSPLPCNSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQL

Query:  KASSFSYCLVDQDSDSSSTLEFNS-DRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAF
        KA+SFSYCLVD+DS  SS+L+FNS        T+PL++N +  ++ YV + G SVGG  + +    F+++ SG GG+I+D GT +TRL +  Y SLR+AF
Subjt:  KASSFSYCLVDQDSDSSSTLEFNS-DRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAF

Query:  VKLTSNLPP-AQGISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNK
        +KLT NL   +  ISLFDTCYDFS  S V+VPT+AF   GG SL LPAKNYLIPVD +GT+C AF  T+SSLSIIG+ QQQG R++YDL+ +++G S NK
Subjt:  VKLTSNLPP-AQGISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNK

Query:  C
        C
Subjt:  C

AT3G20015.1 Eukaryotic aspartyl protease family protein1.9e-10042.68Show/hide
Query:  LFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETH-SHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLNRNL
        L L+  +S+ FP F    + Q   T    V+A+     N          TH S   +S  +L L  R    + +Y+++   + AR+ RD  RV ++ R  
Subjt:  LFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETH-SHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLNRNL

Query:  ELSLNGGQILGERINGSKS---VDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC
               +I G+ I  S S   V+   + +VSG  QGSG EYF RIGVG P +  ++V D+GSD+ W+QCQPC     CYKQ DP+FDP  S SY+ + C
Subjt:  ELSLNGGQILGERINGSKS---VDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPC

Query:  NSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQDSD
         S  C  ++ +GC+S  C Y+V YGDGS+T G LA ETLTF  +  + N+ +GCGH N G+F+GAAGL+G+GGG++S   QL      +F YCLV + +D
Subjt:  NSQQCQLLDIAGCNSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQDSD

Query:  SSSTLEFNSDR-PSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGISL
        S+ +L F  +  P  +   PLV+N R  S+ YV + G+ VGG  +P+    F++ E+G GG+++D+GT +TRLP+  Y + R+ F   T+NLP A G+S+
Subjt:  SSSTLEFNSDR-PSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGISL

Query:  FDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
        FDTCYD SG  +V VPT++F    G  L LPA+N+L+PVD +GTYC AF  + + LSIIG+ QQ+GI+VS+D AN  VGF  N C
Subjt:  FDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC

AT3G61820.1 Eukaryotic aspartyl protease family protein1.5e-9743.97Show/hide
Query:  ILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYD--SLVRARLARDAARVQSLNRNLELS
        +L +L F  F     + S S+    +  +T      LS       T      S  SL +H       +S+ D     L   RL RD+ RV+S+     +S
Subjt:  ILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYD--SLVRARLARDAARVQSLNRNLELS

Query:  LNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPCNSQQCQ
           G+   +R    ++    +  V+SG SQGSG EYF R+GVG PA + ++V DTGSDV WLQC PC    ACY Q D IFDPK S +++ +PC S+ C+
Subjt:  LNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPCNSQQCQ

Query:  LLDIAG-C---NSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQDSDSS
         LD +  C    S  CLYQV YGDGSFT G+ +TETLTF  +  + ++P+GCGHDNEGLFVGAAGL+GLG G +S  SQ K      FSYCLVD+ S  S
Subjt:  LLDIAG-C---NSDVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLK---ASSFSYCLVDQDSDSS

Query:  -----STLEF-NSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLP-ISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQ
             ST+ F N+  P  S+ +PL+ N +  ++ Y+++ G+SVGG+ +P +S ++F+++ +G GG+I+DSGT +TRL    Y +LR+AF    + L  A 
Subjt:  -----STLEF-NSDRPSDSLTSPLVKNNRFQSYRYVKVDGMSVGGNPLP-ISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQ

Query:  GISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC
          SLFDTC+D SG + V+VPT+ F   GG  + LPA NYLIPV++ G +C AF  T  SLSIIG+ QQQG RV+YDL  S VGF +  C
Subjt:  GISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPVDSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACACTTCACTTTCCTCTGCTTTTCTCTTCCTTACAATCCTCACTTCCCTTCAATTCCCTTCATTTCTCTCTCGCAAATTATCACAATCGGATTCCACTACCATCTT
CGATGTCTCTGCCTCCACAAACCAAGTCCAGAACGCCCTCTCCATGAAACCCAAACCTTTTGAAACTCACTCTCACCATCCAAATTCCCCTTTATCTCTACCATTGCACC
CCAGATTGACCCTTCATAACACTTCTTACAAGGACTACGATAGCCTCGTCAGGGCCCGACTCGCCCGTGATGCTGCCCGAGTTCAATCCCTTAACCGAAATCTTGAGCTC
TCTTTAAATGGGGGCCAAATTTTAGGTGAAAGAATTAACGGGTCGAAGTCTGTAGATTCAATTACTGCTCCGGTTGTTTCAGGGCAAAGTCAGGGGAGTGGCGCGGAGTA
TTTTGCCCGGATTGGCGTCGGTCAGCCGGCGCAATCGTTTTTTTTGGTGCCCGATACTGGCAGCGATGTCACGTGGCTTCAATGCCAACCCTGTGCTGCTGAGAACGCTT
GTTATAAACAAATCGACCCGATATTTGACCCGAAATCGTCGTCTTCTTACTCTCCCCTGCCTTGCAATTCACAACAATGTCAATTGCTGGACATAGCCGGTTGTAACTCT
GATGTATGTCTGTACCAAGTCAAGTACGGCGACGGTTCATTCACAACCGGCGAACTCGCCACTGAAACGTTGACGTTTGGGAATTCTAATTCCATCCCCAATTTACCAAT
CGGCTGCGGCCACGACAACGAAGGCCTCTTCGTTGGAGCTGCCGGTTTAATCGGCCTTGGCGGTGGGGCCATTTCCCTTTCTTCCCAACTAAAAGCGTCGTCGTTTTCAT
ACTGTCTCGTCGACCAAGACTCAGACTCCTCCTCCACTCTCGAGTTCAACTCAGACCGACCCAGTGACTCACTCACCTCTCCACTCGTGAAAAACAACCGATTCCAGTCG
TACCGCTACGTGAAAGTCGACGGAATGAGCGTCGGCGGGAACCCTTTACCGATTTCCTCAACGAGATTTGAAATCGAAGAGTCGGGATTAGGAGGAATAATCGTGGATTC
TGGGACGATTATAACTCGGCTACCGAGTGATGTGTATGAATCGTTAAGAGAGGCGTTTGTGAAGCTGACGAGTAACCTCCCGCCAGCACAAGGAATATCGTTGTTCGATA
CATGTTACGATTTTTCAGGTCAGTCGAATGTGGAGGTGCCAACGATAGCATTTGTGTTGCCGGGAGGAAACTCGCTGCGGCTGCCGGCGAAAAATTACTTGATCCCGGTG
GACTCGGCCGGAACTTATTGCTTGGCATTTATTAAAACGACGTCGTCGCTTTCAATAATTGGGAGCTTTCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTC
CCTCGTCGGATTCTCAACTAATAAATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACACTTCACTTTCCTCTGCTTTTCTCTTCCTTACAATCCTCACTTCCCTTCAATTCCCTTCATTTCTCTCTCGCAAATTATCACAATCGGATTCCACTACCATCTT
CGATGTCTCTGCCTCCACAAACCAAGTCCAGAACGCCCTCTCCATGAAACCCAAACCTTTTGAAACTCACTCTCACCATCCAAATTCCCCTTTATCTCTACCATTGCACC
CCAGATTGACCCTTCATAACACTTCTTACAAGGACTACGATAGCCTCGTCAGGGCCCGACTCGCCCGTGATGCTGCCCGAGTTCAATCCCTTAACCGAAATCTTGAGCTC
TCTTTAAATGGGGGCCAAATTTTAGGTGAAAGAATTAACGGGTCGAAGTCTGTAGATTCAATTACTGCTCCGGTTGTTTCAGGGCAAAGTCAGGGGAGTGGCGCGGAGTA
TTTTGCCCGGATTGGCGTCGGTCAGCCGGCGCAATCGTTTTTTTTGGTGCCCGATACTGGCAGCGATGTCACGTGGCTTCAATGCCAACCCTGTGCTGCTGAGAACGCTT
GTTATAAACAAATCGACCCGATATTTGACCCGAAATCGTCGTCTTCTTACTCTCCCCTGCCTTGCAATTCACAACAATGTCAATTGCTGGACATAGCCGGTTGTAACTCT
GATGTATGTCTGTACCAAGTCAAGTACGGCGACGGTTCATTCACAACCGGCGAACTCGCCACTGAAACGTTGACGTTTGGGAATTCTAATTCCATCCCCAATTTACCAAT
CGGCTGCGGCCACGACAACGAAGGCCTCTTCGTTGGAGCTGCCGGTTTAATCGGCCTTGGCGGTGGGGCCATTTCCCTTTCTTCCCAACTAAAAGCGTCGTCGTTTTCAT
ACTGTCTCGTCGACCAAGACTCAGACTCCTCCTCCACTCTCGAGTTCAACTCAGACCGACCCAGTGACTCACTCACCTCTCCACTCGTGAAAAACAACCGATTCCAGTCG
TACCGCTACGTGAAAGTCGACGGAATGAGCGTCGGCGGGAACCCTTTACCGATTTCCTCAACGAGATTTGAAATCGAAGAGTCGGGATTAGGAGGAATAATCGTGGATTC
TGGGACGATTATAACTCGGCTACCGAGTGATGTGTATGAATCGTTAAGAGAGGCGTTTGTGAAGCTGACGAGTAACCTCCCGCCAGCACAAGGAATATCGTTGTTCGATA
CATGTTACGATTTTTCAGGTCAGTCGAATGTGGAGGTGCCAACGATAGCATTTGTGTTGCCGGGAGGAAACTCGCTGCGGCTGCCGGCGAAAAATTACTTGATCCCGGTG
GACTCGGCCGGAACTTATTGCTTGGCATTTATTAAAACGACGTCGTCGCTTTCAATAATTGGGAGCTTTCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTC
CCTCGTCGGATTCTCAACTAATAAATGTTAG
Protein sequenceShow/hide protein sequence
MNTSLSSAFLFLTILTSLQFPSFLSRKLSQSDSTTIFDVSASTNQVQNALSMKPKPFETHSHHPNSPLSLPLHPRLTLHNTSYKDYDSLVRARLARDAARVQSLNRNLEL
SLNGGQILGERINGSKSVDSITAPVVSGQSQGSGAEYFARIGVGQPAQSFFLVPDTGSDVTWLQCQPCAAENACYKQIDPIFDPKSSSSYSPLPCNSQQCQLLDIAGCNS
DVCLYQVKYGDGSFTTGELATETLTFGNSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLKASSFSYCLVDQDSDSSSTLEFNSDRPSDSLTSPLVKNNRFQS
YRYVKVDGMSVGGNPLPISSTRFEIEESGLGGIIVDSGTIITRLPSDVYESLREAFVKLTSNLPPAQGISLFDTCYDFSGQSNVEVPTIAFVLPGGNSLRLPAKNYLIPV
DSAGTYCLAFIKTTSSLSIIGSFQQQGIRVSYDLANSLVGFSTNKC