; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G006780 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G006780
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase NANA, chloroplast-like
Genome locationCmo_Chr02:4302239..4303967
RNA-Seq ExpressionCmoCh02G006780
SyntenyCmoCh02G006780
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605363.1 Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. sororia]2.2e-26686.45Show/hide
Query:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN
        MSPISHLLILFFVFFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVVKR+ DEIKVD +EDRIKDIRYHDQ+RLRAIS H+NWTKVVEN
Subjt:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN

Query:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
        AEEKE KE S SNL   SQTPI LK YPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+ SP+HKMRN+MR RF YALYANQS
Subjt:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS

Query:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
        SSFSPIPCSSKQCI DF +LGGQPDCPTPN+PCSYTYSY  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV

Query:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
        YKAAENN+GGGFSYCLADH RNITAISYFVFGTPSPKTF+A+TSSPIGPP+TT+L TGG+YSCYYGVQL GISVD QILNIP HVWNIKSGCGTILDTGT
Subjt:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
        SLTMLTAPAHDAVIEAMAPKI KFGRM      E+++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
Subjt:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI

Query:  IQQTYLWQFDLLKGSVTFAPSDCA
        IQQTY WQFDLLKGSVTFAPSDCA
Subjt:  IQQTYLWQFDLLKGSVTFAPSDCA

KAG6605377.1 Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. sororia]1.7e-30397.71Show/hide
Query:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN
        MSPISHLLILFFVF SPLTVAVA+ SNANNPKQESDANNEE+EFVRLDL+HRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVH+NWTKVVEN
Subjt:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN

Query:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
        AEEKEKKEASSSNLPP SQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
Subjt:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS

Query:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
        SSFSPIPCSSKQCIQDFSELGGQPDCPTPN PCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV

Query:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
        YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTF ASTS+PIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
Subjt:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
        SLTMLTAPAHDAVIEAMAPKIEKFGRMERD +GEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
Subjt:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI

Query:  IQQTYLWQFDLLKGSVTFAPSDCA
        IQQTYLWQFDLLKGSVTFAPSDCA
Subjt:  IQQTYLWQFDLLKGSVTFAPSDCA

XP_022947824.1 aspartic proteinase NANA, chloroplast-like [Cucurbita moschata]8.0e-310100Show/hide
Query:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN
        MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN
Subjt:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN

Query:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
        AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
Subjt:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS

Query:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
        SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV

Query:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
        YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
Subjt:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
        SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
Subjt:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI

Query:  IQQTYLWQFDLLKGSVTFAPSDCA
        IQQTYLWQFDLLKGSVTFAPSDCA
Subjt:  IQQTYLWQFDLLKGSVTFAPSDCA

XP_023007158.1 aspartic proteinase NANA, chloroplast-like [Cucurbita maxima]1.2e-29394.68Show/hide
Query:  MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVV
        MS ISHLLILFFV  FFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAIS H+NWTKVV
Subjt:  MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVV

Query:  ENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYAN
        ENAEEK  KEAS SN PP SQTPIALKTYPGADFGSSEFFVQLK+GTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRN+MRERF YALYAN
Subjt:  ENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYAN

Query:  QSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS
        QSSSFSPIPCSSKQCIQDFSELGGQPDCPTPN+PCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEE+TDSQFLDGADGLIGLGSSIYS
Subjt:  QSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS

Query:  FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDT
        FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTF+ASTSSPIGPPATT+L TGGRYSCYYGVQL+GISVDGQILNIPPHVWNIKSGCGTILDT
Subjt:  FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDT

Query:  GTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG
        GTSLTMLTAPAHDAVIEAMAPKIEKFGRME+DVKGEREKNFKLCFNDT+WNFGMLPKLGFHFE GAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG
Subjt:  GTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG

Query:  NIIQQTYLWQFDLLKGSVTFAPSDCA
        NIIQQT++W++DLLKGSVTFAPSDCA
Subjt:  NIIQQTYLWQFDLLKGSVTFAPSDCA

XP_023533886.1 aspartic proteinase NANA, chloroplast-like [Cucurbita pepo subsp. pepo]7.0e-29795.04Show/hide
Query:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN
        MSPISHLLILFFVFFSPLTVA ADQSNANNPKQESDANNEE+E VRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAIS H+NWTKVVEN
Subjt:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN

Query:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
        AEEKEKKEAS SNLPPQSQ+PIALKTYPGAD+GSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMR+RFIYALYANQS
Subjt:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS

Query:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
        SSFSPIPCSS+QCIQDFSELGGQPDCPTPNSPCSYTYSYLSGD AMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDS+FLDGADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV

Query:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
        YKAAENN+GGGFSYCLADH+R+ITAISYFVFGTPSPKTFAASTS+PIGPPATT+LITGGRYSCYYGVQLAGISVDGQILNIPPHVWNI SGCGTILDTGT
Subjt:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
        SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDT+W FGM PKLGFHFEGG VFEPPDRSY+V AS QCSCIAITSLPFPSINILGNI
Subjt:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI

Query:  IQQTYLWQFDLLKGSVTFAPSDCA
        IQQTYLWQFDL KGSVTFAPSDCA
Subjt:  IQQTYLWQFDLLKGSVTFAPSDCA

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein9.7e-14350.09Show/hide
Query:  MSPISH--------LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHM
        MSPIS+        LL  F  F S    A+ D+ N  N     + + +E+E ++ DL+HRHHP+V +++H ++K+  + +R+KDI  HD +R R+IS  M
Subjt:  MSPISH--------LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHM

Query:  NWTKVVENAEEK-------EKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRN
        N  K VE+A  +       E++ A S+ LPP + TPI ++   GADFGSSE+FV+LK+GTP Q F +IADTGSDL W +CRYRRC G+CS+ +  HK +N
Subjt:  NWTKVVENAEEK-------EKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRN

Query:  RMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDG
          ++RF +A  AN SSSF  + CSS  C  D ++L    +C  P SPC Y YSY  G  A GIFA ET+TV LTNGKEKQL + + GCTE +  S F  G
Subjt:  RMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDG

Query:  ADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPAT-TRLITGGRYSCYYGVQLAGISVDGQILNIPPH
        ADG++GLG+S YS  YKAAEN  GGGFSYCL DHL +  AISYFV G P+P T A+++S+ +    T T+L  G  YS +YGV L GIS +G +LNIP  
Subjt:  ADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPAT-TRLITGGRYSCYYGVQLAGISVDGQILNIPPH

Query:  VWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCI
        VW+I SG GTI+D+GTSLT+L APA D V+EA+ P+++KF ++E +        F  CFN++Q+   M PKL FHF  G VFEPP +SYIVS     SCI
Subjt:  VWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCI

Query:  AITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC
           S+PFP+ NI+GNI+QQ +LWQFD  K  V FAPS+C
Subjt:  AITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC

A0A1S3C2F3 aspartic proteinase CDR14.6e-14550.93Show/hide
Query:  MSPISH-----LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWT
        MSPIS+     LL+ F  F S    A+ D++N  N   + D    E++ +R DL+HRHHP+V ++L+ ++K+  + +R+KDI  HD++R R+IS  MN  
Subjt:  MSPISH-----LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWT

Query:  KVVENAEEKEKKEAS-------SSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMR
        K +E+A  + + EA+       S+ LPP + TPI +K   GADFGSSE+FVQLK+GTP Q F +IADTGSDL W +CRYRRC G+CS  +  HK +N  +
Subjt:  KVVENAEEKEKKEAS-------SSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMR

Query:  ERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADG
        +RF +AL ANQSS+F  + CSS  C  + +EL    +C TP SPC Y YSY  G  A GIFA ET+TV LTNGKEKQL++ + GCT EI      DGADG
Subjt:  ERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADG

Query:  LIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPAT---TRLITGGRYSCYYGVQLAGISVDGQILNIPPHV
        ++GLG+S YS  YKAAEN  GGGFSYCL DHL +  A+SYFV G P+P T A+++S+   PPA    T+L  G  YS +YGV L GIS DGQ+LNIPP V
Subjt:  LIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPAT---TRLITGGRYSCYYGVQLAGISVDGQILNIPPHV

Query:  WNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIA
        W+   GCGTI+D+GTSLT+L  PA D V+E +  ++++F ++E +        F  CFN++Q+   M PKL FHF  G VFEPP +SYIVS     SCI 
Subjt:  WNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIA

Query:  ITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC
        I S+PFPS+NI+GNI+QQ +LWQFD  K  V FA S+C
Subjt:  ITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC

A0A6J1G5P4 aspartic proteinase NANA, chloroplast-like1.0e-26485.8Show/hide
Query:  MSPISHLLIL----FFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTK
        MSPISHLLIL     FVFFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVVKR+ DEIKVD +EDRIKDIRYHDQ+RLRAIS H+NWTK
Subjt:  MSPISHLLIL----FFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTK

Query:  VVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALY
        VVENAEEKE KE S SNL   SQTPI LKTYPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+ SP+HKMRN+MR RF YALY
Subjt:  VVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALY

Query:  ANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSI
        ANQSSSFSPIPCSSKQCI DF +LGGQPDCPTPN+PCSYTYSY  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSI
Subjt:  ANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSI

Query:  YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTIL
        YSFVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTF+A+TSSPIGPPATT+L TGG+YSCYYGVQL GISVD QILNIP HVWNIKSGCGTIL
Subjt:  YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTIL

Query:  DTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
        DTGTSLTMLTAPAHDAVIEAMAPKI KFGRM      E+++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
Subjt:  DTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI

Query:  LGNIIQQTYLWQFDLLKGSVTFAPSDCA
        LGNIIQQTY WQFDLLKGSVTFAPSDCA
Subjt:  LGNIIQQTYLWQFDLLKGSVTFAPSDCA

A0A6J1G810 aspartic proteinase NANA, chloroplast-like3.9e-310100Show/hide
Query:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN
        MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN
Subjt:  MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVEN

Query:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
        AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS
Subjt:  AEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS

Query:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
        SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFV

Query:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
        YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT
Subjt:  YKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
        SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI
Subjt:  SLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNI

Query:  IQQTYLWQFDLLKGSVTFAPSDCA
        IQQTYLWQFDLLKGSVTFAPSDCA
Subjt:  IQQTYLWQFDLLKGSVTFAPSDCA

A0A6J1L6Y2 aspartic proteinase NANA, chloroplast-like6.0e-29494.68Show/hide
Query:  MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVV
        MS ISHLLILFFV  FFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAIS H+NWTKVV
Subjt:  MSPISHLLILFFV--FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVV

Query:  ENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYAN
        ENAEEK  KEAS SN PP SQTPIALKTYPGADFGSSEFFVQLK+GTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRN+MRERF YALYAN
Subjt:  ENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYAN

Query:  QSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS
        QSSSFSPIPCSSKQCIQDFSELGGQPDCPTPN+PCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEE+TDSQFLDGADGLIGLGSSIYS
Subjt:  QSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYS

Query:  FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDT
        FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTF+ASTSSPIGPPATT+L TGGRYSCYYGVQL+GISVDGQILNIPPHVWNIKSGCGTILDT
Subjt:  FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDT

Query:  GTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG
        GTSLTMLTAPAHDAVIEAMAPKIEKFGRME+DVKGEREKNFKLCFNDT+WNFGMLPKLGFHFE GAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG
Subjt:  GTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG

Query:  NIIQQTYLWQFDLLKGSVTFAPSDCA
        NIIQQT++W++DLLKGSVTFAPSDCA
Subjt:  NIIQQTYLWQFDLLKGSVTFAPSDCA

SwissProt top hitse value%identityAlignment
Q6XBF8 Aspartic proteinase CDR11.2e-3328.13Show/hide
Query:  SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPC
        S E+ + + +GTPP     IADTGSDLLWT+C    C    +   P+   +               SS++  + CSS QC    + L  Q  C T ++ C
Subjt:  SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPC

Query:  SYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGT
        SY+ SY       G  A +T+T+  ++ +  QLK+I+ GC        F     G++GLG    S + K   +++ G FSYCL          S   FGT
Subjt:  SYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGT

Query:  PSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG
         +  + +   S+P        LI       +Y + L  ISV  + +           G   I+D+GT+LT+L    +  + +A+A  I      + + K 
Subjt:  PSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG

Query:  EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA
        + +    LC++ T      +P +  HF+G  V      ++ V  S    C A      PS +I GN+ Q  +L  +D +  +V+F P+DCA
Subjt:  EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA

Q766C2 Aspartic proteinase nepenthesin-24.8e-3830.36Show/hide
Query:  LRAISVHMNWTK--VVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHK
        L  +    N TK  +++ A ++ ++   S N   QS + I    Y     G  E+ + + +GTP   F+ I DTGSDL+WT+C    C    S P+PI  
Subjt:  LRAISVHMNWTK--VVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHK

Query:  MRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQF
         ++              SSSFS +PC S+ C QD       P     N+ C YTY Y  G    G  ATET T   ++     + +I +GC E+      
Subjt:  MRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQF

Query:  LDGADGLIGLGSSIYSFVYKAAENNVG-GGFSYCLADHLRNITAISYFVFGTPSPKTFA---ASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQI
         +GA GLIG+G    S       + +G G FSYC+              +G+ SP T A   A++  P G P+TT LI       YY + L GI+V G  
Subjt:  LDGADGLIGLGSSIYSFVYKAAENNVG-GGFSYCLADHLRNITAISYFVFGTPSPKTFA---ASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQI

Query:  LNIPPHVWNIKSG--CGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFND-TQWNFGMLPKLGFHFEGGAVFEPPDRSYIV
        L IP   + ++     G I+D+GT+LT L   A++AV +A   +I            E       CF   +  +   +P++   F+GG V    +++ ++
Subjt:  LNIPPHVWNIKSG--CGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFND-TQWNFGMLPKLGFHFEGGAVFEPPDRSYIV

Query:  SASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC
        S +    C+A+ S     I+I GNI QQ     +DL   +V+F P+ C
Subjt:  SASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC

Q766C3 Aspartic proteinase nepenthesin-11.3e-3230.13Show/hide
Query:  GSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSP
        G  E+ + L +GTP Q F+ I DTGSDL+WT+C  + C    +  +PI   +               SSSFS +PCSS+ C     +    P C   N+ 
Subjt:  GSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSP

Query:  CSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFG
        C YTY Y  G    G   TET+T    +     + +I +GC E        +GA GL+G+G    S   +         FSYC+       T I      
Subjt:  CSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFG

Query:  TPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGT---ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMER
        TPS     +  +S       T LI   +   +Y + L G+SV    L I P  + + S  GT   I+D+GT+LT     A+ +V      + E   ++  
Subjt:  TPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGT---ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMER

Query:  DVKGEREKNFKLCFNDTQWNFGM-LPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC
         V       F LCF        + +P    HF+GG + E P  +Y +S S    C+A+ S     ++I GNI QQ  L  +D     V+FA + C
Subjt:  DVKGEREKNFKLCFNDTQWNFGM-LPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC

Q9LNJ3 Aspartyl protease family protein 21.5e-3129.57Show/hide
Query:  GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPT
        G   GS E+F +L +GTP +   M+ DTGSD++W +C    CR   S   PI   R              +S +++ IPCSS  C +  S       C T
Subjt:  GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPT

Query:  PNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISY
            C Y  SY  G   +G F+TET+T R       ++K +  GC  +  +     GA GL+GLG    SF  +   +     FSYCL D   +      
Subjt:  PNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISY

Query:  FVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDG-QILNIPPHVWNIK--SGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFG
             PS   F  +  S I     T L++  +   +Y V L GISV G ++  +   ++ +      G I+D+GTS+T L  PA+ A+ +A     +   
Subjt:  FVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDG-QILNIPPHVWNIK--SGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFG

Query:  RMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA
        R            F  CF+ +  N   +P +  HF G  V  P     I   +    C A        ++I+GNI QQ +   +DL    V FAP  CA
Subjt:  RMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA

Q9LTW4 Aspartic proteinase NANA, chloroplast2.7e-7336.53Show/hide
Query:  VENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYA
        +E+    ++K  S  +    S   + +    G D+G++++F ++++GTP +KF ++ DTGS+L W  CRYR    D           NR   R      A
Subjt:  VENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYA

Query:  NQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIY
        ++S SF  + C ++ C  D   L     CPTP++PCSY Y Y  G  A G+FA ET+TV LTNG+  +L   L GC+   T   F  GADG++GL  S +
Subjt:  NQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIY

Query:  SFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILD
        SF    A +  G  FSYCL DHL N    +Y +FG+      A   ++P+        +T  R   +Y + + GIS+   +L+IP  VW+  SG GTILD
Subjt:  SFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILD

Query:  TGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQ-WNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
        +GTSLT+L   A+  V+  +A  + +  R++ +         + CF+ T  +N   LP+L FH +GGA FEP  +SY+V A+    C+   S   P+ N+
Subjt:  TGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQ-WNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI

Query:  LGNIIQQTYLWQFDLLKGSVTFAPSDC
        +GNI+QQ YLW+FDL+  +++FAPS C
Subjt:  LGNIIQQTYLWQFDLLKGSVTFAPSDC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein5.6e-4228.19Show/hide
Query:  ADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEK--KEASSSNLPPQSQT
        A  S +N+    S  ++  KE  R  +      +   R+  E K  +    + D++  D +R++ +    N +K  +N + ++K   + S    P  S  
Subjt:  ADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEK--KEASSSNLPPQSQT

Query:  PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSEL
         +      G   GS E+F+ + +GTPP+ F++I DTGSDL W +C    C  DC + + +                   S+SF  I C+  +C      L
Subjt:  PIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSEL

Query:  GGQPD----CPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLT----NGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGF
           PD    C + N  C Y Y Y       G FA ET TV LT       E ++ ++++GC     +     GA GL+GLG    SF     ++  G  F
Subjt:  GGQPD----CPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLT----NGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGF

Query:  SYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYS--CYYGVQLAGISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLTAP
        SYCL D   N    S  +FG    K     T+        T  + G   S   +Y +Q+  I V G+ L+IP   WNI S    GTI+D+GT+L+    P
Subjt:  SYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYS--CYYGVQLAGISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLTAP

Query:  AHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYL
        A++ +    A K+++   + RD           CFN    + N   LP+LG  F  G V+  P  +  +  S    C+AI   P  + +I+GN  QQ + 
Subjt:  AHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYL

Query:  WQFDLLKGSVTFAPSDCA
          +D  +  + F P+ CA
Subjt:  WQFDLLKGSVTFAPSDCA

AT3G12700.1 Eukaryotic aspartyl protease family protein1.9e-7436.53Show/hide
Query:  VENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYA
        +E+    ++K  S  +    S   + +    G D+G++++F ++++GTP +KF ++ DTGS+L W  CRYR    D           NR   R      A
Subjt:  VENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYA

Query:  NQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIY
        ++S SF  + C ++ C  D   L     CPTP++PCSY Y Y  G  A G+FA ET+TV LTNG+  +L   L GC+   T   F  GADG++GL  S +
Subjt:  NQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIY

Query:  SFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILD
        SF    A +  G  FSYCL DHL N    +Y +FG+      A   ++P+        +T  R   +Y + + GIS+   +L+IP  VW+  SG GTILD
Subjt:  SFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILD

Query:  TGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQ-WNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
        +GTSLT+L   A+  V+  +A  + +  R++ +         + CF+ T  +N   LP+L FH +GGA FEP  +SY+V A+    C+   S   P+ N+
Subjt:  TGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQ-WNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI

Query:  LGNIIQQTYLWQFDLLKGSVTFAPSDC
        +GNI+QQ YLW+FDL+  +++FAPS C
Subjt:  LGNIIQQTYLWQFDLLKGSVTFAPSDC

AT3G25700.1 Eukaryotic aspartyl protease family protein1.4e-5333.33Show/hide
Query:  GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD-CP
        GA  GS ++FV L++G PPQ   +IADTGSDL+W +C    CR +CS+ SP                +   SS+FSP  C    C      L  +PD  P
Subjt:  GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD-CP

Query:  TPN-----SPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQF----LDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLAD
          N     S C Y Y Y  G    G+FA ET +++ ++GKE +LK + +GC   I+         +GA+G++GLG    SF  +      G  FSYCL D
Subjt:  TPN-----SPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQF----LDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLAD

Query:  HLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNI--KSGCGTILDTGTSLTMLTAPAHDAVIEA
        +  +    SY + G         +    I     T L+T      +Y V+L  + V+G  L I P +W I      GT++D+GT+L  L  PA+ +VI A
Subjt:  HLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNI--KSGCGTILDTGTSLTMLTAPAHDAVIEA

Query:  MAPKIEKFGRMERDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSL-PFPSINILGNIIQQTYLWQFDLLK
        +        R++  +       F LC N         +LP+L F F GGAVF PP R+Y +    Q  C+AI S+ P    +++GN++QQ +L++FD  +
Subjt:  MAPKIEKFGRMERDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSL-PFPSINILGNIIQQTYLWQFDLLK

Query:  GSVTFAPSDCA
          + F+   CA
Subjt:  GSVTFAPSDCA

AT3G59080.1 Eukaryotic aspartyl protease family protein6.8e-4027.36Show/hide
Query:  FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAIS---VHMNWTKVVENAEEKEKKEA-
        F +P+    A  S +N+    S      KE    +   + H   +KR           + + +++  D +R++ +    +  N    V   ++K  KE  
Subjt:  FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAIS---VHMNWTKVVENAEEKEKKEA-

Query:  ----SSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALY-ANQSSSFS
             +S++  Q+   +A     G   GS E+F+ + +G+PP+ F++I DTGSDL W +C    C  DC   +               A Y    S+S+ 
Subjt:  ----SSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALY-ANQSSSFS

Query:  PIPCSSKQCIQDFSELGGQPDCPTP----NSPCSYTYSYLSGDRAMGIFATETVTVRL-TNGKEKQL---KDILYGCTEEITDSQFLDGADGLIGLGSSI
         I C+ ++C      L   PD P P    N  C Y Y Y       G FA ET TV L TNG   +L   +++++GC     +     GA GL+GLG   
Subjt:  PIPCSSKQCIQDFSELGGQPDCPTP----NSPCSYTYSYLSGDRAMGIFATETVTVRL-TNGKEKQL---KDILYGCTEEITDSQFLDGADGLIGLGSSI

Query:  YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKS--GCGT
         SF     ++  G  FSYCL D   +    S  +FG         + +          L+       +Y VQ+  I V G++LNIP   WNI S    GT
Subjt:  YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKS--GCGT

Query:  ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSI
        I+D+GT+L+    PA++ +   +A K +    + RD           CFN +  +   LP+LG  F  GAV+  P  +  +  +    C+A+   P  + 
Subjt:  ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSI

Query:  NILGNIIQQTYLWQFDLLKGSVTFAPSDCA
        +I+GN  QQ +   +D  +  + +AP+ CA
Subjt:  NILGNIIQQTYLWQFDLLKGSVTFAPSDCA

AT5G33340.1 Eukaryotic aspartyl protease family protein8.6e-3528.13Show/hide
Query:  SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPC
        S E+ + + +GTPP     IADTGSDLLWT+C    C    +   P+   +               SS++  + CSS QC    + L  Q  C T ++ C
Subjt:  SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPC

Query:  SYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGT
        SY+ SY       G  A +T+T+  ++ +  QLK+I+ GC        F     G++GLG    S + K   +++ G FSYCL          S   FGT
Subjt:  SYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGT

Query:  PSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG
         +  + +   S+P        LI       +Y + L  ISV  + +           G   I+D+GT+LT+L    +  + +A+A  I      + + K 
Subjt:  PSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG

Query:  EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA
        + +    LC++ T      +P +  HF+G  V      ++ V  S    C A      PS +I GN+ Q  +L  +D +  +V+F P+DCA
Subjt:  EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGC
CAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCA
TCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCG
AGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCC
GCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGC
GTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTC
GGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAG
ACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCT
CTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTC
TTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGT
CCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGA
TGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAA
CTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGC
GTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGG
GATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGC
CAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCA
TCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCG
AGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCC
GCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGC
GTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTC
GGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAG
ACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCT
CTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTC
TTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGT
CCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGA
TGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAA
CTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGC
GTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGG
GATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAGAACTTCTCCATTTTCTTTCATTTATTACTTCCTTCTTATTAAT
Protein sequenceShow/hide protein sequence
MSPISHLLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEAS
SSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSEL
GGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFV
FGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFK
LCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA