; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G006930 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G006930
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase NANA, chloroplast-like
Genome locationCmo_Chr02:4379541..4381434
RNA-Seq ExpressionCmoCh02G006930
SyntenyCmoCh02G006930
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605363.1 Aspartic proteinase NANA, chloroplast, partial [Cucurbita argyrosperma subsp. sororia]2.6e-29998.07Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MSPISHLLIL     FVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVD+VEDRIKDIRYHDQNRLRAISAHLNWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS
        VVENAEEKEKEVSGSNLSQTPIGLK YPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS
Subjt:  VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS

Query:  SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV
        SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVT+FMKGADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV

Query:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT
        YKAAENNIGGGFSYCLADHHRN TAISYFVFGTPSPKTFSATTSSPIGPP+TTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT
Subjt:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF
        SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF
Subjt:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF

Query:  WQFDLLKGSVTFAPSDCA
        WQFDLLKGSVTFAPSDCA
Subjt:  WQFDLLKGSVTFAPSDCA

XP_022947059.1 aspartic proteinase NANA, chloroplast-like [Cucurbita moschata]3.7e-306100Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS
        VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS
Subjt:  VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS

Query:  SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV
        SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV

Query:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT
        YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT
Subjt:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF
        SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF
Subjt:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF

Query:  WQFDLLKGSVTFAPSDCA
        WQFDLLKGSVTFAPSDCA
Subjt:  WQFDLLKGSVTFAPSDCA

XP_022947824.1 aspartic proteinase NANA, chloroplast-like [Cucurbita moschata]1.3e-26385.8Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MSPISHLLIL     FVFFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVVKR+ DEIKVD +EDRIKDIRYHDQ+RLRAIS H+NWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKE-KEVSGSNL---SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALY
        VVENAEEKE KE S SNL   SQTPI LKTYPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+ SP+HKMRN+MR RF YALY
Subjt:  VVENAEEKE-KEVSGSNL---SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALY

Query:  ANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSI
        ANQSSSFSPIPCSSKQCI DF +LGGQPDCPTPN+PCSYTYSY  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSI
Subjt:  ANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSI

Query:  YSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTIL
        YSFVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTF+A+TSSPIGPPATT+L TGG+YSCYYGVQL GISVD QILNIP HVWNIKSGCGTIL
Subjt:  YSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTIL

Query:  DTGTSLTMLTAPAHDAVIEAMAPKIAKFGRM------EKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
        DTGTSLTMLTAPAHDAVIEAMAPKI KFGRM      E+++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
Subjt:  DTGTSLTMLTAPAHDAVIEAMAPKIAKFGRM------EKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI

Query:  LGNIIQQTYFWQFDLLKGSVTFAPSDCA
        LGNIIQQTY WQFDLLKGSVTFAPSDCA
Subjt:  LGNIIQQTYFWQFDLLKGSVTFAPSDCA

XP_023007158.1 aspartic proteinase NANA, chloroplast-like [Cucurbita maxima]4.9e-26687.48Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MS ISHLLIL FV VF FFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKR+ DEIKVD +EDRIKDIRYHDQ+RLRAISAHLNWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKEKEVSGSN---LSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYA
        VVENAEEK KE SGSN    SQTPI LKTYPGADFGS EFFVQLKVGTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+ SP+HKMRNKMR RF YALYA
Subjt:  VVENAEEKEKEVSGSN---LSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYA

Query:  NQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY
        NQSSSFSPIPCSSKQCI DF +LGGQPDCPTPNTPCSYTYSY  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIY
Subjt:  NQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY

Query:  SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILD
        SFVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTFSA+TSSPIGPPATTKLFTGG+YSCYYGVQL GISVD QILNIP HVWNIKSGCGTILD
Subjt:  SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILD

Query:  TGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINIL
        TGTSLTMLTAPAHDAVIEAMAPKI KFGRMEK      ++NF+LCFNDTEWNFGM PKLGFHFE GAVFEPPDRSYIVSASYQCSCIAITSLPFPSINIL
Subjt:  TGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINIL

Query:  GNIIQQTYFWQFDLLKGSVTFAPSDCA
        GNIIQQT+ W++DLLKGSVTFAPSDCA
Subjt:  GNIIQQTYFWQFDLLKGSVTFAPSDCA

XP_023532727.1 aspartic proteinase NANA, chloroplast-like [Cucurbita pepo subsp. pepo]2.6e-29195.01Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MSPISH LIL   FVFVFFSP+TVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKR+DDEIKVDSVEDRI+DIRYHDQNRLR+ISA LNWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKEKEVSGSNL---SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYA
        VVENAEEKEKEVSGSNL   SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCS+LSPMHKMRNKMRGRFRYALYA
Subjt:  VVENAEEKEKEVSGSNL---SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYA

Query:  NQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY
        NQSSSFSPIPCSS+QCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVE+T+FMKGADGLIGLGSSIY
Subjt:  NQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY

Query:  SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILD
        SFVYKAAENNIGGGFSYCLADH+RNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQ+L IPRHVWNIKSGCGTILD
Subjt:  SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILD

Query:  TGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQ
        TGTSLT+LTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSY+VSA+ QCSC+AI+SLPFPSINILGNIIQQ
Subjt:  TGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQ

Query:  TYFWQFDLLKGSVTFAPSDCA
        TYFWQFDLLK SVTFAPSDCA
Subjt:  TYFWQFDLLKGSVTFAPSDCA

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein1.1e-14351.41Show/hide
Query:  MSPISHLLILVFVFVFVFF----SPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHL
        MSPIS+     F F+  FF    S    A+ D+ N  N     + + +EQE ++ DL+HRHHP+V ++I  ++K+  V +R+KDI  HD NR R+IS  +
Subjt:  MSPISHLLILVFVFVFVFF----SPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHL

Query:  NWTKVVENAEEK-------EKEVSGSNL----SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRN
        N  K VE+A  +       E+EV+ S +    + TPIG++   GADFGS E+FV+LKVGTP QTF LIADTGSDL W KCR+RRC G+CS  +  HK +N
Subjt:  NWTKVVENAEEK-------EKEVSGSNL----SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRN

Query:  KMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKG
        + + RFR+A  AN SSSF  + CSS  C +D  DL    +C  P +PC Y YSYTGG  A GIFA ET+TV LTNGKEKQL + + GCTE V+ + F  G
Subjt:  KMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKG

Query:  ADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPAT-TKLFTGGQYSCYYGVQLIGISVDDQILNIPRH
        ADG++GLG+S YS  YKAAEN  GGGFSYCL DH  +  AISYFV G P+P T ++T+S+ +    T TKL+ G  YS +YGV LIGIS +  +LNIP  
Subjt:  ADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPAT-TKLFTGGQYSCYYGVQLIGISVDDQILNIPRH

Query:  VWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLP
        VW+I SG GTI+D+GTSLT+L APA D V+EA+ P++ KF ++E +  F+ CFN++++   M+PKL FHF  G VFEPP +SYIVS     SCI   S+P
Subjt:  VWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLP

Query:  FPSINILGNIIQQTYFWQFDLLKGSVTFAPSDC
        FP+ NI+GNI+QQ + WQFD  K  V FAPS+C
Subjt:  FPSINILGNIIQQTYFWQFDLLKGSVTFAPSDC

A0A1S3C2F3 aspartic proteinase CDR11.5e-14050.56Show/hide
Query:  MSPISHL-LILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWT
        MSPIS+     + +F   F S    A+ D++N  N   + D    EQ+ +R DL+HRHHP+V ++++ ++K+  + +R+KDI  HD+NR R+IS  +N  
Subjt:  MSPISHL-LILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWT

Query:  KVVENAEEK-------EKEVSGSNL----SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMR
        K +E+A  +       + EV+ S +    + TPIG+K   GADFGS E+FVQLKVGTP QTF LIADTGSDL W KCR+RRC G+CS  +  HK +N+ +
Subjt:  KVVENAEEK-------EKEVSGSNL----SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMR

Query:  GRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADG
         RFR+AL ANQSS+F  + CSS  C ++  +L    +C TP +PC Y YSY GG  A GIFA ET+TV LTNGKEKQL++ + GCTE V+   F  GADG
Subjt:  GRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADG

Query:  LIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPAT---TKLFTGGQYSCYYGVQLIGISVDDQILNIPRHV
        ++GLG+S YS  YKAAEN  GGGFSYCL DH  +  A+SYFV G P+P T ++T+S+   PPA    TKL+ G  YS +YGV LIGIS D Q+LNIP  V
Subjt:  LIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPAT---TKLFTGGQYSCYYGVQLIGISVDDQILNIPRHV

Query:  WNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPF
        W+   GCGTI+D+GTSLT+L  PA D V+E +  ++ +F ++E +  F  CFN++++   M+PKL FHF  G VFEPP +SYIVS     SCI I S+PF
Subjt:  WNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPF

Query:  PSINILGNIIQQTYFWQFDLLKGSVTFAPSDC
        PS+NI+GNI+QQ + WQFD  K  V FA S+C
Subjt:  PSINILGNIIQQTYFWQFDLLKGSVTFAPSDC

A0A6J1G5P4 aspartic proteinase NANA, chloroplast-like1.8e-306100Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS
        VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS
Subjt:  VVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQS

Query:  SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV
        SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV
Subjt:  SSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV

Query:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT
        YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT
Subjt:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF
        SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF
Subjt:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYF

Query:  WQFDLLKGSVTFAPSDCA
        WQFDLLKGSVTFAPSDCA
Subjt:  WQFDLLKGSVTFAPSDCA

A0A6J1G810 aspartic proteinase NANA, chloroplast-like6.4e-26485.8Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MSPISHLLIL     FVFFSPLTVAVADQSNANN KQESDANNEE+EFVRLDLIHRHHPEVVKR+ DEIKVD +EDRIKDIRYHDQ+RLRAIS H+NWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKE-KEVSGSNL---SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALY
        VVENAEEKE KE S SNL   SQTPI LKTYPGADFGS EFFVQLK+GTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+ SP+HKMRN+MR RF YALY
Subjt:  VVENAEEKE-KEVSGSNL---SQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALY

Query:  ANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSI
        ANQSSSFSPIPCSSKQCI DF +LGGQPDCPTPN+PCSYTYSY  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSI
Subjt:  ANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSI

Query:  YSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTIL
        YSFVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTF+A+TSSPIGPPATT+L TGG+YSCYYGVQL GISVD QILNIP HVWNIKSGCGTIL
Subjt:  YSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTIL

Query:  DTGTSLTMLTAPAHDAVIEAMAPKIAKFGRM------EKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
        DTGTSLTMLTAPAHDAVIEAMAPKI KFGRM      E+++NF+LCFNDT+WNFGM PKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
Subjt:  DTGTSLTMLTAPAHDAVIEAMAPKIAKFGRM------EKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI

Query:  LGNIIQQTYFWQFDLLKGSVTFAPSDCA
        LGNIIQQTY WQFDLLKGSVTFAPSDCA
Subjt:  LGNIIQQTYFWQFDLLKGSVTFAPSDCA

A0A6J1L6Y2 aspartic proteinase NANA, chloroplast-like2.4e-26687.48Show/hide
Query:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK
        MS ISHLLIL FV VF FFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKR+ DEIKVD +EDRIKDIRYHDQ+RLRAISAHLNWTK
Subjt:  MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTK

Query:  VVENAEEKEKEVSGSN---LSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYA
        VVENAEEK KE SGSN    SQTPI LKTYPGADFGS EFFVQLKVGTPPQ FT+IADTGSDLLWT+CR+RRCRGDCS+ SP+HKMRNKMR RF YALYA
Subjt:  VVENAEEKEKEVSGSN---LSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYA

Query:  NQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY
        NQSSSFSPIPCSSKQCI DF +LGGQPDCPTPNTPCSYTYSY  G+RA GIFA ETVTVRLTNGKEKQLKDIL+GCTEE+  + F+ GADGLIGLGSSIY
Subjt:  NQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIY

Query:  SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILD
        SFVYKAAENN+GGGFSYCLADH RN TAISYFVFGTPSPKTFSA+TSSPIGPPATTKLFTGG+YSCYYGVQL GISVD QILNIP HVWNIKSGCGTILD
Subjt:  SFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILD

Query:  TGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINIL
        TGTSLTMLTAPAHDAVIEAMAPKI KFGRMEK      ++NF+LCFNDTEWNFGM PKLGFHFE GAVFEPPDRSYIVSASYQCSCIAITSLPFPSINIL
Subjt:  TGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEK------QRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINIL

Query:  GNIIQQTYFWQFDLLKGSVTFAPSDCA
        GNIIQQT+ W++DLLKGSVTFAPSDCA
Subjt:  GNIIQQTYFWQFDLLKGSVTFAPSDCA

SwissProt top hitse value%identityAlignment
Q6XBF8 Aspartic proteinase CDR11.7e-3529.61Show/hide
Query:  SGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPC
        SGE+ + + +GTPP     IADTGSDLLWT+C    C    + + P+   +               SS++  + CSS QC      L  Q  C T +  C
Subjt:  SGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPC

Query:  SYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGT
        SY+ SY       G  A +T+T+  ++ +  QLK+I+ GC      T F K   G++GLG    S + K   ++I G FSYCL          S   FGT
Subjt:  SYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGT

Query:  PSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNF
         +  + S   S+P+   A+ + F        Y + L  ISV  + +           G   I+D+GT+LT+L    +  + +A+A  I    + + Q   
Subjt:  PSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNF

Query:  ELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA
         LC++ T       P +  HF+G  V      ++ V  S    C A      PS +I GN+ Q  +   +D +  +V+F P+DCA
Subjt:  ELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA

Q766C2 Aspartic proteinase nepenthesin-24.4e-3629.41Show/hide
Query:  KVVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQ
        + ++  E + + ++    S + I    Y     G GE+ + + +GTP  +F+ I DTGSDL+WT+C    C    S  +P+   ++              
Subjt:  KVVENAEEKEKEVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQ

Query:  SSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSF
        SSSFS +PC S+ C D        P     N  C YTY Y  G    G  A ET T   ++     + +I FGC E+ +      GA GLIG+G    S 
Subjt:  SSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSF

Query:  VYKAAENNIG-GGFSYCLADHHRNTTAISYFVFGTPSPKTF---SATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSG--CG
              + +G G FSYC+              +G+ SP T    SA +  P G P+TT L        YY + L GI+V    L IP   + ++     G
Subjt:  VYKAAENNIG-GGFSYCLADHHRNTTAISYFVFGTPSPKTF---SATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSG--CG

Query:  TILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFND-TEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG
         I+D+GT+LT L   A++AV +A   +I      E       CF   ++ +    P++   F+GG V    +++ ++S +    C+A+ S     I+I G
Subjt:  TILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFND-TEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG

Query:  NIIQQTYFWQFDLLKGSVTFAPSDC
        NI QQ     +DL   +V+F P+ C
Subjt:  NIIQQTYFWQFDLLKGSVTFAPSDC

Q766C3 Aspartic proteinase nepenthesin-13.3e-3129.93Show/hide
Query:  PIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDL
        P G++T   A  G GE+ + L +GTP Q F+ I DTGSDL+WT+C  + C    +  +P+   +               SSSFS +PCSS+ C       
Subjt:  PIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDL

Query:  GGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHH
           P C   N  C YTY Y  G    G    ET+T    +     + +I FGC E  +      GA GL+G+G    S   +         FSYC+    
Subjt:  GGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHH

Query:  RNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGT---ILDTGTSLTMLTAPAHDAVIEAM
           T I      TPS     +  +S       T L    Q   +Y + L G+SV    L I    + + S  GT   I+D+GT+LT     A+ +V +  
Subjt:  RNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGT---ILDTGTSLTMLTAPAHDAVIEAM

Query:  APKIAKFGRMEKQRNFELCFNDTEWNFGMS-PKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSD
          +I           F+LCF        +  P    HF+GG + E P  +Y +S S    C+A+ S     ++I GNI QQ     +D     V+FA + 
Subjt:  APKIAKFGRMEKQRNFELCFNDTEWNFGMS-PKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSD

Query:  C
        C
Subjt:  C

Q9LNJ3 Aspartyl protease family protein 23.9e-3228.78Show/hide
Query:  EVSGSNLSQTP----IGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPI
        ++ G N++  P           G   GSGE+F +L VGTP +   ++ DTGSD++W +C    CR   S   P+   R              +S +++ I
Subjt:  EVSGSNLSQTP----IGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPI

Query:  PCSSKQC--IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAA
        PCSS  C  +D          C T    C Y  SY  G    G F+ ET+T R       ++K +  GC  + E      GA GL+GLG    SF  +  
Subjt:  PCSSKQC--IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAA

Query:  ENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISV-DDQILNIPRHVWNIK--SGCGTILDTGTS
         +     FSYCL D   ++          PS   F     S I     T L +  +   +Y V L+GISV   ++  +   ++ +      G I+D+GTS
Subjt:  ENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISV-DDQILNIPRHVWNIK--SGCGTILDTGTS

Query:  LTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFW
        +T L  PA+ A+ +A         R      F+ CF+ +  N    P +  HF G  V  P     I   +    C A        ++I+GNI QQ +  
Subjt:  LTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFW

Query:  QFDLLKGSVTFAPSDCA
         +DL    V FAP  CA
Subjt:  QFDLLKGSVTFAPSDCA

Q9LTW4 Aspartic proteinase NANA, chloroplast2.2e-7236.5Show/hide
Query:  SVEDRIKD--IRYHDQNRLRAISAHLNWTKVVENAEEKEKE-VSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRR
        +V D +KD  +R    +R   +   L+  + V  A++K    +S    S   + +    G D+G+ ++F +++VGTP + F ++ DTGS+L W  CR+ R
Subjt:  SVEDRIKD--IRYHDQNRLRAISAHLNWTKVVENAEEKEKE-VSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRR

Query:  CRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDI
         RG  +                R    A++S SF  + C ++ C  D  +L     CPTP+TPCSY Y Y  G  A G+FA ET+TV LTNG+  +L   
Subjt:  CRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDI

Query:  LFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTP-SPKT-FSATTSSPIGPPATTKLFTGGQYSCYYGV
        L GC+       F +GADG++GL  S +SF    A +  G  FSYCL DH  N    +Y +FG+  S KT F  TT     P   T++        +Y +
Subjt:  LFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTP-SPKT-FSATTSSPIGPPATTKLFTGGQYSCYYGV

Query:  QLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQR-NFELCFNDTE-WNFGMSPKLGFHFEGGAVFEPPDR
         +IGIS+   +L+IP  VW+  SG GTILD+GTSLT+L   A+  V+  +A  + +  R++ +    E CF+ T  +N    P+L FH +GGA FEP  +
Subjt:  QLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQR-NFELCFNDTE-WNFGMSPKLGFHFEGGAVFEPPDR

Query:  SYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDC
        SY+V A+    C+   S   P+ N++GNI+QQ Y W+FDL+  +++FAPS C
Subjt:  SYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein7.7e-4430.72Show/hide
Query:  HHPEVVK---RIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNLSQTP------IGLKTYPGADFGSGEFFVQLKVGTPP
        H  E VK   RI  E K       + D++  D  R++ + A  N +K  +N + ++K  S  +L   P      +      G   GSGE+F+ + VGTPP
Subjt:  HHPEVVK---RIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNLSQTP------IGLKTYPGADFGSGEFFVQLKVGTPP

Query:  QTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQC-IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERAS
        + F+LI DTGSDL W +C    C  DC H + M                   S+SF  I C+  +C +   PD   Q  C + N  C Y Y Y      +
Subjt:  QTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQC-IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERAS

Query:  GIFANETVTVRLT----NGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSAT
        G FA ET TV LT       E ++ +++FGC           GA GL+GLG    SF     ++  G  FSYCL D + NT   S  +FG          
Subjt:  GIFANETVTVRLT----NGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSAT

Query:  TSSPIGPPATTKLFTGGQYS--CYYGVQLIGISVDDQILNIPRHVWNIKS--GCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFEL---
         +        T    G + S   +Y +Q+  I V  + L+IP   WNI S    GTI+D+GT+L+    PA++ +    A K+ +       R+F +   
Subjt:  TSSPIGPPATTKLFTGGQYS--CYYGVQLIGISVDDQILNIPRHVWNIKS--GCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFEL---

Query:  CFN--DTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA
        CFN    E N    P+LG  F  G V+  P  +  +  S    C+AI   P  + +I+GN  QQ +   +D  +  + F P+ CA
Subjt:  CFN--DTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA

AT3G12700.1 Eukaryotic aspartyl protease family protein1.6e-7336.5Show/hide
Query:  SVEDRIKD--IRYHDQNRLRAISAHLNWTKVVENAEEKEKE-VSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRR
        +V D +KD  +R    +R   +   L+  + V  A++K    +S    S   + +    G D+G+ ++F +++VGTP + F ++ DTGS+L W  CR+ R
Subjt:  SVEDRIKD--IRYHDQNRLRAISAHLNWTKVVENAEEKEKE-VSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRR

Query:  CRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDI
         RG  +                R    A++S SF  + C ++ C  D  +L     CPTP+TPCSY Y Y  G  A G+FA ET+TV LTNG+  +L   
Subjt:  CRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDI

Query:  LFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTP-SPKT-FSATTSSPIGPPATTKLFTGGQYSCYYGV
        L GC+       F +GADG++GL  S +SF    A +  G  FSYCL DH  N    +Y +FG+  S KT F  TT     P   T++        +Y +
Subjt:  LFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTP-SPKT-FSATTSSPIGPPATTKLFTGGQYSCYYGV

Query:  QLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQR-NFELCFNDTE-WNFGMSPKLGFHFEGGAVFEPPDR
         +IGIS+   +L+IP  VW+  SG GTILD+GTSLT+L   A+  V+  +A  + +  R++ +    E CF+ T  +N    P+L FH +GGA FEP  +
Subjt:  QLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQR-NFELCFNDTE-WNFGMSPKLGFHFEGGAVFEPPDR

Query:  SYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDC
        SY+V A+    C+   S   P+ N++GNI+QQ Y W+FDL+  +++FAPS C
Subjt:  SYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDC

AT3G25700.1 Eukaryotic aspartyl protease family protein1.4e-5333.25Show/hide
Query:  NLSQTPIGLKTYP---GADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQ
        +L + PI     P   GA  GSG++FV L++G PPQ+  LIADTGSDL+W KC    CR +CSH SP                +   SS+FSP  C    
Subjt:  NLSQTPIGLKTYP---GADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQ

Query:  C-IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGC-----TEEVEVTDFMKGADGLIGLGSSIYSFVYKAAEN
        C +   PD     +    ++ C Y Y Y  G   SG+FA ET +++ ++GKE +LK + FGC      + V  T F  GA+G++GLG    SF  +    
Subjt:  C-IDDFPDLGGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGC-----TEEVEVTDFMKGADGLIGLGSSIYSFVYKAAEN

Query:  NIGGGFSYCLADHHRNTTAISYFVFGTP----SPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNI--KSGCGTILDTGT
          G  FSYCL D+  +    SY + G      S   F+   ++P+ P              +Y V+L  + V+   L I   +W I      GT++D+GT
Subjt:  NIGGGFSYCLADHHRNTTAISYFVFGTP----SPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNI--KSGCGTILDTGT

Query:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFN--DTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSL-PFPSINILGNIIQQ
        +L  L  PA+ +VI A+  ++           F+LC N         + P+L F F GGAVF PP R+Y +    Q  C+AI S+ P    +++GN++QQ
Subjt:  SLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFN--DTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSL-PFPSINILGNIIQQ

Query:  TYFWQFDLLKGSVTFAPSDCA
         + ++FD  +  + F+   CA
Subjt:  TYFWQFDLLKGSVTFAPSDCA

AT3G59080.1 Eukaryotic aspartyl protease family protein1.1e-4228.03Show/hide
Query:  FFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNL
        F +P+    A  S +N+    S      +E    +   + H   +KR +      +  + + +++  D  R++ +   +          +K+K+     +
Subjt:  FFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNL

Query:  SQTPIGLKT-----------YPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPI
        + TP+                 G   GSGE+F+ + VG+PP+ F+LI DTGSDL W +C    C  DC           +  G F        S+S+  I
Subjt:  SQTPIGLKT-----------YPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPI

Query:  PCSSKQCIDDFPDLGGQPDCPTP----NTPCSYTYSYTGGERASGIFANETVTVRL-TNGKEKQL---KDILFGCTEEVEVTDFMKGADGLIGLGSSIYS
         C+ ++C     +L   PD P P    N  C Y Y Y      +G FA ET TV L TNG   +L   ++++FGC           GA GL+GLG    S
Subjt:  PCSSKQCIDDFPDLGGQPDCPTP----NTPCSYTYSYTGGERASGIFANETVTVRL-TNGKEKQL---KDILFGCTEEVEVTDFMKGADGLIGLGSSIYS

Query:  FVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQ---YSCYYGVQLIGISVDDQILNIPRHVWNIKS--GCG
        F     ++  G  FSYCL D + +T   S  +FG                P      F  G+      +Y VQ+  I V  ++LNIP   WNI S    G
Subjt:  FVYKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQ---YSCYYGVQLIGISVDDQILNIPRHVWNIKS--GCG

Query:  TILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFEL---CFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI
        TI+D+GT+L+    PA++ +   +A K AK G+    R+F +   CFN +  +    P+LG  F  GAV+  P  +  +  +    C+A+   P  + +I
Subjt:  TILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFEL---CFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINI

Query:  LGNIIQQTYFWQFDLLKGSVTFAPSDCA
        +GN  QQ +   +D  +  + +AP+ CA
Subjt:  LGNIIQQTYFWQFDLLKGSVTFAPSDCA

AT3G59080.2 Eukaryotic aspartyl protease family protein5.4e-3726.05Show/hide
Query:  FFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNL
        F +P+    A  S +N+    S      +E    +   + H   +KR +      +  + + +++  D  R++ +   +          +K+K+     +
Subjt:  FFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEKEVSGSNL

Query:  SQTPIGLKT-----------YPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPI
        + TP+                 G   GSGE+F+ + VG+PP+ F+LI DTGSDL W +C                                        +
Subjt:  SQTPIGLKT-----------YPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPI

Query:  PCSSKQCIDDFPDLGGQPDC--PTPNTPCSYTYSYTGGERASGIFANETVTVRL-TNGKEKQL---KDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV
        PC                DC     N  C Y Y Y      +G FA ET TV L TNG   +L   ++++FGC           GA GL+GLG    SF 
Subjt:  PCSSKQCIDDFPDLGGQPDC--PTPNTPCSYTYSYTGGERASGIFANETVTVRL-TNGKEKQL---KDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFV

Query:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQ---YSCYYGVQLIGISVDDQILNIPRHVWNIKS--GCGTI
            ++  G  FSYCL D + +T   S  +FG                P      F  G+      +Y VQ+  I V  ++LNIP   WNI S    GTI
Subjt:  YKAAENNIGGGFSYCLADHHRNTTAISYFVFGTPSPKTFSATTSSPIGPPATTKLFTGGQ---YSCYYGVQLIGISVDDQILNIPRHVWNIKS--GCGTI

Query:  LDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFEL---CFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG
        +D+GT+L+    PA++ +   +A K AK G+    R+F +   CFN +  +    P+LG  F  GAV+  P  +  +  +    C+A+   P  + +I+G
Subjt:  LDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFEL---CFNDTEWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILG

Query:  NIIQQTYFWQFDLLKGSVTFAPSDCA
        N  QQ +   +D  +  + +AP+ CA
Subjt:  NIIQQTYFWQFDLLKGSVTFAPSDCA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCGATTTCTCATCTTTTAATCCTTGTCTTCGTCTTCGTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAACGCCAATAATCTCAAACA
AGAAAGCGATGCCAATAATGAAGAACAAGAATTCGTGAGGCTAGATCTGATACACCGCCACCATCCGGAAGTGGTTAAAAGGATTGATGACGAAATTAAGGTGGATAGTG
TCGAGGATCGCATCAAGGATATTCGCTATCACGATCAAAACCGCCTCCGAGCCATCTCCGCCCACCTGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAG
GAGGTGTCGGGTTCGAATCTGTCGCAGACGCCAATAGGATTGAAAACATACCCCGGCGCTGATTTCGGTAGCGGTGAATTTTTCGTGCAATTGAAAGTCGGAACGCCGCC
GCAGACGTTCACACTGATTGCAGATACCGGAAGTGACCTATTGTGGACGAAATGCAGATTCCGGCGGTGCAGGGGAGATTGCAGCCACCTCTCTCCGATGCATAAGATGC
GTAACAAAATGAGAGGGAGATTCAGATACGCGCTTTATGCGAATCAGTCGTCTTCTTTCTCCCCAATCCCTTGTTCCTCCAAGCAGTGCATCGATGATTTCCCTGATCTC
GGCGGCCAACCCGATTGTCCAACCCCTAACACCCCCTGTTCCTATACCTACAGCTACACAGGTGGGGAGCGTGCGAGTGGAATATTCGCAAACGAGACGGTAACGGTAAG
ACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATTCGGCTGCACAGAAGAAGTTGAAGTCACCGACTTCATGAAGGGAGCCGATGGCCTCATTGGCTTAGGCT
CTAGCATCTACTCCTTCGTCTACAAAGCCGCCGAAAACAACATCGGCGGCGGCTTCTCCTACTGCCTGGCCGACCACCACCGCAACACAACCGCCATTAGCTACTTCGTC
TTCGGCACCCCTTCCCCCAAGACCTTCTCCGCCACCACCTCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCAATACAGCTGCTACTACGGCGT
CCAACTGATCGGAATCTCGGTCGACGACCAGATCCTTAACATCCCCCGTCACGTCTGGAACATCAAGTCCGGGTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGA
TGCTGACGGCACCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCGAAGATCGCGAAATTCGGAAGAATGGAAAAGCAGAGGAACTTCGAACTTTGCTTCAATGACACT
GAGTGGAATTTTGGTATGTCGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTG
TATTGCCATAACTTCTCTGCCATTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACTTTTGGCAATTTGATTTACTTAAGGGATCCGTCACTTTTGCTC
CCTCCGATTGCGCCTAG
mRNA sequenceShow/hide mRNA sequence
ACGGTTAGGGTTTATAGATGTTCTTCCTCTTTTTAAGTCCGCCATTCTCATCTTCCTCCTTGCTTCATCTGTTTTCTCTGACAGGAACGGAGCAATGTCGCCGATTTCTC
ATCTTTTAATCCTTGTCTTCGTCTTCGTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAACGCCAATAATCTCAAACAAGAAAGCGATGCCAAT
AATGAAGAACAAGAATTCGTGAGGCTAGATCTGATACACCGCCACCATCCGGAAGTGGTTAAAAGGATTGATGACGAAATTAAGGTGGATAGTGTCGAGGATCGCATCAA
GGATATTCGCTATCACGATCAAAACCGCCTCCGAGCCATCTCCGCCCACCTGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGGAGGTGTCGGGTTCGA
ATCTGTCGCAGACGCCAATAGGATTGAAAACATACCCCGGCGCTGATTTCGGTAGCGGTGAATTTTTCGTGCAATTGAAAGTCGGAACGCCGCCGCAGACGTTCACACTG
ATTGCAGATACCGGAAGTGACCTATTGTGGACGAAATGCAGATTCCGGCGGTGCAGGGGAGATTGCAGCCACCTCTCTCCGATGCATAAGATGCGTAACAAAATGAGAGG
GAGATTCAGATACGCGCTTTATGCGAATCAGTCGTCTTCTTTCTCCCCAATCCCTTGTTCCTCCAAGCAGTGCATCGATGATTTCCCTGATCTCGGCGGCCAACCCGATT
GTCCAACCCCTAACACCCCCTGTTCCTATACCTACAGCTACACAGGTGGGGAGCGTGCGAGTGGAATATTCGCAAACGAGACGGTAACGGTAAGACTAACAAACGGAAAA
GAAAAGCAACTGAAGGACATTCTATTCGGCTGCACAGAAGAAGTTGAAGTCACCGACTTCATGAAGGGAGCCGATGGCCTCATTGGCTTAGGCTCTAGCATCTACTCCTT
CGTCTACAAAGCCGCCGAAAACAACATCGGCGGCGGCTTCTCCTACTGCCTGGCCGACCACCACCGCAACACAACCGCCATTAGCTACTTCGTCTTCGGCACCCCTTCCC
CCAAGACCTTCTCCGCCACCACCTCCTCTCCCATCGGCCCCCCCGCCACCACTAAACTCTTCACCGGCGGCCAATACAGCTGCTACTACGGCGTCCAACTGATCGGAATC
TCGGTCGACGACCAGATCCTTAACATCCCCCGTCACGTCTGGAACATCAAGTCCGGGTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGATGCTGACGGCACCGGC
TCACGATGCGGTGATAGAAGCGATGGCTCCGAAGATCGCGAAATTCGGAAGAATGGAAAAGCAGAGGAACTTCGAACTTTGCTTCAATGACACTGAGTGGAATTTTGGTA
TGTCGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCCATAACTTCT
CTGCCATTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACTTTTGGCAATTTGATTTACTTAAGGGATCCGTCACTTTTGCTCCCTCCGATTGCGCCTA
GAACACTCCTCCTCTTTCTTTCATTCCTTTCTTCTTCTTCTTTTATTTTTATTTTTATTTTTTAAATGATTCAATTTCAACAAACATGGAGAGGGGGATTATTACTTTTT
TATTTTTAAACAAAAC
Protein sequenceShow/hide protein sequence
MSPISHLLILVFVFVFVFFSPLTVAVADQSNANNLKQESDANNEEQEFVRLDLIHRHHPEVVKRIDDEIKVDSVEDRIKDIRYHDQNRLRAISAHLNWTKVVENAEEKEK
EVSGSNLSQTPIGLKTYPGADFGSGEFFVQLKVGTPPQTFTLIADTGSDLLWTKCRFRRCRGDCSHLSPMHKMRNKMRGRFRYALYANQSSSFSPIPCSSKQCIDDFPDL
GGQPDCPTPNTPCSYTYSYTGGERASGIFANETVTVRLTNGKEKQLKDILFGCTEEVEVTDFMKGADGLIGLGSSIYSFVYKAAENNIGGGFSYCLADHHRNTTAISYFV
FGTPSPKTFSATTSSPIGPPATTKLFTGGQYSCYYGVQLIGISVDDQILNIPRHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIAKFGRMEKQRNFELCFNDT
EWNFGMSPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYFWQFDLLKGSVTFAPSDCA