; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G044310 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G044310
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationCiama_Chr02:32134046..32135997
RNA-Seq ExpressionCaUC02G044310
SyntenyCaUC02G044310
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033565.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa]2.3e-19467.95Show/hide
Query:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS
        MK++DL++R+KDIH+HD  R+++IS S+N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGS
Subjt:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS

Query:  DLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFR
        DLTW+KCRYRRC GNCS   NHKS+NE+K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                      
Subjt:  DLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFR

Query:  VGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA
                     +Y GGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE VQG +F G DGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A
Subjt:  VGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA

Query:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF
         SYF+LG P+PS SA+ SS  P   M++TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F
Subjt:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF

Query:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        +QIE+EPF FCFNNSQYTH+MAPK+RFHFGDGT+F+PP KSY+VS GE+ISCIG VS+PFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]1.3e-22468.36Show/hide
Query:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY
        MLGYRKPMSPIS+FCFFF    LFFFLS  +    ALG  D               QE +K DLLHRHHPQVAEK+HG+MK++D+++R+KDIH+HD  R+
Subjt:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY

Query:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN
        ++IS S+N+KQ+E+ +L+AEAE     E AK  ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCSS  N
Subjt:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN

Query:  HKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASA
        HKS+NE+K RFR+AFLAN+SSSFKT+ CSST CT DLADLF++ EC  PTSPC+YDY                                   +YTGGASA
Subjt:  HKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASA

Query:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV
        KGIFA ETLTV LTNGKEKQLHNSIIGCTESVQG +FGG DGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  
Subjt:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV

Query:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM
            MT+TKL+VGDPY+SFYGVDLI ISA+G+MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK F+Q+E+EPF FCFNNSQYTHEM
Subjt:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM

Query:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        APK+RFHFGDGT+F+PP KSY+VS G++ISCIGFVS+PFPA NIIGNILQQNHLWQFDF +R+VGFAPSEC+
Subjt:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]2.4e-21566.55Show/hide
Query:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL
        MLGYRKPMSPIS+FCFFF L FFLS  +    ALG             +Q+T++ DLLHRHHPQV+EKL+G+MK++DL++R+KDIH+HD  R+++IS S+
Subjt:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL

Query:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER
        N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCS   NHKS+NE+
Subjt:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER

Query:  KTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIE
        K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                                   +Y GGASAKGIFA E
Subjt:  KTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIE

Query:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF
        TLTV LTNGKEKQL NSIIGCTE VQG +F G DGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  P   M++
Subjt:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF

Query:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH
        TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F+QIE+EPF FCFNNSQYTH+MAPK+RFH
Subjt:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH

Query:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        FGDGT+F+PP KSY+VS GE+ISCIG VS+PFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_022943788.1 aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata]1.3e-16855.23Show/hide
Query:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE
        MLGY  PMSPIS    FF F FFLSVH A  G +Q+          VKLD++HRHHP V EKL+G  +     DR +DIH+HD  R ++ISTS+   + +
Subjt:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE

Query:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLAN
         +              LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LIVDTGSDLTW+KCRYRRC+GNC++ A+HKSR E K +F + FLAN
Subjt:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLAN

Query:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKE
        +SSSFK I C S  C  DL  LF+I +C+ P++PC+YDY Y+                                   GG +A G+FA ET+TV LTNGKE
Subjt:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKE

Query:  KQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS
        KQLH+++IGCTE        GVDG++GLGT  +SF ++AA + NGGGFSYCL+DHLSHH+ATSYFILG P     A   SV P GNMTF  L +G P+NS
Subjt:  KQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS

Query:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV
        +YGV LI IS DGV LNIPPRVWDI  GGGTILDSGTSL+ML APAFD+ MEA+  KLK F+QI  +PF +CFN + Y+HEMAPK+RFHF  G +F+PP 
Subjt:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV

Query:  KSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        KSY+V   + I C+GF S+PFP  NIIGNILQQN LWQFDFF +KVGFAPS+C+
Subjt:  KSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]3.8e-24577.92Show/hide
Query:  MLGYRKPMSPISHFCFFFLFFFLSVHNAL--GGHDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAE
        MLGYRKPMSPISHFC FFLFFFLSV  A   G HDQE VKLDLLHRHHPQV+EKLHG++K+E++NDRIKDI +HD KRYQTIS+SLNR +++E+L+ EA 
Subjt:  MLGYRKPMSPISHFCFFFLFFFLSVHNAL--GGHDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAE

Query:  VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTI
          A KD  LPP S TPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLI DTGSDLTW+KCRYRRCIGNCSS  NHK+RNERK RFRNAFLANYSSSFKTI
Subjt:  VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTI

Query:  HCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSII
         CSS  CT DLADLFSIGEC+TPTSPC+YDY                                   +Y+GGASAKG+FAIETLTV LTNGKEKQLHNSII
Subjt:  HCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSII

Query:  GCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAP--SPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDL
        GCTESVQGRIFGG DGVIGLGTS YSFTYKAAENANGGGF+YCLVDHLS  TATSYFILG P  S  ++AAASSV P+GNM+FTKLF+GDPY+SFYGVDL
Subjt:  GCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAP--SPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDL

Query:  IAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVS
        + ISADGVMLNIPPRVWDINSGGGTI+DSGTSLTMLAAPAFDMVMEAL PKLKHFE IE+EPF FCFNNS+YTHEMAPK+RFHFGDGT+FQPP KSY+VS
Subjt:  IAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVS

Query:  AGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
         GEYISCIGFVS+PFPA NIIGNILQQNHLW+FDF    VGFAPSECV
Subjt:  AGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein6.2e-22568.36Show/hide
Query:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY
        MLGYRKPMSPIS+FCFFF    LFFFLS  +    ALG  D               QE +K DLLHRHHPQVAEK+HG+MK++D+++R+KDIH+HD  R+
Subjt:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY

Query:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN
        ++IS S+N+KQ+E+ +L+AEAE     E AK  ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCSS  N
Subjt:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN

Query:  HKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASA
        HKS+NE+K RFR+AFLAN+SSSFKT+ CSST CT DLADLF++ EC  PTSPC+YDY                                   +YTGGASA
Subjt:  HKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASA

Query:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV
        KGIFA ETLTV LTNGKEKQLHNSIIGCTESVQG +FGG DGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  
Subjt:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV

Query:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM
            MT+TKL+VGDPY+SFYGVDLI ISA+G+MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK F+Q+E+EPF FCFNNSQYTHEM
Subjt:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM

Query:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        APK+RFHFGDGT+F+PP KSY+VS G++ISCIGFVS+PFPA NIIGNILQQNHLWQFDF +R+VGFAPSEC+
Subjt:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A1S3C2F3 aspartic proteinase CDR11.2e-21566.55Show/hide
Query:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL
        MLGYRKPMSPIS+FCFFF L FFLS  +    ALG             +Q+T++ DLLHRHHPQV+EKL+G+MK++DL++R+KDIH+HD  R+++IS S+
Subjt:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL

Query:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER
        N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCS   NHKS+NE+
Subjt:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER

Query:  KTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIE
        K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                                   +Y GGASAKGIFA E
Subjt:  KTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIE

Query:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF
        TLTV LTNGKEKQL NSIIGCTE VQG +F G DGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  P   M++
Subjt:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF

Query:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH
        TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F+QIE+EPF FCFNNSQYTH+MAPK+RFH
Subjt:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH

Query:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        FGDGT+F+PP KSY+VS GE+ISCIG VS+PFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A5D3B701 Aspartic proteinase CDR11.1e-19467.95Show/hide
Query:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS
        MK++DL++R+KDIH+HD  R+++IS S+N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGS
Subjt:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS

Query:  DLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFR
        DLTW+KCRYRRC GNCS   NHKS+NE+K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                      
Subjt:  DLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFR

Query:  VGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA
                     +Y GGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE VQG +F G DGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A
Subjt:  VGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA

Query:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF
         SYF+LG P+PS SA+ SS  P   M++TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F
Subjt:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF

Query:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        +QIE+EPF FCFNNSQYTH+MAPK+RFHFGDGT+F+PP KSY+VS GE+ISCIG VS+PFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X21.9e-16555.03Show/hide
Query:  MSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEA
        MSPIS    FF F FFLSVH A  G +Q+          VKLD++HRHHP V EKL+G  +     DR +DIH+HD  R ++ISTS+   + + +     
Subjt:  MSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEA

Query:  EVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKT
                 LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LIVDTGSDLTW+KCRYRRC+GNC++ A+HKSR E K +F + FLAN+SSSFK 
Subjt:  EVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKT

Query:  IHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSI
        I C S  C  DL  LF+I +C+ P++PC+YDY Y+                                   GG +A G+FA ET+TV LTNGKEKQLH+++
Subjt:  IHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSI

Query:  IGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLI
        IGCTE        GVDG++GLGT  +SF ++AA + NGGGFSYCL+DHLSHH+ATSYFILG P     A   SV P GNMTF  L +G P+NS+YGV LI
Subjt:  IGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLI

Query:  AISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSA
         IS DGV LNIPPRVWDI  GGGTILDSGTSL+ML APAFD+ MEA+  KLK F+QI  +PF +CFN + Y+HEMAPK+RFHF  G +F+PP KSY+V  
Subjt:  AISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSA

Query:  GEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
         + I C+GF S+PFP  NIIGNILQQN LWQFDFF +KVGFAPS+C+
Subjt:  GEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X16.3e-16955.23Show/hide
Query:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE
        MLGY  PMSPIS    FF F FFLSVH A  G +Q+          VKLD++HRHHP V EKL+G  +     DR +DIH+HD  R ++ISTS+   + +
Subjt:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE

Query:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLAN
         +              LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LIVDTGSDLTW+KCRYRRC+GNC++ A+HKSR E K +F + FLAN
Subjt:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLAN

Query:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKE
        +SSSFK I C S  C  DL  LF+I +C+ P++PC+YDY Y+                                   GG +A G+FA ET+TV LTNGKE
Subjt:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKE

Query:  KQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS
        KQLH+++IGCTE        GVDG++GLGT  +SF ++AA + NGGGFSYCL+DHLSHH+ATSYFILG P     A   SV P GNMTF  L +G P+NS
Subjt:  KQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS

Query:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV
        +YGV LI IS DGV LNIPPRVWDI  GGGTILDSGTSL+ML APAFD+ MEA+  KLK F+QI  +PF +CFN + Y+HEMAPK+RFHF  G +F+PP 
Subjt:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV

Query:  KSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        KSY+V   + I C+GF S+PFP  NIIGNILQQN LWQFDFF +KVGFAPS+C+
Subjt:  KSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED35.1e-2725.28Show/hide
Query:  PTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLAD
        P P  + + SG+      Y V+ K+GTPPQ   +++DT +D  W+ C        CS  +N  +          +F  N SS++ T+ CS+  CT     
Subjt:  PTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLAD

Query:  LFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGG
             + +  T P               +S P  S   F              +Y G +S       +TLT+         + N   GC  S  G     
Subjt:  LFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGG

Query:  VDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATS--YFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIP
          G++GLG  P S   +   +   G FSYCL    S + + S    +LG P               ++ +T L       S Y V+L  +S   V + + 
Subjt:  VDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATS--YFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIP

Query:  P--RVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV
        P    +D NSG GTI+DSGT +T  A P ++ + +    ++       +  F  CF  S     +APKI  H     +  P   + + S+   ++C+   
Subjt:  P--RVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV

Query:  SVPFPA---YNIIGNILQQNHLWQFDFFRRKVGFAPSEC
         +   A    N+I N+ QQN    FD    ++G AP  C
Subjt:  SVPFPA---YNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Q6XBF8 Aspartic proteinase CDR11.1e-2626.08Show/hide
Query:  SSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCI
        S EY + + +GTPP   M I DTGSDL W +C        C         ++  T+    F    SS++K + CSS+ CT     L +   C T  + C 
Subjt:  SSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCI

Query:  YDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFT
        Y                                     +Y   +  KG  A++TLT+  ++ +  QL N IIGC  +  G       G++GLG  P S  
Subjt:  YDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFT

Query:  YKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSG
         +  ++ + G FSYCLV   S    TS    G          +++V    +  T L       +FY + L +IS     +       + +S G  I+DSG
Subjt:  YKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSG

Query:  TSLTMLAAPAFDMVMEALTPKL-KHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNH
        T+LT+L    +  + +A+   +    +Q        C+  S       P I  HF DG   +    +  V   E + C  F     P+++I GN+ Q N 
Subjt:  TSLTMLAAPAFDMVMEALTPKL-KHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNH

Query:  LWQFDFFRRKVGFAPSEC
        L  +D   + V F P++C
Subjt:  LWQFDFFRRKVGFAPSEC

Q9LNJ3 Aspartyl protease family protein 29.4e-2927.54Show/hide
Query:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCR-YRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTT
        +P P G    ++SG   GS EYF +L VGTP +   +++DTGSD+ W++C   RRC                       F    S ++ TI CSS  C  
Subjt:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCR-YRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTT

Query:  DLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGR
            L S G C T    C+Y                                     +Y  G+   G F+ ETLT      +  ++    +GC    +G 
Subjt:  DLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGR

Query:  IFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVML-
        +F G  G++GLG    SF  +     N   FSYCLVD              A S  +S    +   S    FT L      ++FY V L+ IS  G  + 
Subjt:  IFGGVDGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVML-

Query:  NIPPRVWDIN--SGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI-EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVV---SAGEY
         +   ++ ++    GG I+DSGTS+T L  PA+  + +A     K  ++  +   F  CF+ S       P +  HF  G     P  +Y++   + G++
Subjt:  NIPPRVWDIN--SGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI-EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVV---SAGEY

Query:  ISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
          C  F        +IIGNI QQ     +D    +VGFAP  C
Subjt:  ISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 15.5e-2925.46Show/hide
Query:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGEC
        ++SG+  GS EYF ++ VGTP +   L++DTGSD+ WI+C       +C  +++              F    SS++K++ CS+  C+     L     C
Subjt:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGEC

Query:  KTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVD-LTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIG
        +  ++ C+Y   Y                                          G F +  L  D +T G   +++N  +GC    +G +F G  G++G
Subjt:  KTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVD-LTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIG

Query:  LGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINS
        LG    S T           FSYCLVD  S               S+S   +SV   G      L      ++FY V L   S  G  + +P  ++D+++
Subjt:  LGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINS

Query:  --GGGTILDSGTSLTMLAAPAFDMVMEA---LTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEY-ISCIGFVSVPF
           GG ILD GT++T L   A++ + +A   LT  LK      +  F  C++ S  +    P + FHF  G     P K+Y++   +    C  F     
Subjt:  --GGGTILDSGTSLTMLAAPAFDMVMEA---LTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEY-ISCIGFVSVPF

Query:  PAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
         + +IIGN+ QQ     +D  +  +G + ++C
Subjt:  PAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Q9LTW4 Aspartic proteinase NANA, chloroplast2.3e-8340.78Show/hide
Query:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADL
        +G+KM   SG DYG+++YF +++VGTP + F ++VDTGS+LTW+ CRYR        K N           R  F A+ S SFKT+ C + TC  DL +L
Subjt:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADL

Query:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGV
        FS+  C TP++PC YDYR                                   Y  G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F G 
Subjt:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGV

Query:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV
        DGV+GL  S +SFT   A +  G  FSYCLVDHLS+   ++Y I G+ S S   A     P   +  T++        FY +++I IS    ML+IP +V
Subjt:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV

Query:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSV
        WD  SGGGTILDSGTSLT+LA  A+  V+  L   L   ++++ E  P ++CF+  S +     P++ FH   G  F+P  KSY+V A   + C+GFVS 
Subjt:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSV

Query:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
          PA N+IGNI+QQN+LW+FD     + FAPS C
Subjt:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein3.6e-4428.49Show/hide
Query:  RHHPQVAEKLHGNMKVED--LNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPI-LPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPP
        + H + + K    +K E       + D+   DL R +T+    N+ + ++  K   ++ +    +  P  SP  +   + SG   GS EYF+ + VGTPP
Subjt:  RHHPQVAEKLHGNMKVED--LNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPI-LPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPP

Query:  QTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLN
        + F LI+DTGSDL W++C    C  +C  +  +    + KT          S+SFK I C+   C+  ++      +C++    C Y Y Y         
Subjt:  QTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLN

Query:  SLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLT----NGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGG
                     G +             ++  G FA+ET TV+LT       E ++ N + GC    +G +F G  G++GLG  P SF+    ++  G 
Subjt:  SLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLT----NGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAAENANGG

Query:  GFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS---FYGVDLIAISADGVMLNIPPRVWDINS--GGGTILDSGTSLTM
         FSYCLVD  S+   +S  I G            ++   N+ FT  FV    NS   FY + + +I   G  L+IP   W+I+S   GGTI+DSGT+L+ 
Subjt:  GFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS---FYGVDLIAISADGVMLNIPPRVWDINS--GGGTILDSGTSLTM

Query:  LAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNS--QYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLW
         A PA++++      K+K    I  +      CFN S  +  +   P++   F DGT++  P ++  +   E + C+  +  P   ++IIGN  QQN   
Subjt:  LAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNS--QYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLW

Query:  QFDFFRRKVGFAPSEC
         +D  R ++GF P++C
Subjt:  QFDFFRRKVGFAPSEC

AT3G12700.1 Eukaryotic aspartyl protease family protein1.6e-8440.78Show/hide
Query:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADL
        +G+KM   SG DYG+++YF +++VGTP + F ++VDTGS+LTW+ CRYR        K N           R  F A+ S SFKT+ C + TC  DL +L
Subjt:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADL

Query:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGV
        FS+  C TP++PC YDYR                                   Y  G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F G 
Subjt:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGV

Query:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV
        DGV+GL  S +SFT   A +  G  FSYCLVDHLS+   ++Y I G+ S S   A     P   +  T++        FY +++I IS    ML+IP +V
Subjt:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV

Query:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSV
        WD  SGGGTILDSGTSLT+LA  A+  V+  L   L   ++++ E  P ++CF+  S +     P++ FH   G  F+P  KSY+V A   + C+GFVS 
Subjt:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSV

Query:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
          PA N+IGNI+QQN+LW+FD     + FAPS C
Subjt:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

AT3G25700.1 Eukaryotic aspartyl protease family protein3.3e-5332.27Show/hide
Query:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTC-TTDLADLFSIGE
        ++SG+  GS +YFV L++G PPQ+ +LI DTGSDL W+KC   R   NCS    H S           F   +SS+F   HC    C      D   I  
Subjt:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTC-TTDLADLFSIGE

Query:  CKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGC-----TESVQGRIFGGV
             S C Y+Y                                    Y  G+   G+FA ET ++  ++GKE +L +   GC      +SV G  F G 
Subjt:  CKTPTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGC-----TESVQGRIFGGV

Query:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV
        +GV+GLG  P SF  +      G  FSYCL+D+      TSY I+G      S           + FT L       +FY V L ++  +G  L I P +
Subjt:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV

Query:  WDI--NSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEP-FKFCFNNSQYT--HEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV
        W+I  +  GGT++DSGT+L  LA PA+  V+ A+  ++K      + P F  C N S  T   ++ P+++F F  G +F PP ++Y +   E I C+   
Subjt:  WDI--NSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEP-FKFCFNNSQYT--HEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV

Query:  SV-PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
        SV P   +++IGN++QQ  L++FD  R ++GF+   C
Subjt:  SV-PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

AT3G59080.1 Eukaryotic aspartyl protease family protein2.8e-4428.9Show/hide
Query:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS
        + +TVK   L R      EK   N    +++ DL  RI+ +H   L++    + S  +K+ ++++     V ++ +        T     + SG   GS 
Subjt:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS

Query:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFL-ANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIY
        EYF+ + VG+PP+ F LI+DTGSDL WI+C    C  +C  +               AF     S+S+K I C+   C   ++       CK+    C Y
Subjt:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFL-ANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIY

Query:  DYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGVDGVIGLGTSPY
         Y Y                     G S              ++  G FA+ET TV+L TNG   +L+   N + GC    +G +F G  G++GLG  P 
Subjt:  DYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGVDGVIGLGTSPY

Query:  SFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GG
        SF+    ++  G  FSYCLVD  S    +S  I G            ++   N+ FT    G  +  ++FY V + +I   G +LNIP   W+I+S   G
Subjt:  SFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GG

Query:  GTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNII
        GTI+DSGT+L+  A PA++ +   +  K K    +  +      CFN S   +   P++   F DG ++  P ++  +   E + C+  +  P  A++II
Subjt:  GTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNII

Query:  GNILQQNHLWQFDFFRRKVGFAPSEC
        GN  QQN    +D  R ++G+AP++C
Subjt:  GNILQQNHLWQFDFFRRKVGFAPSEC

AT3G59080.2 Eukaryotic aspartyl protease family protein2.7e-3927.24Show/hide
Query:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS
        + +TVK   L R      EK   N    +++ DL  RI+ +H   L++    + S  +K+ ++++     V ++ +        T     + SG   GS 
Subjt:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS

Query:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYD
        EYF+ + VG+PP+ F LI+DTGSDL WI+C    C  +C  + +++S                                                 C Y 
Subjt:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYD

Query:  YRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGVDGVIGLGTSPYS
        Y Y                     G S              ++  G FA+ET TV+L TNG   +L+   N + GC    +G +F G  G++GLG  P S
Subjt:  YRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGVDGVIGLGTSPYS

Query:  FTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GGG
        F+    ++  G  FSYCLVD  S    +S  I G            ++   N+ FT    G  +  ++FY V + +I   G +LNIP   W+I+S   GG
Subjt:  FTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GGG

Query:  TILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIG
        TI+DSGT+L+  A PA++ +   +  K K    +  +      CFN S   +   P++   F DG ++  P ++  +   E + C+  +  P  A++IIG
Subjt:  TILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIG

Query:  NILQQNHLWQFDFFRRKVGFAPSEC
        N  QQN    +D  R ++G+AP++C
Subjt:  NILQQNHLWQFDFFRRKVGFAPSEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTCTTCTTCTTCCTCTCTGTTCACAACGCATTGGGCGGCCATGACCAAGAAACTGT
AAAACTCGATCTACTTCACCGTCACCATCCACAAGTCGCCGAGAAGCTTCACGGCAATATGAAAGTTGAAGATCTCAACGATCGGATCAAGGATATTCACGATCACGACC
TCAAACGATATCAAACCATCTCCACATCGTTGAACCGGAAGCAAATTGAGGAGAAATTGAAGGCGGAAGCGGAGGTTGAGGCAGCGAAGGATCCGATACTTCCACCGACG
TCGCCTACGCCGATAGGGCTGAAAATGATATCAGGTTCAGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTGGGAACACCGCCTCAGACGTTCATGTTGATCGT
GGATACCGGAAGCGATCTAACGTGGATAAAATGTAGATATCGGAGGTGTATCGGAAATTGTAGCAGCAAGGCGAATCATAAGAGCCGAAACGAACGGAAAACGAGATTTA
GAAATGCGTTTTTGGCGAATTATTCGTCGTCTTTCAAGACCATTCATTGCAGCTCCACGACTTGTACCACTGATCTCGCGGATCTGTTCTCAATTGGGGAATGCAAAACC
CCAACTAGCCCTTGTATCTATGATTACAGGTACGTAAGATTGATTCACCAAAAACTAAATTCCTTACCACTCAACTCAACTGAATTATTTAGAGTTGGAGGGAGTAAGGT
GCGGTGCAACCCGACTACACCTAACTACACAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATCGAGACCCTAACCGTAGACCTAACAAACGGAAAAGAAAAACAGCTCC
ACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGAAGGATATTCGGCGGAGTCGACGGCGTCATTGGCTTAGGCACTAGCCCCTACTCTTTTACCTACAAAGCCGCC
GAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTTGTCGACCATCTCAGCCACCACACCGCCACCAGCTACTTCATCCTCGGCGCCCCTTCCCCCTCCGCTTCCGCTGC
TGCCTCCTCCGTCGTCCCTTCTGGCAACATGACCTTCACCAAACTCTTCGTCGGCGACCCTTACAACAGCTTCTACGGCGTCGATCTCATCGCAATCTCCGCCGACGGCG
TCATGCTCAACATTCCTCCCCGCGTTTGGGACATCAATTCCGGCGGCGGTACCATCCTCGACTCCGGAACTAGCCTCACCATGCTGGCGGCGCCGGCGTTCGATATGGTC
ATGGAAGCTCTGACTCCTAAGCTGAAGCATTTCGAGCAAATTGAAGTCGAACCCTTCAAATTTTGCTTCAATAATAGCCAGTACACCCATGAAATGGCCCCGAAGATCCG
ATTCCATTTCGGCGACGGCACGATGTTCCAGCCGCCGGTGAAAAGCTACGTTGTGTCGGCGGGTGAATATATTAGCTGTATTGGGTTCGTTTCTGTGCCTTTTCCGGCCT
ACAATATCATCGGGAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCTTTAGGAGAAAAGTCGGTTTTGCCCCCTCTGAATGCGTCTAA
mRNA sequenceShow/hide mRNA sequence
AGTCCCCATTATTAGGCGGCTCTATATTTATCACTTTTCAGTTAAGTTTGTTCTTCCTTCATGCTTTCCCCCCTTTTTAAGTCCGCCATTATCATCTTTCTTCTTGCTCC
ATCTCTCTAACATTACATTCTCTGTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTCTTCTTCTTCCTCTCTGTTCACAAC
GCATTGGGCGGCCATGACCAAGAAACTGTAAAACTCGATCTACTTCACCGTCACCATCCACAAGTCGCCGAGAAGCTTCACGGCAATATGAAAGTTGAAGATCTCAACGA
TCGGATCAAGGATATTCACGATCACGACCTCAAACGATATCAAACCATCTCCACATCGTTGAACCGGAAGCAAATTGAGGAGAAATTGAAGGCGGAAGCGGAGGTTGAGG
CAGCGAAGGATCCGATACTTCCACCGACGTCGCCTACGCCGATAGGGCTGAAAATGATATCAGGTTCAGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTGGGA
ACACCGCCTCAGACGTTCATGTTGATCGTGGATACCGGAAGCGATCTAACGTGGATAAAATGTAGATATCGGAGGTGTATCGGAAATTGTAGCAGCAAGGCGAATCATAA
GAGCCGAAACGAACGGAAAACGAGATTTAGAAATGCGTTTTTGGCGAATTATTCGTCGTCTTTCAAGACCATTCATTGCAGCTCCACGACTTGTACCACTGATCTCGCGG
ATCTGTTCTCAATTGGGGAATGCAAAACCCCAACTAGCCCTTGTATCTATGATTACAGGTACGTAAGATTGATTCACCAAAAACTAAATTCCTTACCACTCAACTCAACT
GAATTATTTAGAGTTGGAGGGAGTAAGGTGCGGTGCAACCCGACTACACCTAACTACACAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATCGAGACCCTAACCGTAGA
CCTAACAAACGGAAAAGAAAAACAGCTCCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGAAGGATATTCGGCGGAGTCGACGGCGTCATTGGCTTAGGCACTA
GCCCCTACTCTTTTACCTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTTGTCGACCATCTCAGCCACCACACCGCCACCAGCTACTTCATCCTC
GGCGCCCCTTCCCCCTCCGCTTCCGCTGCTGCCTCCTCCGTCGTCCCTTCTGGCAACATGACCTTCACCAAACTCTTCGTCGGCGACCCTTACAACAGCTTCTACGGCGT
CGATCTCATCGCAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCCCGCGTTTGGGACATCAATTCCGGCGGCGGTACCATCCTCGACTCCGGAACTAGCCTCACCA
TGCTGGCGGCGCCGGCGTTCGATATGGTCATGGAAGCTCTGACTCCTAAGCTGAAGCATTTCGAGCAAATTGAAGTCGAACCCTTCAAATTTTGCTTCAATAATAGCCAG
TACACCCATGAAATGGCCCCGAAGATCCGATTCCATTTCGGCGACGGCACGATGTTCCAGCCGCCGGTGAAAAGCTACGTTGTGTCGGCGGGTGAATATATTAGCTGTAT
TGGGTTCGTTTCTGTGCCTTTTCCGGCCTACAATATCATCGGGAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCTTTAGGAGAAAAGTCGGTTTTGCCCCCT
CTGAATGCGTCTAAAAACTTCTTTCAATTTCTTCATCATCATCTTCTTCTTCTTCTTCTTCTTCTCCTTCTTTTCGTTTCTTCTTCTTCTCC
Protein sequenceShow/hide protein sequence
MLGYRKPMSPISHFCFFFLFFFLSVHNALGGHDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPT
SPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKTRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKT
PTSPCIYDYRYVRLIHQKLNSLPLNSTELFRVGGSKVRCNPTTPNYTGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGVDGVIGLGTSPYSFTYKAA
ENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMV
MEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSVPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV