; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G19900 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G19900
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionEukaryotic aspartyl protease family protein
Genome locationClcChr02:32524550..32526496
RNA-Seq ExpressionClc02G19900
SyntenyClc02G19900
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033565.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa]7.3e-19668.56Show/hide
Query:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS
        MK++DL++R+KDIH+HD  R+++IS S+N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGS
Subjt:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS

Query:  DLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFR
        DLTW+KCRYRRC GNCS   NHKS+NE+K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                      
Subjt:  DLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFR

Query:  VGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA
                     +YAGGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A
Subjt:  VGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA

Query:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF
         SYF+LG P+PS SA+ SS  P   M++TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F
Subjt:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF

Query:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        +QIE+EPF FCFNNSQYTH+MAPK+RFHFGDGT+F+PP KSY+VS GE+ISCIG VSMPFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]3.3e-22568.53Show/hide
Query:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY
        MLGYRKPMSPIS+FCFFF    LFFFLS  +    ALG  D               QE +K DLLHRHHPQVAEK+HG+MK++D+++R+KDIH+HD  R+
Subjt:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY

Query:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN
        ++IS S+N+KQ+E+ +L+AEAE     E AK  ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCSS  N
Subjt:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN

Query:  HKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASA
        HKS+NE+K RFR+AFLAN+SSSFKT+ CSST CT DLADLF++ EC  PTSPC+YDY                                   +Y GGASA
Subjt:  HKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASA

Query:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV
        KGIFA ETLTV LTNGKEKQLHNSIIGCTESVQG +FGGADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  
Subjt:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV

Query:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM
            MT+TKL+VGDPY+SFYGVDLI ISA+G+MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK F+Q+E+EPF FCFNNSQYTHEM
Subjt:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM

Query:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        APK+RFHFGDGT+F+PP KSY+VS G++ISCIGFVSMPFPA NIIGNILQQNHLWQFDF +R+VGFAPSEC+
Subjt:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]5.7e-21767.08Show/hide
Query:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL
        MLGYRKPMSPIS+FCFFF L FFLS  +    ALG             +Q+T++ DLLHRHHPQV+EKL+G+MK++DL++R+KDIH+HD  R+++IS S+
Subjt:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL

Query:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER
        N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCS   NHKS+NE+
Subjt:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER

Query:  KMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIE
        K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                                   +YAGGASAKGIFA E
Subjt:  KMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIE

Query:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF
        TLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  P   M++
Subjt:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF

Query:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH
        TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F+QIE+EPF FCFNNSQYTH+MAPK+RFH
Subjt:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH

Query:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        FGDGT+F+PP KSY+VS GE+ISCIG VSMPFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_022943788.1 aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata]3.8e-16855.05Show/hide
Query:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE
        MLGY  PMSPIS    FF F FFLSVH A  G +Q+          VKLD++HRHHP V EKL+G  +     DR +DIH+HD  R ++ISTS+   + +
Subjt:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE

Query:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLAN
         +              LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LIVDTGSDLTW+KCRYRRC+GNC++ A+HKSR E K++F + FLAN
Subjt:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLAN

Query:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKE
        +SSSFK I C S  C  DL  LF+I +C+ P++PC+YDY Y+                                   GG +A G+FA ET+TV LTNGKE
Subjt:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKE

Query:  KQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS
        KQLH+++IGCTE        G DG++GLGT  +SF ++AA + NGGGFSYCL+DHLSHH+ATSYFILG P     A   SV P GNMTF  L +G P+NS
Subjt:  KQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS

Query:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV
        +YGV LI IS DGV LNIPPRVWDI  GGGTILDSGTSL+ML APAFD+ MEA+  KLK F+QI  +PF +CFN + Y+HEMAPK+RFHF  G +F+PP 
Subjt:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV

Query:  KSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        KSY+V   + I C+GF S+PFP  NIIGNILQQN LWQFDFF +KVGFAPS+C+
Subjt:  KSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]2.6e-24678.28Show/hide
Query:  MLGYRKPMSPISHFCFFFLFFFLSVHNAL--GGHDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAE
        MLGYRKPMSPISHFC FFLFFFLSV  A   G HDQE VKLDLLHRHHPQV+EKLHG++K+E++NDRIKDI +HD KRYQTIS+SLNR +++E+L+ EA 
Subjt:  MLGYRKPMSPISHFCFFFLFFFLSVHNAL--GGHDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAE

Query:  VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTI
          A KD  LPP S TPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLI DTGSDLTW+KCRYRRCIGNCSS  NHK+RNERK+RFRNAFLANYSSSFKTI
Subjt:  VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTI

Query:  HCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSII
         CSS  CT DLADLFSIGEC+TPTSPC+YDY                                   +Y+GGASAKG+FAIETLTV LTNGKEKQLHNSII
Subjt:  HCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSII

Query:  GCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAP--SPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDL
        GCTESVQGRIFGGADGVIGLGTS YSFTYKAAENANGGGF+YCLVDHLS  TATSYFILG P  S  ++AAASSV P+GNM+FTKLF+GDPY+SFYGVDL
Subjt:  GCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAP--SPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDL

Query:  IAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVS
        + ISADGVMLNIPPRVWDINSGGGTI+DSGTSLTMLAAPAFDMVMEAL PKLKHFE IE+EPF FCFNNS+YTHEMAPK+RFHFGDGT+FQPP KSY+VS
Subjt:  IAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVS

Query:  AGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
         GEYISCIGFVSMPFPA NIIGNILQQNHLW+FDF    VGFAPSECV
Subjt:  AGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein1.6e-22568.53Show/hide
Query:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY
        MLGYRKPMSPIS+FCFFF    LFFFLS  +    ALG  D               QE +K DLLHRHHPQVAEK+HG+MK++D+++R+KDIH+HD  R+
Subjt:  MLGYRKPMSPISHFCFFF----LFFFLSVHN----ALGGHD---------------QETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRY

Query:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN
        ++IS S+N+KQ+E+ +L+AEAE     E AK  ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCSS  N
Subjt:  QTISTSLNRKQIEE-KLKAEAEV----EAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKAN

Query:  HKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASA
        HKS+NE+K RFR+AFLAN+SSSFKT+ CSST CT DLADLF++ EC  PTSPC+YDY                                   +Y GGASA
Subjt:  HKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASA

Query:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV
        KGIFA ETLTV LTNGKEKQLHNSIIGCTESVQG +FGGADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  
Subjt:  KGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVV

Query:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM
            MT+TKL+VGDPY+SFYGVDLI ISA+G+MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK F+Q+E+EPF FCFNNSQYTHEM
Subjt:  PSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEM

Query:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        APK+RFHFGDGT+F+PP KSY+VS G++ISCIGFVSMPFPA NIIGNILQQNHLWQFDF +R+VGFAPSEC+
Subjt:  APKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A1S3C2F3 aspartic proteinase CDR12.8e-21767.08Show/hide
Query:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL
        MLGYRKPMSPIS+FCFFF L FFLS  +    ALG             +Q+T++ DLLHRHHPQV+EKL+G+MK++DL++R+KDIH+HD  R+++IS S+
Subjt:  MLGYRKPMSPISHFCFFF-LFFFLSVHN----ALGGH-----------DQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSL

Query:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER
        N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGSDLTW+KCRYRRC GNCS   NHKS+NE+
Subjt:  NRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNER

Query:  KMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIE
        K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                                   +YAGGASAKGIFA E
Subjt:  KMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIE

Query:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF
        TLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A SYF+LG P+PS SA+ SS  P   M++
Subjt:  TLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTF

Query:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH
        TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F+QIE+EPF FCFNNSQYTH+MAPK+RFH
Subjt:  TKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFH

Query:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        FGDGT+F+PP KSY+VS GE+ISCIG VSMPFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  FGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A5D3B701 Aspartic proteinase CDR13.5e-19668.56Show/hide
Query:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS
        MK++DL++R+KDIH+HD  R+++IS S+N+KQIE+ +L+AEAE    VE AK  ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLI DTGS
Subjt:  MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEE-KLKAEAE----VEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGS

Query:  DLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFR
        DLTW+KCRYRRC GNCS   NHKS+NE+K RFR+A LAN SS+FKT+ CSST CT +LA+LF++ EC TPTSPC+YDY                      
Subjt:  DLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFR

Query:  VGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA
                     +YAGGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTS YS TYKAAENANGGGFSYCLVDHL+   A
Subjt:  VGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTA

Query:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF
         SYF+LG P+PS SA+ SS  P   M++TKL+VGDPY+SFYGVDLI ISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LK F
Subjt:  TSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHF

Query:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        +QIE+EPF FCFNNSQYTH+MAPK+RFHFGDGT+F+PP KSY+VS GE+ISCIG VSMPFP+ NIIGNILQQNHLWQFDF +R+VGFA SEC+
Subjt:  EQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X25.5e-16554.84Show/hide
Query:  MSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEA
        MSPIS    FF F FFLSVH A  G +Q+          VKLD++HRHHP V EKL+G  +     DR +DIH+HD  R ++ISTS+   + + +     
Subjt:  MSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEA

Query:  EVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKT
                 LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LIVDTGSDLTW+KCRYRRC+GNC++ A+HKSR E K++F + FLAN+SSSFK 
Subjt:  EVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKT

Query:  IHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSI
        I C S  C  DL  LF+I +C+ P++PC+YDY Y+                                   GG +A G+FA ET+TV LTNGKEKQLH+++
Subjt:  IHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSI

Query:  IGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLI
        IGCTE        G DG++GLGT  +SF ++AA + NGGGFSYCL+DHLSHH+ATSYFILG P     A   SV P GNMTF  L +G P+NS+YGV LI
Subjt:  IGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLI

Query:  AISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSA
         IS DGV LNIPPRVWDI  GGGTILDSGTSL+ML APAFD+ MEA+  KLK F+QI  +PF +CFN + Y+HEMAPK+RFHF  G +F+PP KSY+V  
Subjt:  AISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSA

Query:  GEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
         + I C+GF S+PFP  NIIGNILQQN LWQFDFF +KVGFAPS+C+
Subjt:  GEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X11.8e-16855.05Show/hide
Query:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE
        MLGY  PMSPIS    FF F FFLSVH A  G +Q+          VKLD++HRHHP V EKL+G  +     DR +DIH+HD  R ++ISTS+   + +
Subjt:  MLGYRKPMSPISHFCFFFLF-FFLSVHNALGGHDQE---------TVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIE

Query:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLAN
         +              LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LIVDTGSDLTW+KCRYRRC+GNC++ A+HKSR E K++F + FLAN
Subjt:  EKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLAN

Query:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKE
        +SSSFK I C S  C  DL  LF+I +C+ P++PC+YDY Y+                                   GG +A G+FA ET+TV LTNGKE
Subjt:  YSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKE

Query:  KQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS
        KQLH+++IGCTE        G DG++GLGT  +SF ++AA + NGGGFSYCL+DHLSHH+ATSYFILG P     A   SV P GNMTF  L +G P+NS
Subjt:  KQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS

Query:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV
        +YGV LI IS DGV LNIPPRVWDI  GGGTILDSGTSL+ML APAFD+ MEA+  KLK F+QI  +PF +CFN + Y+HEMAPK+RFHF  G +F+PP 
Subjt:  FYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPV

Query:  KSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV
        KSY+V   + I C+GF S+PFP  NIIGNILQQN LWQFDFF +KVGFAPS+C+
Subjt:  KSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED32.3e-2725.28Show/hide
Query:  PTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLAD
        P P  + + SG+      Y V+ K+GTPPQ   +++DT +D  W+ C        CS  +N  +          +F  N SS++ T+ CS+  CT     
Subjt:  PTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLAD

Query:  LFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGG
             + +  T P               +S P  S   F              +Y G +S       +TLT+         + N   GC  S  G     
Subjt:  LFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGG

Query:  ADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATS--YFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIP
          G++GLG  P S   +   +   G FSYCL    S + + S    +LG P               ++ +T L       S Y V+L  +S   V + + 
Subjt:  ADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATS--YFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIP

Query:  P--RVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV
        P    +D NSG GTI+DSGT +T  A P ++ + +    ++       +  F  CF  S     +APKI  H     +  P   + + S+   ++C+   
Subjt:  P--RVWDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV

Query:  SMPFPA---YNIIGNILQQNHLWQFDFFRRKVGFAPSEC
         +   A    N+I N+ QQN    FD    ++G AP  C
Subjt:  SMPFPA---YNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Q6XBF8 Aspartic proteinase CDR13.0e-2725.84Show/hide
Query:  SSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCI
        S EY + + +GTPP   M I DTGSDL W +C       +C ++ +              F    SS++K + CSS+ CT     L +   C T  + C 
Subjt:  SSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCI

Query:  YDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFT
        Y                                     +Y   +  KG  A++TLT+  ++ +  QL N IIGC  +  G       G++GLG  P S  
Subjt:  YDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFT

Query:  YKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSG
         +  ++ + G FSYCLV   S    TS    G          +++V    +  T L       +FY + L +IS     +       + +S G  I+DSG
Subjt:  YKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSG

Query:  TSLTMLAAPAFDMVMEALTPKL-KHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNH
        T+LT+L    +  + +A+   +    +Q        C+  S       P I  HF DG   +    +  V   E + C  F     P+++I GN+ Q N 
Subjt:  TSLTMLAAPAFDMVMEALTPKL-KHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNH

Query:  LWQFDFFRRKVGFAPSEC
        L  +D   + V F P++C
Subjt:  LWQFDFFRRKVGFAPSEC

Q9LNJ3 Aspartyl protease family protein 22.5e-2927.77Show/hide
Query:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCR-YRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTT
        +P P G    ++SG   GS EYF +L VGTP +   +++DTGSD+ W++C   RRC                       F    S ++ TI CSS  C  
Subjt:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCR-YRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTT

Query:  DLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGR
            L S G C T    C+Y                                     +Y  G+   G F+ ETLT      +  ++    +GC    +G 
Subjt:  DLADLFSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGR

Query:  IFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVML-
        +F GA G++GLG    SF  +     N   FSYCLVD              A S  +S    +   S    FT L      ++FY V L+ IS  G  + 
Subjt:  IFGGADGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVML-

Query:  NIPPRVWDIN--SGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI-EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVV---SAGEY
         +   ++ ++    GG I+DSGTS+T L  PA+  + +A     K  ++  +   F  CF+ S       P +  HF  G     P  +Y++   + G++
Subjt:  NIPPRVWDIN--SGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI-EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVV---SAGEY

Query:  ISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
          C  F        +IIGNI QQ     +D    +VGFAP  C
Subjt:  ISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.9e-2925.75Show/hide
Query:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGEC
        ++SG+  GS EYF ++ VGTP +   L++DTGSD+ WI+C       +C  +++              F    SS++K++ CS+  C+     L     C
Subjt:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGEC

Query:  KTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGL
        +  ++ C+Y                                     +Y  G+   G  A +T+T     G   +++N  +GC    +G +F GA G++GL
Subjt:  KTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGL

Query:  GTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINS-
        G    S T           FSYCLVD  S               S+S   +SV   G      L      ++FY V L   S  G  + +P  ++D+++ 
Subjt:  GTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINS-

Query:  -GGGTILDSGTSLTMLAAPAFDMVMEA---LTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEY-ISCIGFVSMPFP
          GG ILD GT++T L   A++ + +A   LT  LK      +  F  C++ S  +    P + FHF  G     P K+Y++   +    C  F      
Subjt:  -GGGTILDSGTSLTMLAAPAFDMVMEA---LTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEY-ISCIGFVSMPFP

Query:  AYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
        + +IIGN+ QQ     +D  +  +G + ++C
Subjt:  AYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Q9LTW4 Aspartic proteinase NANA, chloroplast2.7e-8441.24Show/hide
Query:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADL
        +G+KM   SG DYG+++YF +++VGTP + F ++VDTGS+LTW+ CRYR        K N           R  F A+ S SFKT+ C + TC  DL +L
Subjt:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADL

Query:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGA
        FS+  C TP++PC YDYR                                   YA G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F GA
Subjt:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGA

Query:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV
        DGV+GL  S +SFT   A +  G  FSYCLVDHLS+   ++Y I G+ S S   A     P   +  T++        FY +++I IS    ML+IP +V
Subjt:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV

Query:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSM
        WD  SGGGTILDSGTSLT+LA  A+  V+  L   L   ++++ E  P ++CF+  S +     P++ FH   G  F+P  KSY+V A   + C+GFVS 
Subjt:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSM

Query:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
          PA N+IGNI+QQN+LW+FD     + FAPS C
Subjt:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein4.3e-4528.82Show/hide
Query:  RHHPQVAEKLHGNMKVED--LNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPI-LPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPP
        + H + + K    +K E       + D+   DL R +T+    N+ + ++  K   ++ +    +  P  SP  +   + SG   GS EYF+ + VGTPP
Subjt:  RHHPQVAEKLHGNMKVED--LNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPI-LPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPP

Query:  QTFMLIVDTGSDLTWIKC-RYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKL
        + F LI+DTGSDL W++C     C        + K+                S+SFK I C+   C+  ++      +C++    C Y Y Y        
Subjt:  QTFMLIVDTGSDLTWIKC-RYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYDYRYVRLIHQKL

Query:  NSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLT----NGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANG
                       G+ R N T           G FA+ET TV+LT       E ++ N + GC    +G +F GA G++GLG  P SF+    ++  G
Subjt:  NSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLT----NGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAAENANG

Query:  GGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS---FYGVDLIAISADGVMLNIPPRVWDINS--GGGTILDSGTSLT
          FSYCLVD  S+   +S  I G            ++   N+ FT  FV    NS   FY + + +I   G  L+IP   W+I+S   GGTI+DSGT+L+
Subjt:  GGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNS---FYGVDLIAISADGVMLNIPPRVWDINS--GGGTILDSGTSLT

Query:  MLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNS--QYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHL
          A PA++++      K+K    I  +      CFN S  +  +   P++   F DGT++  P ++  +   E + C+  +  P   ++IIGN  QQN  
Subjt:  MLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNS--QYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHL

Query:  WQFDFFRRKVGFAPSEC
          +D  R ++GF P++C
Subjt:  WQFDFFRRKVGFAPSEC

AT3G12700.1 Eukaryotic aspartyl protease family protein1.9e-8541.24Show/hide
Query:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADL
        +G+KM   SG DYG+++YF +++VGTP + F ++VDTGS+LTW+ CRYR        K N           R  F A+ S SFKT+ C + TC  DL +L
Subjt:  IGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADL

Query:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGA
        FS+  C TP++PC YDYR                                   YA G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F GA
Subjt:  FSIGECKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGA

Query:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV
        DGV+GL  S +SFT   A +  G  FSYCLVDHLS+   ++Y I G+ S S   A     P   +  T++        FY +++I IS    ML+IP +V
Subjt:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV

Query:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSM
        WD  SGGGTILDSGTSLT+LA  A+  V+  L   L   ++++ E  P ++CF+  S +     P++ FH   G  F+P  KSY+V A   + C+GFVS 
Subjt:  WDINSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVE--PFKFCFN-NSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSM

Query:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
          PA N+IGNI+QQN+LW+FD     + FAPS C
Subjt:  PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

AT3G25700.1 Eukaryotic aspartyl protease family protein8.6e-5432.49Show/hide
Query:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTC-TTDLADLFSIGE
        ++SG+  GS +YFV L++G PPQ+ +LI DTGSDL W+KC   R   NCS    H S           F   +SS+F   HC    C      D   I  
Subjt:  MISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTC-TTDLADLFSIGE

Query:  CKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGC-----TESVQGRIFGGA
             S C Y+Y                                    YA G+   G+FA ET ++  ++GKE +L +   GC      +SV G  F GA
Subjt:  CKTPTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGC-----TESVQGRIFGGA

Query:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV
        +GV+GLG  P SF  +      G  FSYCL+D+      TSY I+G      S           + FT L       +FY V L ++  +G  L I P +
Subjt:  DGVIGLGTSPYSFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRV

Query:  WDI--NSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEP-FKFCFNNSQYT--HEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV
        W+I  +  GGT++DSGT+L  LA PA+  V+ A+  ++K      + P F  C N S  T   ++ P+++F F  G +F PP ++Y +   E I C+   
Subjt:  WDI--NSGGGTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQIEVEP-FKFCFNNSQYT--HEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFV

Query:  SM-PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC
        S+ P   +++IGN++QQ  L++FD  R ++GF+   C
Subjt:  SM-PFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSEC

AT3G59080.1 Eukaryotic aspartyl protease family protein7.3e-4528.71Show/hide
Query:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS
        + +TVK   L R      EK   N    +++ DL  RI+ +H   L++    + S  +K+ ++++     V ++ +        T     + SG   GS 
Subjt:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS

Query:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFL-ANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIY
        EYF+ + VG+PP+ F LI+DTGSDL WI+C    C  +C  +               AF     S+S+K I C+   C   ++       CK+    C Y
Subjt:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFL-ANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIY

Query:  DYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGADGVIGLGTSPY
         Y                                    Y   ++  G FA+ET TV+L TNG   +L+   N + GC    +G +F GA G++GLG  P 
Subjt:  DYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGADGVIGLGTSPY

Query:  SFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GG
        SF+    ++  G  FSYCLVD  S    +S  I G            ++   N+ FT    G  +  ++FY V + +I   G +LNIP   W+I+S   G
Subjt:  SFTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GG

Query:  GTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNII
        GTI+DSGT+L+  A PA++ +   +  K K    +  +      CFN S   +   P++   F DG ++  P ++  +   E + C+  +  P  A++II
Subjt:  GTILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNII

Query:  GNILQQNHLWQFDFFRRKVGFAPSEC
        GN  QQN    +D  R ++G+AP++C
Subjt:  GNILQQNHLWQFDFFRRKVGFAPSEC

AT3G59080.2 Eukaryotic aspartyl protease family protein7.1e-4027.05Show/hide
Query:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS
        + +TVK   L R      EK   N    +++ DL  RI+ +H   L++    + S  +K+ ++++     V ++ +        T     + SG   GS 
Subjt:  DQETVKLDLLHRHHPQVAEKLHGN----MKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPTSPTPIGLKMISGSDYGSS

Query:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYD
        EYF+ + VG+PP+ F LI+DTGSDL WI+C                      +   + F  N + S                              C Y 
Subjt:  EYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKTPTSPCIYD

Query:  YRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGADGVIGLGTSPYS
        Y                                    Y   ++  G FA+ET TV+L TNG   +L+   N + GC    +G +F GA G++GLG  P S
Subjt:  YRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDL-TNGKEKQLH---NSIIGCTESVQGRIFGGADGVIGLGTSPYS

Query:  FTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GGG
        F+    ++  G  FSYCLVD  S    +S  I G            ++   N+ FT    G  +  ++FY V + +I   G +LNIP   W+I+S   GG
Subjt:  FTYKAAENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVG--DPYNSFYGVDLIAISADGVMLNIPPRVWDINS--GGG

Query:  TILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIG
        TI+DSGT+L+  A PA++ +   +  K K    +  +      CFN S   +   P++   F DG ++  P ++  +   E + C+  +  P  A++IIG
Subjt:  TILDSGTSLTMLAAPAFDMVMEALTPKLKHFEQI--EVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIG

Query:  NILQQNHLWQFDFFRRKVGFAPSEC
        N  QQN    +D  R ++G+AP++C
Subjt:  NILQQNHLWQFDFFRRKVGFAPSEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTCTTCTTCTTCCTCTCTGTTCACAACGCATTGGGCGGCCATGACCAAGAAACTGT
AAAACTCGATCTACTTCACCGTCACCATCCACAAGTCGCCGAGAAGCTTCACGGCAATATGAAAGTTGAAGATCTCAACGATCGGATCAAGGATATTCACGATCACGACC
TCAAACGATATCAAACCATCTCCACATCGTTGAACCGGAAGCAAATTGAGGAGAAATTGAAGGCGGAAGCGGAGGTTGAGGCAGCGAAGGATCCGATACTTCCACCGACG
TCGCCTACGCCGATAGGGCTGAAAATGATATCAGGTTCAGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTGGGAACACCGCCTCAGACGTTCATGTTGATCGT
GGATACCGGAAGCGATCTAACGTGGATAAAATGTAGATATCGGAGGTGTATCGGAAATTGTAGCAGCAAGGCGAATCATAAGAGCCGAAACGAACGGAAAATGAGATTTA
GAAATGCGTTTTTGGCAAATTATTCCTCGTCTTTCAAGACCATTCATTGCAGCTCCACGACTTGTACCACTGACCTCGCAGATCTGTTCTCAATTGGGGAATGCAAAACC
CCAACTAGCCCTTGTATCTATGATTACAGGTACGTAAGATTGATTCACCAAAAACTAAATTCCTTACCCCTCAACTCAACTGACTTATTTAGAGTTGGAGGGGGCGAGGT
GCGGTGCAACCCGACTACACCTAACTACGCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATCGAGACCCTAACCGTAGACCTAACAAACGGAAAAGAAAAACAGCTCC
ACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGAAGGATATTTGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCCCCTACTCTTTTACCTACAAAGCCGCC
GAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTTGTCGACCATCTCAGCCACCACACCGCCACCAGCTACTTCATCCTCGGCGCCCCTTCCCCCTCCGCTTCCGCTGC
TGCCTCCTCCGTCGTCCCTTCTGGCAACATGACCTTCACCAAACTCTTCGTCGGCGACCCTTACAACAGCTTCTACGGCGTCGATCTCATTGCAATCTCCGCCGACGGCG
TCATGCTCAACATTCCTCCCCGCGTTTGGGACATCAATTCCGGCGGCGGTACCATCCTCGACTCCGGAACTAGCCTCACCATGCTGGCGGCGCCGGCGTTCGATATGGTC
ATGGAAGCTCTGACTCCTAAGCTGAAGCATTTCGAGCAAATTGAAGTCGAACCCTTCAAATTTTGCTTCAATAATAGCCAGTACACCCATGAAATGGCCCCGAAGATCCG
ATTCCATTTCGGCGACGGCACGATGTTCCAGCCGCCGGTGAAAAGCTACGTTGTGTCGGCGGGTGAATATATTAGCTGTATTGGGTTCGTTTCTATGCCTTTTCCGGCCT
ACAATATCATCGGGAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCTTTAGGAGAAAAGTCGGTTTTGCCCCCTCTGAATGCGTCTAA
mRNA sequenceShow/hide mRNA sequence
AGTCCCCATTATTAGGCGCTCTATATTTATCACTTTTCAGTTAAGTTTGTTCTTCCTTCATGCTTTCCCCCCTTTTTAAGTCCGCCATTATCATCTTTCTTCTTGCTCCA
TCTCTCTAACATTACATTCTCTGTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTCTTCTTCTTCCTCTCTGTTCACAACG
CATTGGGCGGCCATGACCAAGAAACTGTAAAACTCGATCTACTTCACCGTCACCATCCACAAGTCGCCGAGAAGCTTCACGGCAATATGAAAGTTGAAGATCTCAACGAT
CGGATCAAGGATATTCACGATCACGACCTCAAACGATATCAAACCATCTCCACATCGTTGAACCGGAAGCAAATTGAGGAGAAATTGAAGGCGGAAGCGGAGGTTGAGGC
AGCGAAGGATCCGATACTTCCACCGACGTCGCCTACGCCGATAGGGCTGAAAATGATATCAGGTTCAGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTGGGAA
CACCGCCTCAGACGTTCATGTTGATCGTGGATACCGGAAGCGATCTAACGTGGATAAAATGTAGATATCGGAGGTGTATCGGAAATTGTAGCAGCAAGGCGAATCATAAG
AGCCGAAACGAACGGAAAATGAGATTTAGAAATGCGTTTTTGGCAAATTATTCCTCGTCTTTCAAGACCATTCATTGCAGCTCCACGACTTGTACCACTGACCTCGCAGA
TCTGTTCTCAATTGGGGAATGCAAAACCCCAACTAGCCCTTGTATCTATGATTACAGGTACGTAAGATTGATTCACCAAAAACTAAATTCCTTACCCCTCAACTCAACTG
ACTTATTTAGAGTTGGAGGGGGCGAGGTGCGGTGCAACCCGACTACACCTAACTACGCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATCGAGACCCTAACCGTAGAC
CTAACAAACGGAAAAGAAAAACAGCTCCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGAAGGATATTTGGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAG
CCCCTACTCTTTTACCTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCTCCTACTGCCTTGTCGACCATCTCAGCCACCACACCGCCACCAGCTACTTCATCCTCG
GCGCCCCTTCCCCCTCCGCTTCCGCTGCTGCCTCCTCCGTCGTCCCTTCTGGCAACATGACCTTCACCAAACTCTTCGTCGGCGACCCTTACAACAGCTTCTACGGCGTC
GATCTCATTGCAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCCCGCGTTTGGGACATCAATTCCGGCGGCGGTACCATCCTCGACTCCGGAACTAGCCTCACCAT
GCTGGCGGCGCCGGCGTTCGATATGGTCATGGAAGCTCTGACTCCTAAGCTGAAGCATTTCGAGCAAATTGAAGTCGAACCCTTCAAATTTTGCTTCAATAATAGCCAGT
ACACCCATGAAATGGCCCCGAAGATCCGATTCCATTTCGGCGACGGCACGATGTTCCAGCCGCCGGTGAAAAGCTACGTTGTGTCGGCGGGTGAATATATTAGCTGTATT
GGGTTCGTTTCTATGCCTTTTCCGGCCTACAATATCATCGGGAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCTTTAGGAGAAAAGTCGGTTTTGCCCCCTC
TGAATGCGTCTAAAAACTTCTTTCAATTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCCTCCTTCTTCTCG
Protein sequenceShow/hide protein sequence
MLGYRKPMSPISHFCFFFLFFFLSVHNALGGHDQETVKLDLLHRHHPQVAEKLHGNMKVEDLNDRIKDIHDHDLKRYQTISTSLNRKQIEEKLKAEAEVEAAKDPILPPT
SPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIVDTGSDLTWIKCRYRRCIGNCSSKANHKSRNERKMRFRNAFLANYSSSFKTIHCSSTTCTTDLADLFSIGECKT
PTSPCIYDYRYVRLIHQKLNSLPLNSTDLFRVGGGEVRCNPTTPNYAGGASAKGIFAIETLTVDLTNGKEKQLHNSIIGCTESVQGRIFGGADGVIGLGTSPYSFTYKAA
ENANGGGFSYCLVDHLSHHTATSYFILGAPSPSASAAASSVVPSGNMTFTKLFVGDPYNSFYGVDLIAISADGVMLNIPPRVWDINSGGGTILDSGTSLTMLAAPAFDMV
MEALTPKLKHFEQIEVEPFKFCFNNSQYTHEMAPKIRFHFGDGTMFQPPVKSYVVSAGEYISCIGFVSMPFPAYNIIGNILQQNHLWQFDFFRRKVGFAPSECV