; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi10G015010 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi10G015010
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionaspartic proteinase NANA, chloroplast-like
Genome locationchr10:19211172..19213679
RNA-Seq ExpressionLsi10G015010
SyntenyLsi10G015010
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033565.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa]3.1e-19672.44Show/hide
Query:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ
        MK QDL+ R+KDIHEHD  R++SIS S+N+K IE+       A+ +AEAEA  + E    A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP Q
Subjt:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ

Query:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF
        TFMLIADTGSDLTW+KCRYRRC GNCS   +HKS+NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIF
Subjt:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF

Query:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN
        A ETLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   
Subjt:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN

Query:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL
        MS+T+L+VGDPYSSFYGV LIGISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL
Subjt:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL

Query:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
         FHF DGTVF+PP KSYIVSVGEFISCIG VSMPFP+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]7.7e-22772.21Show/hide
Query:  MLGYRKPMSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRY
        MLGYRKPMSPIS+FCFFF    LFFFLS   +F    GD D               QE +K DLLHRHHPQV+EK+HGDMK QD++ R+KDIHEHDH R+
Subjt:  MLGYRKPMSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRY

Query:  QSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRR
        +SIS S+N+K +E+       A+ +AEAEA  E   E  A+ +ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLIADTGSDLTWMKCRYRR
Subjt:  QSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRR

Query:  CIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSI
        C GNCSS  +HKS+NE K RFR+AFLAN+SSSFKT+ CSST+C  DL+DLF++ EC  PTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQLHNSI
Subjt:  CIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSI

Query:  IGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLI
        IGCTESVQG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LGIP+PS S++ SS      M++T+L+VGDPYSSFYGV LI
Subjt:  IGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLI

Query:  GISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV
        GISA+G+MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK+F+ +EI+PF+FCFNNSQYTHEM PKL FHF DGTVF+PP KSYIVSV
Subjt:  GISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV

Query:  GEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        G+FISCIGFVSMPFPA NIIGNILQQNHLWQFDF   +VGFAPSEC+
Subjt:  GEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]8.5e-21870.74Show/hide
Query:  MLGYRKPMSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL
        MLGYRKPMSPIS+FCFFF L FFLS   +F    GD            +Q+T++ DLLHRHHPQVSEKL+GDMK QDL+ R+KDIHEHD  R++SIS S+
Subjt:  MLGYRKPMSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL

Query:  NRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS
        N+K IE+       A+ +AEAEA  + E    A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCRYRRC GNCS 
Subjt:  NRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS

Query:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV
          +HKS+NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE V
Subjt:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV

Query:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV
        QG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   MS+T+L+VGDPYSSFYGV LIGISADG 
Subjt:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV

Query:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI
        MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL FHF DGTVF+PP KSYIVSVGEFISCI
Subjt:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI

Query:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        G VSMPFP+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

XP_022943788.1 aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata]7.1e-17257.87Show/hide
Query:  MLGYRKPMSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIE
        MLGY  PMSPIS    FF F FFLSVH+AF GD  Q+         VKLD++HRHHP V EKL+G+ +      R +DIHEHDH R +SISTS+     +
Subjt:  MLGYRKPMSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIE

Query:  ERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKS
         +                             LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC++ A HKS
Subjt:  ERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKS

Query:  RNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFR
        R E K++F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYSY GG +A G+FA ET+TV LTNGKEKQLH+++IGCTE       +
Subjt:  RNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFR

Query:  GADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPP
        G DG++GLGT ++SF ++AA + NGGGFSYCL+DHLS H+ATSYFILG P     +   SV P GNM+F  L +G P++S+YGV LIGIS DGV LNIPP
Subjt:  GADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPP

Query:  RVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMP
        RVWDI  GGGTI+DSGTSL+ML APAFD+ MEA+  KLK+F+ I   PF +CFN + Y+HEM PKL FHF  G VF+PP KSYIV V + I C+GF S+P
Subjt:  RVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMP

Query:  FPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        FP TNIIGNILQQN LWQFDFFN KVGFAPS+C+
Subjt:  FPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]1.0e-24782.01Show/hide
Query:  MLGYRKPMSPISHFCFFFLFFFLSVHIAFGD--YDQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEAK
        MLGYRKPMSPISHFC FFLFFFLSV IAFGD  +DQE VKLDLLHRHHPQVSEKLHGD+K +++N R+KDI EHD KRYQ+IS+SLNR  ++E+   EA 
Subjt:  MLGYRKPMSPISHFCFFFLFFFLSVHIAFGD--YDQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEAK

Query:  AKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRF
                      AE A ++  LPP S TPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS  +HK+RNE K+RF
Subjt:  AKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRF

Query:  RNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGL
        RNAFLANYSSSFKTI CSS +C  DL+DLFSIGECQTPTSPCLYDYSYSGGASAKG+FAIETLTV LTNGKEKQLHNSIIGCTESVQGRIF GADGVIGL
Subjt:  RNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGL

Query:  GTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIP--SPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDIN
        GTSSYSFTYKAA+NANGGGF+YCLVDHLSD TATSYFILG P  S  +++AASSV P+GNMSFT+LF+GDPYSSFYGV L+GISADGVMLNIPPRVWDIN
Subjt:  GTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIP--SPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDIN

Query:  SGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNI
        SGGGTIVDSGTSLTMLAAPAFDMVMEAL PKLK FE IEI+PF+FCFNNS+YTHEM PKL FHF DGTVFQPP KSYIVSVGE+ISCIGFVSMPFPATNI
Subjt:  SGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNI

Query:  IGNILQQNHLWQFDFFNGKVGFAPSECV
        IGNILQQNHLW+FDF  G VGFAPSECV
Subjt:  IGNILQQNHLWQFDFFNGKVGFAPSECV

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein3.7e-22772.21Show/hide
Query:  MLGYRKPMSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRY
        MLGYRKPMSPIS+FCFFF    LFFFLS   +F    GD D               QE +K DLLHRHHPQV+EK+HGDMK QD++ R+KDIHEHDH R+
Subjt:  MLGYRKPMSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRY

Query:  QSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRR
        +SIS S+N+K +E+       A+ +AEAEA  E   E  A+ +ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLIADTGSDLTWMKCRYRR
Subjt:  QSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRR

Query:  CIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSI
        C GNCSS  +HKS+NE K RFR+AFLAN+SSSFKT+ CSST+C  DL+DLF++ EC  PTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQLHNSI
Subjt:  CIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSI

Query:  IGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLI
        IGCTESVQG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LGIP+PS S++ SS      M++T+L+VGDPYSSFYGV LI
Subjt:  IGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLI

Query:  GISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV
        GISA+G+MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK+F+ +EI+PF+FCFNNSQYTHEM PKL FHF DGTVF+PP KSYIVSV
Subjt:  GISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV

Query:  GEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        G+FISCIGFVSMPFPA NIIGNILQQNHLWQFDF   +VGFAPSEC+
Subjt:  GEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

A0A1S3C2F3 aspartic proteinase CDR14.1e-21870.74Show/hide
Query:  MLGYRKPMSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL
        MLGYRKPMSPIS+FCFFF L FFLS   +F    GD            +Q+T++ DLLHRHHPQVSEKL+GDMK QDL+ R+KDIHEHD  R++SIS S+
Subjt:  MLGYRKPMSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL

Query:  NRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS
        N+K IE+       A+ +AEAEA  + E    A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCRYRRC GNCS 
Subjt:  NRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS

Query:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV
          +HKS+NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE V
Subjt:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV

Query:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV
        QG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   MS+T+L+VGDPYSSFYGV LIGISADG 
Subjt:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV

Query:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI
        MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL FHF DGTVF+PP KSYIVSVGEFISCI
Subjt:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI

Query:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        G VSMPFP+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

A0A5D3B701 Aspartic proteinase CDR11.5e-19672.44Show/hide
Query:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ
        MK QDL+ R+KDIHEHD  R++SIS S+N+K IE+       A+ +AEAEA  + E    A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP Q
Subjt:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ

Query:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF
        TFMLIADTGSDLTW+KCRYRRC GNCS   +HKS+NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIF
Subjt:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF

Query:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN
        A ETLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   
Subjt:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN

Query:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL
        MS+T+L+VGDPYSSFYGV LIGISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL
Subjt:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL

Query:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
         FHF DGTVF+PP KSYIVSVGEFISCIG VSMPFP+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X21.0e-16857.69Show/hide
Query:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEA
        MSPIS    FF F FFLSVH+AF GD  Q+         VKLD++HRHHP V EKL+G+ +      R +DIHEHDH R +SISTS+     + +     
Subjt:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEA

Query:  KAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR
                                LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC++ A HKSR E K++
Subjt:  KAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR

Query:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG
        F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYSY GG +A G+FA ET+TV LTNGKEKQLH+++IGCTE       +G DG++G
Subjt:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG

Query:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS
        LGT ++SF ++AA + NGGGFSYCL+DHLS H+ATSYFILG P     +   SV P GNM+F  L +G P++S+YGV LIGIS DGV LNIPPRVWDI  
Subjt:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS

Query:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII
        GGGTI+DSGTSL+ML APAFD+ MEA+  KLK+F+ I   PF +CFN + Y+HEM PKL FHF  G VF+PP KSYIV V + I C+GF S+PFP TNII
Subjt:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII

Query:  GNILQQNHLWQFDFFNGKVGFAPSECV
        GNILQQN LWQFDFFN KVGFAPS+C+
Subjt:  GNILQQNHLWQFDFFNGKVGFAPSECV

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X13.4e-17257.87Show/hide
Query:  MLGYRKPMSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIE
        MLGY  PMSPIS    FF F FFLSVH+AF GD  Q+         VKLD++HRHHP V EKL+G+ +      R +DIHEHDH R +SISTS+     +
Subjt:  MLGYRKPMSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIE

Query:  ERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKS
         +                             LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC++ A HKS
Subjt:  ERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKS

Query:  RNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFR
        R E K++F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYSY GG +A G+FA ET+TV LTNGKEKQLH+++IGCTE       +
Subjt:  RNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFR

Query:  GADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPP
        G DG++GLGT ++SF ++AA + NGGGFSYCL+DHLS H+ATSYFILG P     +   SV P GNM+F  L +G P++S+YGV LIGIS DGV LNIPP
Subjt:  GADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPP

Query:  RVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMP
        RVWDI  GGGTI+DSGTSL+ML APAFD+ MEA+  KLK+F+ I   PF +CFN + Y+HEM PKL FHF  G VF+PP KSYIV V + I C+GF S+P
Subjt:  RVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMP

Query:  FPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        FP TNIIGNILQQN LWQFDFFN KVGFAPS+C+
Subjt:  FPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

SwissProt top hitse value%identityAlignment
Q8S9J6 Aspartyl protease family protein At5g107701.0e-3229.41Show/hide
Query:  GSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTP
        GS  GS  Y V + +GTP     LI DTGSDLTW +C  + C+  C  +                F  + S+S+  + CSS  C    S   + G C   
Subjt:  GSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTP

Query:  TSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFI
         S C+Y   Y   + + G  A E  T  LTN           GC E+ QG +F G  G++GLG    SF  + A  A    FSYCL    S     ++  
Subjt:  TSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFI

Query:  LGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIE-
         GI              S ++ FT +      +SFYG++++ I+  G  L IP  V+   S  G ++DSGT +T L   A+  +  +   K+ ++     
Subjt:  LGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIE-

Query:  IQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKS--YIVSVGEFISCIGFVSMPFPA-TNIIGNILQQNHLWQFDFFNGKVGFAPSEC
        +   + CF+ S +    +PK+ F F  G V +  +K   Y+  + +   C+ F      +   I GN+ QQ     +D   G+VGFAP+ C
Subjt:  IQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKS--YIVSVGEFISCIGFVSMPFPA-TNIIGNILQQNHLWQFDFFNGKVGFAPSEC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 21.1e-3930.83Show/hide
Query:  GLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSI
        G  ++SG D GS EYFV++ VG+PP+   ++ D+GSD+ W++C+          K  +K  +         F    S S+  + C S++C +       I
Subjt:  GLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSI

Query:  GECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT
              +  C Y+  Y  G+  KG  A+ETLT   T      + N  +GC    +G +F GA G++G+G  S SF  + +    GG F YCLV   +D T
Subjt:  GECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT

Query:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTML---AAPAFDMVMEALT
         +  F                LP G  S+  L       SFY V L G+   GV + +P  V+D+     GG ++D+GT++T L   A  AF    ++ T
Subjt:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTML---AAPAFDMVMEALT

Query:  PKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
          L +   + I  F+ C++ S +    VP + F+F +G V   PA+++++ V +    C  F + P    +IIGNI Q+     FD  NG VGF P+ C
Subjt:  PKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC

Q9LNJ3 Aspartyl protease family protein 21.9e-3932.85Show/hide
Query:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCR-YRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKK
        +P P G    ++SG   GS EYF +L VGTP +   ++ DTGSD+ W++C   RRC                       F    S ++ TI CSS  C++
Subjt:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCR-YRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKK

Query:  DLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCL
            L S G C T    CLY  SY  G+   G F+ ETLT      +  ++    +GC    +G +F GA G++GLG    SF  +     N   FSYCL
Subjt:  DLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCL

Query:  VDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMS------FTRLFVGDPYSSFYGVHLIGISADGVML-NIPPRVWDIN--SGGGTIVDSGTSLTMLA
        VD                  SASS  SSV+  GN +      FT L       +FY V L+GIS  G  +  +   ++ ++    GG I+DSGTS+T L 
Subjt:  VDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMS------FTRLFVGDPYSSFYGVHLIGISADGVML-NIPPRVWDIN--SGGGTIVDSGTSLTMLA

Query:  APAFDMVMEALTPKLKQFE-IIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV---GEFISCIGFVSMPFPATNIIGNILQQNHLWQF
         PA+  + +A     K  +   +   F+ CF+ S      VP +  HFR G     PA +Y++ V   G+F  C  F        +IIGNI QQ     +
Subjt:  APAFDMVMEALTPKLKQFE-IIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV---GEFISCIGFVSMPFPATNIIGNILQQNHLWQF

Query:  DFFNGKVGFAPSEC
        D  + +VGFAP  C
Subjt:  DFFNGKVGFAPSEC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.0e-3227.72Show/hide
Query:  EKLHGDMKFQDLNHRLKDIHEHD------HKRYQSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTS--PTPIGLKMISGSDYGS
        E L   + F   +    ++H  D      HK Y+S+  +L+R   +    A   AK +   E    ++ +    E     T    TP+    +SG+  GS
Subjt:  EKLHGDMKFQDLNHRLKDIHEHD------HKRYQSISTSLNRKPIEERAEAEAKAKAKAEAEAEAEAEAEAAARESILPPTS--PTPIGLKMISGSDYGS

Query:  SEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLY
         EYF ++ VGTP +   L+ DTGSD+ W++C       +C  ++               F    SS++K++ CS+  C      L     C+  ++ CLY
Subjt:  SEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLY

Query:  DYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSP
          SY  G+   G  A +T+T     G   +++N  +GC    +G +F GA G++GLG    S T           FSYCLVD  S               
Subjt:  DYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSP

Query:  SASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEA---LTPKLKQFEIIEIQ
        S+S   +SV   G  +   L       +FY V L G S  G  + +P  ++D+++   GG I+D GT++T L   A++ + +A   LT  LK+     I 
Subjt:  SASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEA---LTPKLKQFEIIEIQ

Query:  PFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
         F+ C++ S  +   VP + FHF  G     PAK+Y++ V +    C  F      + +IIGN+ QQ     +D     +G + ++C
Subjt:  PFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC

Q9LTW4 Aspartic proteinase NANA, chloroplast1.1e-9043.31Show/hide
Query:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC
        R S++     + +G+KM   SG DYG+++YF +++VGTP + F ++ DTGS+LTW+ CRYR                 GK   R  F A+ S SFKT+ C
Subjt:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC

Query:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG
         +  CK DL +LFS+  C TP++PC YDY Y+ G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F+GADGV+GL  S +SFT   A +  G
Subjt:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG

Query:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP
          FSYCLVDHLS+   ++Y I G    S+ S  ++   +  +  TR+        FY +++IGIS    ML+IP +VWD  SGGGTI+DSGTSLT+LA  
Subjt:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP

Query:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF
        A+  V+  L   L + + ++ +  P E+CF+  S +    +P+L FH + G  F+P  KSY+V     + C+GFVS   PATN+IGNI+QQN+LW+FD  
Subjt:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF

Query:  NGKVGFAPSEC
           + FAPS C
Subjt:  NGKVGFAPSEC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein6.6e-5133.57Show/hide
Query:  PPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKC-RYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCK
        P  SP  +   + SG   GS EYF+ + VGTPP+ F LI DTGSDL W++C     C          K+                S+SFK I C+   C 
Subjt:  PPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKC-RYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCK

Query:  KDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLT----NGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGG
          +S      +C++    C Y Y Y   ++  G FA+ET TVNLT       E ++ N + GC    +G +F GA G++GLG    SF+    Q+  G  
Subjt:  KDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLT----NGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGG

Query:  FSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYS--SFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLA
        FSYCLVD  S+   +S  I G            +L   N++FT    G   S  +FY + +  I   G  L+IP   W+I+S   GGTI+DSGT+L+  A
Subjt:  FSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYS--SFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLA

Query:  APAFDMVMEALTPKLKQ-FEIIEIQP-FEFCFNNS--QYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQF
         PA++++      K+K+ + I    P  + CFN S  +  +  +P+LG  F DGTV+  PA++  + + E + C+  +  P    +IIGN  QQN    +
Subjt:  APAFDMVMEALTPKLKQ-FEIIEIQP-FEFCFNNS--QYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQF

Query:  DFFNGKVGFAPSEC
        D    ++GF P++C
Subjt:  DFFNGKVGFAPSEC

AT3G12700.1 Eukaryotic aspartyl protease family protein7.7e-9243.31Show/hide
Query:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC
        R S++     + +G+KM   SG DYG+++YF +++VGTP + F ++ DTGS+LTW+ CRYR                 GK   R  F A+ S SFKT+ C
Subjt:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC

Query:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG
         +  CK DL +LFS+  C TP++PC YDY Y+ G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F+GADGV+GL  S +SFT   A +  G
Subjt:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG

Query:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP
          FSYCLVDHLS+   ++Y I G    S+ S  ++   +  +  TR+        FY +++IGIS    ML+IP +VWD  SGGGTI+DSGTSLT+LA  
Subjt:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP

Query:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF
        A+  V+  L   L + + ++ +  P E+CF+  S +    +P+L FH + G  F+P  KSY+V     + C+GFVS   PATN+IGNI+QQN+LW+FD  
Subjt:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF

Query:  NGKVGFAPSEC
           + FAPS C
Subjt:  NGKVGFAPSEC

AT3G25700.1 Eukaryotic aspartyl protease family protein6.6e-5935.57Show/hide
Query:  MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCK-KDLSDLFSIGE
        ++SG+  GS +YFV L++G PPQ+ +LIADTGSDL W+KC   R   NC    SH S           F   +SS+F   HC   +C+     D   I  
Subjt:  MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCK-KDLSDLFSIGE

Query:  CQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGC-----TESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLS
             S C Y+Y Y+ G+   G+FA ET ++  ++GKE +L +   GC      +SV G  F GA+GV+GLG    SF  +  +   G  FSYCL+D+  
Subjt:  CQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGC-----TESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLS

Query:  DHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTMLAAPAFDMVMEALT
            TSY I+G      S           + FT L       +FY V L  +  +G  L I P +W+I  +  GGT+VDSGT+L  LA PA+  V+ A+ 
Subjt:  DHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTMLAAPAFDMVMEALT

Query:  PKLKQFEIIEIQP-FEFCFNNSQYT--HEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSM-PFPATNIIGNILQQNHLWQFDFFNGKVGFAPS
         ++K      + P F+ C N S  T   +++P+L F F  G VF PP ++Y +   E I C+   S+ P    ++IGN++QQ  L++FD    ++GF+  
Subjt:  PKLKQFEIIEIQP-FEFCFNNSQYT--HEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSM-PFPATNIIGNILQQNHLWQFDFFNGKVGFAPS

Query:  EC
         C
Subjt:  EC

AT3G59080.1 Eukaryotic aspartyl protease family protein1.7e-5134.59Show/hide
Query:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFL-ANYSSSFKTIHCSSTLCKKDLSDLFSIGECQ
        SG   GS EYF+ + VG+PP+ F LI DTGSDL W++C    C  +C  +               AF     S+S+K I C+   C   +S       C+
Subjt:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFL-ANYSSSFKTIHCSSTLCKKDLSDLFSIGECQ

Query:  TPTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT
        +    C Y Y Y   ++  G FA+ET TVNL TNG   +L+   N + GC    +G +F GA G++GLG    SF+    Q+  G  FSYCLVD  SD  
Subjt:  TPTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT

Query:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTP
         +S  I G            +L   N++FT    G  +   +FY V +  I   G +LNIP   W+I+S   GGTI+DSGT+L+  A PA++ +   +  
Subjt:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTP

Query:  KLK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
        K K ++ +    P  + CFN S   +  +P+LG  F DG V+  P ++  + + E + C+  +  P  A +IIGN  QQN    +D    ++G+AP++C
Subjt:  KLK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC

AT3G59080.2 Eukaryotic aspartyl protease family protein4.4e-4732.66Show/hide
Query:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQT
        SG   GS EYF+ + VG+PP+ F LI DTGSDL W++C                                       + C          D F     Q 
Subjt:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQT

Query:  PTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTA
            C Y Y Y   ++  G FA+ET TVNL TNG   +L+   N + GC    +G +F GA G++GLG    SF+    Q+  G  FSYCLVD  SD   
Subjt:  PTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTA

Query:  TSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTPK
        +S  I G            +L   N++FT    G  +   +FY V +  I   G +LNIP   W+I+S   GGTI+DSGT+L+  A PA++ +   +  K
Subjt:  TSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTPK

Query:  LK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
         K ++ +    P  + CFN S   +  +P+LG  F DG V+  P ++  + + E + C+  +  P  A +IIGN  QQN    +D    ++G+AP++C
Subjt:  LK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTTTTCTTCTTCCTCTCTGTTCACATCGCATTCGGCGACTATGATCAAGAAACTGT
AAAACTCGATCTACTTCACCGTCACCATCCACAAGTCTCCGAGAAGCTTCACGGTGATATGAAATTTCAAGATCTAAATCATCGCCTCAAGGATATTCACGAACACGACC
ACAAACGTTATCAATCGATCTCCACGTCGTTGAATCGGAAGCCAATTGAGGAGAGGGCTGAGGCTGAGGCTAAGGCTAAGGCTAAGGCGGAGGCTGAGGCGGAGGCGGAG
GCTGAGGCTGAGGCTGCAGCGAGGGAATCGATACTTCCACCGACATCACCAACGCCGATAGGGCTGAAAATGATATCAGGTTCTGATTATGGAAGTAGTGAGTATTTTGT
TCAATTGAAAGTCGGAACGCCGCCGCAGACGTTTATGTTGATCGCCGATACTGGAAGTGATCTAACGTGGATGAAATGTAGATATCGGAGATGTATCGGAAATTGTAGCA
GCAAAGCGAGTCATAAGAGCCGAAACGAAGGAAAAATTAGATTTAGAAATGCATTTTTGGCGAATTATTCATCATCTTTTAAGACGATTCATTGCAGCTCGACGTTGTGT
AAGAAGGATCTTTCGGATCTGTTCTCAATTGGAGAATGCCAAACCCCAACCAGCCCTTGTCTATATGATTACAGCTACTCAGGAGGAGCAAGTGCAAAGGGAATATTCGC
AATTGAGACCCTAACCGTAAACCTAACAAATGGAAAAGAAAAACAACTTCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGCAGGATCTTCCGCGGAGCCGACG
GCGTCATTGGCTTAGGCACTAGCTCCTACTCTTTCACCTACAAAGCCGCCCAAAACGCCAACGGCGGCGGCTTCTCTTACTGCCTTGTCGACCATCTCAGCGACCACACC
GCCACCAGCTACTTCATCCTCGGCATCCCTTCCCCTTCTGCTTCCTCTGCCGCCTCCTCCGTCCTCCCTTCCGGCAACATGTCCTTCACCAGACTCTTCGTCGGCGACCC
TTACAGCAGCTTCTATGGCGTCCATCTCATCGGAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCTCGCGTTTGGGACATCAATTCTGGCGGCGGAACCATCGTCG
ACTCCGGAACTAGCCTCACCATGCTGGCGGCGCCGGCGTTTGATATGGTCATGGAAGCTCTGACTCCGAAGCTGAAGCAATTCGAGATAATTGAAATCCAACCCTTCGAA
TTTTGCTTCAATAACAGCCAGTACACTCATGAAATGGTCCCGAAGCTCGGATTCCATTTCCGCGACGGCACGGTGTTTCAGCCGCCGGCAAAAAGCTACATTGTTTCGGT
GGGTGAATTCATTAGCTGTATTGGGTTCGTTTCTATGCCTTTCCCGGCCACCAATATCATTGGGAATATTCTTCAGCAGAATCACCTTTGGCAATTTGATTTCTTTAACG
GAAAAGTCGGTTTTGCCCCCTCTGAATGCGTTTAA
mRNA sequenceShow/hide mRNA sequence
TGAGTTTGTTATATAAAACAGGAAAAAAAAAAAAAGAGGAAAATCCCCAAAATTAACTAATCTATTTTTATCAATTTTCAGTTAAGTTTCTTCTTCCTTCTTCCCTTCTC
CCCTTTTTAAGTCCGCCATTATCATCTTTCTTCTTGCTCCATCTCTCTAACATTACATTATCTGTTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCACATT
TTTGTTTCTTCTTCCTTTTCTTCTTCCTCTCTGTTCACATCGCATTCGGCGACTATGATCAAGAAACTGTAAAACTCGATCTACTTCACCGTCACCATCCACAAGTCTCC
GAGAAGCTTCACGGTGATATGAAATTTCAAGATCTAAATCATCGCCTCAAGGATATTCACGAACACGACCACAAACGTTATCAATCGATCTCCACGTCGTTGAATCGGAA
GCCAATTGAGGAGAGGGCTGAGGCTGAGGCTAAGGCTAAGGCTAAGGCGGAGGCTGAGGCGGAGGCGGAGGCTGAGGCTGAGGCTGCAGCGAGGGAATCGATACTTCCAC
CGACATCACCAACGCCGATAGGGCTGAAAATGATATCAGGTTCTGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTCGGAACGCCGCCGCAGACGTTTATGTTG
ATCGCCGATACTGGAAGTGATCTAACGTGGATGAAATGTAGATATCGGAGATGTATCGGAAATTGTAGCAGCAAAGCGAGTCATAAGAGCCGAAACGAAGGAAAAATTAG
ATTTAGAAATGCATTTTTGGCGAATTATTCATCATCTTTTAAGACGATTCATTGCAGCTCGACGTTGTGTAAGAAGGATCTTTCGGATCTGTTCTCAATTGGAGAATGCC
AAACCCCAACCAGCCCTTGTCTATATGATTACAGCTACTCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATTGAGACCCTAACCGTAAACCTAACAAATGGAAAAGAA
AAACAACTTCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGCAGGATCTTCCGCGGAGCCGACGGCGTCATTGGCTTAGGCACTAGCTCCTACTCTTTCACCTA
CAAAGCCGCCCAAAACGCCAACGGCGGCGGCTTCTCTTACTGCCTTGTCGACCATCTCAGCGACCACACCGCCACCAGCTACTTCATCCTCGGCATCCCTTCCCCTTCTG
CTTCCTCTGCCGCCTCCTCCGTCCTCCCTTCCGGCAACATGTCCTTCACCAGACTCTTCGTCGGCGACCCTTACAGCAGCTTCTATGGCGTCCATCTCATCGGAATCTCC
GCCGACGGCGTCATGCTCAACATTCCTCCTCGCGTTTGGGACATCAATTCTGGCGGCGGAACCATCGTCGACTCCGGAACTAGCCTCACCATGCTGGCGGCGCCGGCGTT
TGATATGGTCATGGAAGCTCTGACTCCGAAGCTGAAGCAATTCGAGATAATTGAAATCCAACCCTTCGAATTTTGCTTCAATAACAGCCAGTACACTCATGAAATGGTCC
CGAAGCTCGGATTCCATTTCCGCGACGGCACGGTGTTTCAGCCGCCGGCAAAAAGCTACATTGTTTCGGTGGGTGAATTCATTAGCTGTATTGGGTTCGTTTCTATGCCT
TTCCCGGCCACCAATATCATTGGGAATATTCTTCAGCAGAATCACCTTTGGCAATTTGATTTCTTTAACGGAAAAGTCGGTTTTGCCCCCTCTGAATGCGTTTAAAAACT
TCCTTCAATTTCTTCATCATCCTCTTCCTCCTCAATCATCTTAATTTTATATATAATAATTATTATTTTTCTTTTCTGTTTCTGTAACACCTGTTATTATTAATTATATA
TAATGTGATATTCTTTTTTTATCTAAGTGGGGTATTTTTGGTTTTTCCTCGAGTCTTGGTATGTTAAAAAAATTTGATGGATGTACAACTTTGTAATGGTTGAGTTCAAT
AATAATAATAAGCTCTATATATATATATATATATATAGATAATTT
Protein sequenceShow/hide protein sequence
MLGYRKPMSPISHFCFFFLFFFLSVHIAFGDYDQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAEAKAKAKAEAEAEAE
AEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLC
KKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT
ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFE
FCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV