; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027844 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027844
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAspartyl protease family protein
Genome locationtig00153055:3185856..3190003
RNA-Seq ExpressionSgr027844
SyntenySgr027844
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR033873 - CND41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024399.1 Aspartyl protease family protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.2e-21673.32Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S LFFA CFLFF+S AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                     AQ SG R Q SLE++HRHGPC G    ++ APT  E+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

XP_022936652.1 aspartyl protease family protein At5g10770-like isoform X1 [Cucurbita moschata]1.4e-21673.51Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S LFFA CFLFF+S AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                     AQ SG R Q SLE++HRHGPC G    ++ APT AE+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

XP_022936653.1 aspartyl protease family protein At5g10770-like isoform X2 [Cucurbita moschata]1.3e-21472.95Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S LFFA CFLFF+S AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                         G R Q SLE++HRHGPC G    ++ APT AE+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

XP_022977010.1 aspartyl protease family protein At5g10770-like isoform X1 [Cucurbita maxima]8.3e-21472.95Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S L FA CF FFHS AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                     AQ SG R Q SLE++HRHGPC       + APT AE+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQC PCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

XP_023535774.1 aspartyl protease family protein At5g10770-like [Cucurbita pepo subsp. pepo]4.7e-21773.69Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S LFFA CFLFFHS AGKV+P   H LTVE+A LLPSATC RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                     AQ SG R Q SLE++HRHGPC G    ++ APT AE+F++DQ RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCS AKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPSTVAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

TrEMBL top hitse value%identityAlignment
A0A1S3BVJ7 aspartyl protease family protein At5g10770-like1.5e-21372.01Show/hide
Query:  MAAL-SLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASV
        MA+L S+   FFA  FLFF SFAGK+   D H+LTVE+A L PSA+CTRRS                                                 
Subjt:  MAAL-SLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASV

Query:  WPIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVN
                      A +S + +Q SLE++HRHGPC       ++APT AE+F++DQ+RVDFI++ FAG   S  RLRPSKATK+PAKSGATIGSGNY VN
Subjt:  WPIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVN

Query:  VALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTL
        V LGTPKKYLSLIFDTGSDLTWTQC+PCARYCYNQKDPVFAPSQSTTYSNISCSS  CS+LESGTGN PGCSAA++CIYGIQYGDQSFSVGY AKETLTL
Subjt:  VALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTL

Query:  SPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        + +DVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYLTFGGGGGG  L+YTPITKAHGVANFYG+DIVGIKV G
Subjt:  SPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
        TQ+PISSSVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGMA YPKAPELSILDTCYDLSKY+++Q PKV  +FKG  ELDLDGTGI+YGASTSQ+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGNQDPSTVAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

A0A6J1F908 aspartyl protease family protein At5g10770-like isoform X16.6e-21773.51Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S LFFA CFLFF+S AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                     AQ SG R Q SLE++HRHGPC G    ++ APT AE+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

A0A6J1FDU3 aspartyl protease family protein At5g10770-like isoform X26.2e-21572.95Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S LFFA CFLFF+S AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                         G R Q SLE++HRHGPC G    ++ APT AE+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

A0A6J1III3 aspartyl protease family protein At5g10770-like isoform X22.2e-21272.39Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S L FA CF FFHS AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                         G R Q SLE++HRHGPC       + APT AE+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQC PCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

A0A6J1IL31 aspartyl protease family protein At5g10770-like isoform X14.0e-21472.95Show/hide
Query:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW
        MA LS S L FA CF FFHS AGKV+P D H LTVE+A LLPSA C RRS                                                  
Subjt:  MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVW

Query:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV
                     AQ SG R Q SLE++HRHGPC       + APT AE+F++DQ+RVDFI++ F+G+F S  RLRPSKATK+PAKSGATIGSGNY VNV
Subjt:  PIKYELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNV

Query:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS
        ALGTPKKYLSLIFDTGSDLTWTQC PCARYCY+QKDPVFAPSQSTTYSNI+CSSP+CS+LESGTGN PGCSAAKSCIYGIQYGDQSFSVGY AKETLTL+
Subjt:  ALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLS

Query:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG
        P+DVI NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLP T SSTGYL FGG GGGG  L+YTPITKAHGVANFYGVDIVG+KV G
Subjt:  PSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGG-GGGGSRLQYTPITKAHGVANFYGVDIVGIKVSG

Query:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA
         Q+PIS+SVFSTSGAIIDSGTVITRLPP AY ALKSAFQKGM  YPKAPELSILDTCYDLSKYTSV+ PKV FLFKGG+ L+LDGTGILYGAST+Q+CLA
Subjt:  TQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLA

Query:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY
        FAGN DPS+VAIIGNVQQKT+QVVYDVGGGKIGFGY
Subjt:  FAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGY

SwissProt top hitse value%identityAlignment
Q8S9J6 Aspartyl protease family protein At5g107701.1e-13357.56Show/hide
Query:  SLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQ
        SL + HRHG C+     KA++P   EI   DQ+RV+ I++  +    +T  +  SK+T LPAK G+T+GSGNY V V LGTPK  LSLIFDTGSDLTWTQ
Subjt:  SLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQ

Query:  CEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSA
        C+PC R CY+QK+P+F PS+ST+Y N+SCSS  C  L S TGNA  CSA+ +CIYGIQYGDQSFSVG+LAKE  TL+ SDV +   FGCG+NN+GLF   
Subjt:  CEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSA

Query:  AGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTVIT
        AGL+GLG+DK+S   QTA  Y +IFSYCLP + S TG+LTFG  G    +++TPI+      +FYG++IV I V G +LPI S+VFST GA+IDSGTVIT
Subjt:  AGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTVIT

Query:  RLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQVV
        RLPP AY AL+S+F+  M+ YP    +SILDTC+DLS + +V  PKVAF F GG+ ++L   GI Y    SQ+CLAFAGN D S  AI GNVQQ+T++VV
Subjt:  RLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQVV

Query:  YDVGGGKIGF
        YD  GG++GF
Subjt:  YDVGGGKIGF

Q9LEW3 Aspartyl protease AED11.5e-11251.46Show/hide
Query:  SLELVHRHGPCAGAPKAKASAPTD-AEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWT
        SL +VH HG C+      + A  D  EI  RDQ+RV+ I +  + N  S   +  +K+T+LPAKSG T+GSGNY V + +GTPK  LSL+FDTGSDLTWT
Subjt:  SLELVHRHGPCAGAPKAKASAPTD-AEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWT

Query:  QCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGS
        QCEPC   CY+QK+P F PS S+TY N+SCSSP+C + ES       CSA+ +C+Y I YGD+SF+ G+LAKE  TL+ SDV+E+  FGCG+NN+GLF  
Subjt:  QCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGS

Query:  AAGLIGLGQDKISIVKQTAQKYGQIFSYCLPP-TYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTV
         AGL+GLG  K+S+  QT   Y  IFSYCLP  T +STG+LTFG  G    +++TPI+      N YG+DI+GI V   +L I+ + FST GAIIDSGTV
Subjt:  AAGLIGLGQDKISIVKQTAQKYGQIFSYCLPP-TYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTV

Query:  ITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQ
         TRLP   Y  L+S F++ M++Y       + DTCYD +   +V +P +AF F G + ++LDG+GI      SQ+CLAFAGN D    AI GNVQQ T+ 
Subjt:  ITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQ

Query:  VVYDVGGGKIGF
        VVYDV GG++GF
Subjt:  VVYDVGGGKIGF

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 28.4e-6840.28Show/hide
Query:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQS
        SG   GSG YFV + +G+P +   ++ D+GSD+ W QC+PC + CY Q DPVF P++S +Y+ +SC S VC  +E+      GC +   C Y + YGD S
Subjt:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQS

Query:  FSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPT-YSSTGYLTFGGGGGGSRLQYTPITKAHGVA
        ++ G LA ETLT + + V+ N   GCG  NRG+F  AAGL+G+G   +S V Q + + G  F YCL      STG L FG         + P+ +     
Subjt:  FSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPT-YSSTGYLTFGGGGGGSRLQYTPITKAHGVA

Query:  NFYGVDIVGIKVSGTQLPISSSVFSTS-----GAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSEL
        +FY V + G+ V G ++P+   VF  +     G ++D+GT +TRLP AAY A +  F+   A  P+A  +SI DTCYDLS + SV+ P V+F F  G  L
Subjt:  NFYGVDIVGIKVSGTQLPISSSVFSTS-----GAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSEL

Query:  DLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFG
         L     L     S   C AFA +  P+ ++IIGN+QQ+ +QV +D   G +GFG
Subjt:  DLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFG

Q9LNJ3 Aspartyl protease family protein 28.1e-7141.46Show/hide
Query:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSA-AKSCIYGIQYGDQ
        SG + GSG YF  + +GTP +Y+ ++ DTGSD+ W QC PC R CY+Q DP+F P +S TY+ I CSSP C  L+S      GC+   K+C+Y + YGD 
Subjt:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSA-AKSCIYGIQYGDQ

Query:  SFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSST--GYLTFGGGGGGSRLQYTPITKAHG
        SF+VG  + ETLT   + V +    GCG +N GLF  AAGL+GLG+ K+S   QT  ++ Q FSYCL    +S+    + FG        ++TP+     
Subjt:  SFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSST--GYLTFGGGGGGSRLQYTPITKAHG

Query:  VANFYGVDIVGIKVSGTQLP-ISSSVF-----STSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGG
        +  FY V ++GI V GT++P +++S+F        G IIDSGT +TRL   AY A++ AF+ G     +AP+ S+ DTC+DLS    V+ P V   F+ G
Subjt:  VANFYGVDIVGIKVSGTQLP-ISSSVF-----STSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGG

Query:  SELDLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGF
        +++ L  T  L    T+ + C AFAG      ++IIGN+QQ+  +VVYD+   ++GF
Subjt:  SELDLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGF

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 15.1e-6541.39Show/hide
Query:  PAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYG
        P  SGA+ GSG YF  + +GTP K + L+ DTGSD+ W QCEPCA  CY Q DPVF P+ S+TY +++CS+P CS LE+       C + K C+Y + YG
Subjt:  PAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYG

Query:  DQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCL----PPTYSSTGYLTFGGGGGGSRLQYTPIT
        D SF+VG LA +T+T   S  I N   GCG +N GLF  AAGL+GLG   +SI   T Q     FSYCL        SS  + +   GGG +     P+ 
Subjt:  DQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCL----PPTYSSTGYLTFGGGGGGSRLQYTPIT

Query:  KAHGVANFYGVDIVGIKVSGTQLPISSSVF-----STSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPK-APELSILDTCYDLSKYTSVQFPKVAFL
        +   +  FY V + G  V G ++ +  ++F      + G I+D GT +TRL   AY++L+ AF K      K +  +S+ DTCYD S  ++V+ P VAF 
Subjt:  KAHGVANFYGVDIVGIKVSGTQLPISSSVF-----STSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPK-APELSILDTCYDLSKYTSVQFPKVAFL

Query:  FKGGSELDLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIG
        F GG  LDL     L     S   C AFA     S+++IIGNVQQ+  ++ YD+    IG
Subjt:  FKGGSELDLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIG

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein5.7e-7241.46Show/hide
Query:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSA-AKSCIYGIQYGDQ
        SG + GSG YF  + +GTP +Y+ ++ DTGSD+ W QC PC R CY+Q DP+F P +S TY+ I CSSP C  L+S      GC+   K+C+Y + YGD 
Subjt:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSA-AKSCIYGIQYGDQ

Query:  SFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSST--GYLTFGGGGGGSRLQYTPITKAHG
        SF+VG  + ETLT   + V +    GCG +N GLF  AAGL+GLG+ K+S   QT  ++ Q FSYCL    +S+    + FG        ++TP+     
Subjt:  SFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSST--GYLTFGGGGGGSRLQYTPITKAHG

Query:  VANFYGVDIVGIKVSGTQLP-ISSSVF-----STSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGG
        +  FY V ++GI V GT++P +++S+F        G IIDSGT +TRL   AY A++ AF+ G     +AP+ S+ DTC+DLS    V+ P V   F+ G
Subjt:  VANFYGVDIVGIKVSGTQLP-ISSSVF-----STSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGG

Query:  SELDLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGF
        +++ L  T  L    T+ + C AFAG      ++IIGN+QQ+  +VVYD+   ++GF
Subjt:  SELDLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGF

AT1G79720.1 Eukaryotic aspartyl protease family protein1.3e-7938.53Show/hide
Query:  SVWPIK--YELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGN
        ++W  K  YE S+  +  +   G R+  +LE+ HR   C+G  K            + D  RV  +        SST     S+ T++P  SG  + S N
Subjt:  SVWPIK--YELSSFLWGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGN

Query:  YFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKS-----CIYGIQYGDQSFSVG
        Y V V LG   K +SLI DTGSDLTW QC+PC R CYNQ+ P++ PS S++Y  + C+S  C +L + T N+  C          C Y + YGD S++ G
Subjt:  YFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKS-----CIYGIQYGDQSFSVG

Query:  YLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTY-SSTGYLTFGGGG----GGSRLQYTPITKAHGVA
         LA E++ L  +  +ENF+FGCG+NN+GLFG ++GL+GLG+  +S+V QT + +  +FSYCLP     ++G L+FG         + + YTP+ +   + 
Subjt:  YLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTY-SSTGYLTFGGGG----GGSRLQYTPITKAHGVA

Query:  NFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGT
        +FY +++ G  + G +L  SS      G +IDSGTVITRLPP+ Y A+K  F K  + +P AP  SILDTC++L+ Y  +  P +  +F+G +EL++D T
Subjt:  NFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGT

Query:  GILY--GASTSQICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIG
        G+ Y      S +CLA A     + V IIGN QQK  +V+YD    ++G
Subjt:  GILY--GASTSQICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIG

AT3G20015.1 Eukaryotic aspartyl protease family protein6.0e-6940.28Show/hide
Query:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQS
        SG   GSG YFV + +G+P +   ++ D+GSD+ W QC+PC + CY Q DPVF P++S +Y+ +SC S VC  +E+      GC +   C Y + YGD S
Subjt:  SGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQS

Query:  FSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPT-YSSTGYLTFGGGGGGSRLQYTPITKAHGVA
        ++ G LA ETLT + + V+ N   GCG  NRG+F  AAGL+G+G   +S V Q + + G  F YCL      STG L FG         + P+ +     
Subjt:  FSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPPT-YSSTGYLTFGGGGGGSRLQYTPITKAHGVA

Query:  NFYGVDIVGIKVSGTQLPISSSVFSTS-----GAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSEL
        +FY V + G+ V G ++P+   VF  +     G ++D+GT +TRLP AAY A +  F+   A  P+A  +SI DTCYDLS + SV+ P V+F F  G  L
Subjt:  NFYGVDIVGIKVSGTQLPISSSVFSTS-----GAIIDSGTVITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSEL

Query:  DLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFG
         L     L     S   C AFA +  P+ ++IIGN+QQ+ +QV +D   G +GFG
Subjt:  DLDGTGILYGASTS-QICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFG

AT5G10760.1 Eukaryotic aspartyl protease family protein1.0e-11351.46Show/hide
Query:  SLELVHRHGPCAGAPKAKASAPTD-AEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWT
        SL +VH HG C+      + A  D  EI  RDQ+RV+ I +  + N  S   +  +K+T+LPAKSG T+GSGNY V + +GTPK  LSL+FDTGSDLTWT
Subjt:  SLELVHRHGPCAGAPKAKASAPTD-AEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWT

Query:  QCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGS
        QCEPC   CY+QK+P F PS S+TY N+SCSSP+C + ES       CSA+ +C+Y I YGD+SF+ G+LAKE  TL+ SDV+E+  FGCG+NN+GLF  
Subjt:  QCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGS

Query:  AAGLIGLGQDKISIVKQTAQKYGQIFSYCLPP-TYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTV
         AGL+GLG  K+S+  QT   Y  IFSYCLP  T +STG+LTFG  G    +++TPI+      N YG+DI+GI V   +L I+ + FST GAIIDSGTV
Subjt:  AAGLIGLGQDKISIVKQTAQKYGQIFSYCLPP-TYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTV

Query:  ITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQ
         TRLP   Y  L+S F++ M++Y       + DTCYD +   +V +P +AF F G + ++LDG+GI      SQ+CLAFAGN D    AI GNVQQ T+ 
Subjt:  ITRLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQ

Query:  VVYDVGGGKIGF
        VVYDV GG++GF
Subjt:  VVYDVGGGKIGF

AT5G10770.1 Eukaryotic aspartyl protease family protein8.2e-13557.56Show/hide
Query:  SLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQ
        SL + HRHG C+     KA++P   EI   DQ+RV+ I++  +    +T  +  SK+T LPAK G+T+GSGNY V V LGTPK  LSLIFDTGSDLTWTQ
Subjt:  SLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLTWTQ

Query:  CEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSA
        C+PC R CY+QK+P+F PS+ST+Y N+SCSS  C  L S TGNA  CSA+ +CIYGIQYGDQSFSVG+LAKE  TL+ SDV +   FGCG+NN+GLF   
Subjt:  CEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSA

Query:  AGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTVIT
        AGL+GLG+DK+S   QTA  Y +IFSYCLP + S TG+LTFG  G    +++TPI+      +FYG++IV I V G +LPI S+VFST GA+IDSGTVIT
Subjt:  AGLIGLGQDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTVIT

Query:  RLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQVV
        RLPP AY AL+S+F+  M+ YP    +SILDTC+DLS + +V  PKVAF F GG+ ++L   GI Y    SQ+CLAFAGN D S  AI GNVQQ+T++VV
Subjt:  RLPPAAYDALKSAFQKGMAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQVV

Query:  YDVGGGKIGF
        YD  GG++GF
Subjt:  YDVGGGKIGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTCTCTCTCTCTCCAAACTCTTCTTCGCACTCTGCTTCCTTTTCTTCCACTCCTTCGCCGGAAAAGTCACTCCGGACGACGGCCACCACCTCACCGTCGAAAT
CGCCGCCCTGCTGCCGTCCGCCACCTGCACCCGTCGCTCCACCCTAGGTCCCTCTCTCTCTCTCTCTCACTCGAATTTTGCAGAGCTCCCTGAAGCCGCCGCTATAAATG
CGTTTTCAATTTTCAAGACCATCAACTCTTCCACGTGGGCCTCCCGGAGGCCCACTATGAATACAACCGCTTCGGTTTGGCCCATAAAATACGAATTATCGTCGTTCTTG
TGGGGGGCAGCCCAAAGTTCAGGTCTTAGAAAGCAGTTATCTCTCGAGCTGGTTCACCGGCATGGCCCGTGCGCTGGAGCCCCGAAAGCCAAAGCAAGTGCTCCGACCGA
CGCCGAAATCTTTCTCCGTGATCAGTCTCGGGTTGATTTCATCAATGCGCATTTCGCCGGGAACTTCAGTTCCACTCAGCGTCTACGGCCCTCAAAGGCCACCAAACTTC
CGGCCAAGTCCGGCGCCACCATCGGCTCCGGCAATTACTTCGTGAACGTCGCCCTCGGCACGCCGAAGAAGTACCTCTCGCTCATATTTGACACCGGCAGCGATCTGACT
TGGACGCAGTGTGAGCCCTGCGCCAGATATTGCTACAACCAAAAGGATCCGGTGTTCGCTCCGTCGCAATCCACCACATATTCCAACATCTCTTGTTCCTCGCCGGTCTG
CTCTGAGCTTGAATCCGGCACTGGGAACGCGCCTGGTTGCTCCGCCGCGAAGTCGTGCATTTATGGAATACAGTATGGCGATCAATCTTTCTCCGTCGGATATCTTGCCA
AAGAAACGCTAACCTTGTCGCCGTCTGACGTGATCGAGAACTTTCTGTTTGGTTGCGGCCAAAACAACCGTGGACTCTTCGGTAGCGCCGCCGGTCTCATTGGTCTCGGC
CAGGACAAAATCTCAATCGTTAAACAGACGGCACAAAAGTACGGCCAGATCTTCTCTTACTGTCTGCCTCCGACGTACAGTTCAACCGGCTACCTGACGTTTGGCGGCGG
CGGCGGCGGCAGCAGACTGCAGTACACACCAATTACAAAAGCACACGGTGTGGCCAATTTCTACGGCGTTGATATTGTCGGCATAAAGGTCAGCGGAACTCAGTTACCGA
TTTCGTCCTCAGTCTTTTCAACCTCCGGCGCGATCATTGATTCTGGCACGGTAATTACACGGCTGCCGCCGGCGGCGTACGACGCCTTGAAATCGGCGTTTCAGAAAGGA
ATGGCGGCGTATCCAAAAGCGCCGGAGCTGTCGATCCTCGATACGTGTTACGATCTGAGCAAGTACACCTCCGTACAGTTCCCGAAAGTGGCCTTTCTTTTCAAAGGTGG
ATCGGAGCTTGATCTCGACGGCACGGGGATATTGTACGGAGCATCGACGTCGCAAATTTGTTTGGCGTTCGCCGGAAATCAGGATCCGAGCACCGTCGCCATTATAGGGA
ATGTGCAGCAGAAGACTGTGCAGGTGGTCTACGATGTCGGTGGAGGGAAGATTGGGTTTGGCTACAAAGATAATTTCAATGATAACTTGAAAGATTATGAACTAGAATCA
AACTATAGTAGTGCTCTTCATGTTGAAGTTGAAGCTGTGTGGGAAGGTATCAAGTTCGCATTGGAGTTAGATTTCGAAGAGTTGGTTCTACACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTCTCTCTCTCTCCAAACTCTTCTTCGCACTCTGCTTCCTTTTCTTCCACTCCTTCGCCGGAAAAGTCACTCCGGACGACGGCCACCACCTCACCGTCGAAAT
CGCCGCCCTGCTGCCGTCCGCCACCTGCACCCGTCGCTCCACCCTAGGTCCCTCTCTCTCTCTCTCTCACTCGAATTTTGCAGAGCTCCCTGAAGCCGCCGCTATAAATG
CGTTTTCAATTTTCAAGACCATCAACTCTTCCACGTGGGCCTCCCGGAGGCCCACTATGAATACAACCGCTTCGGTTTGGCCCATAAAATACGAATTATCGTCGTTCTTG
TGGGGGGCAGCCCAAAGTTCAGGTCTTAGAAAGCAGTTATCTCTCGAGCTGGTTCACCGGCATGGCCCGTGCGCTGGAGCCCCGAAAGCCAAAGCAAGTGCTCCGACCGA
CGCCGAAATCTTTCTCCGTGATCAGTCTCGGGTTGATTTCATCAATGCGCATTTCGCCGGGAACTTCAGTTCCACTCAGCGTCTACGGCCCTCAAAGGCCACCAAACTTC
CGGCCAAGTCCGGCGCCACCATCGGCTCCGGCAATTACTTCGTGAACGTCGCCCTCGGCACGCCGAAGAAGTACCTCTCGCTCATATTTGACACCGGCAGCGATCTGACT
TGGACGCAGTGTGAGCCCTGCGCCAGATATTGCTACAACCAAAAGGATCCGGTGTTCGCTCCGTCGCAATCCACCACATATTCCAACATCTCTTGTTCCTCGCCGGTCTG
CTCTGAGCTTGAATCCGGCACTGGGAACGCGCCTGGTTGCTCCGCCGCGAAGTCGTGCATTTATGGAATACAGTATGGCGATCAATCTTTCTCCGTCGGATATCTTGCCA
AAGAAACGCTAACCTTGTCGCCGTCTGACGTGATCGAGAACTTTCTGTTTGGTTGCGGCCAAAACAACCGTGGACTCTTCGGTAGCGCCGCCGGTCTCATTGGTCTCGGC
CAGGACAAAATCTCAATCGTTAAACAGACGGCACAAAAGTACGGCCAGATCTTCTCTTACTGTCTGCCTCCGACGTACAGTTCAACCGGCTACCTGACGTTTGGCGGCGG
CGGCGGCGGCAGCAGACTGCAGTACACACCAATTACAAAAGCACACGGTGTGGCCAATTTCTACGGCGTTGATATTGTCGGCATAAAGGTCAGCGGAACTCAGTTACCGA
TTTCGTCCTCAGTCTTTTCAACCTCCGGCGCGATCATTGATTCTGGCACGGTAATTACACGGCTGCCGCCGGCGGCGTACGACGCCTTGAAATCGGCGTTTCAGAAAGGA
ATGGCGGCGTATCCAAAAGCGCCGGAGCTGTCGATCCTCGATACGTGTTACGATCTGAGCAAGTACACCTCCGTACAGTTCCCGAAAGTGGCCTTTCTTTTCAAAGGTGG
ATCGGAGCTTGATCTCGACGGCACGGGGATATTGTACGGAGCATCGACGTCGCAAATTTGTTTGGCGTTCGCCGGAAATCAGGATCCGAGCACCGTCGCCATTATAGGGA
ATGTGCAGCAGAAGACTGTGCAGGTGGTCTACGATGTCGGTGGAGGGAAGATTGGGTTTGGCTACAAAGATAATTTCAATGATAACTTGAAAGATTATGAACTAGAATCA
AACTATAGTAGTGCTCTTCATGTTGAAGTTGAAGCTGTGTGGGAAGGTATCAAGTTCGCATTGGAGTTAGATTTCGAAGAGTTGGTTCTACACTAA
Protein sequenceShow/hide protein sequence
MAALSLSKLFFALCFLFFHSFAGKVTPDDGHHLTVEIAALLPSATCTRRSTLGPSLSLSHSNFAELPEAAAINAFSIFKTINSSTWASRRPTMNTTASVWPIKYELSSFL
WGAAQSSGLRKQLSLELVHRHGPCAGAPKAKASAPTDAEIFLRDQSRVDFINAHFAGNFSSTQRLRPSKATKLPAKSGATIGSGNYFVNVALGTPKKYLSLIFDTGSDLT
WTQCEPCARYCYNQKDPVFAPSQSTTYSNISCSSPVCSELESGTGNAPGCSAAKSCIYGIQYGDQSFSVGYLAKETLTLSPSDVIENFLFGCGQNNRGLFGSAAGLIGLG
QDKISIVKQTAQKYGQIFSYCLPPTYSSTGYLTFGGGGGGSRLQYTPITKAHGVANFYGVDIVGIKVSGTQLPISSSVFSTSGAIIDSGTVITRLPPAAYDALKSAFQKG
MAAYPKAPELSILDTCYDLSKYTSVQFPKVAFLFKGGSELDLDGTGILYGASTSQICLAFAGNQDPSTVAIIGNVQQKTVQVVYDVGGGKIGFGYKDNFNDNLKDYELES
NYSSALHVEVEAVWEGIKFALELDFEELVLH