; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014241 (gene) of Chayote v1 genome

Gene IDSed0014241
OrganismSechium edule (Chayote v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationLG01:12443507..12445677
RNA-Seq ExpressionSed0014241
SyntenySed0014241
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584369.1 Protein ASPARTIC PROTEASE IN GUARD CELL 1, partial [Cucurbita argyrosperma subsp. sororia]1.6e-23483.33Show/hide
Query:  MASANLLPLFAFAVFFF----FSLCRSSPELSR--------SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKD
        MA+ANLLPLF+FA+FFF    F+LCR+SPELS         S +LDVSASLKQA ++LKFDP+   S+QQQ+ ++P NSSSSFSLQLH RD+L+NAGHKD
Subjt:  MASANLLPLFAFAVFFF----FSLCRSSPELSR--------SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKD

Query:  YKSLVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQT
        YKSLVLSRLDRDSSRVKSLNDRL F+LS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQT
Subjt:  YKSLVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQT

Query:  DPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR
        DPIFDPR SSSF+SLPC+S QCQ LE SGCRA KCLYQV+YGDGSFTVGEFVTE+L+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+
Subjt:  DPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR

Query:  ASSFSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVR
        ASSFSYCLVDRDS SSS LEFNSA PSDSV A LLRSGRV+TFYYV L G+SVGG+ LSIPP LFQMDD+G GGIIVDSGTAITRLQT  YN+LRDAFV 
Subjt:  ASSFSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVR

Query:  LTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
         TPYL++TNGFALFDTCYDLSSQSRVTIPT+SFQF+GG+SL LPPKNYLIPVDS GTFC AFAPTTSSLSIIGNVQQQGTRV+FDLANS+VGFSPNKC
Subjt:  LTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

XP_022137431.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Momordica charantia]1.8e-23887.92Show/hide
Query:  LFAFAVFFFFSLCRSSPELSR--SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKS
        +F F++F  F+ CRSSPELSR  S  LDVSASLKQA ++LKFDP+   S+QQQE L+PAN SSSFSLQLHPRD+LRNAGHKDYKSLVLSRLDRDSSRVKS
Subjt:  LFAFAVFFFFSLCRSSPELSR--SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKS

Query:  LNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCD
        LNDRL+FALS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSR+GVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQ+DPIFDPR SSSF+SLPC+
Subjt:  LNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCD

Query:  SQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSST
        SQQCQ LEMSGCRA KCLYQVSYGDGSFTVGEFVTETL+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+ASSFSYCLVDRDSGSSS+
Subjt:  SQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSST

Query:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTCY
        L+FNS  PSDSVTA LLRSGRV+TFYYV LTG+SVGG+ LSIPP LFQMDDSG GGIIVDSGTAITRLQT  YNSLRDAFVRLTPYL+KTNGFALFDTCY
Subjt:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTCY

Query:  DLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        DLSSQSRVTIPTVSF FSGG+SL+LPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANS+VGFSPNKC
Subjt:  DLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

XP_022924034.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita moschata]1.3e-23383.6Show/hide
Query:  MASANLLPLFAFAVFFF----FSLCRSSPELSR----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSL
        MA+ANLLPLF+FA FFF    F+LCR+SPELS     S +LDVSASLKQA ++LKFDP+   S+QQQ+ ++P NSSSSFSLQLH RD+L+NAGH+DYKSL
Subjt:  MASANLLPLFAFAVFFF----FSLCRSSPELSR----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSL

Query:  VLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIF
        VLSRLDRDSSRVKSLNDRL F+LS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSRIGVG+PAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIF
Subjt:  VLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIF

Query:  DPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSF
        DPR SSSF+SLPC+S QCQ LE SGCRA KCLYQV+YGDGSFTVGEFVTE+L+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+ASSF
Subjt:  DPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSF

Query:  SYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPY
        SYCLVDRDS SSS LEFNSA PSDSV A LLRSGRV+TFYYV L G+SVGG+ LSIPP LFQMDD+G GGIIVDSGTAITRLQT  YN+LRDAFV  TPY
Subjt:  SYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPY

Query:  LKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        L++TNGFALFDTCYDLSSQSRVTIPT+SFQF+GG+SL LPPKNYLIPVDS GTFC AFAPTTSSLSIIGNVQQQGTRV+FDLANS+VGFSPNKC
Subjt:  LKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

XP_023000890.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita maxima]2.1e-23182.83Show/hide
Query:  MASANLLPLFAFAVFFF----FSLCRSSPELSR-----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKS
        MA+ANL PLF+FA+FFF    F+LCR+SPELS      S +LDVSASLKQA ++LKFDP+   S+QQQ  ++PANSSSSFSLQLH RD+L+NAGHKDYKS
Subjt:  MASANLLPLFAFAVFFF----FSLCRSSPELSR-----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKS

Query:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI
        LVLSRLDRDSSRVKSLNDRL F+LS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI
Subjt:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI

Query:  FDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASS
        FDPR SSSF+SL C+S QCQ LE SGCRA KCLYQV+YGDGSFTVGEFVTE+L+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSL SQM+ASS
Subjt:  FDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASS

Query:  FSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTP
        FSYCLVDRDS SSS LEFNSA PSD+V A LLRSGRV+TFYYV L G+SVGG+ LSIPP LFQMDD+G GGIIVDSGTAITRLQT  YN+LRDAFV  TP
Subjt:  FSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTP

Query:  YLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        YL++TNG ALFDTCYDLSSQSRVTIPT+SFQF+GG+SL LPPKNYLIPVDS GTFC AFAPTTSSLSIIGNVQQQGTRV+FDLANS++GFSPNKC
Subjt:  YLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

XP_023519500.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita pepo subsp. pepo]4.6e-23483.43Show/hide
Query:  MASANLLPLFAFAVFFF----FSLCRSSPELSR-----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKS
        MA+ANLLPLF+FA+FFF    F+LCR+SPELS      S +LDVSASLKQA ++LKFDP+   S+QQQ+ ++P NSSSSFSLQLH RD+L+NAGHKDYKS
Subjt:  MASANLLPLFAFAVFFF----FSLCRSSPELSR-----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKS

Query:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI
        LVLSRLDRDSSRVKSLNDRL F+LS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSRIGVG+PAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI
Subjt:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI

Query:  FDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASS
        FDPR SSSF+SLPC+S QCQ LE SGCRA KCLYQV+YGDGSFTVGEFVTE+L+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+ASS
Subjt:  FDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASS

Query:  FSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTP
        FSYCLVDRDS SSS LEFNSA PSDSV A LL+SGRV+TFYYV L G+SVGG+ LSIPP LFQMDD+G GGIIVDSGTAITRLQT  YN+LRDAFV  TP
Subjt:  FSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTP

Query:  YLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        YL++TNGFALFDTCYDLSSQSRVTIPT+SFQF+GG SL LPPKNYLIPVDS GTFC AFAPTTSSLSIIGNVQQQGTRV+FDLANS+VGFSPNKC
Subjt:  YLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

TrEMBL top hitse value%identityAlignment
A0A1S3C5F6 protein ASPARTIC PROTEASE IN GUARD CELL 1-like4.3e-23084.25Show/hide
Query:  MASANLLPLFAFAVFFF-FSLCRSSPELS--RSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRL
        MA++NLL LF F    F F+L RSS  LS   S  LDVSASL+QA ++LKFDP  S S+QQQ  L+P+NSS SFSLQLHPRDSL NAGHKDYKSLVLSRL
Subjt:  MASANLLPLFAFAVFFF-FSLCRSSPELS--RSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRL

Query:  DRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRAS
         RDSSRVKS+ DRL FALSELKRSDL+PL+TEILPEDLSTPIVSG SQGSGEYFSR+GVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPR+S
Subjt:  DRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRAS

Query:  SSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLV
        SSF+SLPC+SQQCQ LE SGCRA KCLYQVSYGDGSFTVGEFVTETL FGNSG I+NVA+GCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+ASSFSYCLV
Subjt:  SSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLV

Query:  DRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTN
        DRDS SSS LEFNSA P+DSV A LL+SGRV+TFYYV LTGMSVGG+ LSIPP LFQMDDSG GGIIVDSGTAITRLQT  YN+LRDAFV  TPYLKKTN
Subjt:  DRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTN

Query:  GFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        GFALFDTCYDLSSQSRVTIPTVSF+F+GG+SL LPPKNYLIPVDS GTFCFAFAPTTSSLSIIGNVQQQGTRV++DLANSVVGFSP+KC
Subjt:  GFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

A0A5D3BI72 Protein ASPARTIC PROTEASE IN GUARD CELL 1-like4.3e-23084.25Show/hide
Query:  MASANLLPLFAFAVFFF-FSLCRSSPELS--RSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRL
        MA++NLL LF F    F F+L RSS  LS   S  LDVSASL+QA ++LKFDP  S S+QQQ  L+P+NSS SFSLQLHPRDSL NAGHKDYKSLVLSRL
Subjt:  MASANLLPLFAFAVFFF-FSLCRSSPELS--RSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRL

Query:  DRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRAS
         RDSSRVKS+ DRL FALSELKRSDL+PL+TEILPEDLSTPIVSG SQGSGEYFSR+GVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPR+S
Subjt:  DRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRAS

Query:  SSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLV
        SSF+SLPC+SQQCQ LE SGCRA KCLYQVSYGDGSFTVGEFVTETL FGNSG I+NVA+GCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+ASSFSYCLV
Subjt:  SSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLV

Query:  DRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTN
        DRDS SSS LEFNSA P+DSV A LL+SGRV+TFYYV LTGMSVGG+ LSIPP LFQMDDSG GGIIVDSGTAITRLQT  YN+LRDAFV  TPYLKKTN
Subjt:  DRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTN

Query:  GFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        GFALFDTCYDLSSQSRVTIPTVSF+F+GG+SL LPPKNYLIPVDS GTFCFAFAPTTSSLSIIGNVQQQGTRV++DLANSVVGFSP+KC
Subjt:  GFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

A0A6J1C6K9 protein ASPARTIC PROTEASE IN GUARD CELL 1-like8.7e-23987.92Show/hide
Query:  LFAFAVFFFFSLCRSSPELSR--SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKS
        +F F++F  F+ CRSSPELSR  S  LDVSASLKQA ++LKFDP+   S+QQQE L+PAN SSSFSLQLHPRD+LRNAGHKDYKSLVLSRLDRDSSRVKS
Subjt:  LFAFAVFFFFSLCRSSPELSR--SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKS

Query:  LNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCD
        LNDRL+FALS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSR+GVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQ+DPIFDPR SSSF+SLPC+
Subjt:  LNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCD

Query:  SQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSST
        SQQCQ LEMSGCRA KCLYQVSYGDGSFTVGEFVTETL+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+ASSFSYCLVDRDSGSSS+
Subjt:  SQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSST

Query:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTCY
        L+FNS  PSDSVTA LLRSGRV+TFYYV LTG+SVGG+ LSIPP LFQMDDSG GGIIVDSGTAITRLQT  YNSLRDAFVRLTPYL+KTNGFALFDTCY
Subjt:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTCY

Query:  DLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        DLSSQSRVTIPTVSF FSGG+SL+LPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANS+VGFSPNKC
Subjt:  DLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

A0A6J1E815 protein ASPARTIC PROTEASE IN GUARD CELL 1-like6.5e-23483.6Show/hide
Query:  MASANLLPLFAFAVFFF----FSLCRSSPELSR----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSL
        MA+ANLLPLF+FA FFF    F+LCR+SPELS     S +LDVSASLKQA ++LKFDP+   S+QQQ+ ++P NSSSSFSLQLH RD+L+NAGH+DYKSL
Subjt:  MASANLLPLFAFAVFFF----FSLCRSSPELSR----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSL

Query:  VLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIF
        VLSRLDRDSSRVKSLNDRL F+LS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSRIGVG+PAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIF
Subjt:  VLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIF

Query:  DPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSF
        DPR SSSF+SLPC+S QCQ LE SGCRA KCLYQV+YGDGSFTVGEFVTE+L+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM+ASSF
Subjt:  DPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSF

Query:  SYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPY
        SYCLVDRDS SSS LEFNSA PSDSV A LLRSGRV+TFYYV L G+SVGG+ LSIPP LFQMDD+G GGIIVDSGTAITRLQT  YN+LRDAFV  TPY
Subjt:  SYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPY

Query:  LKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        L++TNGFALFDTCYDLSSQSRVTIPT+SFQF+GG+SL LPPKNYLIPVDS GTFC AFAPTTSSLSIIGNVQQQGTRV+FDLANS+VGFSPNKC
Subjt:  LKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

A0A6J1KJJ8 protein ASPARTIC PROTEASE IN GUARD CELL 1-like1.0e-23182.83Show/hide
Query:  MASANLLPLFAFAVFFF----FSLCRSSPELSR-----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKS
        MA+ANL PLF+FA+FFF    F+LCR+SPELS      S +LDVSASLKQA ++LKFDP+   S+QQQ  ++PANSSSSFSLQLH RD+L+NAGHKDYKS
Subjt:  MASANLLPLFAFAVFFF----FSLCRSSPELSR-----SGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKS

Query:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI
        LVLSRLDRDSSRVKSLNDRL F+LS+LKRSDLQPL+TEILPEDLSTPIVSG SQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI
Subjt:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPI

Query:  FDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASS
        FDPR SSSF+SL C+S QCQ LE SGCRA KCLYQV+YGDGSFTVGEFVTE+L+FGNSG+I NVALGCGHDNEGLFVGSAGLLGLGGGSLSL SQM+ASS
Subjt:  FDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASS

Query:  FSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTP
        FSYCLVDRDS SSS LEFNSA PSD+V A LLRSGRV+TFYYV L G+SVGG+ LSIPP LFQMDD+G GGIIVDSGTAITRLQT  YN+LRDAFV  TP
Subjt:  FSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTP

Query:  YLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        YL++TNG ALFDTCYDLSSQSRVTIPT+SFQF+GG+SL LPPKNYLIPVDS GTFC AFAPTTSSLSIIGNVQQQGTRV+FDLANS++GFSPNKC
Subjt:  YLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-28.5e-7442.94Show/hide
Query:  LSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSF
        + TP+ +    G GEY   + +G P   F  ++DTGSD+ W QC+PCT C+ Q  PIF+P+ SSSFS+LPC+SQ CQ L    C  ++C Y   YGDGS 
Subjt:  LSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSF

Query:  TVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGS-AGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSSTLEFNSAE---PSDSVTAQLLRSGRVNT
        T G   TET  F  S S+ N+A GCG DN+G   G+ AGL+G+G G LSL SQ+    FSYC+    S S STL   SA    P  S +  L+ S    T
Subjt:  TVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGS-AGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSSTLEFNSAE---PSDSVTAQLLRSGRVNT

Query:  FYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAF---VRLTPYLKKTNGFALFDTCYDLSSQ-SRVTIPTVSFQFSGG
        +YY+ L G++VGG +L IP   FQ+ D G GG+I+DSGT +T L    YN++  AF   + L    + ++G +   TC+   S  S V +P +S QF GG
Subjt:  FYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAF---VRLTPYLKKTNGFALFDTCYDLSSQ-SRVTIPTVSFQFSGG

Query:  ESLLLPPKNYLIPVDSAGTFCFAFAPTTS-SLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
          L L  +N LI   + G  C A   ++   +SI GN+QQQ T+V +DL N  V F P +C
Subjt:  ESLLLPPKNYLIPVDSAGTFCFAFAPTTS-SLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

Q766C3 Aspartic proteinase nepenthesin-14.2e-7344.13Show/hide
Query:  GSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLA
        G GEY   + +G PA+PF  ++DTGSD+ W QCQPCT C+ Q+ PIF+P+ SSSFS+LPC SQ CQ L    C  + C Y   YGDGS T G   TETL 
Subjt:  GSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLA

Query:  FGNSGSIRNVALGCGHDNEGLFVGS-AGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSSTLEFNSAEPSDSVTA-----QLLRSGRVNTFYYVALTGM
        FG S SI N+  GCG +N+G   G+ AGL+G+G G LSL SQ+  + FSYC+    S + S L   S   ++SVTA      L++S ++ TFYY+ L G+
Subjt:  FGNSGSIRNVALGCGHDNEGLFVGS-AGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSSTLEFNSAEPSDSVTA-----QLLRSGRVNTFYYVALTGM

Query:  SVGGRSLSIPPYLFQMD-DSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTCYDL-SSQSRVTIPTVSFQFSGGESLLLPPKNYL
        SVG   L I P  F ++ ++G GGII+DSGT +T      Y S+R  F+            + FD C+   S  S + IPT    F GG+ L LP +NY 
Subjt:  SVGGRSLSIPPYLFQMD-DSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTCYDL-SSQSRVTIPTVSFQFSGGESLLLPPKNYL

Query:  IPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        I   S G  C A   ++  +SI GN+QQQ   V +D  NSVV F+  +C
Subjt:  IPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 25.5e-10542.27Show/hide
Query:  LLPLFAFAVFFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVK
        LLPLF F +     L  SS  +S   +  +   L+    +    P+ + ++   E      SSS ++L+L  RD   +  ++++   + +R+ RD+ RV 
Subjt:  LLPLFAFAVFFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVK

Query:  SLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPC
        ++       L  +    +   ++     D  + IVSG  QGSGEYF RIGVG P +  YMV+D+GSD+ W+QCQPC  CY+Q+DP+FDP  S S++ + C
Subjt:  SLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPC

Query:  DSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM---RASSFSYCLVDRDSG
         S  C  +E SGC +  C Y+V YGDGS+T G    ETL F  +  +RNVA+GCGH N G+F+G+AGLLG+GGGS+S   Q+      +F YCLV R + 
Subjt:  DSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM---RASSFSYCLVDRDSG

Query:  SSSTLEF-NSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFAL
        S+ +L F   A P  +    L+R+ R  +FYYV L G+ VGG  + +P  +F + ++G+GG+++D+GTA+TRL T  Y + RD F   T  L + +G ++
Subjt:  SSSTLEF-NSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFAL

Query:  FDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        FDTCYDLS    V +PTVSF F+ G  L LP +N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +V+FD AN  VGF PN C
Subjt:  FDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

Q9LNJ3 Aspartyl protease family protein 25.7e-11049.27Show/hide
Query:  FFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPET-SKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKSLNDRLRF
        FFF SL   S   S       S SL  A  +  F P++ S+S  + E    ++S SS S+ L+       + +K    L  SRL RDS RVKS    +  
Subjt:  FFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPET-SKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKSLNDRLRF

Query:  ALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTL
          +++   ++        P   S+ +VSG SQGSGEYF+R+GVG PA+  YMVLDTGSDI WLQC PC  CY Q+DPIFDPR S +++++PC S  C+ L
Subjt:  ALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTL

Query:  EMSGC--RADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSL---TSQMRASSFSYCLVDRDSGS--SST
        + +GC  R   CLYQVSYGDGSFTVG+F TETL F     ++ VALGCGHDNEGLFVG+AGLLGLG G LS    T       FSYCLVDR + S  SS 
Subjt:  EMSGC--RADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSL---TSQMRASSFSYCLVDRDSGS--SST

Query:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGG-RSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTC
        +  N+A    +    LL + +++TFYYV L G+SVGG R   +   LF++D  GNGG+I+DSGT++TRL  P Y ++RDAF      LK+   F+LFDTC
Subjt:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGG-RSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTC

Query:  YDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        +DLS+ + V +PTV   F G + + LP  NYLIPVD+ G FCFAFA T   LSIIGN+QQQG RV +DLA+S VGF+P  C
Subjt:  YDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 13.6e-16560Show/hide
Query:  MASANLLPLFAFAVFFFF-----SLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSK--SYQQQELLIPA--NSSSSFSLQLHPRDSLRNAGHKDYKS
        MA    L L A      F     +  RS     ++  LDV +SL+Q Q IL  DP  S   + + + L  P   NSSS  SL+LH RD+   + HKDYKS
Subjt:  MASANLLPLFAFAVFFFF-----SLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSK--SYQQQELLIPA--NSSSSFSLQLHPRDSLRNAGHKDYKS

Query:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPL---ETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQT
        L LSRL+RDSSRV  +  ++RFA+  + RSDL+P+   +T    EDL+TP+VSG SQGSGEYFSRIGVG PAK  Y+VLDTGSD+NW+QC+PC DCYQQ+
Subjt:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPL---ETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQT

Query:  DPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR
        DP+F+P +SS++ SL C + QC  LE S CR++KCLYQVSYGDGSFTVGE  T+T+ FGNSG I NVALGCGHDNEGLF G+AGLLGLGGG LS+T+QM+
Subjt:  DPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR

Query:  ASSFSYCLVDRDSGSSSTLEFNSAE-PSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFV
        A+SFSYCLVDRDSG SS+L+FNS +      TA LLR+ +++TFYYV L+G SVGG  + +P  +F +D SG+GG+I+D GTA+TRLQT  YNSLRDAF+
Subjt:  ASSFSYCLVDRDSGSSSTLEFNSAE-PSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFV

Query:  RLTPYLKK-TNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        +LT  LKK ++  +LFDTCYD SS S V +PTV+F F+GG+SL LP KNYLIPVD +GTFCFAFAPT+SSLSIIGNVQQQGTR+ +DL+ +V+G S NKC
Subjt:  RLTPYLKK-TNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein4.0e-11149.27Show/hide
Query:  FFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPET-SKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKSLNDRLRF
        FFF SL   S   S       S SL  A  +  F P++ S+S  + E    ++S SS S+ L+       + +K    L  SRL RDS RVKS    +  
Subjt:  FFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPET-SKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKSLNDRLRF

Query:  ALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTL
          +++   ++        P   S+ +VSG SQGSGEYF+R+GVG PA+  YMVLDTGSDI WLQC PC  CY Q+DPIFDPR S +++++PC S  C+ L
Subjt:  ALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTL

Query:  EMSGC--RADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSL---TSQMRASSFSYCLVDRDSGS--SST
        + +GC  R   CLYQVSYGDGSFTVG+F TETL F     ++ VALGCGHDNEGLFVG+AGLLGLG G LS    T       FSYCLVDR + S  SS 
Subjt:  EMSGC--RADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSL---TSQMRASSFSYCLVDRDSGS--SST

Query:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGG-RSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTC
        +  N+A    +    LL + +++TFYYV L G+SVGG R   +   LF++D  GNGG+I+DSGT++TRL  P Y ++RDAF      LK+   F+LFDTC
Subjt:  LEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGG-RSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTC

Query:  YDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        +DLS+ + V +PTV   F G + + LP  NYLIPVD+ G FCFAFA T   LSIIGN+QQQG RV +DLA+S VGF+P  C
Subjt:  YDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

AT1G25510.1 Eukaryotic aspartyl protease family protein2.8e-15757.87Show/hide
Query:  PLFAFAVFFFFSLCRSS------PELS--RSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDR
        P ++F  F FF    SS      PE S   +  L+V+ S+ + +    F     +  QQ+E     ++SSSFSLQLH R S+R   H DYKSL L+RL+R
Subjt:  PLFAFAVFFFFSLCRSS------PELS--RSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDR

Query:  DSSRVKSLNDRLRFALSELKRSDLQPLETEILPE--DLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRAS
        D++RVKSL  RL  A++ + ++DL+P+ T    E  D+  P++SG +QGSGEYF+R+G+G+PA+  YMVLDTGSD+NWLQC PC DCY QT+PIF+P +S
Subjt:  DSSRVKSLNDRLRFALSELKRSDLQPLETEILPE--DLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRAS

Query:  SSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLV
        SS+  L CD+ QC  LE+S CR   CLY+VSYGDGS+TVG+F TETL  G S  ++NVA+GCGH NEGLFVG+AGLLGLGGG L+L SQ+  +SFSYCLV
Subjt:  SSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLV

Query:  DRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTN
        DRDS S+ST++F ++   D+V A LLR+ +++TFYY+ LTG+SVGG  L IP   F+MD+SG+GGII+DSGTA+TRLQT +YNSLRD+FV+ T  L+K  
Subjt:  DRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTN

Query:  GFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        G A+FDTCY+LS+++ V +PTV+F F GG+ L LP KNY+IPVDS GTFC AFAPT SSL+IIGNVQQQGTRV FDLANS++GFS NKC
Subjt:  GFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

AT3G18490.1 Eukaryotic aspartyl protease family protein2.6e-16660Show/hide
Query:  MASANLLPLFAFAVFFFF-----SLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSK--SYQQQELLIPA--NSSSSFSLQLHPRDSLRNAGHKDYKS
        MA    L L A      F     +  RS     ++  LDV +SL+Q Q IL  DP  S   + + + L  P   NSSS  SL+LH RD+   + HKDYKS
Subjt:  MASANLLPLFAFAVFFFF-----SLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSK--SYQQQELLIPA--NSSSSFSLQLHPRDSLRNAGHKDYKS

Query:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPL---ETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQT
        L LSRL+RDSSRV  +  ++RFA+  + RSDL+P+   +T    EDL+TP+VSG SQGSGEYFSRIGVG PAK  Y+VLDTGSD+NW+QC+PC DCYQQ+
Subjt:  LVLSRLDRDSSRVKSLNDRLRFALSELKRSDLQPL---ETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQT

Query:  DPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR
        DP+F+P +SS++ SL C + QC  LE S CR++KCLYQVSYGDGSFTVGE  T+T+ FGNSG I NVALGCGHDNEGLF G+AGLLGLGGG LS+T+QM+
Subjt:  DPIFDPRASSSFSSLPCDSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR

Query:  ASSFSYCLVDRDSGSSSTLEFNSAE-PSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFV
        A+SFSYCLVDRDSG SS+L+FNS +      TA LLR+ +++TFYYV L+G SVGG  + +P  +F +D SG+GG+I+D GTA+TRLQT  YNSLRDAF+
Subjt:  ASSFSYCLVDRDSGSSSTLEFNSAE-PSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFV

Query:  RLTPYLKK-TNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        +LT  LKK ++  +LFDTCYD SS S V +PTV+F F+GG+SL LP KNYLIPVD +GTFCFAFAPT+SSLSIIGNVQQQGTR+ +DL+ +V+G S NKC
Subjt:  RLTPYLKK-TNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

AT3G20015.1 Eukaryotic aspartyl protease family protein3.9e-10642.27Show/hide
Query:  LLPLFAFAVFFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVK
        LLPLF F +     L  SS  +S   +  +   L+    +    P+ + ++   E      SSS ++L+L  RD   +  ++++   + +R+ RD+ RV 
Subjt:  LLPLFAFAVFFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVK

Query:  SLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPC
        ++       L  +    +   ++     D  + IVSG  QGSGEYF RIGVG P +  YMV+D+GSD+ W+QCQPC  CY+Q+DP+FDP  S S++ + C
Subjt:  SLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPC

Query:  DSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM---RASSFSYCLVDRDSG
         S  C  +E SGC +  C Y+V YGDGS+T G    ETL F  +  +RNVA+GCGH N G+F+G+AGLLG+GGGS+S   Q+      +F YCLV R + 
Subjt:  DSQQCQTLEMSGCRADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQM---RASSFSYCLVDRDSG

Query:  SSSTLEF-NSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFAL
        S+ +L F   A P  +    L+R+ R  +FYYV L G+ VGG  + +P  +F + ++G+GG+++D+GTA+TRL T  Y + RD F   T  L + +G ++
Subjt:  SSSTLEF-NSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFAL

Query:  FDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
        FDTCYDLS    V +PTVSF F+ G  L LP +N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +V+FD AN  VGF PN C
Subjt:  FDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC

AT3G61820.1 Eukaryotic aspartyl protease family protein2.9e-10948.79Show/hide
Query:  NLLPLFAFAVFFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRV
        N L    FAV FF S   S  +      L  SA+L          PE S+S   + L   + S++S S+ L   D+L +        L   RL RDS RV
Subjt:  NLLPLFAFAVFFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRV

Query:  KSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLP
        KS+      +L+ +         T       S  ++SG SQGSGEYF R+GVG PA   YMVLDTGSD+ WLQC PC  CY QTD IFDP+ S +F+++P
Subjt:  KSLNDRLRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLP

Query:  CDSQQCQTL-EMSGC---RADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR---ASSFSYCLV
        C S+ C+ L + S C   R+  CLYQVSYGDGSFT G+F TETL F +   + +V LGCGHDNEGLFVG+AGLLGLG G LS  SQ +      FSYCLV
Subjt:  CDSQQCQTL-EMSGC---RADKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMR---ASSFSYCLV

Query:  DR-DSGSS----STLEF-NSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGG-RSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLT
        DR  SGSS    ST+ F N+A P  SV   LL + +++TFYY+ L G+SVGG R   +    F++D +GNGG+I+DSGT++TRL  P Y +LRDAF    
Subjt:  DR-DSGSS----STLEF-NSAEPSDSVTAQLLRSGRVNTFYYVALTGMSVGG-RSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLT

Query:  PYLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC
          LK+   ++LFDTC+DLS  + V +PTV F F GGE + LP  NYLIPV++ G FCFAFA T  SLSIIGN+QQQG RV +DL  S VGF    C
Subjt:  PYLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPVDSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGCAAATCTGCTTCCTCTGTTTGCTTTCGCCGTCTTCTTCTTCTTCTCGCTCTGTCGCAGTTCGCCGGAGCTTTCTCGCTCCGGTTATCTCGATGTTTCTGC
TTCGCTGAAACAAGCTCAACGGATCCTCAAATTCGACCCGGAGACTTCGAAATCTTACCAGCAGCAGGAACTGCTCATTCCGGCGAACTCTTCTTCCTCCTTCTCTCTGC
AACTCCACCCTCGCGATTCTCTTCGCAATGCCGGCCACAAGGACTACAAGAGCTTGGTTCTCTCCAGGCTCGATCGGGATTCGTCCAGAGTTAAATCGCTCAACGATCGG
CTTCGATTTGCTCTCAGTGAGTTGAAGAGGTCGGATCTTCAGCCCTTGGAGACGGAGATTCTGCCGGAGGATCTTAGCACTCCGATTGTGTCTGGATTCAGTCAGGGGAG
CGGCGAGTATTTCTCCCGTATCGGCGTTGGACAACCGGCGAAGCCGTTCTATATGGTTCTTGATACCGGCAGTGATATCAATTGGCTTCAATGTCAGCCCTGTACCGATT
GCTACCAACAAACCGATCCGATTTTCGATCCGAGAGCTTCGTCTTCGTTTTCTTCTCTTCCCTGCGATTCTCAGCAATGTCAAACCCTAGAAATGTCCGGTTGTCGTGCT
GATAAATGTCTTTACCAGGTCTCGTACGGCGACGGCTCCTTCACCGTCGGCGAATTCGTCACGGAAACGCTGGCGTTTGGAAACTCCGGTTCGATTCGCAACGTCGCTCT
CGGATGCGGCCACGATAACGAAGGATTGTTCGTCGGATCCGCCGGATTGCTCGGACTCGGCGGCGGATCCCTATCCCTAACCTCTCAGATGAGAGCGTCGTCGTTTTCGT
ACTGTCTCGTCGATCGCGATTCCGGCTCTTCCTCAACTCTCGAGTTCAACTCCGCCGAGCCGAGCGACTCGGTGACGGCGCAATTGCTCAGAAGCGGAAGAGTGAATACA
TTCTACTACGTTGCACTCACCGGAATGAGCGTCGGCGGGCGGTCACTGTCGATTCCGCCGTATCTGTTTCAGATGGACGATTCCGGCAACGGCGGCATCATCGTCGATTC
AGGAACCGCCATAACTCGGCTGCAGACTCCGGTCTATAACTCGCTTCGCGACGCGTTCGTGAGGCTCACGCCGTACCTGAAGAAGACGAATGGCTTTGCGCTATTCGACA
CATGTTATGATCTGTCGTCGCAGTCTCGAGTCACCATTCCGACGGTGTCGTTTCAGTTCTCCGGCGGAGAGTCTCTGCTGCTGCCGCCGAAGAACTACCTGATTCCGGTG
GACTCCGCCGGGACTTTCTGTTTCGCGTTCGCTCCGACGACGTCGTCTTTGTCCATCATAGGGAACGTTCAGCAGCAGGGGACACGTGTCAACTTCGATTTGGCGAATTC
AGTCGTAGGGTTTTCGCCCAATAAATGCTAG
mRNA sequenceShow/hide mRNA sequence
TTTAAAAATCAAATCTGTTGCCATTGCAAATCATGTATTTAAAATATAAATAAAATTAAAATTTGAATAATTTTGGGGAAAAAAGAGAGCAGATAATGGCAACTAGGAAA
CAGTGCCTTGCCTCCAACCCCAAACCAAACTCAACCATCTTATTCATTCAATTAAAAACCTCACTTCCTTTTTAAACCCCAACTTTCCACCCTTCTTCTTCACTGCTTAA
CAATGGCGTCTGCAAATCTGCTTCCTCTGTTTGCTTTCGCCGTCTTCTTCTTCTTCTCGCTCTGTCGCAGTTCGCCGGAGCTTTCTCGCTCCGGTTATCTCGATGTTTCT
GCTTCGCTGAAACAAGCTCAACGGATCCTCAAATTCGACCCGGAGACTTCGAAATCTTACCAGCAGCAGGAACTGCTCATTCCGGCGAACTCTTCTTCCTCCTTCTCTCT
GCAACTCCACCCTCGCGATTCTCTTCGCAATGCCGGCCACAAGGACTACAAGAGCTTGGTTCTCTCCAGGCTCGATCGGGATTCGTCCAGAGTTAAATCGCTCAACGATC
GGCTTCGATTTGCTCTCAGTGAGTTGAAGAGGTCGGATCTTCAGCCCTTGGAGACGGAGATTCTGCCGGAGGATCTTAGCACTCCGATTGTGTCTGGATTCAGTCAGGGG
AGCGGCGAGTATTTCTCCCGTATCGGCGTTGGACAACCGGCGAAGCCGTTCTATATGGTTCTTGATACCGGCAGTGATATCAATTGGCTTCAATGTCAGCCCTGTACCGA
TTGCTACCAACAAACCGATCCGATTTTCGATCCGAGAGCTTCGTCTTCGTTTTCTTCTCTTCCCTGCGATTCTCAGCAATGTCAAACCCTAGAAATGTCCGGTTGTCGTG
CTGATAAATGTCTTTACCAGGTCTCGTACGGCGACGGCTCCTTCACCGTCGGCGAATTCGTCACGGAAACGCTGGCGTTTGGAAACTCCGGTTCGATTCGCAACGTCGCT
CTCGGATGCGGCCACGATAACGAAGGATTGTTCGTCGGATCCGCCGGATTGCTCGGACTCGGCGGCGGATCCCTATCCCTAACCTCTCAGATGAGAGCGTCGTCGTTTTC
GTACTGTCTCGTCGATCGCGATTCCGGCTCTTCCTCAACTCTCGAGTTCAACTCCGCCGAGCCGAGCGACTCGGTGACGGCGCAATTGCTCAGAAGCGGAAGAGTGAATA
CATTCTACTACGTTGCACTCACCGGAATGAGCGTCGGCGGGCGGTCACTGTCGATTCCGCCGTATCTGTTTCAGATGGACGATTCCGGCAACGGCGGCATCATCGTCGAT
TCAGGAACCGCCATAACTCGGCTGCAGACTCCGGTCTATAACTCGCTTCGCGACGCGTTCGTGAGGCTCACGCCGTACCTGAAGAAGACGAATGGCTTTGCGCTATTCGA
CACATGTTATGATCTGTCGTCGCAGTCTCGAGTCACCATTCCGACGGTGTCGTTTCAGTTCTCCGGCGGAGAGTCTCTGCTGCTGCCGCCGAAGAACTACCTGATTCCGG
TGGACTCCGCCGGGACTTTCTGTTTCGCGTTCGCTCCGACGACGTCGTCTTTGTCCATCATAGGGAACGTTCAGCAGCAGGGGACACGTGTCAACTTCGATTTGGCGAAT
TCAGTCGTAGGGTTTTCGCCCAATAAATGCTAGAAAATGGAGTTATAAAATTATAAATTTAGGGGGTTGTTTTTTTTTTTGGGATTTGCCTTATTTTTGTTTTGTTGTGT
CCGTGTGGGGTAATTTACGAAATTAGGGTTATGGGGAATTATAAATTAGGGGAAATTAAAAGCGGGTGGGGCAGGTGTGGATAATGGCCGAAGGAAATGAACAAAGAGAA
GCATGTGCAAGTCATAAATCAATCATAAATTGTACCGTAAGATTCAGTTGAGTGGCATGTATAGTAAATAAACTGCAAACCTGTGGCCAGATTTGGGGTTTTCACCATTT
TCTTACCTTCTTCCTTCCCCCACAATATCTAATAAAAAAATATAAAAATTTGAGTAAAAAACCATTTCAGGGGTGGATAATATTGAGTGGTTAAAACTCCAAAGTTTGTA
ACCAAAACTAGTTTGGATATATATAATTTTTGGGTAGAAGAGTTAAAGTAAAAAAAAATTGTTTGAATGTAGATTTAAAAT
Protein sequenceShow/hide protein sequence
MASANLLPLFAFAVFFFFSLCRSSPELSRSGYLDVSASLKQAQRILKFDPETSKSYQQQELLIPANSSSSFSLQLHPRDSLRNAGHKDYKSLVLSRLDRDSSRVKSLNDR
LRFALSELKRSDLQPLETEILPEDLSTPIVSGFSQGSGEYFSRIGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRASSSFSSLPCDSQQCQTLEMSGCRA
DKCLYQVSYGDGSFTVGEFVTETLAFGNSGSIRNVALGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMRASSFSYCLVDRDSGSSSTLEFNSAEPSDSVTAQLLRSGRVNT
FYYVALTGMSVGGRSLSIPPYLFQMDDSGNGGIIVDSGTAITRLQTPVYNSLRDAFVRLTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFQFSGGESLLLPPKNYLIPV
DSAGTFCFAFAPTTSSLSIIGNVQQQGTRVNFDLANSVVGFSPNKC