; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G10890 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G10890
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionaspartic proteinase CDR1
Genome locationChr6:9499375..9501684
RNA-Seq ExpressionCSPI06G10890
SyntenyCSPI06G10890
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033565.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa]8.1e-24090.39Show/hide
Query:  MKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGS
        MKIQD+ ERMKDIHEHD NRHRSISKSMNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFV+LKVGTPAQTFMLIADTGS
Subjt:  MKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGS

Query:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLT
        DLTW+KCRYRRCFGNCS NVNHKSKNEKKQRFRHA LAN SS+FKTVSCSSTMCTN+LA+LFAVAEC TPTSPCVYDYSY GGASAKGIFAWETLTVGLT
Subjt:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLT

Query:  NGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGD
        NGKEKQL NSIIGCTE VQG+VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLG+PTPSTSASTSSAK PAKM+YTKLYVGD
Subjt:  NGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGD

Query:  PYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVF
        PYSSFYGVDLIGISA+G MLNIP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQ+EIEPF+FCFNNSQYTH+MAPKLRFHFGDGTVF
Subjt:  PYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        EPPTKSYIVSVG+FISCIG VSMPFP+ NIIGNILQQNHLWQFDFQKRRVGFA SECI
Subjt:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]9.1e-30899.26Show/hide
Query:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
        MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
Subjt:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH

Query:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSISKSMNQKQVEDARLRAEAEAATE EVAKSAILPPATSTPIGM+MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
Subjt:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
        HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAV ECH PTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
Subjt:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
        VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN

Query:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
        IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
Subjt:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV

Query:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
Subjt:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]1.5e-27388.83Show/hide
Query:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
        MLGYRKPMSPISNFC   FFFLL FFLSFSSSFLFALGDE NN+NN    NDDEDEQ+ I+FDLLHRHHPQV+EK++GDMKIQD+ ERMKDIHEHD NRH
Subjt:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH

Query:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSISKSMNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFV+LKVGTPAQTFMLIADTGSDLTW+KCRYRRCFGNCS NVN
Subjt:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
        HKSKNEKKQRFRHA LAN SS+FKTVSCSSTMCTN+LA+LFAVAEC TPTSPCVYDYSY GGASAKGIFAWETLTVGLTNGKEKQL NSIIGCTE VQG+
Subjt:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
        VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLG+PTPSTSASTSSAK PAKM+YTKLYVGDPYSSFYGVDLIGISA+G MLN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN

Query:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
        IP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQ+EIEPF+FCFNNSQYTH+MAPKLRFHFGDGTVFEPPTKSYIVSVG+FISCIG V
Subjt:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV

Query:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        SMPFP+ NIIGNILQQNHLWQFDFQKRRVGFA SECI
Subjt:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

XP_022943788.1 aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata]5.8e-16154.75Show/hide
Query:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
        MLGY  PMSPIS    FFFF    FFLS   +F    GDE               E  ++K D++HRHHP V EK++G+ +    ++R +DIHEHDHNR 
Subjt:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH

Query:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSIS SM   + +                     LP  +S PI +K+ SG DFG++EYFV+ +VGTP Q F+LI DTGSDLTW+KCRYRRC GNC+++ +
Subjt:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
        HKS+ E K +F H FLANHSSSFK ++C S  C  DL  LFA+ +C  P++PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQLH+++IGCTE     
Subjt:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
           G DG++GLGT ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S      MT+  L++G P++S+YGV LIGIS +G+ LN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN

Query:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
        IP RVWDI  GGGTI+DSGTSL++L APAFD+ MEA+  +LKKFQQ+  +PF +CFN + Y+HEMAPKLRFHF  G VFEPP KSYIV V   I C+GF 
Subjt:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV

Query:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        S+PFP  NIIGNILQQN LWQFDF  ++VGFAPS+CI
Subjt:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]3.1e-22372.17Show/hide
Query:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
        MLGYRKPMSPIS+FC FF    LFFFLS   +F                  D   +QE +K DLLHRHHPQV+EK+HGD+K++++++R+KDI EHD  R+
Subjt:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH

Query:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        ++IS S+N+ ++ D +LR EA    E    K   LPP +STPIG+KMISG+D+GSSEYFV+LKVGTP QTFMLIADTGSDLTWMKCRYRRC GNCSSN N
Subjt:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
        HK++NE+K RFR+AFLAN+SSSFKT+ CSS MCTNDLADLF++ EC TPTSPC+YDYSY+GGASAKG+FA ETLTVGLTNGKEKQLHNSIIGCTESVQG 
Subjt:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPST--SASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIM
        +FGGADGV+GLGTSSYS TYKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +A+ SS      M++TKL++GDPYSSFYGVDL+GISA+G+M
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPST--SASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIM

Query:  LNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIG
        LNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEAL P+LK F+ +EIEPFDFCFNNS+YTHEMAPKLRFHFGDGTVF+PP KSYIVSVG++ISCIG
Subjt:  LNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIG

Query:  FVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        FVSMPFPA NIIGNILQQNHLW+FDF    VGFAPSEC+
Subjt:  FVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein4.4e-30899.26Show/hide
Query:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
        MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
Subjt:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH

Query:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSISKSMNQKQVEDARLRAEAEAATE EVAKSAILPPATSTPIGM+MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
Subjt:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
        HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAV ECH PTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
Subjt:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
        VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN

Query:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
        IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
Subjt:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV

Query:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
Subjt:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

A0A1S3C2F3 aspartic proteinase CDR17.1e-27488.83Show/hide
Query:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
        MLGYRKPMSPISNFC   FFFLL FFLSFSSSFLFALGDE NN+NN    NDDEDEQ+ I+FDLLHRHHPQV+EK++GDMKIQD+ ERMKDIHEHD NRH
Subjt:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH

Query:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSISKSMNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFV+LKVGTPAQTFMLIADTGSDLTW+KCRYRRCFGNCS NVN
Subjt:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
        HKSKNEKKQRFRHA LAN SS+FKTVSCSSTMCTN+LA+LFAVAEC TPTSPCVYDYSY GGASAKGIFAWETLTVGLTNGKEKQL NSIIGCTE VQG+
Subjt:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
        VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLG+PTPSTSASTSSAK PAKM+YTKLYVGDPYSSFYGVDLIGISA+G MLN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN

Query:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
        IP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQ+EIEPF+FCFNNSQYTH+MAPKLRFHFGDGTVFEPPTKSYIVSVG+FISCIG V
Subjt:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV

Query:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        SMPFP+ NIIGNILQQNHLWQFDFQKRRVGFA SECI
Subjt:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

A0A5D3B701 Aspartic proteinase CDR13.9e-24090.39Show/hide
Query:  MKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGS
        MKIQD+ ERMKDIHEHD NRHRSISKSMNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFV+LKVGTPAQTFMLIADTGS
Subjt:  MKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGS

Query:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLT
        DLTW+KCRYRRCFGNCS NVNHKSKNEKKQRFRHA LAN SS+FKTVSCSSTMCTN+LA+LFAVAEC TPTSPCVYDYSY GGASAKGIFAWETLTVGLT
Subjt:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLT

Query:  NGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGD
        NGKEKQL NSIIGCTE VQG+VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLG+PTPSTSASTSSAK PAKM+YTKLYVGD
Subjt:  NGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGD

Query:  PYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVF
        PYSSFYGVDLIGISA+G MLNIP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQ+EIEPF+FCFNNSQYTH+MAPKLRFHFGDGTVF
Subjt:  PYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        EPPTKSYIVSVG+FISCIG VSMPFP+ NIIGNILQQNHLWQFDFQKRRVGFA SECI
Subjt:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X28.4e-15854.53Show/hide
Query:  MSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSM
        MSPIS    FFFF    FFLS   +F    GDE               E  ++K D++HRHHP V EK++G+ +    ++R +DIHEHDHNR RSIS SM
Subjt:  MSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSM

Query:  NQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEK
           + +                     LP  +S PI +K+ SG DFG++EYFV+ +VGTP Q F+LI DTGSDLTW+KCRYRRC GNC+++ +HKS+ E 
Subjt:  NQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEK

Query:  KQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADG
        K +F H FLANHSSSFK ++C S  C  DL  LFA+ +C  P++PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQLH+++IGCTE        G DG
Subjt:  KQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADG

Query:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWD
        ++GLGT ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S      MT+  L++G P++S+YGV LIGIS +G+ LNIP RVWD
Subjt:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWD

Query:  INSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSMPFPAN
        I  GGGTI+DSGTSL++L APAFD+ MEA+  +LKKFQQ+  +PF +CFN + Y+HEMAPKLRFHF  G VFEPP KSYIV V   I C+GF S+PFP  
Subjt:  INSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSMPFPAN

Query:  NIIGNILQQNHLWQFDFQKRRVGFAPSECI
        NIIGNILQQN LWQFDF  ++VGFAPS+CI
Subjt:  NIIGNILQQNHLWQFDFQKRRVGFAPSECI

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X12.8e-16154.75Show/hide
Query:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH
        MLGY  PMSPIS    FFFF    FFLS   +F    GDE               E  ++K D++HRHHP V EK++G+ +    ++R +DIHEHDHNR 
Subjt:  MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRH

Query:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSIS SM   + +                     LP  +S PI +K+ SG DFG++EYFV+ +VGTP Q F+LI DTGSDLTW+KCRYRRC GNC+++ +
Subjt:  RSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS
        HKS+ E K +F H FLANHSSSFK ++C S  C  DL  LFA+ +C  P++PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQLH+++IGCTE     
Subjt:  HKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN
           G DG++GLGT ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S      MT+  L++G P++S+YGV LIGIS +G+ LN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLN

Query:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV
        IP RVWDI  GGGTI+DSGTSL++L APAFD+ MEA+  +LKKFQQ+  +PF +CFN + Y+HEMAPKLRFHF  G VFEPP KSYIV V   I C+GF 
Subjt:  IPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFV

Query:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI
        S+PFP  NIIGNILQQN LWQFDF  ++VGFAPS+CI
Subjt:  SMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED33.1e-3228.82Show/hide
Query:  PPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTN
        P  TS P+     SG       Y V  K+GTP Q   ++ DT +D  W+ C        CS   N  +          +F  N SS++ TVSCS+  CT 
Subjt:  PPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTN

Query:  DLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCL
          A            S C ++ SY G +S       +TLT+         + N   GC  S  G+      G+MGLG    SL  +   +   G FSYCL
Subjt:  DLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCL

Query:  VDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPS--RVWDINSGGGTIIDSGTSLTILAAPAFDMV
            +      YF       S S        P  + YT L       S Y V+L G+S   + + +      +D NSG GTIIDSGT +T  A P ++ +
Subjt:  VDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPS--RVWDINSGGGTIIDSGTSLTILAAPAFDMV

Query:  MEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSMPFPAN---NIIGNILQQNHLWQFDFQKRRVG
         +    ++       +  FD CF  S     +APK+  H     +  P   + I S    ++C+    +   AN   N+I N+ QQN    FD    R+G
Subjt:  MEALTPRLKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSMPFPAN---NIIGNILQQNHLWQFDFQKRRVG

Query:  FAPSEC
         AP  C
Subjt:  FAPSEC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 21.7e-3026.57Show/hide
Query:  FFFFFLLFFFLSFSSSFLF-----------------ALGDEDNNFNNNNNINDDEDEQEIIKFDLLHR-HHPQVAEKIHGDMKIQDVSERMKDIHEHDHN
        FFFF  L   LS SSS  F                  L D      NN + +D+   +  ++  LLHR   P V  + H          R+      D +
Subjt:  FFFFFLLFFFLSFSSSFLF-----------------ALGDEDNNFNNNNNINDDEDEQEIIKFDLLHR-HHPQVAEKIHGDMKIQDVSERMKDIHEHDHN

Query:  RHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSN
        R  +I + ++ K +  +  R E                       G  ++SG D GS EYFV + VG+P +   ++ D+GSD+ W++C+  +     S  
Subjt:  RHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSN

Query:  VNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQ
        V   +K               S S+  VSC S++C     D    + CH  +  C Y+  Y  G+  KG  A ETLT   T      + N  +GC    +
Subjt:  VNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQ

Query:  GSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIM
        G +F GA G++G+G  S S   + +    GG F YCLV   TD      F                 LP   ++  L       SFY V L G+   G+ 
Subjt:  GSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIM

Query:  LNIPSRVWDI--NSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLE-IEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV-GKFI
        + +P  V+D+     GG ++D+GT++T L   A+    +    +     +   +  FD C++ S +     P + F+F +G V   P +++++ V     
Subjt:  LNIPSRVWDI--NSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLE-IEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV-GKFI

Query:  SCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
         C  F + P    +IIGNI Q+     FD     VGF P+ C
Subjt:  SCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC

Q9LNJ3 Aspartyl protease family protein 24.2e-3730.9Show/hide
Query:  MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAE
        ++SG   GS EYF  L VGTPA+   ++ DTGSD+ W++C   RRC+                      F    S ++ T+ CSS  C          A 
Subjt:  MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAE

Query:  CHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAI
        C+T    C+Y  SY  G+   G F+ ETLT      +  ++    +GC    +G +F GA G++GLG    S   +     N   FSYCLVD     +  
Subjt:  CHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAI

Query:  SYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIML-NIPSRVWDIN--SGGGTIIDSGTSLTILAAPAFDMVMEALTPRLK
                   +S    +A +     +T L       +FY V L+GIS  G  +  + + ++ ++    GG IIDSGTS+T L  PA+  + +A     K
Subjt:  SYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIML-NIPSRVWDIN--SGGGTIIDSGTSLTILAAPAFDMVMEALTPRLK

Query:  KFQQL-EIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV---GKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
          ++  +   FD CF+ S       P +  HF    V  P T +Y++ V   GKF  C  F        +IIGNI QQ     +D    RVGFAP  C
Subjt:  KFQQL-EIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV---GKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 13.2e-3730.4Show/hide
Query:  MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAEC
        ++SGA  GS EYF  + VGTPA+   L+ DTGSD+ W++C        C+         +  Q+    F    SS++K+++CS+  C+     L   + C
Subjt:  MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAEC

Query:  HTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAIS
           ++ C+Y  SY  G+   G  A +T+T G  +GK   ++N  +GC    +G +F GA G++GLG    S+T           FSYCLVD  + +    
Subjt:  HTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAIS

Query:  YFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEA---LTPRL
                 S+S   +S +L        L       +FY V L G S  G  + +P  ++D+++   GG I+D GT++T L   A++ + +A   LT  L
Subjt:  YFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEA---LTPRL

Query:  KKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV---GKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
        KK     I  FD C++ S  +    P + FHF  G   + P K+Y++ V   G F  C  F      + +IIGN+ QQ     +D  K  +G + ++C
Subjt:  KKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV---GKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC

Q9LTW4 Aspartic proteinase NANA, chloroplast1.3e-8639.39Show/hide
Query:  VSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVELKVGTPAQTFMLIADTGSDLT
        V++ MKD        HR         ++ED             +  + +++    ++ +G+KM   SG D+G+++YF E++VGTPA+ F ++ DTGS+LT
Subjt:  VSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVELKVGTPAQTFMLIADTGSDLT

Query:  WMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGK
        W+ CRYR            + K+      R  F A+ S SFKTV C +  C  DL +LF++  C TP++PC YDY Y  G++A+G+FA ET+TVGLTNG+
Subjt:  WMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGK

Query:  EKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYS
          +L   +IGC+ S  G  F GADGV+GL  S +S T   A +  G  FSYCLVDHL+++   +Y + G    S+ ++ ++ +    +  T++       
Subjt:  EKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYS

Query:  SFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIE--PFDFCFN-NSQYTHEMAPKLRFHFGDGTVF
         FY +++IGIS    ML+IPS+VWD  SGGGTI+DSGTSLT+LA  A+  V+  L   L + ++++ E  P ++CF+  S +     P+L FH   G  F
Subjt:  SFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIE--PFDFCFN-NSQYTHEMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
        EP  KSY+V     + C+GFVS   PA N+IGNI+QQN+LW+FD     + FAPS C
Subjt:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein8.0e-4428.66Show/hide
Query:  DMKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTG
        D++IQD++ R+K +H   +   +  ++ + +K   D  L    E +    +A                + SG   GS EYF+++ VGTP + F LI DTG
Subjt:  DMKIQDVSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTG

Query:  SDLTWMKC-RYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVG
        SDL W++C     CF       + K+                S+SFK ++C+   C+  ++      +C +    C Y Y Y   ++  G FA ET TV 
Subjt:  SDLTWMKC-RYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVG

Query:  LT----NGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYT
        LT       E ++ N + GC    +G +F GA G++GLG    S +    ++  G  FSYCLVD  ++    S  + G                  + +T
Subjt:  LT----NGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYT

Query:  KLYVGDPYS--SFYGVDLIGISANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQL--EIEPFDFCFNNS--QYTHEM
            G   S  +FY + +  I   G  L+IP   W+I+S   GGTIIDSGT+L+  A PA++++      ++K+   +  +    D CFN S  +  +  
Subjt:  KLYVGDPYS--SFYGVDLIGISANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQL--EIEPFDFCFNNS--QYTHEM

Query:  APKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
         P+L   F DGTV+  P ++  + + + + C+  +  P    +IIGN  QQN    +D ++ R+GF P++C
Subjt:  APKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC

AT3G12700.1 Eukaryotic aspartyl protease family protein9.0e-8839.39Show/hide
Query:  VSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVELKVGTPAQTFMLIADTGSDLT
        V++ MKD        HR         ++ED             +  + +++    ++ +G+KM   SG D+G+++YF E++VGTPA+ F ++ DTGS+LT
Subjt:  VSERMKDIHEHDHNRHRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVELKVGTPAQTFMLIADTGSDLT

Query:  WMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGK
        W+ CRYR            + K+      R  F A+ S SFKTV C +  C  DL +LF++  C TP++PC YDY Y  G++A+G+FA ET+TVGLTNG+
Subjt:  WMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGK

Query:  EKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYS
          +L   +IGC+ S  G  F GADGV+GL  S +S T   A +  G  FSYCLVDHL+++   +Y + G    S+ ++ ++ +    +  T++       
Subjt:  EKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYS

Query:  SFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIE--PFDFCFN-NSQYTHEMAPKLRFHFGDGTVF
         FY +++IGIS    ML+IPS+VWD  SGGGTI+DSGTSLT+LA  A+  V+  L   L + ++++ E  P ++CF+  S +     P+L FH   G  F
Subjt:  SFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPRLKKFQQLEIE--PFDFCFN-NSQYTHEMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
        EP  KSY+V     + C+GFVS   PA N+IGNI+QQN+LW+FD     + FAPS C
Subjt:  EPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC

AT3G25700.1 Eukaryotic aspartyl protease family protein1.7e-5735.31Show/hide
Query:  MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMC----TNDLADLFA
        ++SGA  GS +YFV+L++G P Q+ +LIADTGSDL W+KC   R   NCS    H S           F   HSS+F    C   +C      D A +  
Subjt:  MISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMC----TNDLADLFA

Query:  VAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGC-----TESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVD
            H   S C Y+Y Y  G+   G+FA ET ++  ++GKE +L +   GC      +SV G+ F GA+GVMGLG    S   +      G  FSYCL+D
Subjt:  VAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGC-----TESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVD

Query:  HLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDI--NSGGGTIIDSGTSLTILAAPAFDMVME
        +       SY ++G      S          K+ +T L       +FY V L  +  NG  L I   +W+I  +  GGT++DSGT+L  LA PA+  V+ 
Subjt:  HLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDI--NSGGGTIIDSGTSLTILAAPAFDMVME

Query:  ALTPRLKKFQQLEIEP-FDFCFNNSQYT--HEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSM-PFPANNIIGNILQQNHLWQFDFQKRRVGF
        A+  R+K      + P FD C N S  T   ++ P+L+F F  G VF PP ++Y +   + I C+   S+ P    ++IGN++QQ  L++FD  + R+GF
Subjt:  ALTPRLKKFQQLEIEP-FDFCFNNSQYT--HEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSM-PFPANNIIGNILQQNHLWQFDFQKRRVGF

Query:  APSEC
        +   C
Subjt:  APSEC

AT3G59080.1 Eukaryotic aspartyl protease family protein2.0e-4731.03Show/hide
Query:  HRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKC-RYRRCFGNCSSN
        H+ + +  NQ  V   + + + E  T   VA S       +  +   + SG   GS EYF+++ VG+P + F LI DTGSDL W++C     CF    + 
Subjt:  HRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKC-RYRRCFGNCSSN

Query:  VNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGL-TNGKEKQLH---NSIIGCT
         + K+                S+S+K ++C+   C N ++       C +    C Y Y Y   ++  G FA ET TV L TNG   +L+   N + GC 
Subjt:  VNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGL-TNGKEKQLH---NSIIGCT

Query:  ESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVG--DPYSSFYGVDLIGI
           +G +F GA G++GLG    S +    ++  G  FSYCLVD  +D    S  + G      S           + +T    G  +   +FY V +  I
Subjt:  ESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVG--DPYSSFYGVDLIGI

Query:  SANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEALTPRLK-KFQQLEIEP-FDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIV
           G +LNIP   W+I+S   GGTIIDSGT+L+  A PA++ +   +  + K K+      P  D CFN S   +   P+L   F DG V+  PT++  +
Subjt:  SANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEALTPRLK-KFQQLEIEP-FDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIV

Query:  SVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
         + + + C+  +  P  A +IIGN  QQN    +D ++ R+G+AP++C
Subjt:  SVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC

AT3G59080.2 Eukaryotic aspartyl protease family protein5.7e-4229.53Show/hide
Query:  HRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNV
        H+ + +  NQ  V   + + + E  T   VA S       +  +   + SG   GS EYF+++ VG+P + F LI DTGSDL W++C             
Subjt:  HRSISKSMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNV

Query:  NHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGL-TNGKEKQLH---NSIIGCTE
                                  + C      ND                C Y Y Y   ++  G FA ET TV L TNG   +L+   N + GC  
Subjt:  NHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGL-TNGKEKQLH---NSIIGCTE

Query:  SVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVG--DPYSSFYGVDLIGIS
          +G +F GA G++GLG    S +    ++  G  FSYCLVD  +D    S  + G      S           + +T    G  +   +FY V +  I 
Subjt:  SVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVG--DPYSSFYGVDLIGIS

Query:  ANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEALTPRLK-KFQQLEIEP-FDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVS
          G +LNIP   W+I+S   GGTIIDSGT+L+  A PA++ +   +  + K K+      P  D CFN S   +   P+L   F DG V+  PT++  + 
Subjt:  ANGIMLNIPSRVWDINS--GGGTIIDSGTSLTILAAPAFDMVMEALTPRLK-KFQQLEIEP-FDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVS

Query:  VGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC
        + + + C+  +  P  A +IIGN  QQN    +D ++ R+G+AP++C
Subjt:  VGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCAAATTTTTGTTTCTTCTTCTTCTTCTTCCTCCTTTTCTTCTTCTTATCCTTCTCCTCCTCTTTCCTCTTTGCATT
GGGCGACGAAGACAACAACTTCAACAACAATAATAATATTAATGACGACGAAGATGAGCAAGAGATAATCAAATTCGATCTTCTTCACCGTCACCATCCACAAGTCGCGG
AAAAGATTCACGGTGATATGAAAATCCAAGATGTAAGCGAAAGAATGAAAGACATTCACGAACACGACCACAATCGTCACCGCTCCATCTCCAAATCGATGAACCAGAAG
CAAGTTGAGGATGCAAGATTGAGGGCAGAGGCAGAGGCGGCGACAGAGGTAGAGGTAGCAAAGAGTGCAATACTTCCACCGGCAACGTCGACGCCGATAGGAATGAAGAT
GATTTCGGGTGCAGATTTCGGGAGTAGTGAGTATTTCGTAGAGTTGAAGGTGGGAACGCCGGCGCAAACGTTTATGTTGATTGCGGATACAGGGAGTGATTTAACGTGGA
TGAAATGTAGATATCGGAGGTGTTTTGGGAATTGTAGCAGTAACGTGAATCATAAAAGCAAAAATGAAAAGAAACAGAGATTTAGACATGCTTTCTTGGCGAATCATTCG
TCTTCTTTTAAGACGGTTTCTTGCAGCTCAACGATGTGCACCAATGATCTTGCGGATTTGTTTGCTGTTGCGGAATGCCACACCCCAACTAGTCCTTGTGTCTATGATTA
CAGCTACACTGGAGGAGCAAGTGCAAAGGGGATATTCGCATGGGAGACTCTAACCGTAGGCTTAACAAACGGAAAAGAAAAACAACTCCATAATTCCATAATTGGATGTA
CGGAATCCGTCCAAGGCAGCGTTTTCGGTGGTGCCGACGGCGTCATGGGCTTAGGCACTAGCTCCTATTCTTTAACCTACAAGGCCGCCGAAAACGCCAACGGCGGTGGC
TTCTCTTACTGCCTTGTCGATCATCTCACCGATCAAAGAGCCATCAGTTACTTCGTCCTCGGCATTCCTACCCCTTCCACTTCCGCCTCCACCTCCTCCGCCAAACTTCC
CGCCAAAATGACCTACACCAAACTCTACGTCGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTGATCGGTATCTCCGCCAACGGCATCATGCTCAACATCCCTTCCC
GTGTTTGGGACATTAATTCCGGCGGCGGTACCATCATCGATTCCGGTACCAGTCTCACTATACTTGCGGCTCCTGCTTTCGACATGGTAATGGAAGCTCTAACTCCGAGA
TTGAAGAAATTTCAGCAACTTGAAATCGAACCTTTCGATTTTTGCTTCAATAACAGCCAGTACACTCACGAAATGGCGCCGAAGCTCCGATTCCATTTCGGTGACGGGAC
GGTGTTTGAGCCGCCGACGAAAAGCTACATTGTGTCGGTGGGGAAATTCATTAGCTGTATTGGGTTCGTTTCAATGCCTTTTCCGGCGAACAATATCATTGGAAATATTC
TTCAGCAAAATCACCTTTGGCAATTTGATTTCCAAAAGAGAAGAGTCGGTTTTGCCCCTTCCGAATGCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATTATATTTATGTTACAGAAATATCCCTAAATTAAGATTAAGGGGGAATTGGATGATTGAATTGTGATAATTAATTTGAGGGTATAATGGGGGATGGGATATTTGTAATG
AATTATTGACCCAGAAAATGGATGTATTTTAAAAAGTCAAATAAAGAGAATTAGAAAAAAGAAAGAAAGGAAAAAAACAAAAAACAGAAAGAGTTGGTTAAGAGAGAAGT
AGTTTGTTATAAAAAACAGGAAGAAAAAGAAGAAGAAAAGCGATTAATAGGGAATCTAAATATAACCATTTTCGGTTAAATTTCTTAGTTTTTAAGTCCGCCATTATCAT
CGTTCTTGTTGCTCCATCTCTCTAACATTACATTCTCTGTTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCAAATTTTTGTTTCTTCTTCTTCTTCTTCCT
CCTTTTCTTCTTCTTATCCTTCTCCTCCTCTTTCCTCTTTGCATTGGGCGACGAAGACAACAACTTCAACAACAATAATAATATTAATGACGACGAAGATGAGCAAGAGA
TAATCAAATTCGATCTTCTTCACCGTCACCATCCACAAGTCGCGGAAAAGATTCACGGTGATATGAAAATCCAAGATGTAAGCGAAAGAATGAAAGACATTCACGAACAC
GACCACAATCGTCACCGCTCCATCTCCAAATCGATGAACCAGAAGCAAGTTGAGGATGCAAGATTGAGGGCAGAGGCAGAGGCGGCGACAGAGGTAGAGGTAGCAAAGAG
TGCAATACTTCCACCGGCAACGTCGACGCCGATAGGAATGAAGATGATTTCGGGTGCAGATTTCGGGAGTAGTGAGTATTTCGTAGAGTTGAAGGTGGGAACGCCGGCGC
AAACGTTTATGTTGATTGCGGATACAGGGAGTGATTTAACGTGGATGAAATGTAGATATCGGAGGTGTTTTGGGAATTGTAGCAGTAACGTGAATCATAAAAGCAAAAAT
GAAAAGAAACAGAGATTTAGACATGCTTTCTTGGCGAATCATTCGTCTTCTTTTAAGACGGTTTCTTGCAGCTCAACGATGTGCACCAATGATCTTGCGGATTTGTTTGC
TGTTGCGGAATGCCACACCCCAACTAGTCCTTGTGTCTATGATTACAGCTACACTGGAGGAGCAAGTGCAAAGGGGATATTCGCATGGGAGACTCTAACCGTAGGCTTAA
CAAACGGAAAAGAAAAACAACTCCATAATTCCATAATTGGATGTACGGAATCCGTCCAAGGCAGCGTTTTCGGTGGTGCCGACGGCGTCATGGGCTTAGGCACTAGCTCC
TATTCTTTAACCTACAAGGCCGCCGAAAACGCCAACGGCGGTGGCTTCTCTTACTGCCTTGTCGATCATCTCACCGATCAAAGAGCCATCAGTTACTTCGTCCTCGGCAT
TCCTACCCCTTCCACTTCCGCCTCCACCTCCTCCGCCAAACTTCCCGCCAAAATGACCTACACCAAACTCTACGTCGGCGACCCTTACAGCAGCTTCTACGGCGTCGATC
TGATCGGTATCTCCGCCAACGGCATCATGCTCAACATCCCTTCCCGTGTTTGGGACATTAATTCCGGCGGCGGTACCATCATCGATTCCGGTACCAGTCTCACTATACTT
GCGGCTCCTGCTTTCGACATGGTAATGGAAGCTCTAACTCCGAGATTGAAGAAATTTCAGCAACTTGAAATCGAACCTTTCGATTTTTGCTTCAATAACAGCCAGTACAC
TCACGAAATGGCGCCGAAGCTCCGATTCCATTTCGGTGACGGGACGGTGTTTGAGCCGCCGACGAAAAGCTACATTGTGTCGGTGGGGAAATTCATTAGCTGTATTGGGT
TCGTTTCAATGCCTTTTCCGGCGAACAATATCATTGGAAATATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCCAAAAGAGAAGAGTCGGTTTTGCCCCTTCCGAA
TGCATCTAATAAACAAAAAAAAGAAATACAAACTTCATCTAATTGTTTCCATTTTCAAATTTTATTATAATTATAATTATTTTTCCTTTTCTTTTATATATGTTTCTATT
ATTAACTAATACCTCTATCTTATTATTGAAATACATACCATCTCACTTTTCTCTTTT
Protein sequenceShow/hide protein sequence
MLGYRKPMSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHRHHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMNQK
QVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKQRFRHAFLANHS
SSFKTVSCSSTMCTNDLADLFAVAECHTPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGG
FSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTILAAPAFDMVMEALTPR
LKKFQQLEIEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSECI