; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0000691 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0000691
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionaspartic proteinase CDR1
Genome locationchr11:23186194..23189431
RNA-Seq ExpressionIVF0000691
SyntenyIVF0000691
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033565.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
        MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
Subjt:  MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS

Query:  DLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLT
        DLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLT
Subjt:  DLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLT

Query:  NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGD
        NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGD
Subjt:  NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGD

Query:  PYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVF
        PYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVF
Subjt:  PYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI
        EPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI
Subjt:  EPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI

XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]0.088.08Show/hide
Query:  MLGYRKPMSPISNFCFFF---LLLFFLSFSSSFLFALGDEANNYNNN----DDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRH
        MLGYRKPMSPISNFCFFF   LL FFLSFSSSFLFALGDE NN+NNN    DDEDEQ+ I+FDLLHRHHPQV+EK++GDMKIQD+ ERMKDIHEHD NRH
Subjt:  MLGYRKPMSPISNFCFFF---LLLFFLSFSSSFLFALGDEANNYNNN----DDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRH

Query:  RSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVN
        RSISKSMNQKQ+EDARLRAEAEAAT+ EVAKSAILPPATSTPIGM+MISGADFGSSEYFV+LKVGTPAQTFMLIADTGSDLTW+KCRYRRCFGNCS NVN
Subjt:  RSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVN

Query:  HKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGN
        HKSKNEKKQRFRHA LAN SS+FKTVSCSSTMCTN+LA+LFAV EC  PTSPCVYDYSY GGASAKGIFAWETLTVGLTNGKEKQL NSIIGCTE VQG+
Subjt:  HKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGN

Query:  VFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLN
        VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLG+PTPSTSASTSSAK PAKM+YTKLYVGDPYSSFYGVDLIGISA+G MLN
Subjt:  VFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLN

Query:  IPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIV
        IP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQ+EIEPF+FCFNNSQYTH+MAPKLRFHFGDGTVFEPPTKSYIVSVG+FISCIG V
Subjt:  IPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIV

Query:  SMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI
        SMPFP+ NIIGNILQQNHLWQFDFQKRRVGFA SECI
Subjt:  SMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]0.0100Show/hide
Query:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM
        MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM
Subjt:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM

Query:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK
        NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK
Subjt:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK

Query:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG
        KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG
Subjt:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG

Query:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD
        VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD
Subjt:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD

Query:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL
        SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL
Subjt:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL

Query:  NIIGNILQQNHLWQFDFQKRRVGFAASECI
        NIIGNILQQNHLWQFDFQKRRVGFAASECI
Subjt:  NIIGNILQQNHLWQFDFQKRRVGFAASECI

XP_022943788.1 aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata]1.44e-19353.58Show/hide
Query:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM
        MLGY  PMSPIS    FF   FFLS   +F    GDE           E   ++ D++HRHHP V EKL G+ +     +R +DIHEHD NR RSIS SM
Subjt:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM

Query:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK
           + +                     LP  +S PI +K+ SG DFG++EYFVQ +VGTP Q F+LI DTGSDLTWLKCRYRRC GNC+ + +HKS+ E 
Subjt:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK

Query:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG
        K +F H  LAN SS+FK ++C S  C  +L  LFA+ +C  P++PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQL +++IGCTE+       G DG
Subjt:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG

Query:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD
        ++GLGT ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S  P   M++  L++G P++S+YGV LIGIS DG  LNIPPRVWD
Subjt:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD

Query:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL
          KG GTI+DSGTSL++L  PAFDV ME +  +LK+FQQI  +PF +CFN + Y+H+MAPKLRFHF  G VFEPP KSYIV V + + C+G  S+PFP  
Subjt:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL

Query:  NIIGNILQQNHLWQFDFQKRRVGFAASECI
        NIIGNILQQN LWQFDF  ++VGFA S+CI
Subjt:  NIIGNILQQNHLWQFDFQKRRVGFAASECI

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]8.55e-26869.74Show/hide
Query:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM
        MLGYRKPMSPIS+FC     LFFL F  S   A GD +++         Q+ ++ DLLHRHHPQVSEKL+GD+K++++++R+KDI EHD+ R+++IS S+
Subjt:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM

Query:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK
        N+ ++ D +LR EA    +    K   LPP +STPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCRYRRC GNCS N NHK++NE+
Subjt:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK

Query:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG
        K RFR+A LAN SS+FKT+ CSS MCTN+LA+LF++ EC TPTSPC+YDYSY+GGASAKG+FA ETLTVGLTNGKEKQL NSIIGCTE VQG +F GADG
Subjt:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG

Query:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPST--SASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRV
        V+GLGTSSYS TYKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +A+ SS  P   MS+TKL++GDPYSSFYGVDL+GISADG MLNIPPRV
Subjt:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPST--SASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRV

Query:  WDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFP
        WD   G GTI+DSGTSLT+LA PAFD+VME L  +LK F+ IEIEPF+FCFNNS+YTH+MAPKLRFHFGDGTVF+PP KSYIVSVGE+ISCIG VSMPFP
Subjt:  WDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFP

Query:  SLNIIGNILQQNHLWQFDFQKRRVGFAASECI
        + NIIGNILQQNHLW+FDF    VGFA SEC+
Subjt:  SLNIIGNILQQNHLWQFDFQKRRVGFAASECI

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein5.9e-27388.08Show/hide
Query:  MLGYRKPMSPISNFC---FFFLLLFFLSFSSSFLFALGDEANNYNN----NDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRH
        MLGYRKPMSPISNFC   FFFLL FFLSFSSSFLFALGDE NN+NN    NDDEDEQ+ I+FDLLHRHHPQV+EK++GDMKIQD+ ERMKDIHEHD NRH
Subjt:  MLGYRKPMSPISNFC---FFFLLLFFLSFSSSFLFALGDEANNYNN----NDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRH

Query:  RSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVN
        RSISKSMNQKQ+EDARLRAEAEAAT+ EVAKSAILPPATSTPIGM+MISGADFGSSEYFV+LKVGTPAQTFMLIADTGSDLTW+KCRYRRCFGNCS NVN
Subjt:  RSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVN

Query:  HKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGN
        HKSKNEKKQRFRHA LAN SS+FKTVSCSSTMCTN+LA+LFAV EC  PTSPCVYDYSY GGASAKGIFAWETLTVGLTNGKEKQL NSIIGCTE VQG+
Subjt:  HKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGN

Query:  VFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLN
        VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLG+PTPSTSASTSSAK PAKM+YTKLYVGDPYSSFYGVDLIGISA+G MLN
Subjt:  VFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLN

Query:  IPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIV
        IP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQ+EIEPF+FCFNNSQYTH+MAPKLRFHFGDGTVFEPPTKSYIVSVG+FISCIG V
Subjt:  IPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIV

Query:  SMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI
        SMPFP+ NIIGNILQQNHLWQFDFQKRRVGFA SECI
Subjt:  SMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI

A0A1S3C2F3 aspartic proteinase CDR13.7e-307100Show/hide
Query:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM
        MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM
Subjt:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM

Query:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK
        NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK
Subjt:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK

Query:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG
        KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG
Subjt:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG

Query:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD
        VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD
Subjt:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD

Query:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL
        SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL
Subjt:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL

Query:  NIIGNILQQNHLWQFDFQKRRVGFAASECI
        NIIGNILQQNHLWQFDFQKRRVGFAASECI
Subjt:  NIIGNILQQNHLWQFDFQKRRVGFAASECI

A0A5D3B701 Aspartic proteinase CDR11.0e-264100Show/hide
Query:  MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
        MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
Subjt:  MKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS

Query:  DLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLT
        DLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLT
Subjt:  DLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLT

Query:  NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGD
        NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGD
Subjt:  NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGD

Query:  PYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVF
        PYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVF
Subjt:  PYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI
        EPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI
Subjt:  EPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X29.5e-15452.96Show/hide
Query:  MSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIED
        MSPIS    FF   FFLS   +F      + +         E   ++ D++HRHHP V EKL G+ +     +R +DIHEHD NR RSIS SM   + + 
Subjt:  MSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIED

Query:  ARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHA
                            LP  +S PI +K+ SG DFG++EYFVQ +VGTP Q F+LI DTGSDLTWLKCRYRRC GNC+ + +HKS+ E K +F H 
Subjt:  ARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHA

Query:  LLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTS
         LAN SS+FK ++C S  C  +L  LFA+ +C  P++PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQL +++IGCTE+       G DG++GLGT 
Subjt:  LLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTS

Query:  SYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGT
        ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S  P   M++  L++G P++S+YGV LIGIS DG  LNIPPRVWD  KG GT
Subjt:  SYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGT

Query:  IIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNIL
        I+DSGTSL++L  PAFDV ME +  +LK+FQQI  +PF +CFN + Y+H+MAPKLRFHF  G VFEPP KSYIV V + I C+G  S+PFP  NIIGNIL
Subjt:  IIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNIL

Query:  QQNHLWQFDFQKRRVGFAASECI
        QQN LWQFDF  ++VGFA S+CI
Subjt:  QQNHLWQFDFQKRRVGFAASECI

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X13.1e-15753.21Show/hide
Query:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM
        MLGY  PMSPIS    FF   FFLS   +F      + +         E   ++ D++HRHHP V EKL G+ +     +R +DIHEHD NR RSIS SM
Subjt:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSM

Query:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK
           + +                     LP  +S PI +K+ SG DFG++EYFVQ +VGTP Q F+LI DTGSDLTWLKCRYRRC GNC+ + +HKS+ E 
Subjt:  NQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEK

Query:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG
        K +F H  LAN SS+FK ++C S  C  +L  LFA+ +C  P++PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQL +++IGCTE+       G DG
Subjt:  KQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADG

Query:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD
        ++GLGT ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S  P   M++  L++G P++S+YGV LIGIS DG  LNIPPRVWD
Subjt:  VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWD

Query:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL
          KG GTI+DSGTSL++L  PAFDV ME +  +LK+FQQI  +PF +CFN + Y+H+MAPKLRFHF  G VFEPP KSYIV V + I C+G  S+PFP  
Subjt:  SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSL

Query:  NIIGNILQQNHLWQFDFQKRRVGFAASECI
        NIIGNILQQN LWQFDF  ++VGFA S+CI
Subjt:  NIIGNILQQNHLWQFDFQKRRVGFAASECI

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED31.2e-3128.82Show/hide
Query:  PPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTN
        P  TS P+     SG       Y V+ K+GTP Q   ++ DT +D  WL C        CSG  N  +          +   N SST+ TVSCS+  CT 
Subjt:  PPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTN

Query:  NLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCL
          A            S C ++ SY G +S       +TLT+         + N   GC     GN      G+MGLG    SL  +   +   G FSYCL
Subjt:  NLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCL

Query:  VDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPP--RVWDSYKGCGTIIDSGTSLTVLATPAFDVV
            +      YF       S S        P  + YT L       S Y V+L G+S     + + P    +D+  G GTIIDSGT +T  A P ++ +
Subjt:  VDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPP--RVWDSYKGCGTIIDSGTSLTVLATPAFDVV

Query:  MEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCI---GIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVG
         +    ++       +  F+ CF  S    ++APK+  H     +  P   + I S    ++C+   GI       LN+I N+ QQN    FD    R+G
Subjt:  MEVLTSRLKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCI---GIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVG

Query:  FAASEC
         A   C
Subjt:  FAASEC

Q6XBF8 Aspartic proteinase CDR11.2e-3129.87Show/hide
Query:  SSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR-YRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPC
        S EY + + +GTP    M IADTGSDL W +C     C+       + K+                SST+K VSCSS+ CT     L   A C T  + C
Subjt:  SSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR-YRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPC

Query:  VYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVP
         Y  SY   +  KG  A +TLT+G ++ +  QL+N IIGC     G       G++GLG    SL  +  ++ + G FSYCLV   + +   S    G  
Subjt:  VYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVP

Query:  TPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRL-KQFQQIEIEPF
           + +   S    AK S           +FY + L  IS   + +       +S +G   IIDSGT+LT+L T  +  + + + S +  + +Q      
Subjt:  TPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRL-KQFQQIEIEPF

Query:  NFCFNNSQYTHDM-APKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
        + C++    T D+  P +  HF DG   +  + +  V V E + C        PS +I GN+ Q N L  +D   + V F  ++C
Subjt:  NFCFNNSQYTHDM-APKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC

Q9LNJ3 Aspartyl protease family protein 22.8e-3028.04Show/hide
Query:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSF--LFALGDE---ANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRS
        M+G RK +  + + CFFFL L   S   SF  LF        A+  +   D D +  +  +       + S  +  ++   D     K   E   +R   
Subjt:  MLGYRKPMSPISNFCFFFLLLFFLSFSSSF--LFALGDE---ANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRS

Query:  ISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR-YRRCFGNCSGNVNH
                Q +  R+++ A  A Q+        P          ++SG   GS EYF +L VGTPA+   ++ DTGSD+ WL+C   RRC+       + 
Subjt:  ISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR-YRRCFGNCSGNVNH

Query:  KSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNV
        +                +S T+ T+ CSS  C          A C+T    C+Y  SY  G+   G F+ ETLT      +  +++   +GC    +G +
Subjt:  KSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNV

Query:  FDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADG-QMLN
        F GA G++GLG    S   +     N   FSYCLVD     +  S  V G    S  A            +T L       +FY V L+GIS  G ++  
Subjt:  FDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADG-QMLN

Query:  IPPRVW--DSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQI-EIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSV---GEFI
        +   ++  D     G IIDSGTS+T L  PA+  + +      K  ++  +   F+ CF+ S       P +  HF    V  P T +Y++ V   G+F 
Subjt:  IPPRVW--DSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQI-EIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSV---GEFI

Query:  SCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
         C          L+IIGNI QQ     +D    RVGFA   C
Subjt:  SCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 12.9e-3529.29Show/hide
Query:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR-YRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAE
        ++SGA  GS EYF ++ VGTPA+   L+ DTGSD+ W++C     C+       N  S                SST+K+++CS+  C+     L   + 
Subjt:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCR-YRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAE

Query:  CDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAV
        C   ++ C+Y  SY  G+   G  A +T+T G  +GK   + N  +GC    +G +F GA G++GLG    S+T           FSYCLVD  + +   
Subjt:  CDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAV

Query:  SYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVW--DSYKGCGTIIDSGTSLTVLATPAFDVVMEV---LTSR
                  S+S   +S +     +   L       +FY V L G S  G+ + +P  ++  D+    G I+D GT++T L T A++ + +    LT  
Subjt:  SYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVW--DSYKGCGTIIDSGTSLTVLATPAFDVVMEV---LTSR

Query:  LKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
        LK+     I  F+ C++ S  +    P + FHF  G   + P K+Y++ V +  +     +    SL+IIGN+ QQ     +D  K  +G + ++C
Subjt:  LKQFQQIEIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC

Q9LTW4 Aspartic proteinase NANA, chloroplast2.7e-8137.58Show/hide
Query:  ERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWL
        + MKD     +  HR         +IED             +  + +++    ++ +G+KM   SG D+G+++YF +++VGTPA+ F ++ DTGS+LTW+
Subjt:  ERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWL

Query:  KCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEK
         CRYR       G  N           R    A++S +FKTV C +  C  +L  LF++  C TP++PC YDY YA G++A+G+FA ET+TVGLTNG+  
Subjt:  KCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEK

Query:  QLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSF
        +L   +IGC+    G  F GADGV+GL  S +S T   A +  G  FSYCLVDHL+++   +Y + G    S+ ++ ++ +    +  T++        F
Subjt:  QLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSF

Query:  YGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIE--PFNFCFN-NSQYTHDMAPKLRFHFGDGTVFEP
        Y +++IGIS    ML+IP +VWD+  G GTI+DSGTSLT+LA  A+  V+  L   L + ++++ E  P  +CF+  S +     P+L FH   G  FEP
Subjt:  YGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIE--PFNFCFN-NSQYTHDMAPKLRFHFGDGTVFEP

Query:  PTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
          KSY+V     + C+G VS   P+ N+IGNI+QQN+LW+FD     + FA S C
Subjt:  PTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein1.5e-4228.66Show/hide
Query:  DMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTG
        D++IQDL  R+K +H       +  ++ + +K   D  L    E +    +A                + SG   GS EYF+ + VGTP + F LI DTG
Subjt:  DMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTG

Query:  SDLTWLKC-RYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVG
        SDL WL+C     CF       + K+                S++FK ++C+   C + ++      +C++    C Y Y Y   ++  G FA ET TV 
Subjt:  SDLTWLKC-RYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVG

Query:  LT----NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYT
        LT       E ++ N + GC    +G +F GA G++GLG    S +    ++  G  FSYCLVD  ++    S  + G                  +++T
Subjt:  LT----NGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYT

Query:  KLYVGDPYS--SFYGVDLIGISADGQMLNIPPRVWD--SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQI--EIEPFNFCFNNS--QYTHDM
            G   S  +FY + +  I   G+ L+IP   W+  S    GTIIDSGT+L+  A PA++++      ++K+   I  +    + CFN S  +  +  
Subjt:  KLYVGDPYS--SFYGVDLIGISADGQMLNIPPRVWD--SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQI--EIEPFNFCFNNS--QYTHDM

Query:  APKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
         P+L   F DGTV+  P ++  + + E + C+ I+  P  + +IIGN  QQN    +D ++ R+GF  ++C
Subjt:  APKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC

AT3G12700.1 Eukaryotic aspartyl protease family protein1.9e-8237.58Show/hide
Query:  ERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWL
        + MKD     +  HR         +IED             +  + +++    ++ +G+KM   SG D+G+++YF +++VGTPA+ F ++ DTGS+LTW+
Subjt:  ERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMI--SGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWL

Query:  KCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEK
         CRYR       G  N           R    A++S +FKTV C +  C  +L  LF++  C TP++PC YDY YA G++A+G+FA ET+TVGLTNG+  
Subjt:  KCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEK

Query:  QLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSF
        +L   +IGC+    G  F GADGV+GL  S +S T   A +  G  FSYCLVDHL+++   +Y + G    S+ ++ ++ +    +  T++        F
Subjt:  QLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSF

Query:  YGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIE--PFNFCFN-NSQYTHDMAPKLRFHFGDGTVFEP
        Y +++IGIS    ML+IP +VWD+  G GTI+DSGTSLT+LA  A+  V+  L   L + ++++ E  P  +CF+  S +     P+L FH   G  FEP
Subjt:  YGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQIEIE--PFNFCFN-NSQYTHDMAPKLRFHFGDGTVFEP

Query:  PTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
          KSY+V     + C+G VS   P+ N+IGNI+QQN+LW+FD     + FA S C
Subjt:  PTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC

AT3G25700.1 Eukaryotic aspartyl protease family protein7.6e-5534.49Show/hide
Query:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMC-TNNLAELFAVAE
        ++SGA  GS +YFV L++G P Q+ +LIADTGSDL W+KC   R   NCS    H S                SSTF    C   +C      +   +  
Subjt:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMC-TNNLAELFAVAE

Query:  CDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGC-----TEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLT
             S C Y+Y YA G+   G+FA ET ++  ++GKE +L++   GC      + V G  F+GA+GVMGLG    S   +      G  FSYCL+D+  
Subjt:  CDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGC-----TEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLT

Query:  DQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVW---DSYKGCGTIIDSGTSLTVLATPAFDVVMEVL
             SY ++G          +     +K+ +T L       +FY V L  +  +G  L I P +W   DS  G GT++DSGT+L  LA PA+  V+  +
Subjt:  DQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVW---DSYKGCGTIIDSGTSLTVLATPAFDVVMEVL

Query:  TSRLKQFQQIEIEP-FNFCFNNSQYT--HDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSM-PFPSLNIIGNILQQNHLWQFDFQKRRVGFAA
          R+K      + P F+ C N S  T    + P+L+F F  G VF PP ++Y +   E I C+ I S+ P    ++IGN++QQ  L++FD  + R+GF+ 
Subjt:  TSRLKQFQQIEIEP-FNFCFNNSQYT--HDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSM-PFPSLNIIGNILQQNHLWQFDFQKRRVGFAA

Query:  SEC
          C
Subjt:  SEC

AT3G59080.1 Eukaryotic aspartyl protease family protein9.3e-4528.28Show/hide
Query:  EQQTIRFDLLHRHHPQVSEKLNG---DMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGAD
        E +T++F L  R      +       +++I+DL  R++ +       H+ + +  NQ  +   + + + E  T   VA S       +  +   + SG  
Subjt:  EQQTIRFDLLHRHHPQVSEKLNG---DMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGAD

Query:  FGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKC-RYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTS
         GS EYF+ + VG+P + F LI DTGSDL W++C     CF       + K+                S+++K ++C+   C N ++       C +   
Subjt:  FGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKC-RYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTS

Query:  PCVYDYSYAGGASAKGIFAWETLTVGL-TNGKEKQL---RNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSY
         C Y Y Y   ++  G FA ET TV L TNG   +L    N + GC    +G +F GA G++GLG    S +    ++  G  FSYCLVD  +D    S 
Subjt:  PCVYDYSYAGGASAKGIFAWETLTVGL-TNGKEKQL---RNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSY

Query:  FVLGVPTPSTSASTSSAKPPAKMSYTKLYVG--DPYSSFYGVDLIGISADGQMLNIPPRVWD--SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQ
         + G      S           +++T    G  +   +FY V +  I   G++LNIP   W+  S    GTIIDSGT+L+  A PA++ +   +  + K 
Subjt:  FVLGVPTPSTSASTSSAKPPAKMSYTKLYVG--DPYSSFYGVDLIGISADGQMLNIPPRVWD--SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQ

Query:  FQQI--EIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
           +  +    + CFN S   +   P+L   F DG V+  PT++  + + E + C+ ++  P  + +IIGN  QQN    +D ++ R+G+A ++C
Subjt:  FQQI--EIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC

AT3G59080.2 Eukaryotic aspartyl protease family protein2.6e-3927.13Show/hide
Query:  EQQTIRFDLLHRHHPQVSEKLNG---DMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGAD
        E +T++F L  R      +       +++I+DL  R++ +       H+ + +  NQ  +   + + + E  T   VA S       +  +   + SG  
Subjt:  EQQTIRFDLLHRHHPQVSEKLNG---DMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARLRAEAEAATQVEVAKSAILPPATSTPIGMKMISGAD

Query:  FGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSP
         GS EYF+ + VG+P + F LI DTGSDL W++C    C+     N N                                                    
Subjt:  FGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDTPTSP

Query:  CVYDYSYAGGASAKGIFAWETLTVGL-TNGKEKQL---RNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYF
        C Y Y Y   ++  G FA ET TV L TNG   +L    N + GC    +G +F GA G++GLG    S +    ++  G  FSYCLVD  +D    S  
Subjt:  CVYDYSYAGGASAKGIFAWETLTVGL-TNGKEKQL---RNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYF

Query:  VLGVPTPSTSASTSSAKPPAKMSYTKLYVG--DPYSSFYGVDLIGISADGQMLNIPPRVWD--SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQF
        + G      S           +++T    G  +   +FY V +  I   G++LNIP   W+  S    GTIIDSGT+L+  A PA++ +   +  + K  
Subjt:  VLGVPTPSTSASTSSAKPPAKMSYTKLYVG--DPYSSFYGVDLIGISADGQMLNIPPRVWD--SYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQF

Query:  QQI--EIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC
          +  +    + CFN S   +   P+L   F DG V+  PT++  + + E + C+ ++  P  + +IIGN  QQN    +D ++ R+G+A ++C
Subjt:  QQI--EIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCAAATTTTTGTTTCTTCTTCCTCCTCCTCTTCTTCTTATCCTTCTCCTCCTCCTTCCTCTTTGCATTGGGCGACGA
AGCCAACAACTACAACAACAATGACGACGAAGACGAACAACAAACAATCAGATTTGATCTACTTCACCGTCACCATCCACAAGTCTCCGAAAAGCTTAACGGTGATATGA
AAATCCAAGATCTTCATGAAAGAATGAAAGACATTCATGAACACGACCGCAATCGTCACCGCTCCATCTCCAAATCCATGAACCAGAAGCAAATTGAGGATGCGAGGTTG
AGGGCAGAGGCAGAGGCGGCGACACAGGTAGAGGTGGCAAAGAGTGCAATACTTCCACCGGCAACGTCGACGCCGATAGGAATGAAAATGATTTCGGGTGCAGATTTTGG
GAGTAGTGAGTATTTCGTACAATTGAAGGTGGGAACACCGGCGCAAACGTTTATGTTGATTGCGGATACAGGGAGCGATTTAACGTGGTTGAAATGTAGATATCGAAGGT
GTTTTGGGAATTGTAGCGGTAACGTGAATCATAAAAGCAAAAATGAAAAGAAACAGAGATTTAGACATGCGTTATTGGCGAATCAGTCGTCTACTTTTAAGACAGTTTCT
TGCAGCTCAACGATGTGTACAAATAATCTTGCGGAATTGTTTGCTGTTGCGGAATGCGACACCCCAACTAGTCCTTGTGTCTATGATTACAGCTACGCAGGAGGTGCAAG
TGCAAAGGGAATATTCGCATGGGAGACTCTAACTGTAGGCTTAACAAACGGAAAAGAAAAACAACTCCGTAATTCCATAATAGGATGTACGGAAATCGTCCAAGGCAACG
TTTTCGATGGAGCCGACGGTGTCATGGGCTTAGGCACTAGCTCCTATTCTTTAACTTACAAAGCCGCAGAAAACGCCAACGGCGGCGGCTTCTCTTACTGCCTTGTCGAT
CATCTCACCGATCAAAGAGCTGTCAGTTATTTCGTCCTCGGCGTTCCTACCCCTTCCACTTCCGCCTCCACTTCCTCTGCTAAACCTCCCGCCAAAATGTCCTACACCAA
ACTCTACGTTGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTCATCGGTATCTCCGCCGACGGCCAAATGCTCAACATCCCTCCTCGTGTTTGGGACAGCTATAAAG
GCTGCGGTACCATCATCGATTCCGGTACTAGTCTCACTGTGCTTGCCACTCCTGCTTTCGATGTGGTAATGGAAGTTTTGACTTCGAGATTGAAGCAATTTCAGCAAATT
GAGATCGAACCTTTCAATTTTTGCTTCAATAATAGCCAATACACTCACGACATGGCGCCGAAGCTCCGATTCCATTTCGGTGACGGAACGGTGTTTGAGCCGCCGACCAA
AAGCTACATTGTGTCGGTGGGGGAATTCATTAGTTGTATTGGAATCGTTTCGATGCCTTTTCCGTCCCTCAATATCATTGGAAATATTCTTCAGCAAAATCACCTTTGGC
AATTTGATTTCCAAAAGAGAAGAGTCGGTTTTGCCGCTTCCGAATGCATCTAA
mRNA sequenceShow/hide mRNA sequence
CTCTAACATTACACTCTCTGTTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCAAATTTTTGTTTCTTCTTCCTCCTCCTCTTCTTCTTATCCTTCTCCTCC
TCCTTCCTCTTTGCATTGGGCGACGAAGCCAACAACTACAACAACAATGACGACGAAGACGAACAACAAACAATCAGATTTGATCTACTTCACCGTCACCATCCACAAGT
CTCCGAAAAGCTTAACGGTGATATGAAAATCCAAGATCTTCATGAAAGAATGAAAGACATTCATGAACACGACCGCAATCGTCACCGCTCCATCTCCAAATCCATGAACC
AGAAGCAAATTGAGGATGCGAGGTTGAGGGCAGAGGCAGAGGCGGCGACACAGGTAGAGGTGGCAAAGAGTGCAATACTTCCACCGGCAACGTCGACGCCGATAGGAATG
AAAATGATTTCGGGTGCAGATTTTGGGAGTAGTGAGTATTTCGTACAATTGAAGGTGGGAACACCGGCGCAAACGTTTATGTTGATTGCGGATACAGGGAGCGATTTAAC
GTGGTTGAAATGTAGATATCGAAGGTGTTTTGGGAATTGTAGCGGTAACGTGAATCATAAAAGCAAAAATGAAAAGAAACAGAGATTTAGACATGCGTTATTGGCGAATC
AGTCGTCTACTTTTAAGACAGTTTCTTGCAGCTCAACGATGTGTACAAATAATCTTGCGGAATTGTTTGCTGTTGCGGAATGCGACACCCCAACTAGTCCTTGTGTCTAT
GATTACAGCTACGCAGGAGGTGCAAGTGCAAAGGGAATATTCGCATGGGAGACTCTAACTGTAGGCTTAACAAACGGAAAAGAAAAACAACTCCGTAATTCCATAATAGG
ATGTACGGAAATCGTCCAAGGCAACGTTTTCGATGGAGCCGACGGTGTCATGGGCTTAGGCACTAGCTCCTATTCTTTAACTTACAAAGCCGCAGAAAACGCCAACGGCG
GCGGCTTCTCTTACTGCCTTGTCGATCATCTCACCGATCAAAGAGCTGTCAGTTATTTCGTCCTCGGCGTTCCTACCCCTTCCACTTCCGCCTCCACTTCCTCTGCTAAA
CCTCCCGCCAAAATGTCCTACACCAAACTCTACGTTGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTCATCGGTATCTCCGCCGACGGCCAAATGCTCAACATCCC
TCCTCGTGTTTGGGACAGCTATAAAGGCTGCGGTACCATCATCGATTCCGGTACTAGTCTCACTGTGCTTGCCACTCCTGCTTTCGATGTGGTAATGGAAGTTTTGACTT
CGAGATTGAAGCAATTTCAGCAAATTGAGATCGAACCTTTCAATTTTTGCTTCAATAATAGCCAATACACTCACGACATGGCGCCGAAGCTCCGATTCCATTTCGGTGAC
GGAACGGTGTTTGAGCCGCCGACCAAAAGCTACATTGTGTCGGTGGGGGAATTCATTAGTTGTATTGGAATCGTTTCGATGCCTTTTCCGTCCCTCAATATCATTGGAAA
TATTCTTCAGCAAAATCACCTTTGGCAATTTGATTTCCAAAAGAGAAGAGTCGGTTTTGCCGCTTCCGAATGCATCTAAAAGAAATAATAATAATACAAACTTCATCATA
ATTAATTATTTTCATTTTCAATTTTTATTATAATTTTTTGTTTCTATTATTAACTAATACATCTATCTTATTATTGAAATATATACCATCTCAAATTTCTCTTTAATTTT
CCTTTTCTATACATACACAATATATGTCTTTTCTTTTTATTTTTTCCTTTTTCATTTTGGAATAACTTGGAGAGTGGAGATTTTTTTTTAATTTTCTTTCTTGTGTAAAG
AGTTGAAGAAGAAGATGATTGTATCTTGAATAATATTTACTAGAAAAGATAGAAAGAAGAGATATTCTTTGGTTCCTC
Protein sequenceShow/hide protein sequence
MLGYRKPMSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDEDEQQTIRFDLLHRHHPQVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMNQKQIEDARL
RAEAEAATQVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYRRCFGNCSGNVNHKSKNEKKQRFRHALLANQSSTFKTVS
CSSTMCTNNLAELFAVAECDTPTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCTEIVQGNVFDGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVD
HLTDQRAVSYFVLGVPTPSTSASTSSAKPPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVLATPAFDVVMEVLTSRLKQFQQI
EIEPFNFCFNNSQYTHDMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASECI