; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0012854 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0012854
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionaspartic proteinase CDR1
Genome locationchr11:8284767..8287016
RNA-Seq ExpressionPI0012854
SyntenyPI0012854
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033565.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa]6.7e-22989.04Show/hide
Query:  MKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
        MKIQDL+ER+KDIHEHD  RHRSISK+MNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
Subjt:  MKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS

Query:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLT
        DLTW+KCRYRRCFGNCS NVNHKSKNEKK RFR+A LAN SS+FKTVSCSSTMCTN+LA+LFA+AECDTP SPCVYDYSYAGGASAKGIFA ETLTVGLT
Subjt:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLT

Query:  NGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGD
        NGKEKQLRNS+IGCTE VQG+VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLGVP+PSTSA+TSSAK P KMSYTKLYVGD
Subjt:  NGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGD

Query:  PYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVF
        PYSSFYGVDLIGISA+G  LNIP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQIEI+PF+FCFNNSQYTH+MAPKLRFHFGDGTVF
Subjt:  PYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFQR
        EPPTKSYIVSVGEFISCIG VSMPFP+ NIIGNILQQNHLWQFDFQ+
Subjt:  EPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFQR

XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]1.1e-27691.44Show/hide
Query:  MLGYRKPMSPISNFCFFFF---LLFFFSFSSSFLFALGDEDNNFNNN---NDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRH
        MLGYRKPMSPISNFCFFFF   L FF SFSSSFLFALGDEDNNFNNN   NDDEDEQ+ IK DL HRHHPQV+EK+HGDMKIQD++ER+KDIHEHDH RH
Subjt:  MLGYRKPMSPISNFCFFFF---LLFFFSFSSSFLFALGDEDNNFNNN---NDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRH

Query:  RSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSISK+MNQKQVEDARLRAEAEAATE EVAKSAILPPATSTPIGM+MISGADFGSSEYFV+LKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
Subjt:  RSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGS
        HKSKNEKK RFR+AFLANHSSSFKTVSCSSTMCTNDLADLFA+ EC  P SPCVYDYSY GGASAKGIFA ETLTVGLTNGKEKQL NS+IGCTESVQGS
Subjt:  HKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLN
        VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLG+P+PSTSA+TSSAKLP KM+YTKLYVGDPYSSFYGVDLIGISANGI LN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLN

Query:  IPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFV
        IPSRVWDINSGGGTIIDSGTSLT+LAAPAFDMVMEALTPRLKKFQQ+EI+PFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVG+FISCIGFV
Subjt:  IPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFV

Query:  SMPFPATNIIGNILQQNHLWQFDFQR
        SMPFPA NIIGNILQQNHLWQFDFQ+
Subjt:  SMPFPATNIIGNILQQNHLWQFDFQR

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]9.9e-26588.85Show/hide
Query:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA
        MLGYRKPMSPISNFCFFF LLFF SFSSSFLFALGDE NN+ NNNDDEDEQQTI+ DL HRHHPQVSEKL+GDMKIQDL+ER+KDIHEHD  RHRSISK+
Subjt:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA

Query:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE
        MNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTW+KCRYRRCFGNCS NVNHKSKNE
Subjt:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE

Query:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD
        KK RFR+A LAN SS+FKTVSCSSTMCTN+LA+LFA+AECDTP SPCVYDYSYAGGASAKGIFA ETLTVGLTNGKEKQLRNS+IGCTE VQG+VF GAD
Subjt:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD

Query:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW
        GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLGVP+PSTSA+TSSAK P KMSYTKLYVGDPYSSFYGVDLIGISA+G  LNIP RVW
Subjt:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW

Query:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA
        D   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQIEI+PF+FCFNNSQYTH+MAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIG VSMPFP+
Subjt:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA

Query:  TNIIGNILQQNHLWQFDFQR
         NIIGNILQQNHLWQFDFQ+
Subjt:  TNIIGNILQQNHLWQFDFQR

XP_022943788.1 aspartic proteinase NANA, chloroplast-like isoform X1 [Cucurbita moschata]1.5e-15655.41Show/hide
Query:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA
        MLGY  PMSPIS    FFF  FF S   +F       D +         E   +KLD+ HRHHP V EKL+G+ +     +R +DIHEHDH R RSIS +
Subjt:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA

Query:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE
        M   + +                     LP  +S PI +K+ SG DFG++EYFVQ +VGTP Q F+LI DTGSDLTW+KCRYRRC GNC+++ +HKS+ E
Subjt:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE

Query:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD
         K++F + FLANHSSSFK ++C S  C  DL  LFAI +C  P +PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQL +++IGCTE        G D
Subjt:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD

Query:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW
        G++GLGT ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S    G M++  L++G P++S+YGV LIGIS +G+TLNIP RVW
Subjt:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW

Query:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA
        DI  GGGTI+DSGTSL+ML APAFD+ MEA+  +LKKFQQI  DPF +CFN + Y+HEMAPKLRFHF  G VFEPP KSYIV V + I C+GF S+PFP 
Subjt:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA

Query:  TNIIGNILQQNHLWQFDF
        TNIIGNILQQN LWQFDF
Subjt:  TNIIGNILQQNHLWQFDF

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]6.7e-22174.23Show/hide
Query:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA
        MLGYRKPMSPIS+FC  FFL FF S   +F               D   +Q+ +KLDL HRHHPQVSEKLHGD+K++++N+R+KDI EHD KR+++IS +
Subjt:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA

Query:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE
        +N+ ++ D +LR EA    E    K   LPP +STPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTWMKCRYRRC GNCSSN NHK++NE
Subjt:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE

Query:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD
        +K+RFRNAFLAN+SSSFKT+ CSS MCTNDLADLF+I EC TP SPC+YDYSY+GGASAKG+FA+ETLTVGLTNGKEKQL NS+IGCTESVQG +FGGAD
Subjt:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD

Query:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPST--SAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSR
        GV+GLGTSSYS TYKAAENANGGGF+YCLVDHL+D+ A SYF+LG P  ST  +AA SS    G MS+TKL++GDPYSSFYGVDL+GISA+G+ LNIP R
Subjt:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPST--SAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSR

Query:  VWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPF
        VWDINSGGGTI+DSGTSLTMLAAPAFDMVMEAL P+LK F+ IEI+PFDFCFNNS+YTHEMAPKLRFHFGDGTVF+PP KSYIVSVGE+ISCIGFVSMPF
Subjt:  VWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPF

Query:  PATNIIGNILQQNHLWQFDF
        PATNIIGNILQQNHLW+FDF
Subjt:  PATNIIGNILQQNHLWQFDF

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein5.5e-27791.44Show/hide
Query:  MLGYRKPMSPISNFCFFFF---LLFFFSFSSSFLFALGDEDNNFNNN---NDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRH
        MLGYRKPMSPISNFCFFFF   L FF SFSSSFLFALGDEDNNFNNN   NDDEDEQ+ IK DL HRHHPQV+EK+HGDMKIQD++ER+KDIHEHDH RH
Subjt:  MLGYRKPMSPISNFCFFFF---LLFFFSFSSSFLFALGDEDNNFNNN---NDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRH

Query:  RSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
        RSISK+MNQKQVEDARLRAEAEAATE EVAKSAILPPATSTPIGM+MISGADFGSSEYFV+LKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
Subjt:  RSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGS
        HKSKNEKK RFR+AFLANHSSSFKTVSCSSTMCTNDLADLFA+ EC  P SPCVYDYSY GGASAKGIFA ETLTVGLTNGKEKQL NS+IGCTESVQGS
Subjt:  HKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLN
        VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLG+P+PSTSA+TSSAKLP KM+YTKLYVGDPYSSFYGVDLIGISANGI LN
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLN

Query:  IPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFV
        IPSRVWDINSGGGTIIDSGTSLT+LAAPAFDMVMEALTPRLKKFQQ+EI+PFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVG+FISCIGFV
Subjt:  IPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFV

Query:  SMPFPATNIIGNILQQNHLWQFDFQR
        SMPFPA NIIGNILQQNHLWQFDFQ+
Subjt:  SMPFPATNIIGNILQQNHLWQFDFQR

A0A1S3C2F3 aspartic proteinase CDR14.8e-26588.85Show/hide
Query:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA
        MLGYRKPMSPISNFCFFF LLFF SFSSSFLFALGDE NN+ NNNDDEDEQQTI+ DL HRHHPQVSEKL+GDMKIQDL+ER+KDIHEHD  RHRSISK+
Subjt:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA

Query:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE
        MNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTW+KCRYRRCFGNCS NVNHKSKNE
Subjt:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE

Query:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD
        KK RFR+A LAN SS+FKTVSCSSTMCTN+LA+LFA+AECDTP SPCVYDYSYAGGASAKGIFA ETLTVGLTNGKEKQLRNS+IGCTE VQG+VF GAD
Subjt:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD

Query:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW
        GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLGVP+PSTSA+TSSAK P KMSYTKLYVGDPYSSFYGVDLIGISA+G  LNIP RVW
Subjt:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW

Query:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA
        D   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQIEI+PF+FCFNNSQYTH+MAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIG VSMPFP+
Subjt:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA

Query:  TNIIGNILQQNHLWQFDFQR
         NIIGNILQQNHLWQFDFQ+
Subjt:  TNIIGNILQQNHLWQFDFQR

A0A5D3B701 Aspartic proteinase CDR13.3e-22989.04Show/hide
Query:  MKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
        MKIQDL+ER+KDIHEHD  RHRSISK+MNQKQ+EDARLRAEAEAAT+VEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS
Subjt:  MKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGS

Query:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLT
        DLTW+KCRYRRCFGNCS NVNHKSKNEKK RFR+A LAN SS+FKTVSCSSTMCTN+LA+LFA+AECDTP SPCVYDYSYAGGASAKGIFA ETLTVGLT
Subjt:  DLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLT

Query:  NGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGD
        NGKEKQLRNS+IGCTE VQG+VF GADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRA+SYFVLGVP+PSTSA+TSSAK P KMSYTKLYVGD
Subjt:  NGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGD

Query:  PYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVF
        PYSSFYGVDLIGISA+G  LNIP RVWD   G GTIIDSGTSLT+LA PAFD+VME LT RLK+FQQIEI+PF+FCFNNSQYTH+MAPKLRFHFGDGTVF
Subjt:  PYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVF

Query:  EPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFQR
        EPPTKSYIVSVGEFISCIG VSMPFP+ NIIGNILQQNHLWQFDFQ+
Subjt:  EPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFQR

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X21.7e-15355.19Show/hide
Query:  MSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVE
        MSPIS    FFF  FF S   +F       D +         E   +KLD+ HRHHP V EKL+G+ +     +R +DIHEHDH R RSIS +M   + +
Subjt:  MSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVE

Query:  DARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRN
                             LP  +S PI +K+ SG DFG++EYFVQ +VGTP Q F+LI DTGSDLTW+KCRYRRC GNC+++ +HKS+ E K++F +
Subjt:  DARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRN

Query:  AFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGT
         FLANHSSSFK ++C S  C  DL  LFAI +C  P +PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQL +++IGCTE        G DG++GLGT
Subjt:  AFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGT

Query:  SSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDINSGGG
         ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S    G M++  L++G P++S+YGV LIGIS +G+TLNIP RVWDI  GGG
Subjt:  SSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDINSGGG

Query:  TIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNI
        TI+DSGTSL+ML APAFD+ MEA+  +LKKFQQI  DPF +CFN + Y+HEMAPKLRFHF  G VFEPP KSYIV V + I C+GF S+PFP TNIIGNI
Subjt:  TIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNI

Query:  LQQNHLWQFDF
        LQQN LWQFDF
Subjt:  LQQNHLWQFDF

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X17.4e-15755.41Show/hide
Query:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA
        MLGY  PMSPIS    FFF  FF S   +F       D +         E   +KLD+ HRHHP V EKL+G+ +     +R +DIHEHDH R RSIS +
Subjt:  MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKA

Query:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE
        M   + +                     LP  +S PI +K+ SG DFG++EYFVQ +VGTP Q F+LI DTGSDLTW+KCRYRRC GNC+++ +HKS+ E
Subjt:  MNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNE

Query:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD
         K++F + FLANHSSSFK ++C S  C  DL  LFAI +C  P +PCVYDYSY GG +A G+FA ET+TVGLTNGKEKQL +++IGCTE        G D
Subjt:  KKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGAD

Query:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW
        G++GLGT ++S  ++AA + NGGGFSYCL+DHL+   A SYF+LG P     A   S    G M++  L++G P++S+YGV LIGIS +G+TLNIP RVW
Subjt:  GVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVW

Query:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA
        DI  GGGTI+DSGTSL+ML APAFD+ MEA+  +LKKFQQI  DPF +CFN + Y+HEMAPKLRFHF  G VFEPP KSYIV V + I C+GF S+PFP 
Subjt:  DINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPA

Query:  TNIIGNILQQNHLWQFDF
        TNIIGNILQQN LWQFDF
Subjt:  TNIIGNILQQNHLWQFDF

SwissProt top hitse value%identityAlignment
Q766C3 Aspartic proteinase nepenthesin-11.3e-2829.79Show/hide
Query:  GSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISP
        G  EY + L +GTPAQ F  I DTGSDL W +C+   +CF   +   N +                 SSSF T+ CSS +C        A++      + 
Subjt:  GSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISP

Query:  CVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGV
        C Y Y Y  G+  +G    ETLT G  +     + N   GC E+ QG   G   G++G+G    SL        +   FSYC+          S  +LG 
Subjt:  CVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGV

Query:  PSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDINSG---GGTIIDSGTSLTMLAAPAFDMVMEALTPRLK-KFQQIE
         + S +A + +         T L       +FY + L G+S     L I    + +NS    GG IIDSGT+LT     A+  V +    ++        
Subjt:  PSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDINSG---GGTIIDSGTSLTMLAAPAFDMVMEALTPRLK-KFQQIE

Query:  IDPFDFCFNN-SQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFD
           FD CF   S  ++   P    HF DG   E P+++Y +S    + C+   S      +I GNI QQN L  +D
Subjt:  IDPFDFCFNN-SQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFD

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 28.8e-3026.84Show/hide
Query:  FFFFLLFFFSFSSSFLFALGDED---------------NNFNNNNDDEDEQQTIKLDLFHR-HHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISK
        FFFFL      SSS   +  D                  +FNN +  ++      L L HR   P V+ +                   H H+ H     
Subjt:  FFFFLLFFFSFSSSFLFALGDED---------------NNFNNNNDDEDEQQTIKLDLFHR-HHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISK

Query:  AMNQKQVEDARLRAEAEAATEV-EVAKSAILPPATS----TPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN
                 AR+R + +  + +       ++P + S       G  ++SG D GS EYFV++ VG+P +   ++ D+GSD+ W++C+  +     S  V 
Subjt:  AMNQKQVEDARLRAEAEAATEV-EVAKSAILPPATS----TPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVN

Query:  HKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGS
          +K               S S+  VSC S++C     D    + C +    C Y+  Y  G+  KG  ALETLT   T      +RN  +GC    +G 
Subjt:  HKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGS

Query:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLN
        +F GA G++G+G  S S   + +    GG F YCLV   TD              + S       LP   S+  L       SFY V L G+   G+ + 
Subjt:  VFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLN

Query:  IPSRVWDI--NSGGGTIIDSGTSLTML---AAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEF-I
        +P  V+D+     GG ++D+GT++T L   A  AF    ++ T  L +   + I  FD C++ S +     P + F+F +G V   P +++++ V +   
Subjt:  IPSRVWDI--NSGGGTIIDSGTSLTML---AAPAFDMVMEALTPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEF-I

Query:  SCIGFVSMPFPATNIIGNILQQNHLWQFD
         C  F + P    +IIGNI Q+     FD
Subjt:  SCIGFVSMPFPATNIIGNILQQNHLWQFD

Q9LNJ3 Aspartyl protease family protein 26.5e-3330.13Show/hide
Query:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAE
        ++SG   GS EYF +L VGTPA+   ++ DTGSD+ W++C   RRC+                      F    S ++ T+ CSS  C          A 
Subjt:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAE

Query:  CDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAI
        C+T    C+Y  SY  G+   G F+ ETLT      +  +++   +GC    +G +F GA G++GLG    S   +     N   FSYCLVD        
Subjt:  CDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAI

Query:  SYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITL-NIPSRVWDIN--SGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLK
                S  +S    +A +     +T L       +FY V L+GIS  G  +  + + ++ ++    GG IIDSGTS+T L  PA+  + +A     K
Subjt:  SYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITL-NIPSRVWDIN--SGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLK

Query:  KFQQI-EIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV---GEFISCIGFVSMPFPATNIIGNILQQNHLWQFD
          ++  +   FD CF+ S       P +  HF    V  P T +Y++ V   G+F  C  F        +IIGNI QQ     +D
Subjt:  KFQQI-EIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSV---GEFISCIGFVSMPFPATNIIGNILQQNHLWQFD

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 15.3e-3530.05Show/hide
Query:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAEC
        ++SGA  GS EYF ++ VGTPA+   L+ DTGSD+ W++C        C+                  F    SS++K+++CS+  C+     L   + C
Subjt:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAEC

Query:  DTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAIS
         +  + C+Y  SY  G+   G  A +T+T G  +GK   + N  +GC    +G +F GA G++GLG    S+T           FSYCLVD  + +    
Subjt:  DTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAIS

Query:  YFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDINS--GGGTIIDSGTSLTMLAAPAFDMVMEA---LTPRL
                 S+S   +S +L G  +   L       +FY V L G S  G  + +P  ++D+++   GG I+D GT++T L   A++ + +A   LT  L
Subjt:  YFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDINS--GGGTIIDSGTSLTMLAAPAFDMVMEA---LTPRL

Query:  KKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFQR
        KK     I  FD C++ S  +    P + FHF  G   + P K+Y++ V +    C  F      + +IIGN+ QQ     +D  +
Subjt:  KKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFQR

Q9LTW4 Aspartic proteinase NANA, chloroplast2.3e-8338.89Show/hide
Query:  HGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIAD
        H D  +     R++D+   D KRH  IS+  N                              ++  + M + SG D+G+++YF +++VGTPA+ F ++ D
Subjt:  HGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIAD

Query:  TGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTV
        TGS+LTW+ CRYR            + K+      R  F A+ S SFKTV C +  C  DL +LF++  C TP +PC YDY YA G++A+G+FA ET+TV
Subjt:  TGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTV

Query:  GLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLY
        GLTNG+  +L   +IGC+ S  G  F GADGV+GL  S +S T   A +  G  FSYCLVDHL+++   +Y + G    S+ +  ++ +    +  T++ 
Subjt:  GLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLY

Query:  VGDPYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEID--PFDFCFN-NSQYTHEMAPKLRFHF
               FY +++IGIS     L+IPS+VWD  SGGGTI+DSGTSLT+LA  A+  V+  L   L + ++++ +  P ++CF+  S +     P+L FH 
Subjt:  VGDPYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEID--PFDFCFN-NSQYTHEMAPKLRFHF

Query:  GDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFD
          G  FEP  KSY+V     + C+GFVS   PATN+IGNI+QQN+LW+FD
Subjt:  GDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFD

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein3.3e-4028.29Show/hide
Query:  SFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVA
        S S+   F+  + D +  +  +    Q  IK +     H  V      D++IQDL  R+K +H   +K  +  ++ + +K   D  L    E +    +A
Subjt:  SFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVA

Query:  KSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKC-RYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCS
                        + SG   GS EYF+ + VGTP + F LI DTGSDL W++C     CF       + K+                S+SFK ++C+
Subjt:  KSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKC-RYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCS

Query:  STMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLT----NGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAEN
           C+  ++      +C++    C Y Y Y   ++  G FA+ET TV LT       E ++ N + GC    +G +F GA G++GLG    S +    ++
Subjt:  STMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLT----NGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAEN

Query:  ANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYS--SFYGVDLIGISANGITLNIPSRVWDINS--GGGTIIDSGTS
          G  FSYCLVD  ++    S  + G      +           +++T    G   S  +FY + +  I   G  L+IP   W+I+S   GGTIIDSGT+
Subjt:  ANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYS--SFYGVDLIGISANGITLNIPSRVWDINS--GGGTIIDSGTS

Query:  LTMLAAPAFDMVMEALTPRLKKFQQI--EIDPFDFCFNNS--QYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQN
        L+  A PA++++      ++K+   I  +    D CFN S  +  +   P+L   F DGTV+  P ++  + + E + C+  +  P    +IIGN  QQN
Subjt:  LTMLAAPAFDMVMEALTPRLKKFQQI--EIDPFDFCFNNS--QYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQN

Query:  HLWQFDFQR
            +D +R
Subjt:  HLWQFDFQR

AT3G12700.1 Eukaryotic aspartyl protease family protein1.7e-8438.89Show/hide
Query:  HGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIAD
        H D  +     R++D+   D KRH  IS+  N                              ++  + M + SG D+G+++YF +++VGTPA+ F ++ D
Subjt:  HGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIAD

Query:  TGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTV
        TGS+LTW+ CRYR            + K+      R  F A+ S SFKTV C +  C  DL +LF++  C TP +PC YDY YA G++A+G+FA ET+TV
Subjt:  TGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTV

Query:  GLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLY
        GLTNG+  +L   +IGC+ S  G  F GADGV+GL  S +S T   A +  G  FSYCLVDHL+++   +Y + G    S+ +  ++ +    +  T++ 
Subjt:  GLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLY

Query:  VGDPYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEID--PFDFCFN-NSQYTHEMAPKLRFHF
               FY +++IGIS     L+IPS+VWD  SGGGTI+DSGTSLT+LA  A+  V+  L   L + ++++ +  P ++CF+  S +     P+L FH 
Subjt:  VGDPYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQIEID--PFDFCFN-NSQYTHEMAPKLRFHF

Query:  GDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFD
          G  FEP  KSY+V     + C+GFVS   PATN+IGNI+QQN+LW+FD
Subjt:  GDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFD

AT3G25700.1 Eukaryotic aspartyl protease family protein4.3e-5635.97Show/hide
Query:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMC-TNDLADLFAIAE
        ++SGA  GS +YFV L++G P Q+ +LIADTGSDL W+KC   R   NCS    H S           F   HSS+F    C   +C      D   I  
Subjt:  MISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMC-TNDLADLFAIAE

Query:  CDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGC-----TESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLT
             S C Y+Y YA G+   G+FA ET ++  ++GKE +L++   GC      +SV G+ F GA+GVMGLG    S   +      G  FSYCL+D+  
Subjt:  CDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGC-----TESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLT

Query:  DQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDI--NSGGGTIIDSGTSLTMLAAPAFDMVMEALT
             SY ++G      S          K+ +T L       +FY V L  +  NG  L I   +W+I  +  GGT++DSGT+L  LA PA+  V+ A+ 
Subjt:  DQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDI--NSGGGTIIDSGTSLTMLAAPAFDMVMEALT

Query:  PRLKKFQQIEIDP-FDFCFNNSQYT--HEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSM-PFPATNIIGNILQQNHLWQFDFQR
         R+K      + P FD C N S  T   ++ P+L+F F  G VF PP ++Y +   E I C+   S+ P    ++IGN++QQ  L++FD  R
Subjt:  PRLKKFQQIEIDP-FDFCFNNSQYT--HEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSM-PFPATNIIGNILQQNHLWQFDFQR

AT3G59080.1 Eukaryotic aspartyl protease family protein5.1e-4129.1Show/hide
Query:  FFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHG---DMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAAT
        F S SSS     G          +   E +T+K  L  R      +       +++I+DL  R++ +       H+ + +  NQ  V   + + + E  T
Subjt:  FFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHG---DMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDARLRAEAEAAT

Query:  EVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKC-RYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFK
           VA S       +  +   + SG   GS EYF+ + VG+P + F LI DTGSDL W++C     CF    +  + K+                S+S+K
Subjt:  EVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKC-RYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFK

Query:  TVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGL-TNGKEKQL---RNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTY
         ++C+   C N ++       C +    C Y Y Y   ++  G FA+ET TV L TNG   +L    N + GC    +G +F GA G++GLG    S + 
Subjt:  TVSCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGL-TNGKEKQL---RNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTY

Query:  KAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVG--DPYSSFYGVDLIGISANGITLNIPSRVWDINS--GGGTII
           ++  G  FSYCLVD  +D    S  + G      S           +++T    G  +   +FY V +  I   G  LNIP   W+I+S   GGTII
Subjt:  KAAENANGGGFSYCLVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVG--DPYSSFYGVDLIGISANGITLNIPSRVWDINS--GGGTII

Query:  DSGTSLTMLAAPAFDMVMEALTPRLKKFQQI--EIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNIL
        DSGT+L+  A PA++ +   +  + K    +  +    D CFN S   +   P+L   F DG V+  PT++  + + E + C+  +  P  A +IIGN  
Subjt:  DSGTSLTMLAAPAFDMVMEALTPRLKKFQQI--EIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNIL

Query:  QQNHLWQFDFQR
        QQN    +D +R
Subjt:  QQNHLWQFDFQR

AT3G61820.1 Eukaryotic aspartyl protease family protein3.4e-3731.57Show/hide
Query:  PATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTN
        P T+      +ISG   GS EYF++L VGTPA    ++ DTGSD+ W++C   + C+    +  + K                 S +F TV C S +C  
Subjt:  PATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCR-YRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTVSCSSTMCTN

Query:  DLADLFAIAECDTPIS-PCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYC
            L   +EC T  S  C+Y  SY  G+  +G F+ ETLT         ++ +  +GC    +G +F GA G++GLG    S      +N   G FSYC
Subjt:  DLADLFAIAECDTPIS-PCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYC

Query:  LVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSR---VWDINSGGGTIIDSGTSLTMLAAPAFD
        LVD  +   +          P ++    +A +P    +T L       +FY + L+GIS  G  +   S      D    GG IIDSGTS+T L  PA+ 
Subjt:  LVDHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSR---VWDINSGGGTIIDSGTSLTMLAAPAFD

Query:  MVMEAL---TPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVG-EFISCIGFVSMPFPATNIIGNILQQNHLWQFD
         + +A      +LK+     +  FD CF+ S  T    P + FHFG G V   P  +Y++ V  E   C  F      + +IIGNI QQ     +D
Subjt:  MVMEAL---TPRLKKFQQIEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVG-EFISCIGFVSMPFPATNIIGNILQQNHLWQFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCAAATTTTTGTTTCTTCTTCTTCCTCCTCTTCTTCTTCTCCTTCTCCTCCTCCTTCCTCTTTGCATTGGGTGACGA
AGACAACAACTTCAACAACAATAATGATGACGAAGATGAACAACAAACAATCAAATTGGATCTATTTCACCGTCACCATCCACAAGTCTCGGAAAAGCTTCATGGTGATA
TGAAAATCCAAGATCTAAACGAGCGAGTAAAAGACATTCACGAACACGACCACAAACGTCACCGCTCGATCTCGAAAGCGATGAACCAGAAGCAAGTTGAGGATGCGAGA
TTGAGGGCAGAGGCAGAGGCGGCGACAGAGGTAGAGGTAGCAAAGAGTGCAATACTTCCACCGGCAACGTCGACGCCGATAGGAATGAAAATGATTTCGGGTGCGGATTT
CGGGAGTAGTGAGTATTTCGTACAATTGAAAGTGGGAACGCCGGCGCAAACGTTTATGTTGATTGCGGATACAGGGAGTGATTTAACGTGGATGAAATGTAGATATCGGA
GGTGTTTTGGGAATTGTAGCAGTAACGTGAATCATAAAAGCAAAAATGAAAAGAAAATGAGATTTAGAAATGCGTTTTTGGCGAATCATTCGTCTTCTTTTAAGACGGTT
TCTTGCAGCTCAACGATGTGTACCAATGATCTTGCGGATTTGTTTGCTATTGCGGAATGCGATACCCCAATCAGTCCTTGTGTCTATGATTACAGCTACGCTGGAGGAGC
AAGTGCAAAGGGAATATTCGCATTGGAGACTCTAACCGTAGGCTTAACAAACGGAAAAGAAAAACAACTCCGTAATTCTGTAATAGGCTGTACGGAATCCGTCCAAGGCA
GCGTTTTCGGTGGAGCCGACGGCGTCATGGGCTTAGGCACTAGCTCCTATTCTTTAACTTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCTCTTACTGCCTTGTC
GATCATCTCACCGATCAAAGAGCCATCAGTTACTTCGTCCTCGGCGTTCCTTCCCCTTCCACTTCCGCTGCCACCTCCTCCGCCAAGCTTCCCGGCAAAATGTCCTACAC
CAAACTCTACGTAGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTCATCGGCATCTCCGCCAACGGCATCACGCTCAACATCCCTTCCCGTGTTTGGGACATTAATT
CCGGCGGCGGTACTATCATCGATTCCGGTACTAGTCTCACTATGCTTGCCGCCCCTGCTTTCGATATGGTAATGGAAGCTCTGACTCCGAGATTGAAGAAATTTCAGCAA
ATTGAGATCGATCCTTTCGATTTTTGCTTCAATAATAGCCAATACACTCACGAAATGGCGCCGAAGCTCCGATTCCATTTCGGTGACGGTACGGTGTTTGAGCCGCCGAC
GAAAAGCTACATTGTGTCGGTGGGGGAATTCATTAGCTGTATTGGATTCGTTTCAATGCCTTTTCCGGCGACCAATATCATTGGGAACATTCTTCAGCAAAATCACCTTT
GGCAATTTGATTTCCAAAGAGAAGAGTCGGTTTTGCCCCTTCCGAATGCATCTAAAAAAAAAAAAAACAAACTTCTCATCATAATTAATTATTTTCATTTTCAATTATTT
TTATTATTATTATTTTTTCTTTTATGTTTCTATTATTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAACCGATTAATAGGCAATCTAAATTTAACCATTTTCAGTTAAATTTCTTAGTTTTTAAGTCCGCCATTATCATCTTTCTTCTTGCTCCATCTCTCTAACATTACAT
TCTCTGTTTTGTATGTTAGGTTACAGGAAGCCAATGTCGCCTATTTCAAATTTTTGTTTCTTCTTCTTCCTCCTCTTCTTCTTCTCCTTCTCCTCCTCCTTCCTCTTTGC
ATTGGGTGACGAAGACAACAACTTCAACAACAATAATGATGACGAAGATGAACAACAAACAATCAAATTGGATCTATTTCACCGTCACCATCCACAAGTCTCGGAAAAGC
TTCATGGTGATATGAAAATCCAAGATCTAAACGAGCGAGTAAAAGACATTCACGAACACGACCACAAACGTCACCGCTCGATCTCGAAAGCGATGAACCAGAAGCAAGTT
GAGGATGCGAGATTGAGGGCAGAGGCAGAGGCGGCGACAGAGGTAGAGGTAGCAAAGAGTGCAATACTTCCACCGGCAACGTCGACGCCGATAGGAATGAAAATGATTTC
GGGTGCGGATTTCGGGAGTAGTGAGTATTTCGTACAATTGAAAGTGGGAACGCCGGCGCAAACGTTTATGTTGATTGCGGATACAGGGAGTGATTTAACGTGGATGAAAT
GTAGATATCGGAGGTGTTTTGGGAATTGTAGCAGTAACGTGAATCATAAAAGCAAAAATGAAAAGAAAATGAGATTTAGAAATGCGTTTTTGGCGAATCATTCGTCTTCT
TTTAAGACGGTTTCTTGCAGCTCAACGATGTGTACCAATGATCTTGCGGATTTGTTTGCTATTGCGGAATGCGATACCCCAATCAGTCCTTGTGTCTATGATTACAGCTA
CGCTGGAGGAGCAAGTGCAAAGGGAATATTCGCATTGGAGACTCTAACCGTAGGCTTAACAAACGGAAAAGAAAAACAACTCCGTAATTCTGTAATAGGCTGTACGGAAT
CCGTCCAAGGCAGCGTTTTCGGTGGAGCCGACGGCGTCATGGGCTTAGGCACTAGCTCCTATTCTTTAACTTACAAAGCCGCCGAAAACGCCAACGGCGGCGGCTTCTCT
TACTGCCTTGTCGATCATCTCACCGATCAAAGAGCCATCAGTTACTTCGTCCTCGGCGTTCCTTCCCCTTCCACTTCCGCTGCCACCTCCTCCGCCAAGCTTCCCGGCAA
AATGTCCTACACCAAACTCTACGTAGGCGACCCTTACAGCAGCTTCTACGGCGTCGATCTCATCGGCATCTCCGCCAACGGCATCACGCTCAACATCCCTTCCCGTGTTT
GGGACATTAATTCCGGCGGCGGTACTATCATCGATTCCGGTACTAGTCTCACTATGCTTGCCGCCCCTGCTTTCGATATGGTAATGGAAGCTCTGACTCCGAGATTGAAG
AAATTTCAGCAAATTGAGATCGATCCTTTCGATTTTTGCTTCAATAATAGCCAATACACTCACGAAATGGCGCCGAAGCTCCGATTCCATTTCGGTGACGGTACGGTGTT
TGAGCCGCCGACGAAAAGCTACATTGTGTCGGTGGGGGAATTCATTAGCTGTATTGGATTCGTTTCAATGCCTTTTCCGGCGACCAATATCATTGGGAACATTCTTCAGC
AAAATCACCTTTGGCAATTTGATTTCCAAAGAGAAGAGTCGGTTTTGCCCCTTCCGAATGCATCTAAAAAAAAAAAAAACAAACTTCTCATCATAATTAATTATTTTCAT
TTTCAATTATTTTTATTATTATTATTTTTTCTTTTATGTTTCTATTATTAATCAATACATCTTATTATTGAAAATATACAATGTCACATTTCTCTTTAATATTTTCCTTT
TCATTTGGAATAAACTTGGAGAGTGGAGATATTTTCTTTTTTTTCTTTTTGTGTGAAGAGTTAAGAAGAAGAAGAAGAAGAAGAAGAAGAGAAGATGATTGTATCTTGAA
TAATATTTATTTGAAAAGAGAAAAAGAAGGGATATTCATTGGCTCTTTCTAATTAA
Protein sequenceShow/hide protein sequence
MLGYRKPMSPISNFCFFFFLLFFFSFSSSFLFALGDEDNNFNNNNDDEDEQQTIKLDLFHRHHPQVSEKLHGDMKIQDLNERVKDIHEHDHKRHRSISKAMNQKQVEDAR
LRAEAEAATEVEVAKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWMKCRYRRCFGNCSSNVNHKSKNEKKMRFRNAFLANHSSSFKTV
SCSSTMCTNDLADLFAIAECDTPISPCVYDYSYAGGASAKGIFALETLTVGLTNGKEKQLRNSVIGCTESVQGSVFGGADGVMGLGTSSYSLTYKAAENANGGGFSYCLV
DHLTDQRAISYFVLGVPSPSTSAATSSAKLPGKMSYTKLYVGDPYSSFYGVDLIGISANGITLNIPSRVWDINSGGGTIIDSGTSLTMLAAPAFDMVMEALTPRLKKFQQ
IEIDPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFQREESVLPLPNASKKKKNKLLIIINYFHFQLF
LLLLFFLLCFYY