; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G021780 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G021780
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionaspartic proteinase-like
Genome locationCG_Chr01:35519428..35530333
RNA-Seq ExpressionClCG01G021780
SyntenyClCG01G021780
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006629 - lipid metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR007856 - Saposin-like type B, region 1
IPR008138 - Saposin B type, region 2
IPR008139 - Saposin B type domain
IPR011001 - Saposin-like
IPR021109 - Aspartic peptidase domain superfamily
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134774.1 aspartic proteinase [Cucumis sativus]4.3e-20677.78Show/hide
Query:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ
        SF  LLVSLLLLI+ YSS A+S+SNE  LRIGLKKIK DQNSR KALLESKKGEFLGSSVGKHNQW NNL ES+N+DIV LKNYLDAQYYGEIGIGTPPQ
Subjt:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ

Query:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD
        KFTVIFDTGSSNLWVPS+KCIFS+                   A  F  K+ S           STY  +    A      YG+GAI+GFFSYDNV+VGD
Subjt:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD

Query:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ
        V+VR+Q+LIEATSMS+ TFMAAKFDGILGLGFQEI+TG AVPVWYNMVKQKLVKEQVFSFWLNRNA+E+EGGE+VFGGVDPKHFKGQHTYVPVT KGYWQ
Subjt:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ

Query:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN
        FDIGDILIGGETT+YCA GCSAIADSGTSLLAGPS IV  INRAIGAA VAHPECKAI+SQ+G+ IMDLLLAKAQPEKICSKIGVCT DETH VSLKIEN
Subjt:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN

Query:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        VV+DKDGRSSGGFS+AMCSACEMAV W+QDELKQNKT+E II+ VN+LCDRGLNQ  TLVDCGRI QMP VSFTIGDR+FEL+SKD
Subjt:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

XP_008440021.1 PREDICTED: aspartic proteinase-like isoform X1 [Cucumis melo]7.3e-20678.05Show/hide
Query:  VTMRR-SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG
        V MR+ SF  LLVSLLLLI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLG SVGKHNQW NNLGES+N+D V LKNYLDAQYYGEIG
Subjt:  VTMRR-SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG

Query:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYD
        IGTPPQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY  +    A      YG+GAIAGFFS D
Subjt:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYD

Query:  NVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVT
        NVRVGDVVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPVT
Subjt:  NVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVT

Query:  TKGYWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGV
         KGYWQFDIGDILIGGETT+YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS IGVCT D+T  V
Subjt:  TKGYWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGV

Query:  SLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        SLKIENVV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELSSKD
Subjt:  SLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

XP_023544281.1 aspartic proteinase-like [Cucurbita pepo subsp. pepo]9.5e-20676.43Show/hide
Query:  MRRSFKPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP
        MR S KPLLVSLLLLI+YSS ASS+SNE L+RIGLKKIKV++N  LKAL+ESKK +FLGS   KH+QW N++GES+NSDIVALKNY+DAQYYGEIGIGTP
Subjt:  MRRSFKPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP

Query:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRV
        PQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY  +    A      YGTGAI+GFFSYDNV+V
Subjt:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRV

Query:  GDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGY
        GDVVVRDQQ IE TSMSS TF+AAKFDGILGLGFQEISTGDAVPVWYNMV QKLVKE VFSFWLNRNA EEEGGEIVFGGVDPKHFKGQHTYVPVTTKGY
Subjt:  GDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGY

Query:  WQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKI
        WQF+IGDILIGGE TEYCARGCSAIADSGTSLLAGPSTIV LINRAIGAA +  PECKA++SQHG++IMDLLLAK QPEKICSKIGVC  D THGVS+KI
Subjt:  WQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKI

Query:  ENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        E+V N+KDGRSSGGFSDAMCSACEMAVSWM DELKQNKT+E++IDYVNKLCDR  NQG TLVDCGRI QMPTVSFTIGD+VFEL+++D
Subjt:  ENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

XP_038880987.1 aspartic proteinase-like isoform X1 [Benincasa hispida]4.9e-21880.73Show/hide
Query:  LSVTMRRSFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEI
        L V MR SFKPLLVSLLLLI+ YSS ASS+SNE  LRIGLKKIK DQN R KALLESKKGEFLGSSVGKHNQW NN+GESRN+DIVALKNYLDAQYYGEI
Subjt:  LSVTMRRSFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEI

Query:  GIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSY
        GIGTPPQKFTV+FDTGSSNLWVPSSKCIFS+                   A  F  ++ S           STY        +     YG+GAIAGFFSY
Subjt:  GIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSY

Query:  DNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPV
        DNVRVGDVVV DQ+LIEATSMSS TFM AKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA E EGGE+VFGGVDPKHFKGQHTYVPV
Subjt:  DNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPV

Query:  TTKGYWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHG
        TTKGYWQFDIGDILIGGETTEYCA GCSAIADSGTSLLAGPSTIVALINRAIGAAEVA PECKAI+SQHGQ IMDLLL  AQPEKICSKIGVCT D+T G
Subjt:  TTKGYWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHG

Query:  VSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        V LKIE +VNDKDG+SSGGFSDAMCSACEMAVSWMQDELKQNKT+E+IIDYVN+LCDRGLNQGATLVDCGRI +MPTVSFTIGDRVFELSSKD
Subjt:  VSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

XP_038880988.1 aspartic proteinase-like isoform X2 [Benincasa hispida]1.1e-21780.98Show/hide
Query:  MRRSFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGT
        MR SFKPLLVSLLLLI+ YSS ASS+SNE  LRIGLKKIK DQN R KALLESKKGEFLGSSVGKHNQW NN+GESRN+DIVALKNYLDAQYYGEIGIGT
Subjt:  MRRSFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGT

Query:  PPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVR
        PPQKFTV+FDTGSSNLWVPSSKCIFS+                   A  F  ++ S           STY        +     YG+GAIAGFFSYDNVR
Subjt:  PPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVR

Query:  VGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKG
        VGDVVV DQ+LIEATSMSS TFM AKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA E EGGE+VFGGVDPKHFKGQHTYVPVTTKG
Subjt:  VGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKG

Query:  YWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLK
        YWQFDIGDILIGGETTEYCA GCSAIADSGTSLLAGPSTIVALINRAIGAAEVA PECKAI+SQHGQ IMDLLL  AQPEKICSKIGVCT D+T GV LK
Subjt:  YWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLK

Query:  IENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        IE +VNDKDG+SSGGFSDAMCSACEMAVSWMQDELKQNKT+E+IIDYVN+LCDRGLNQGATLVDCGRI +MPTVSFTIGDRVFELSSKD
Subjt:  IENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KMZ9 Uncharacterized protein2.1e-20677.78Show/hide
Query:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ
        SF  LLVSLLLLI+ YSS A+S+SNE  LRIGLKKIK DQNSR KALLESKKGEFLGSSVGKHNQW NNL ES+N+DIV LKNYLDAQYYGEIGIGTPPQ
Subjt:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ

Query:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD
        KFTVIFDTGSSNLWVPS+KCIFS+                   A  F  K+ S           STY  +    A      YG+GAI+GFFSYDNV+VGD
Subjt:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD

Query:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ
        V+VR+Q+LIEATSMS+ TFMAAKFDGILGLGFQEI+TG AVPVWYNMVKQKLVKEQVFSFWLNRNA+E+EGGE+VFGGVDPKHFKGQHTYVPVT KGYWQ
Subjt:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ

Query:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN
        FDIGDILIGGETT+YCA GCSAIADSGTSLLAGPS IV  INRAIGAA VAHPECKAI+SQ+G+ IMDLLLAKAQPEKICSKIGVCT DETH VSLKIEN
Subjt:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN

Query:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        VV+DKDGRSSGGFS+AMCSACEMAV W+QDELKQNKT+E II+ VN+LCDRGLNQ  TLVDCGRI QMP VSFTIGDR+FEL+SKD
Subjt:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

A0A1S3B040 aspartic proteinase-like isoform X26.0e-20678.4Show/hide
Query:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ
        SF  LLVSLLLLI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLG SVGKHNQW NNLGES+N+D V LKNYLDAQYYGEIGIGTPPQ
Subjt:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ

Query:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD
        KFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY  +    A      YG+GAIAGFFS DNVRVGD
Subjt:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD

Query:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ
        VVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPVT KGYWQ
Subjt:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ

Query:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN
        FDIGDILIGGETT+YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS IGVCT D+T  VSLKIEN
Subjt:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN

Query:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        VV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELSSKD
Subjt:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

A0A1S3B058 aspartic proteinase-like isoform X13.5e-20678.05Show/hide
Query:  VTMRR-SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG
        V MR+ SF  LLVSLLLLI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLG SVGKHNQW NNLGES+N+D V LKNYLDAQYYGEIG
Subjt:  VTMRR-SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG

Query:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYD
        IGTPPQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY  +    A      YG+GAIAGFFS D
Subjt:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYD

Query:  NVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVT
        NVRVGDVVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPVT
Subjt:  NVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVT

Query:  TKGYWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGV
         KGYWQFDIGDILIGGETT+YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS IGVCT D+T  V
Subjt:  TKGYWQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGV

Query:  SLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        SLKIENVV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELSSKD
Subjt:  SLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

A0A5D3CRY9 Aspartic proteinase-like isoform X27.8e-20678.4Show/hide
Query:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ
        SF  LLVSLLLLI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLGSSVGK+NQW NNLGES+N+D V LKNYLDAQYYGEIGIGTPPQ
Subjt:  SFKPLLVSLLLLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ

Query:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD
        KFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY  +    A      YG+GAIAGFFS DNVRVGD
Subjt:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGD

Query:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ
        VVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPVT KGYWQ
Subjt:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ

Query:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN
        FDIGDILIGGETT+YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS IGVCT D+T  VSLKIEN
Subjt:  FDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIEN

Query:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        VV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELSSKD
Subjt:  VVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

A0A6J1IKS1 aspartic proteinase-like3.9e-20576.02Show/hide
Query:  MRRSFKPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP
        MR S KPLLVSLLLLI+YSS ASS+SNE L+RIGLKKIKV++N  LKAL+ESKK EFLGS   KH+QW N+LGES+NSDIVALKNY+DAQYYGEIGIGTP
Subjt:  MRRSFKPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP

Query:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRV
        PQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY  +    A      YG+GAI+GFFSYDNV+V
Subjt:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRV

Query:  GDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGY
        GDVVVR+QQ IE TSMSS TF+AAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKE VFSFWLNRNA+EEEGGEIVFGGVDPKHFKGQHTYVPVTTKGY
Subjt:  GDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGY

Query:  WQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKI
        WQF+IGDILIGG+ TEYCARGCSAIADSGTSLLAGPSTIV LINRAIGAA +  PECK ++SQHG++IMDLLLAK QPEKICSKIGVC  D +HGVS KI
Subjt:  WQFDIGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKI

Query:  ENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        E+VVN+KDG SSGGFSDAMCSACEMAVSWM DELKQNKT+E++IDYVNKLCDR LN+G TLVDCGRI QMPTVSFTIGD+VFEL+++D
Subjt:  ENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

SwissProt top hitse value%identityAlignment
O04057 Aspartic proteinase3.3e-15358.07Show/hide
Query:  LLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGS
        L L++  +  SS+SN+ LLR+GLKKIK+D  +RL A +ESK  E L ++  K+N    NLGES ++DIVALKNYLDAQYYGEI IGTPPQKFTVIFDTGS
Subjt:  LLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGS

Query:  SNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVRDQQLIE
        SNLWV   +C+FSV+     H   ++ +  S S   +K    S                            YGTGA++GFFSYDNV+VGD+VV++Q  IE
Subjt:  SNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVRDQQLIE

Query:  ATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGG
        AT   S TF+ AKFDG+LGLGFQEI+ G+AVPVWYNMV+Q LVKE VFSFWLNRN +EEEGGEIVFGGVDPKH++G+HTYVPVT KGYWQFD+GD+LI G
Subjt:  ATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGG

Query:  ETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSS
        E T +C  GCSAIADSGTSLLAGP+ ++ +IN AIGA  V   +CKA+++Q+GQTIMDLLL++A P+KICS+I +CT D T GVS+ IE+VV++  G+SS
Subjt:  ETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSS

Query:  GGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
            D MCS CEM V WMQ++L+QN+T+E II+Y+N+LCDR  +  G + VDCG++  MPTVSFTIG ++F+L+ ++
Subjt:  GGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

O65390 Aspartic proteinase A18.5e-14956.29Show/hide
Query:  KPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT
        + + VSL++  +   +A +  N+   R+GLKK+K+D  +RL A +ESK+ + L +           LG+S ++D+V LKNYLDAQYYGEI IGTPPQKFT
Subjt:  KPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT

Query:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVV
        V+FDTGSSNLWVPSSKC FS++                          C LH ++ S+    STY  ++  +A+ +  +YGTGAIAGFFS D V VGD+V
Subjt:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVV

Query:  VRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFD
        V+DQ+ IEAT     TF+ AKFDGILGLGFQEIS G A PVWYNM+KQ L+KE VFSFWLNRNADEEEGGE+VFGGVDP HFKG+HTYVPVT KGYWQFD
Subjt:  VRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFD

Query:  IGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVV
        +GD+LIGG  T +C  GCSAIADSGTSLLAGP+TI+ +IN AIGAA V   +CK ++ Q+GQTI+DLLL++ QP+KICS+IG+CT D T GVS+ IE+VV
Subjt:  IGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVV

Query:  NDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        + ++ + S G  DA CSACEMAV W+Q +L+QN T+E I++YVN+LC+R     G + VDC ++  MPTVS TIG +VF+L+ ++
Subjt:  NDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

P42210 Phytepsin3.0e-13854.15Show/hide
Query:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVI
        LL ++LLL     AAS +  E L+RI LKK  +D+NSR+   L   + + L S         N L      DIVALKNY++AQY+GEIG+GTPPQKFTVI
Subjt:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVI

Query:  FDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVRD
        FDTGSSNLWVPS+KC FS++     ++  ++ A  S                       STY  +    A      YGTG+IAG+FS D+V VGD+VV+D
Subjt:  FDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVRD

Query:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD
        Q+ IEAT     TF+ AKFDGILGLGF+EIS G AVPVWY M++Q LV + VFSFWLNR+ DE EGGEI+FGG+DPKH+ G+HTYVPVT KGYWQFD+GD
Subjt:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD

Query:  ILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDK
        +L+GG++T +CA GC+AIADSGTSLLAGP+ I+  IN  IGAA V   ECK I+SQ+GQ I+DLLLA+ QP+KICS++G+CT D T GVS  I +VV+D+
Subjt:  ILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDK

Query:  DGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
          +S+G  +D MCSACEMAV WMQ++L QNKT++ I+DYVN+LC+R     G + VDCG +  MP + FTIG + F L  ++
Subjt:  DGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

Q42456 Aspartic proteinase oryzasin-14.2e-14054.91Show/hide
Query:  LLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDT
        LL  ++  +   +S+ E L+RI LKK  +D+NSR+ A L  ++G      +G      N+L G     DIVALKNY++AQY+GEIG+GTPPQKFTVIFDT
Subjt:  LLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDT

Query:  GSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVRDQQL
        GSSNLWVPS+KC FS+                   A  F  ++ S           STY  +    A      YGTG+IAGFFS D+V VGD+VV+DQ+ 
Subjt:  GSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVRDQQL

Query:  IEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILI
        IEAT     TFM AKFDGILGLGFQEIS GDAVPVWY MV+Q LV E VFSFW NR++DE EGGEIVFGG+DP H+KG HTYVPV+ KGYWQF++GD+LI
Subjt:  IEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILI

Query:  GGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGR
        GG+TT +CA GCSAIADSGTSLLAGP+ I+  IN  IGA  V   ECK ++SQ+GQ I+DLLLA+ QP KICS++G+CT D  HGVS  I++VV+D+ G 
Subjt:  GGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGR

Query:  SSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        S+G  S  MC+ACEMAV WMQ++L QNKT++ I++Y+N+LCD+     G + VDCG +  MP +SFTIG + F L  ++
Subjt:  SSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

Q8VYL3 Aspartic proteinase A29.4e-14856.11Show/hide
Query:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV
        + VS LL       A S  N+   R+GLKK+K+D N+RL     SK+ E L SS+  +N   NNL G+S ++DIV LKNYLDAQYYGEI IGTPPQKFTV
Subjt:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV

Query:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR
        IFDTGSSNLWVPS KC FS+S                     F  K+ S           STY         R   +YG+G+I+GFFSYD V VGD+VV+
Subjt:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR

Query:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG
        DQ+ IE TS    TF+ AKFDG+LGLGFQEI+ G+A PVWYNM+KQ L+K  VFSFWLNR+   EEGGEIVFGGVDPKHF+G+HT+VPVT +GYWQFD+G
Subjt:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG

Query:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND
        ++LI GE+T YC  GCSAIADSGTSLLAGP+ +VA+IN+AIGA+ V   +CK ++ Q+GQTI+DLLLA+ QP+KICS+IG+C  D THGVS+ IE+VV+ 
Subjt:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND

Query:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        ++ RSS G  DA C ACEMAV W+Q +L+QN T+E I++Y+N++C+R  +  G + VDC ++ +MPTVSFTIG +VF+L+ ++
Subjt:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

Arabidopsis top hitse value%identityAlignment
AT1G11910.1 aspartic proteinase A16.0e-15056.29Show/hide
Query:  KPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT
        + + VSL++  +   +A +  N+   R+GLKK+K+D  +RL A +ESK+ + L +           LG+S ++D+V LKNYLDAQYYGEI IGTPPQKFT
Subjt:  KPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT

Query:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVV
        V+FDTGSSNLWVPSSKC FS++                          C LH ++ S+    STY  ++  +A+ +  +YGTGAIAGFFS D V VGD+V
Subjt:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVV

Query:  VRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFD
        V+DQ+ IEAT     TF+ AKFDGILGLGFQEIS G A PVWYNM+KQ L+KE VFSFWLNRNADEEEGGE+VFGGVDP HFKG+HTYVPVT KGYWQFD
Subjt:  VRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFD

Query:  IGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVV
        +GD+LIGG  T +C  GCSAIADSGTSLLAGP+TI+ +IN AIGAA V   +CK ++ Q+GQTI+DLLL++ QP+KICS+IG+CT D T GVS+ IE+VV
Subjt:  IGDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVV

Query:  NDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        + ++ + S G  DA CSACEMAV W+Q +L+QN T+E I++YVN+LC+R     G + VDC ++  MPTVS TIG +VF+L+ ++
Subjt:  NDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT1G62290.1 Saposin-like aspartyl protease family protein6.7e-14956.11Show/hide
Query:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV
        + VS LL       A S  N+   R+GLKK+K+D N+RL     SK+ E L SS+  +N   NNL G+S ++DIV LKNYLDAQYYGEI IGTPPQKFTV
Subjt:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV

Query:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR
        IFDTGSSNLWVPS KC FS+S                     F  K+ S           STY         R   +YG+G+I+GFFSYD V VGD+VV+
Subjt:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR

Query:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG
        DQ+ IE TS    TF+ AKFDG+LGLGFQEI+ G+A PVWYNM+KQ L+K  VFSFWLNR+   EEGGEIVFGGVDPKHF+G+HT+VPVT +GYWQFD+G
Subjt:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG

Query:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND
        ++LI GE+T YC  GCSAIADSGTSLLAGP+ +VA+IN+AIGA+ V   +CK ++ Q+GQTI+DLLLA+ QP+KICS+IG+C  D THGVS+ IE+VV+ 
Subjt:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND

Query:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        ++ RSS G  DA C ACEMAV W+Q +L+QN T+E I++Y+N++C+R  +  G + VDC ++ +MPTVSFTIG +VF+L+ ++
Subjt:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT1G62290.2 Saposin-like aspartyl protease family protein6.7e-14956.11Show/hide
Query:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV
        + VS LL       A S  N+   R+GLKK+K+D N+RL     SK+ E L SS+  +N   NNL G+S ++DIV LKNYLDAQYYGEI IGTPPQKFTV
Subjt:  LLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV

Query:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR
        IFDTGSSNLWVPS KC FS+S                     F  K+ S           STY         R   +YG+G+I+GFFSYD V VGD+VV+
Subjt:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR

Query:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG
        DQ+ IE TS    TF+ AKFDG+LGLGFQEI+ G+A PVWYNM+KQ L+K  VFSFWLNR+   EEGGEIVFGGVDPKHF+G+HT+VPVT +GYWQFD+G
Subjt:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG

Query:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND
        ++LI GE+T YC  GCSAIADSGTSLLAGP+ +VA+IN+AIGA+ V   +CK ++ Q+GQTI+DLLLA+ QP+KICS+IG+C  D THGVS+ IE+VV+ 
Subjt:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND

Query:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        ++ RSS G  DA C ACEMAV W+Q +L+QN T+E I++Y+N++C+R  +  G + VDC ++ +MPTVSFTIG +VF+L+ ++
Subjt:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT4G04460.1 Saposin-like aspartyl protease family protein9.4e-13553Show/hide
Query:  LLVSLL-LLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV
        LLV LL  LI+ S+A+   + +  +RIGLKK K+D+++RL + L  K     GS     + +  N     N+D+V LKNYLDAQYYG+I IGTPPQKFTV
Subjt:  LLVSLL-LLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV

Query:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR
        IFDTGSSNLW+PS+KC  SV+     +   K+ A  S                       S+Y  +    + R    YGTGAI+G+FS D+V+VGD+VV+
Subjt:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR

Query:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG
        +Q+ IEATS    TF+ AKFDGILGLGF+EIS G++ PVWYNMV++ LVKE +FSFWLNRN  + EGGEIVFGGVDPKHFKG+HT+VPVT KGYWQFD+G
Subjt:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG

Query:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND
        D+ I G+ T YCA+GCSAIADSGTSLL GPST++ +IN AIGA  +   ECKA++ Q+G+T+++ LLA+  P+K+CS+IGVC  D T  VS+ I++VV+D
Subjt:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND

Query:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD
            +SG  + AMCSACEMA  WM+ EL QN+T+E I+ Y  +LCD    Q   + VDCGR+  MP V+F+IG R F+L+ +D
Subjt:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT4G04460.2 Saposin-like aspartyl protease family protein3.3e-13252.8Show/hide
Query:  LLVSLL-LLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV
        LLV LL  LI+ S+A+   + +  +RIGLKK K+D+++RL + L  K     GS     + +  N     N+D+V LKNYLDAQYYG+I IGTPPQKFTV
Subjt:  LLVSLL-LLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV

Query:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR
        IFDTGSSNLW+PS+KC  SV+     +   K+ A  S                       S+Y  +    + R    YGTGAI+G+FS D+V+VGD+VV+
Subjt:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSYDNVRVGDVVVR

Query:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG
        +Q+ IEATS    TF+ AKFDGILGLGF+EIS G++ PVWYNMV++ LVKE +FSFWLNRN  + EGGEIVFGGVDPKHFKG+HT+VPVT KGYWQFD+G
Subjt:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG

Query:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND
        D+ I G+ T YCA+GCSAIADSGTSLL GPST++ +IN AIGA  +   ECKA++ Q+G+T+++ LLA    +K+CS+IGVC  D T  VS+ I++VV+D
Subjt:  DILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVND

Query:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD
            +SG  + AMCSACEMA  WM+ EL QN+T+E I+ Y  +LCD    Q   + VDCGR+  MP V+F+IG R F+L+ +D
Subjt:  KDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTAGTTATCACCCTCATTTTTCTGGATGAATGGATTGCATATTTGGAAAATAGATTGTTGAGTGTTACAATGAGAAGAAGCTTTAAACCCCTCTTGGTTTCTTT
GTTGCTCTTAATCATATATTCCTCTGCAGCATCCTCTTCTTCTAATGAAGAGTTGTTAAGGATTGGATTGAAGAAGATAAAAGTTGATCAAAACAGTCGGTTAAAAGCAT
TGCTTGAGTCAAAGAAAGGAGAGTTTTTAGGATCTTCTGTTGGAAAACATAATCAATGGGATAATAATCTTGGAGAATCTAGAAATAGTGATATTGTAGCATTAAAGAAT
TATTTGGATGCTCAATACTATGGAGAGATTGGCATTGGCACTCCACCTCAAAAGTTCACTGTAATTTTCGATACTGGAAGCTCTAATTTGTGGGTGCCATCTTCGAAATG
TATTTTCTCGGTATCAATCAGGACGGTCAAGCACATACAGAAGAAATGGTCTGCTTTCTTCTCTTTGTCTGCAATAGTCTTCAAAATGAAATTTTGTTCACTGCACGAGT
TTGATTCAACCTTATTTCATTTATCCACATATTGGATTGATCAACTCTTAAGGGCTAGTCGGTTATGGGCGAATTATGGTACAGGAGCTATTGCTGGCTTCTTTAGTTAT
GACAACGTTCGAGTTGGTGATGTTGTTGTCCGTGATCAGCAACTCATTGAGGCAACTAGCATGTCTAGTACGACATTCATGGCTGCCAAATTTGATGGTATATTGGGACT
TGGATTTCAAGAGATCTCGACTGGTGACGCTGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCAAGTTTTCTCATTTTGGCTGAATCGCAATGCCG
ATGAGGAAGAAGGAGGTGAAATTGTGTTTGGAGGGGTTGATCCTAAGCACTTCAAAGGCCAACATACATATGTGCCTGTGACAACCAAAGGGTACTGGCAGTTTGACATC
GGCGATATTCTTATTGGTGGTGAAACGACAGAATATTGTGCTCGTGGTTGCTCTGCGATTGCGGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTACTATAGTGGCATT
AATTAATAGAGCCATTGGAGCAGCTGAAGTTGCTCATCCAGAATGCAAAGCAATTATTTCACAGCATGGACAGACTATTATGGATTTGCTTTTAGCAAAAGCACAACCAG
AGAAGATATGTTCCAAAATTGGCGTGTGTACCTCTGATGAAACTCATGGTGTTAGTTTGAAAATTGAGAATGTGGTGAATGATAAAGATGGTAGATCATCTGGTGGCTTC
TCAGATGCTATGTGCTCAGCTTGTGAGATGGCGGTTTCCTGGATGCAAGATGAGCTGAAGCAGAATAAAACTCGAGAGTATATCATTGATTATGTCAATAAGCTATGTGA
TCGTGGGTTGAACCAAGGAGCAACATTGGTTGACTGTGGACGGATCCCTCAAATGCCTACCGTGTCCTTCACCATTGGCGACAGAGTTTTTGAGCTTAGCTCAAAAGATC
TTAAGCATTTAGCCTTATTCTCTTTAACCCCATTGCTTTCTGTGAACAGTACGTTCTCAAGGTTGGTGAGGGATCTGCAGCTCAATGCATCAGTGGATTCATACCTTTGG
ACATTCCTCCTCCTCGTGGACCCCTATGGATCTTGGGAGACGTCTTCATGGGACGTTATCACACAGTCTTTGATTTTGGCAAAACAAGAGTTGGATTTGCGGACGCTGCT
TGAAGAACATATTCAATTCGTTCTAAAAGGTGAATTAGGAAGAAAAAAACATCCTCCTCAGGACACCAAAGCAAAATACATATACAAAAGGCTCCAACCGGGAATATGTG
GAATCAAGCCATTGCAAAACCAATTTCTACGTTTGCTCCCTTCTGTTCTAAGGAACGGAGTGAATAAAGAAGAACAATTGCCAAAGTTTGTTCCCTTCTCCTTCATCACC
CTCTCTTTCTCTCCATCCCTCCCTTTCACTTCAGTCCTTCTGGCGAAACCACACAGCTCCGGTGAATATCCACGACCGAGGATGGCGGTAGGATGGGTCGACCCAGGCGA
CGGTGTGGGTCTGTTTGGGGCGAAAATGGTTGCAAGGGACCTTGAGGCCCTGCGCAGAGGTGAGGTCGATCTCCAAGACGGACATGGGGGTAGCCAGAGGGTTCTTGGGA
ACCAAGTCTCCGTCGACCTCCATGGATCAAGAACGAGAACATCAGCTGCTATTCGATATGGTTCAGGAGATATTGCCGGTTTCTTTAGTTGTGACAACGTTCGAGTCGGT
GATGTTGTTGTTCGTGATCAGCAACTCATTGAGGCAACTAGAATGTCTGGTGAGATATTCATGGCTGCCAAATTTGATGGTATATTGGGACTTGGATTCCAAGAGATCTT
GACTGGTGGCGCTGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCCAGTTTTCTCATTTTGGCTGAATCGCAATGCCAAGGAGAAAGGTGGTGAAC
TTGTGTTTGGCGGGCTTGATCCTAAGCACTTCAAAGGCCAGCATTCATATGTTCCTGTGAAAACCAAAGGGTATTGGCAGTTTGACATCGGTGATATTCTTATTGCTGGT
GAAACGACAGAATATTGTGCTCGTGGTTGCTCTGCGATTGCAGATTCTGGAACTTCTTTGTTGGCTGGTCCAGCTGCTATAGTGGAAAAAATTAATAAAGCCATTGGAGC
AGCTGCAGTTGCTCATCCAGAATGCAAGGCAATTGTTTCACAACATGGACAGGCTATTATGGATTTGCTTTTAGCCAAGGCACAACCAGAGAAGATATGTTCCAAAATTG
GCGTGTGTACCTTTGTTGAAACTCATGGTGTTAATTTGAAAATTGAGAGTATGGTGAATGATAAAGATGGTAGATCATCTGGTGGCTTCTCAGATGCTATGTGCTCAGCT
TGTGAGATGGCAGTTTCCTGGATGCAAGATGAGCTGAAGCAGAACGAAACTCAAGAACATATCATTGATTATGTCAATGAGCTATGCAATCGTGGGTTGAACCAAGGAGC
AACATTGGTTGACTGTGGATGGATCTCTCAAATGCCTAATGTGTCCTTCACCATTGGCGACAGAGTTTTTGATCTTAGCTCAAAAGATTACATTCTCAAGATAGGTGAGG
GATCTGCAGCTCAATGCACCAGTGGATTCCAACCTGTGGTCATTCCCTTCTGGTACTTCTATTTTCATGTTCTTGGTTTCTTTCAGTTCTTTAAAACTGAGAATCATGTT
GCTACTTCTCTTCAGTTTTTTCTCGTTGGTGAAGTAAGCCACTTTAATGATTGGGTTAACTTGAAGGAAAAGCAGCGTCATCCTCCAAAATGCAGACGTAAGGCTTTCTC
ATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCTAGTTATCACCCTCATTTTTCTGGATGAATGGATTGCATATTTGGAAAATAGATTGTTGAGTGTTACAATGAGAAGAAGCTTTAAACCCCTCTTGGTTTCTTT
GTTGCTCTTAATCATATATTCCTCTGCAGCATCCTCTTCTTCTAATGAAGAGTTGTTAAGGATTGGATTGAAGAAGATAAAAGTTGATCAAAACAGTCGGTTAAAAGCAT
TGCTTGAGTCAAAGAAAGGAGAGTTTTTAGGATCTTCTGTTGGAAAACATAATCAATGGGATAATAATCTTGGAGAATCTAGAAATAGTGATATTGTAGCATTAAAGAAT
TATTTGGATGCTCAATACTATGGAGAGATTGGCATTGGCACTCCACCTCAAAAGTTCACTGTAATTTTCGATACTGGAAGCTCTAATTTGTGGGTGCCATCTTCGAAATG
TATTTTCTCGGTATCAATCAGGACGGTCAAGCACATACAGAAGAAATGGTCTGCTTTCTTCTCTTTGTCTGCAATAGTCTTCAAAATGAAATTTTGTTCACTGCACGAGT
TTGATTCAACCTTATTTCATTTATCCACATATTGGATTGATCAACTCTTAAGGGCTAGTCGGTTATGGGCGAATTATGGTACAGGAGCTATTGCTGGCTTCTTTAGTTAT
GACAACGTTCGAGTTGGTGATGTTGTTGTCCGTGATCAGCAACTCATTGAGGCAACTAGCATGTCTAGTACGACATTCATGGCTGCCAAATTTGATGGTATATTGGGACT
TGGATTTCAAGAGATCTCGACTGGTGACGCTGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCAAGTTTTCTCATTTTGGCTGAATCGCAATGCCG
ATGAGGAAGAAGGAGGTGAAATTGTGTTTGGAGGGGTTGATCCTAAGCACTTCAAAGGCCAACATACATATGTGCCTGTGACAACCAAAGGGTACTGGCAGTTTGACATC
GGCGATATTCTTATTGGTGGTGAAACGACAGAATATTGTGCTCGTGGTTGCTCTGCGATTGCGGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTACTATAGTGGCATT
AATTAATAGAGCCATTGGAGCAGCTGAAGTTGCTCATCCAGAATGCAAAGCAATTATTTCACAGCATGGACAGACTATTATGGATTTGCTTTTAGCAAAAGCACAACCAG
AGAAGATATGTTCCAAAATTGGCGTGTGTACCTCTGATGAAACTCATGGTGTTAGTTTGAAAATTGAGAATGTGGTGAATGATAAAGATGGTAGATCATCTGGTGGCTTC
TCAGATGCTATGTGCTCAGCTTGTGAGATGGCGGTTTCCTGGATGCAAGATGAGCTGAAGCAGAATAAAACTCGAGAGTATATCATTGATTATGTCAATAAGCTATGTGA
TCGTGGGTTGAACCAAGGAGCAACATTGGTTGACTGTGGACGGATCCCTCAAATGCCTACCGTGTCCTTCACCATTGGCGACAGAGTTTTTGAGCTTAGCTCAAAAGATC
TTAAGCATTTAGCCTTATTCTCTTTAACCCCATTGCTTTCTGTGAACAGTACGTTCTCAAGGTTGGTGAGGGATCTGCAGCTCAATGCATCAGTGGATTCATACCTTTGG
ACATTCCTCCTCCTCGTGGACCCCTATGGATCTTGGGAGACGTCTTCATGGGACGTTATCACACAGTCTTTGATTTTGGCAAAACAAGAGTTGGATTTGCGGACGCTGCT
TGAAGAACATATTCAATTCGTTCTAAAAGGTGAATTAGGAAGAAAAAAACATCCTCCTCAGGACACCAAAGCAAAATACATATACAAAAGGCTCCAACCGGGAATATGTG
GAATCAAGCCATTGCAAAACCAATTTCTACGTTTGCTCCCTTCTGTTCTAAGGAACGGAGTGAATAAAGAAGAACAATTGCCAAAGTTTGTTCCCTTCTCCTTCATCACC
CTCTCTTTCTCTCCATCCCTCCCTTTCACTTCAGTCCTTCTGGCGAAACCACACAGCTCCGGTGAATATCCACGACCGAGGATGGCGGTAGGATGGGTCGACCCAGGCGA
CGGTGTGGGTCTGTTTGGGGCGAAAATGGTTGCAAGGGACCTTGAGGCCCTGCGCAGAGGTGAGGTCGATCTCCAAGACGGACATGGGGGTAGCCAGAGGGTTCTTGGGA
ACCAAGTCTCCGTCGACCTCCATGGATCAAGAACGAGAACATCAGCTGCTATTCGATATGGTTCAGGAGATATTGCCGGTTTCTTTAGTTGTGACAACGTTCGAGTCGGT
GATGTTGTTGTTCGTGATCAGCAACTCATTGAGGCAACTAGAATGTCTGGTGAGATATTCATGGCTGCCAAATTTGATGGTATATTGGGACTTGGATTCCAAGAGATCTT
GACTGGTGGCGCTGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCCAGTTTTCTCATTTTGGCTGAATCGCAATGCCAAGGAGAAAGGTGGTGAAC
TTGTGTTTGGCGGGCTTGATCCTAAGCACTTCAAAGGCCAGCATTCATATGTTCCTGTGAAAACCAAAGGGTATTGGCAGTTTGACATCGGTGATATTCTTATTGCTGGT
GAAACGACAGAATATTGTGCTCGTGGTTGCTCTGCGATTGCAGATTCTGGAACTTCTTTGTTGGCTGGTCCAGCTGCTATAGTGGAAAAAATTAATAAAGCCATTGGAGC
AGCTGCAGTTGCTCATCCAGAATGCAAGGCAATTGTTTCACAACATGGACAGGCTATTATGGATTTGCTTTTAGCCAAGGCACAACCAGAGAAGATATGTTCCAAAATTG
GCGTGTGTACCTTTGTTGAAACTCATGGTGTTAATTTGAAAATTGAGAGTATGGTGAATGATAAAGATGGTAGATCATCTGGTGGCTTCTCAGATGCTATGTGCTCAGCT
TGTGAGATGGCAGTTTCCTGGATGCAAGATGAGCTGAAGCAGAACGAAACTCAAGAACATATCATTGATTATGTCAATGAGCTATGCAATCGTGGGTTGAACCAAGGAGC
AACATTGGTTGACTGTGGATGGATCTCTCAAATGCCTAATGTGTCCTTCACCATTGGCGACAGAGTTTTTGATCTTAGCTCAAAAGATTACATTCTCAAGATAGGTGAGG
GATCTGCAGCTCAATGCACCAGTGGATTCCAACCTGTGGTCATTCCCTTCTGGTACTTCTATTTTCATGTTCTTGGTTTCTTTCAGTTCTTTAAAACTGAGAATCATGTT
GCTACTTCTCTTCAGTTTTTTCTCGTTGGTGAAGTAAGCCACTTTAATGATTGGGTTAACTTGAAGGAAAAGCAGCGTCATCCTCCAAAATGCAGACGTAAGGCTTTCTC
ATAG
Protein sequenceShow/hide protein sequence
MFLVITLIFLDEWIAYLENRLLSVTMRRSFKPLLVSLLLLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKN
YLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRASRLWANYGTGAIAGFFSY
DNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNADEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDI
GDILIGGETTEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSSGGF
SDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKDLKHLALFSLTPLLSVNSTFSRLVRDLQLNASVDSYLW
TFLLLVDPYGSWETSSWDVITQSLILAKQELDLRTLLEEHIQFVLKGELGRKKHPPQDTKAKYIYKRLQPGICGIKPLQNQFLRLLPSVLRNGVNKEEQLPKFVPFSFIT
LSFSPSLPFTSVLLAKPHSSGEYPRPRMAVGWVDPGDGVGLFGAKMVARDLEALRRGEVDLQDGHGGSQRVLGNQVSVDLHGSRTRTSAAIRYGSGDIAGFFSCDNVRVG
DVVVRDQQLIEATRMSGEIFMAAKFDGILGLGFQEILTGGAVPVWYNMVKQKLVKEPVFSFWLNRNAKEKGGELVFGGLDPKHFKGQHSYVPVKTKGYWQFDIGDILIAG
ETTEYCARGCSAIADSGTSLLAGPAAIVEKINKAIGAAAVAHPECKAIVSQHGQAIMDLLLAKAQPEKICSKIGVCTFVETHGVNLKIESMVNDKDGRSSGGFSDAMCSA
CEMAVSWMQDELKQNETQEHIIDYVNELCNRGLNQGATLVDCGWISQMPNVSFTIGDRVFDLSSKDYILKIGEGSAAQCTSGFQPVVIPFWYFYFHVLGFFQFFKTENHV
ATSLQFFLVGEVSHFNDWVNLKEKQRHPPKCRRKAFS