; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G021400 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G021400
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionaspartic proteinase-like
Genome locationCiama_Chr01:33993438..34003428
RNA-Seq ExpressionCaUC01G021400
SyntenyCaUC01G021400
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006629 - lipid metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR007856 - Saposin-like type B, region 1
IPR008138 - Saposin B type, region 2
IPR008139 - Saposin B type domain
IPR011001 - Saposin-like
IPR021109 - Aspartic peptidase domain superfamily
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603789.1 Aspartic proteinase A1, partial [Cucurbita argyrosperma subsp. sororia]2.4e-20972.71Show/hide
Query:  MKPLQNPPRKGISTFGSLLNFITLYFIFLDEWIAVTMRRSFKPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGK
        MKPL N PR GIST G+LL  +            VTM  S KPLLVSLL LI+YSS ASS+SNE L+RIGLKKIKV++N  LKAL+ESKK +FLGS   K
Subjt:  MKPLQNPPRKGISTFGSLLNFITLYFIFLDEWIAVTMRRSFKPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGK

Query:  HNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLF
        H+QW N++GES+NSDIVALKNY+DAQYYGEIGIGTPPQKFT IFDTGSSNLWVPSSKC+FS+                   A  F  ++ S         
Subjt:  HNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLF

Query:  HLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFW
          STY      R  TSAAIQYG+GAI+GFFSYDNV+VGDVVVRDQQ IE TSMSS TF+AAKFDGILGLGFQEISTGDAVPVWYNMV QKLVKE VFSFW
Subjt:  HLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFW

Query:  LNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVA
        LNRNA EEEGGE+VFGGVDPKHFKGQHTYVPVTTKGYWQF+IGDILIGG+ T          EYCA GCSAIADSGTSLLAGPSTIV LINRAIGAA + 
Subjt:  LNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVA

Query:  HPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR
         PECKA++SQHG++IMDLLLAK QPEKICSKIGVCT D THGVS+KIE++VN+K GRSSGGFSDAMCSACEMAVSWM DELKQNKT+E++IDYVNKLCDR
Subjt:  HPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR

Query:  GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
         LNQG TLVDCGRI QMPTVSFTIGD+VFEL+++D
Subjt:  GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

XP_008440021.1 PREDICTED: aspartic proteinase-like isoform X1 [Cucumis melo]1.4e-20677.14Show/hide
Query:  VTMRR-SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG
        V MR+ SF  LLVSLL LI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLG SVGKHNQW NNLGES+N+D V LKNYLDAQYYGEIG
Subjt:  VTMRR-SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG

Query:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSY
        IGTPPQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY      +  TSAAIQYG+GAIAGFFS 
Subjt:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSY

Query:  DNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPV
        DNVRVGDVVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPV
Subjt:  DNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPV

Query:  TTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKI
        T KGYWQFDIGDILIGGETT          +YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS I
Subjt:  TTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKI

Query:  GVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELS
        GVCT D+T  VSLKIENVV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELS
Subjt:  GVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELS

Query:  SKD
        SKD
Subjt:  SKD

XP_023544281.1 aspartic proteinase-like [Cucurbita pepo subsp. pepo]1.1e-20675.75Show/hide
Query:  MRRSFKPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP
        MR S KPLLVSLL LI+YSS ASS+SNE L+RIGLKKIKV++N  LKAL+ESKK +FLGS   KH+QW N++GES+NSDIVALKNY+DAQYYGEIGIGTP
Subjt:  MRRSFKPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP

Query:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVR
        PQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY      R  TSAAIQYGTGAI+GFFSYDNV+
Subjt:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVR

Query:  VGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKG
        VGDVVVRDQQ IE TSMSS TF+AAKFDGILGLGFQEISTGDAVPVWYNMV QKLVKE VFSFWLNRNA EEEGGEIVFGGVDPKHFKGQHTYVPVTTKG
Subjt:  VGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKG

Query:  YWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCT
        YWQF+IGDILIGGE T          EYCARGCSAIADSGTSLLAGPSTIV LINRAIGAA +  PECKA++SQHG++IMDLLLAK QPEKICSKIGVC 
Subjt:  YWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCT

Query:  SDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
         D THGVS+KIE+V N+KDGRSSGGFSDAMCSACEMAVSWM DELKQNKT+E++IDYVNKLCDR  NQG TLVDCGRI QMPTVSFTIGD+VFEL+++D
Subjt:  SDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

XP_038880987.1 aspartic proteinase-like isoform X1 [Benincasa hispida]3.3e-22778.36Show/hide
Query:  MKPLQNPPRKGISTFGSLLNFITLYFIFLDEWIAVTMRRSFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVG
        MKPLQNPPRKGISTFG+LL               V MR SFKPLLVSLL LI+ YSS ASS+SNE  LRIGLKKIK DQN R KALLESKKGEFLGSSVG
Subjt:  MKPLQNPPRKGISTFGSLLNFITLYFIFLDEWIAVTMRRSFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVG

Query:  KHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTL
        KHNQW NN+GESRN+DIVALKNYLDAQYYGEIGIGTPPQKFTV+FDTGSSNLWVPSSKCIFS+                   A  F  ++ S        
Subjt:  KHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTL

Query:  FHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSF
           STY      R  TSA+IQYG+GAIAGFFSYDNVRVGDVVV DQ+LIEATSMSS TFM AKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSF
Subjt:  FHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSF

Query:  WLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEV
        WLNRNA E EGGE+VFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGGETT          EYCA GCSAIADSGTSLLAGPSTIVALINRAIGAAEV
Subjt:  WLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEV

Query:  AHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCD
        A PECKAI+SQHGQ IMDLLL  AQPEKICSKIGVCT D+T GV LKIE +VNDKDG+SSGGFSDAMCSACEMAVSWMQDELKQNKT+E+IIDYVN+LCD
Subjt:  AHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCD

Query:  RGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        RGLNQGATLVDCGRI +MPTVSFTIGDRVFELSSKD
Subjt:  RGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

XP_038880988.1 aspartic proteinase-like isoform X2 [Benincasa hispida]1.6e-21880.2Show/hide
Query:  MRRSFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGT
        MR SFKPLLVSLL LI+ YSS ASS+SNE  LRIGLKKIK DQN R KALLESKKGEFLGSSVGKHNQW NN+GESRN+DIVALKNYLDAQYYGEIGIGT
Subjt:  MRRSFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGT

Query:  PPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNV
        PPQKFTV+FDTGSSNLWVPSSKCIFS+                   A  F  ++ S           STY      R  TSA+IQYG+GAIAGFFSYDNV
Subjt:  PPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNV

Query:  RVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTK
        RVGDVVV DQ+LIEATSMSS TFM AKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA E EGGE+VFGGVDPKHFKGQHTYVPVTTK
Subjt:  RVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTK

Query:  GYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVC
        GYWQFDIGDILIGGETT          EYCA GCSAIADSGTSLLAGPSTIVALINRAIGAAEVA PECKAI+SQHGQ IMDLLL  AQPEKICSKIGVC
Subjt:  GYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVC

Query:  TSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        T D+T GV LKIE +VNDKDG+SSGGFSDAMCSACEMAVSWMQDELKQNKT+E+IIDYVN+LCDRGLNQGATLVDCGRI +MPTVSFTIGDRVFELSSKD
Subjt:  TSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KMZ9 Uncharacterized protein6.9e-20777.06Show/hide
Query:  SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ
        SF  LLVSLL LI+ YSS A+S+SNE  LRIGLKKIK DQNSR KALLESKKGEFLGSSVGKHNQW NNL ES+N+DIV LKNYLDAQYYGEIGIGTPPQ
Subjt:  SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ

Query:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVG
        KFTVIFDTGSSNLWVPS+KCIFS+                   A  F  K+ S           STY      R  TSAAIQYG+GAI+GFFSYDNV+VG
Subjt:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVG

Query:  DVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYW
        DV+VR+Q+LIEATSMS+ TFMAAKFDGILGLGFQEI+TG AVPVWYNMVKQKLVKEQVFSFWLNRNA E+EGGE+VFGGVDPKHFKGQHTYVPVT KGYW
Subjt:  DVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYW

Query:  QFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSD
        QFDIGDILIGGETT          +YCA GCSAIADSGTSLLAGPS IV  INRAIGAA VAHPECKAI+SQ+G+ IMDLLLAKAQPEKICSKIGVCT D
Subjt:  QFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSD

Query:  ETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        ETH VSLKIENVV+DKDGRSSGGFS+AMCSACEMAV W+QDELKQNKT+E II+ VN+LCDRGLNQ  TLVDCGRI QMP VSFTIGDR+FEL+SKD
Subjt:  ETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

A0A1S3B040 aspartic proteinase-like isoform X22.0e-20677.46Show/hide
Query:  SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ
        SF  LLVSLL LI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLG SVGKHNQW NNLGES+N+D V LKNYLDAQYYGEIGIGTPPQ
Subjt:  SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ

Query:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVG
        KFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY      +  TSAAIQYG+GAIAGFFS DNVRVG
Subjt:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVG

Query:  DVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYW
        DVVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPVT KGYW
Subjt:  DVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYW

Query:  QFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSD
        QFDIGDILIGGETT          +YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS IGVCT D
Subjt:  QFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSD

Query:  ETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        +T  VSLKIENVV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELSSKD
Subjt:  ETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

A0A1S3B058 aspartic proteinase-like isoform X16.9e-20777.14Show/hide
Query:  VTMRR-SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG
        V MR+ SF  LLVSLL LI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLG SVGKHNQW NNLGES+N+D V LKNYLDAQYYGEIG
Subjt:  VTMRR-SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIG

Query:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSY
        IGTPPQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY      +  TSAAIQYG+GAIAGFFS 
Subjt:  IGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSY

Query:  DNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPV
        DNVRVGDVVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPV
Subjt:  DNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPV

Query:  TTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKI
        T KGYWQFDIGDILIGGETT          +YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS I
Subjt:  TTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKI

Query:  GVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELS
        GVCT D+T  VSLKIENVV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELS
Subjt:  GVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELS

Query:  SKD
        SKD
Subjt:  SKD

A0A5D3CRY9 Aspartic proteinase-like isoform X22.6e-20677.46Show/hide
Query:  SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ
        SF  LLVSLL LI+ YSS A+S+SNE  LRIGLKKI+ DQNSR KALLESKKGEFLGSSVGK+NQW NNLGES+N+D V LKNYLDAQYYGEIGIGTPPQ
Subjt:  SFKPLLVSLLFLII-YSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQ

Query:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVG
        KFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY      +  TSAAIQYG+GAIAGFFS DNVRVG
Subjt:  KFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVG

Query:  DVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYW
        DVVVR+Q LIEATSMSS TFMAAKFDGILGLGFQEISTG AVPVWYNMVKQKLVKEQVFSFWLNRNA EEEGGE+VFGGVDPKHFKGQHTYVPVT KGYW
Subjt:  DVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYW

Query:  QFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSD
        QFDIGDILIGGETT          +YCA GCSAIADSGTSLLAGPS IV LINRAIGAA VAHPECKAI+SQHG+ IMDLLLAKAQPEKICS IGVCT D
Subjt:  QFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSD

Query:  ETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        +T  VSLKIENVV+DKDGRSSGGFS+AMCSACEMAVSW+QDEL+QNKT+E IID VN+LCDRG NQ  TLVDCGRI QMP+VSFTIGDRVFELSSKD
Subjt:  ETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

A0A6J1IKS1 aspartic proteinase-like1.0e-20575.35Show/hide
Query:  MRRSFKPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP
        MR S KPLLVSLL LI+YSS ASS+SNE L+RIGLKKIKV++N  LKAL+ESKK EFLGS   KH+QW N+LGES+NSDIVALKNY+DAQYYGEIGIGTP
Subjt:  MRRSFKPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTP

Query:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVR
        PQKFTVIFDTGSSNLWVPSSKC+FS+                   A  F  ++ S           STY      R  TSAAIQYG+GAI+GFFSYDNV+
Subjt:  PQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVR

Query:  VGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKG
        VGDVVVR+QQ IE TSMSS TF+AAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKE VFSFWLNRNA EEEGGEIVFGGVDPKHFKGQHTYVPVTTKG
Subjt:  VGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKG

Query:  YWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCT
        YWQF+IGDILIGG+ T          EYCARGCSAIADSGTSLLAGPSTIV LINRAIGAA +  PECK ++SQHG++IMDLLLAK QPEKICSKIGVC 
Subjt:  YWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCT

Query:  SDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
         D +HGVS KIE+VVN+KDG SSGGFSDAMCSACEMAVSWM DELKQNKT+E++IDYVNKLCDR LN+G TLVDCGRI QMPTVSFTIGD+VFEL+++D
Subjt:  SDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

SwissProt top hitse value%identityAlignment
O04057 Aspartic proteinase1.0e-15457.58Show/hide
Query:  LFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGS
        LFL++  +  SS+SN+ LLR+GLKKIK+D  +RL A +ESK  E L ++  K+N    NLGES ++DIVALKNYLDAQYYGEI IGTPPQKFTVIFDTGS
Subjt:  LFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGS

Query:  SNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQLI
        SNLWV   +C+FSV+     H   ++ +  S S                              +  TSA+I+YGTGA++GFFSYDNV+VGD+VV++Q  I
Subjt:  SNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQLI

Query:  EATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIG
        EAT   S TF+ AKFDG+LGLGFQEI+ G+AVPVWYNMV+Q LVKE VFSFWLNRN  EEEGGEIVFGGVDPKH++G+HTYVPVT KGYWQFD+GD+LI 
Subjt:  EATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIG

Query:  GETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIE
        GE TG          +C  GCSAIADSGTSLLAGP+ ++ +IN AIGA  V   +CKA+++Q+GQTIMDLLL++A P+KICS+I +CT D T GVS+ IE
Subjt:  GETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLKIE

Query:  NVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        +VV++  G+SS    D MCS CEM V WMQ++L+QN+T+E II+Y+N+LCDR  +  G + VDCG++  MPTVSFTIG ++F+L+ ++
Subjt:  NVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

O65390 Aspartic proteinase A19.1e-14855.44Show/hide
Query:  KPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT
        + + VSL+   +   +A +  N+   R+GLKK+K+D  +RL A +ESK+ + L +           LG+S ++D+V LKNYLDAQYYGEI IGTPPQKFT
Subjt:  KPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT

Query:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDV
        V+FDTGSSNLWVPSSKC FS++                          C LH ++ S+    STY      +   +AAI YGTGAIAGFFS D V VGD+
Subjt:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDV

Query:  VVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQF
        VV+DQ+ IEAT     TF+ AKFDGILGLGFQEIS G A PVWYNM+KQ L+KE VFSFWLNRNA+EEEGGE+VFGGVDP HFKG+HTYVPVT KGYWQF
Subjt:  VVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQF

Query:  DIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDET
        D+GD+LIGG  TG          +C  GCSAIADSGTSLLAGP+TI+ +IN AIGAA V   +CK ++ Q+GQTI+DLLL++ QP+KICS+IG+CT D T
Subjt:  DIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDET

Query:  HGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
         GVS+ IE+VV+ ++ + S G  DA CSACEMAV W+Q +L+QN T+E I++YVN+LC+R     G + VDC ++  MPTVS TIG +VF+L+ ++
Subjt:  HGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

P42210 Phytepsin6.6e-13853.35Show/hide
Query:  LLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVI
        LL ++L L     AAS +  E L+RI LKK  +D+NSR+   L   + + L S         N L      DIVALKNY++AQY+GEIG+GTPPQKFTVI
Subjt:  LLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVI

Query:  FDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVR
        FDTGSSNLWVPS+KC FS++     ++  ++ A  S                       STY      +    AAIQYGTG+IAG+FS D+V VGD+VV+
Subjt:  FDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVR

Query:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG
        DQ+ IEAT     TF+ AKFDGILGLGF+EIS G AVPVWY M++Q LV + VFSFWLNR+ +E EGGEI+FGG+DPKH+ G+HTYVPVT KGYWQFD+G
Subjt:  DQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIG

Query:  DILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGV
        D+L+GG++TG          +CA GC+AIADSGTSLLAGP+ I+  IN  IGAA V   ECK I+SQ+GQ I+DLLLA+ QP+KICS++G+CT D T GV
Subjt:  DILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGV

Query:  SLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        S  I +VV+D+  +S+G  +D MCSACEMAV WMQ++L QNKT++ I+DYVN+LC+R     G + VDCG +  MP + FTIG + F L  ++
Subjt:  SLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

Q42456 Aspartic proteinase oryzasin-15.4e-14054.29Show/hide
Query:  LLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDT
        LL  ++  +   +S+ E L+RI LKK  +D+NSR+ A L  ++G      +G      N+L G     DIVALKNY++AQY+GEIG+GTPPQKFTVIFDT
Subjt:  LLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDT

Query:  GSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQ
        GSSNLWVPS+KC FS+                   A  F  ++ S           STY      +    AAIQYGTG+IAGFFS D+V VGD+VV+DQ+
Subjt:  GSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQ

Query:  LIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDIL
         IEAT     TFM AKFDGILGLGFQEIS GDAVPVWY MV+Q LV E VFSFW NR+++E EGGEIVFGG+DP H+KG HTYVPV+ KGYWQF++GD+L
Subjt:  LIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGDIL

Query:  IGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLK
        IGG+TTG          +CA GCSAIADSGTSLLAGP+ I+  IN  IGA  V   ECK ++SQ+GQ I+DLLLA+ QP KICS++G+CT D  HGVS  
Subjt:  IGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVSLK

Query:  IENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        I++VV+D+ G S+G  S  MC+ACEMAV WMQ++L QNKT++ I++Y+N+LCD+     G + VDCG +  MP +SFTIG + F L  ++
Subjt:  IENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

Q8VYL3 Aspartic proteinase A24.1e-14855.49Show/hide
Query:  VSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIF
        V + FL+ ++  A S  N+   R+GLKK+K+D N+RL     SK+ E L SS+  +N   NNL G+S ++DIV LKNYLDAQYYGEI IGTPPQKFTVIF
Subjt:  VSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIF

Query:  DTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRD
        DTGSSNLWVPS KC FS+S                     F  K+ S           STY      ++   AAI YG+G+I+GFFSYD V VGD+VV+D
Subjt:  DTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRD

Query:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD
        Q+ IE TS    TF+ AKFDG+LGLGFQEI+ G+A PVWYNM+KQ L+K  VFSFWLNR+   EEGGEIVFGGVDPKHF+G+HT+VPVT +GYWQFD+G+
Subjt:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD

Query:  ILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVS
        +LI GE+TG          YC  GCSAIADSGTSLLAGP+ +VA+IN+AIGA+ V   +CK ++ Q+GQTI+DLLLA+ QP+KICS+IG+C  D THGVS
Subjt:  ILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVS

Query:  LKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        + IE+VV+ ++ RSS G  DA C ACEMAV W+Q +L+QN T+E I++Y+N++C+R  +  G + VDC ++ +MPTVSFTIG +VF+L+ ++
Subjt:  LKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

Arabidopsis top hitse value%identityAlignment
AT1G11910.1 aspartic proteinase A16.5e-14955.44Show/hide
Query:  KPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT
        + + VSL+   +   +A +  N+   R+GLKK+K+D  +RL A +ESK+ + L +           LG+S ++D+V LKNYLDAQYYGEI IGTPPQKFT
Subjt:  KPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFT

Query:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDV
        V+FDTGSSNLWVPSSKC FS++                          C LH ++ S+    STY      +   +AAI YGTGAIAGFFS D V VGD+
Subjt:  VIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLH-EFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDV

Query:  VVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQF
        VV+DQ+ IEAT     TF+ AKFDGILGLGFQEIS G A PVWYNM+KQ L+KE VFSFWLNRNA+EEEGGE+VFGGVDP HFKG+HTYVPVT KGYWQF
Subjt:  VVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQF

Query:  DIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDET
        D+GD+LIGG  TG          +C  GCSAIADSGTSLLAGP+TI+ +IN AIGAA V   +CK ++ Q+GQTI+DLLL++ QP+KICS+IG+CT D T
Subjt:  DIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDET

Query:  HGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
         GVS+ IE+VV+ ++ + S G  DA CSACEMAV W+Q +L+QN T+E I++YVN+LC+R     G + VDC ++  MPTVS TIG +VF+L+ ++
Subjt:  HGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDR-GLNQGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT1G62290.1 Saposin-like aspartyl protease family protein2.9e-14955.49Show/hide
Query:  VSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIF
        V + FL+ ++  A S  N+   R+GLKK+K+D N+RL     SK+ E L SS+  +N   NNL G+S ++DIV LKNYLDAQYYGEI IGTPPQKFTVIF
Subjt:  VSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIF

Query:  DTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRD
        DTGSSNLWVPS KC FS+S                     F  K+ S           STY      ++   AAI YG+G+I+GFFSYD V VGD+VV+D
Subjt:  DTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRD

Query:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD
        Q+ IE TS    TF+ AKFDG+LGLGFQEI+ G+A PVWYNM+KQ L+K  VFSFWLNR+   EEGGEIVFGGVDPKHF+G+HT+VPVT +GYWQFD+G+
Subjt:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD

Query:  ILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVS
        +LI GE+TG          YC  GCSAIADSGTSLLAGP+ +VA+IN+AIGA+ V   +CK ++ Q+GQTI+DLLLA+ QP+KICS+IG+C  D THGVS
Subjt:  ILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVS

Query:  LKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        + IE+VV+ ++ RSS G  DA C ACEMAV W+Q +L+QN T+E I++Y+N++C+R  +  G + VDC ++ +MPTVSFTIG +VF+L+ ++
Subjt:  LKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT1G62290.2 Saposin-like aspartyl protease family protein2.9e-14955.49Show/hide
Query:  VSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIF
        V + FL+ ++  A S  N+   R+GLKK+K+D N+RL     SK+ E L SS+  +N   NNL G+S ++DIV LKNYLDAQYYGEI IGTPPQKFTVIF
Subjt:  VSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNL-GESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIF

Query:  DTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRD
        DTGSSNLWVPS KC FS+S                     F  K+ S           STY      ++   AAI YG+G+I+GFFSYD V VGD+VV+D
Subjt:  DTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRD

Query:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD
        Q+ IE TS    TF+ AKFDG+LGLGFQEI+ G+A PVWYNM+KQ L+K  VFSFWLNR+   EEGGEIVFGGVDPKHF+G+HT+VPVT +GYWQFD+G+
Subjt:  QQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQFDIGD

Query:  ILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVS
        +LI GE+TG          YC  GCSAIADSGTSLLAGP+ +VA+IN+AIGA+ V   +CK ++ Q+GQTI+DLLLA+ QP+KICS+IG+C  D THGVS
Subjt:  ILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDETHGVS

Query:  LKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        + IE+VV+ ++ RSS G  DA C ACEMAV W+Q +L+QN T+E I++Y+N++C+R  +  G + VDC ++ +MPTVSFTIG +VF+L+ ++
Subjt:  LKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLN-QGATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT4G04460.1 Saposin-like aspartyl protease family protein9.1e-13552.11Show/hide
Query:  LLVSLL-FLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV
        LLV LL  LI+ S+A+   + +  +RIGLKK K+D+++RL + L  K     GS     + +  N     N+D+V LKNYLDAQYYG+I IGTPPQKFTV
Subjt:  LLVSLL-FLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV

Query:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTS---AAIQYGTGAIAGFFSYDNVRVGD
        IFDTGSSNLW+PS+KC  SV+                          C  H         S Y   Q    R +   A+I+YGTGAI+G+FS D+V+VGD
Subjt:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTS---AAIQYGTGAIAGFFSYDNVRVGD

Query:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ
        +VV++Q+ IEATS    TF+ AKFDGILGLGF+EIS G++ PVWYNMV++ LVKE +FSFWLNRN  + EGGEIVFGGVDPKHFKG+HT+VPVT KGYWQ
Subjt:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ

Query:  FDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDE
        FD+GD+ I G+ TG          YCA+GCSAIADSGTSLL GPST++ +IN AIGA  +   ECKA++ Q+G+T+++ LLA+  P+K+CS+IGVC  D 
Subjt:  FDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDE

Query:  THGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        T  VS+ I++VV+D    +SG  + AMCSACEMA  WM+ EL QN+T+E I+ Y  +LCD    Q   + VDCGR+  MP V+F+IG R F+L+ +D
Subjt:  THGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD

AT4G04460.2 Saposin-like aspartyl protease family protein3.2e-13251.91Show/hide
Query:  LLVSLL-FLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV
        LLV LL  LI+ S+A+   + +  +RIGLKK K+D+++RL + L  K     GS     + +  N     N+D+V LKNYLDAQYYG+I IGTPPQKFTV
Subjt:  LLVSLL-FLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWDNNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTV

Query:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTS---AAIQYGTGAIAGFFSYDNVRVGD
        IFDTGSSNLW+PS+KC  SV+                          C  H         S Y   Q    R +   A+I+YGTGAI+G+FS D+V+VGD
Subjt:  IFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQLLRARTS---AAIQYGTGAIAGFFSYDNVRVGD

Query:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ
        +VV++Q+ IEATS    TF+ AKFDGILGLGF+EIS G++ PVWYNMV++ LVKE +FSFWLNRN  + EGGEIVFGGVDPKHFKG+HT+VPVT KGYWQ
Subjt:  VVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVFGGVDPKHFKGQHTYVPVTTKGYWQ

Query:  FDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDE
        FD+GD+ I G+ TG          YCA+GCSAIADSGTSLL GPST++ +IN AIGA  +   ECKA++ Q+G+T+++ LLA    +K+CS+IGVC  D 
Subjt:  FDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLLAKAQPEKICSKIGVCTSDE

Query:  THGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD
        T  VS+ I++VV+D    +SG  + AMCSACEMA  WM+ EL QN+T+E I+ Y  +LCD    Q   + VDCGR+  MP V+F+IG R F+L+ +D
Subjt:  THGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQG-ATLVDCGRIPQMPTVSFTIGDRVFELSSKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCCATTGCAAAACCCACCAAGAAAAGGAATTTCTACTTTTGGGTCTCTTCTGAATTTTATCACCCTCTATTTCATTTTTCTGGATGAATGGATTGCAGTT
ACAATGAGAAGAAGCTTTAAGCCCCTCTTGGTTTCTTTGTTGTTCTTAATCATATATTCCTCTGCAGCATCCTCTTCTTCTAATGAAGAGTTGTTAAGGATTGGA
TTGAAGAAGATAAAAGTTGATCAAAACAGTCGGTTAAAAGCATTGCTTGAGTCAAAGAAAGGAGAGTTTTTAGGATCTTCTGTTGGAAAACATAATCAATGGGAT
AATAATCTTGGAGAATCTAGAAATAGTGATATTGTAGCATTAAAGAATTATTTGGATGCTCAATACTATGGAGAGATTGGCATTGGCACTCCACCTCAAAAGTTC
ACTGTAATTTTCGATACTGGAAGCTCTAATTTGTGGGTGCCATCTTCGAAATGTATTTTCTCGGTATCAATCAGGACGGTCAAGCACATACAGAAGAAATGGTCT
GCTTTCTTCTCTTTGTCTGCAATAGTCTTCAAAATGAAATTTTGTTCACTGCACGAGTTTGATTCAACCTTATTTCATTTATCCACATATTGGATTGATCAACTC
TTAAGGGCTAGAACATCTGCTGCTATTCAGTATGGTACAGGAGCTATTGCTGGCTTCTTTAGTTATGACAACGTTCGAGTTGGTGATGTTGTTGTCCGTGATCAG
CAACTCATTGAGGCAACTAGCATGTCTAGTACGACATTCATGGCTGCCAAATTTGATGGTATATTGGGACTTGGATTTCAAGAGATCTCGACTGGTGACGCTGTT
CCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCAAGTTTTCTCATTTTGGCTGAATCGCAATGCCAATGAGGAAGAAGGAGGTGAAATTGTGTTT
GGAGGGGTTGATCCTAAGCACTTCAAAGGCCAACATACATATGTGCCTGTGACAACCAAAGGGTACTGGCAGTTTGACATCGGCGATATTCTTATTGGTGGTGAA
ACGACAGGTATGTTCTCTTTACTTATCTCGATTTTGGAATATTGTGCTCGTGGTTGCTCTGCGATTGCGGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTACT
ATAGTGGCATTAATTAATAGAGCCATTGGAGCAGCTGAAGTTGCTCATCCAGAATGCAAAGCAATTATTTCACAGCATGGACAGACTATTATGGATTTGCTTTTA
GCAAAAGCACAACCAGAGAAGATATGTTCCAAAATTGGCGTGTGTACCTCTGATGAAACTCATGGTGTTAGTTTGAAAATTGAGAATGTGGTGAATGATAAAGAT
GGTAGATCATCTGGTGGCTTCTCAGATGCTATGTGCTCAGCTTGTGAGATGGCGGTTTCCTGGATGCAAGATGAGCTGAAGCAGAATAAAACTCGAGAGTATATC
ATTGATTATGTCAATAAGCTATGTGATCGTGGGTTGAACCAAGGAGCAACATTGGTTGACTGTGGACGGATCCCTCAAATGCCTACCGTGTCCTTCACCATTGGC
GACAGAGTTTTTGAGCTTAGCTCAAAAGATCTTAAGCATTTAGCCTTATTCTCTTTAACCCCATTGCTTTCTGTGAACAGTACGTTCTCAAGGTGGGTGAGGGAT
CTGCAGCTCAATGCATCAGTGGATTCATACCTTTGGACATTCCTCCTCCTCGTGGACCCCTATGGATCTTGGGAGACGTCTTCATGGGACGTTATCACACAGTCT
TTGATTTTGGCAAAACAAGAGTCGGATTTGCGGACGCTGCTTGAAGAACATATTCTGATGCTCTACAGTGCCATATACAACCAAAATTATTATTCCTTGAAAGCT
AAAATACATATACAAAAGGCTCCAACCGGGAATATGTGGAATCAAGCCATTGCAAAACCAATTTCTACGTTTGCTCCCTTCTGTTCTAAGGAATGGAGTGAAGTG
ACACCTCCTCTTTCAACCATTAAACGATTGCCAAAGTTTGTTCCCTTCTCCTTCGTCACCCTCTCTTTCTCTCCGTCCCTCCCTTTCACTTCAGTCCTTCTGGCG
AAACCACACAGCTCCGGTGAATATCCACGACCGAGGATGGCGGTAGGATGGGTCGACCCAGGCGACGGTGTGGGTCTGTTTGGGGCGAAAATGGTTGCAAGGGAC
CTTGAGGCCCTGCGCAGAGGTGAGGTCGATCTCCAAGACGGACATGGGGGTAGCCAGAGGGTTCTTGGGAACCAAGTCTCCGTCGACCTCCATGGATCAAGTACG
AGAATTGTGTCTACAAAATTGATGGTGGAACTAATTTTGCTATCAGTGATTAGACTTTCAGCTCACACATTTTATATCCATGGCCTGGAAGTTGTATATTTACAT
CACATGAACACTGCTATTCAGTATGGAACATCAGCTGCTATTCGATATGGTTCAGGAGATATTGCCGGTTTCTTTAGTTGTGACAATGTTCGAGTCGGTGATGTT
GTTGTTCGTGATCAGCAACTCATTGAGGCAACTAGAATGTCTGGTGAGATATTCATGGCTGCCAAATTTGATGGTATATTGGGACTTGGATTCCAAGAGATCTCG
ACTGGTGGCGCGGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCCAGTTTTCTCATTTTGGCTGAATCGCAATGCCAAGGAGAAAGGTGGT
GAACTTGTGTTTGGCGGGCTTGATCCTAAGCACTTCAAAGGCCAGCATTCATTTGTGCCTGTGAAAACCAAAGGGTATTGGCAGTTTGACATCGGTGATATTCTT
ATTGCTGGTGAAACGACAGGTATGTTCCCTCTACTTATCTCAGTTTTGGAATATTGTGCTCGTGGTTGCTCTGCGATTGCAGATTCTGGAACTTCTTTGTTGGCT
GGTCCATCTGCTATAGTGGAAAAAATTAATAAAGCCATTGGAGCAGCTGCAGTTGCTCATCCAGAATGCAAGGCAATTGTTTCACAACATGGACAGGCTATTATG
GATTTGCTTTTAGCCAAGGCACAACCAGAGAAGATATGTTCCAAAATTGGCGTGTGTACCTTTGTTGAAACTCATGGTGTTAAGTTAGTCTTAGAAAGCTTCAAC
AAAGACAATTTGAAAATTGAGAGTATGGTGAATGATAAAGATGGTAGATCATCTGGTGGCTTCTCAGATGCTATGTGCTCAGCTTGTGAGATGGCAGTTTCCTGG
ATGCAAGATGAGCTGAAGCAGAACGAAACTCAAGAACATATCATTGATTATGTCAATGAGGTATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCCATTGCAAAACCCACCAAGAAAAGGAATTTCTACTTTTGGGTCTCTTCTGAATTTTATCACCCTCTATTTCATTTTTCTGGATGAATGGATTGCAGTT
ACAATGAGAAGAAGCTTTAAGCCCCTCTTGGTTTCTTTGTTGTTCTTAATCATATATTCCTCTGCAGCATCCTCTTCTTCTAATGAAGAGTTGTTAAGGATTGGA
TTGAAGAAGATAAAAGTTGATCAAAACAGTCGGTTAAAAGCATTGCTTGAGTCAAAGAAAGGAGAGTTTTTAGGATCTTCTGTTGGAAAACATAATCAATGGGAT
AATAATCTTGGAGAATCTAGAAATAGTGATATTGTAGCATTAAAGAATTATTTGGATGCTCAATACTATGGAGAGATTGGCATTGGCACTCCACCTCAAAAGTTC
ACTGTAATTTTCGATACTGGAAGCTCTAATTTGTGGGTGCCATCTTCGAAATGTATTTTCTCGGTATCAATCAGGACGGTCAAGCACATACAGAAGAAATGGTCT
GCTTTCTTCTCTTTGTCTGCAATAGTCTTCAAAATGAAATTTTGTTCACTGCACGAGTTTGATTCAACCTTATTTCATTTATCCACATATTGGATTGATCAACTC
TTAAGGGCTAGAACATCTGCTGCTATTCAGTATGGTACAGGAGCTATTGCTGGCTTCTTTAGTTATGACAACGTTCGAGTTGGTGATGTTGTTGTCCGTGATCAG
CAACTCATTGAGGCAACTAGCATGTCTAGTACGACATTCATGGCTGCCAAATTTGATGGTATATTGGGACTTGGATTTCAAGAGATCTCGACTGGTGACGCTGTT
CCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCAAGTTTTCTCATTTTGGCTGAATCGCAATGCCAATGAGGAAGAAGGAGGTGAAATTGTGTTT
GGAGGGGTTGATCCTAAGCACTTCAAAGGCCAACATACATATGTGCCTGTGACAACCAAAGGGTACTGGCAGTTTGACATCGGCGATATTCTTATTGGTGGTGAA
ACGACAGGTATGTTCTCTTTACTTATCTCGATTTTGGAATATTGTGCTCGTGGTTGCTCTGCGATTGCGGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTACT
ATAGTGGCATTAATTAATAGAGCCATTGGAGCAGCTGAAGTTGCTCATCCAGAATGCAAAGCAATTATTTCACAGCATGGACAGACTATTATGGATTTGCTTTTA
GCAAAAGCACAACCAGAGAAGATATGTTCCAAAATTGGCGTGTGTACCTCTGATGAAACTCATGGTGTTAGTTTGAAAATTGAGAATGTGGTGAATGATAAAGAT
GGTAGATCATCTGGTGGCTTCTCAGATGCTATGTGCTCAGCTTGTGAGATGGCGGTTTCCTGGATGCAAGATGAGCTGAAGCAGAATAAAACTCGAGAGTATATC
ATTGATTATGTCAATAAGCTATGTGATCGTGGGTTGAACCAAGGAGCAACATTGGTTGACTGTGGACGGATCCCTCAAATGCCTACCGTGTCCTTCACCATTGGC
GACAGAGTTTTTGAGCTTAGCTCAAAAGATCTTAAGCATTTAGCCTTATTCTCTTTAACCCCATTGCTTTCTGTGAACAGTACGTTCTCAAGGTGGGTGAGGGAT
CTGCAGCTCAATGCATCAGTGGATTCATACCTTTGGACATTCCTCCTCCTCGTGGACCCCTATGGATCTTGGGAGACGTCTTCATGGGACGTTATCACACAGTCT
TTGATTTTGGCAAAACAAGAGTCGGATTTGCGGACGCTGCTTGAAGAACATATTCTGATGCTCTACAGTGCCATATACAACCAAAATTATTATTCCTTGAAAGCT
AAAATACATATACAAAAGGCTCCAACCGGGAATATGTGGAATCAAGCCATTGCAAAACCAATTTCTACGTTTGCTCCCTTCTGTTCTAAGGAATGGAGTGAAGTG
ACACCTCCTCTTTCAACCATTAAACGATTGCCAAAGTTTGTTCCCTTCTCCTTCGTCACCCTCTCTTTCTCTCCGTCCCTCCCTTTCACTTCAGTCCTTCTGGCG
AAACCACACAGCTCCGGTGAATATCCACGACCGAGGATGGCGGTAGGATGGGTCGACCCAGGCGACGGTGTGGGTCTGTTTGGGGCGAAAATGGTTGCAAGGGAC
CTTGAGGCCCTGCGCAGAGGTGAGGTCGATCTCCAAGACGGACATGGGGGTAGCCAGAGGGTTCTTGGGAACCAAGTCTCCGTCGACCTCCATGGATCAAGTACG
AGAATTGTGTCTACAAAATTGATGGTGGAACTAATTTTGCTATCAGTGATTAGACTTTCAGCTCACACATTTTATATCCATGGCCTGGAAGTTGTATATTTACAT
CACATGAACACTGCTATTCAGTATGGAACATCAGCTGCTATTCGATATGGTTCAGGAGATATTGCCGGTTTCTTTAGTTGTGACAATGTTCGAGTCGGTGATGTT
GTTGTTCGTGATCAGCAACTCATTGAGGCAACTAGAATGTCTGGTGAGATATTCATGGCTGCCAAATTTGATGGTATATTGGGACTTGGATTCCAAGAGATCTCG
ACTGGTGGCGCGGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCCAGTTTTCTCATTTTGGCTGAATCGCAATGCCAAGGAGAAAGGTGGT
GAACTTGTGTTTGGCGGGCTTGATCCTAAGCACTTCAAAGGCCAGCATTCATTTGTGCCTGTGAAAACCAAAGGGTATTGGCAGTTTGACATCGGTGATATTCTT
ATTGCTGGTGAAACGACAGGTATGTTCCCTCTACTTATCTCAGTTTTGGAATATTGTGCTCGTGGTTGCTCTGCGATTGCAGATTCTGGAACTTCTTTGTTGGCT
GGTCCATCTGCTATAGTGGAAAAAATTAATAAAGCCATTGGAGCAGCTGCAGTTGCTCATCCAGAATGCAAGGCAATTGTTTCACAACATGGACAGGCTATTATG
GATTTGCTTTTAGCCAAGGCACAACCAGAGAAGATATGTTCCAAAATTGGCGTGTGTACCTTTGTTGAAACTCATGGTGTTAAGTTAGTCTTAGAAAGCTTCAAC
AAAGACAATTTGAAAATTGAGAGTATGGTGAATGATAAAGATGGTAGATCATCTGGTGGCTTCTCAGATGCTATGTGCTCAGCTTGTGAGATGGCAGTTTCCTGG
ATGCAAGATGAGCTGAAGCAGAACGAAACTCAAGAACATATCATTGATTATGTCAATGAGGTATACTAG
Protein sequenceShow/hide protein sequence
MKPLQNPPRKGISTFGSLLNFITLYFIFLDEWIAVTMRRSFKPLLVSLLFLIIYSSAASSSSNEELLRIGLKKIKVDQNSRLKALLESKKGEFLGSSVGKHNQWD
NNLGESRNSDIVALKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCIFSVSIRTVKHIQKKWSAFFSLSAIVFKMKFCSLHEFDSTLFHLSTYWIDQL
LRARTSAAIQYGTGAIAGFFSYDNVRVGDVVVRDQQLIEATSMSSTTFMAAKFDGILGLGFQEISTGDAVPVWYNMVKQKLVKEQVFSFWLNRNANEEEGGEIVF
GGVDPKHFKGQHTYVPVTTKGYWQFDIGDILIGGETTGMFSLLISILEYCARGCSAIADSGTSLLAGPSTIVALINRAIGAAEVAHPECKAIISQHGQTIMDLLL
AKAQPEKICSKIGVCTSDETHGVSLKIENVVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNKTREYIIDYVNKLCDRGLNQGATLVDCGRIPQMPTVSFTIG
DRVFELSSKDLKHLALFSLTPLLSVNSTFSRWVRDLQLNASVDSYLWTFLLLVDPYGSWETSSWDVITQSLILAKQESDLRTLLEEHILMLYSAIYNQNYYSLKA
KIHIQKAPTGNMWNQAIAKPISTFAPFCSKEWSEVTPPLSTIKRLPKFVPFSFVTLSFSPSLPFTSVLLAKPHSSGEYPRPRMAVGWVDPGDGVGLFGAKMVARD
LEALRRGEVDLQDGHGGSQRVLGNQVSVDLHGSSTRIVSTKLMVELILLSVIRLSAHTFYIHGLEVVYLHHMNTAIQYGTSAAIRYGSGDIAGFFSCDNVRVGDV
VVRDQQLIEATRMSGEIFMAAKFDGILGLGFQEISTGGAVPVWYNMVKQKLVKEPVFSFWLNRNAKEKGGELVFGGLDPKHFKGQHSFVPVKTKGYWQFDIGDIL
IAGETTGMFPLLISVLEYCARGCSAIADSGTSLLAGPSAIVEKINKAIGAAAVAHPECKAIVSQHGQAIMDLLLAKAQPEKICSKIGVCTFVETHGVKLVLESFN
KDNLKIESMVNDKDGRSSGGFSDAMCSACEMAVSWMQDELKQNETQEHIIDYVNEVY