; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0019107 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0019107
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionAspartic proteinase-like protein 1
Genome locationchr09:12681748..12685342
RNA-Seq ExpressionPI0019107
SyntenyPI0019107
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034164 - Pepsin-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062411.1 aspartic proteinase-like protein 1 [Cucumis melo var. makuwa]1.3e-25288.78Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANCALL LLIACLFVDCSLGLTLSLKLVHRFSDEAKSLW+S R  NVSAKFWPPRNSLKYFQML+DYDLKRRRLKIGSKYD+LFPSEGSQV+FFGNEFN
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTP VPFLVALDVGSDLLWVPCDC+QCAPLSASYYSVLDRDLSEYNP+LSSTSK LFCGHQL                   YYSDNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        +MIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS RI+FGDDGP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G +AAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQ FIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS
        VYILPANQGYKVF LTLEETDEDYGVIGQNLMVGY+MVFDRENLKLGWSKS+CLDINSSTTEHAKPPSNNGN KSPIALPPTN QAIAPTAAR SSKSS
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS

XP_004140314.1 aspartic proteinase-like protein 1 [Cucumis sativus]6.3e-24787.17Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANCALL L IA LFV+CSL LTLSL LVHRFSDEAKSLWESRR  NVSAKFWPP NSLKYFQMLMDYDLKRRRL IGSKYDVLFPSEGSQV+FFGNEFN
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWID+GTPSVPFLVALDVGSDLLWVPCDCIQCAPLSA+YYSVLDRDLSEYNP+LSSTSK LFCGHQL                   YYSDNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        FMIEDKL LTSFSKHGTHSLLQASVV GCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS RILFGDDGP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G +AAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVK NATRIVL+ELPWNYCYN+STLVSFNIPSM+LVFPLNQ FIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS
        VY+LPANQGYKVF LTLEETDEDYGVIGQNLMVGY+MVFDRENLKLGWSKS+CLDINSSTTEHAKPPSNNGN KSPIALPPTN QAIAPTAAR SSKSS
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS

XP_008460494.1 PREDICTED: aspartic proteinase-like protein 1 [Cucumis melo]2.9e-25288.58Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANC+LL LLIACLFVDCSLGLTLSLKLVHRFSDEAKSLW+S R  NVSAKFWPPRNSLKYFQML+DYDLKRRRLKIGSKYD+LFPSEGSQV+FFGNEFN
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTP VPFLVALDVGSDLLWVPCDC+QCAPLSASYYSVLDRDLSEYNP+LSSTSK LFCGHQL                   YYSDNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        +MIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS RI+FGDDGP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G +AAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQ FIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS
        VYILPANQGYKVF LTLEETDEDYGVIGQNLMVGY+MVFDRENLKLGWSKS+CLDINSSTTEHAKPPSNNGN KSPIALPPTN QAIAPTAAR SSKSS
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS

XP_022141351.1 aspartic proteinase-like protein 1 [Momordica charantia]1.5e-20874.46Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANC ++ LL+AC   D S+  T S KL+HRFS+EAK+LWES R+ NVS KFWP R+SLKYF++LMD DLKRRRLKIGSK ++L PSEGS+V+FFGNEF+
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTPSV FLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNP+LSSTSK L C HQL                   YYSDNTS+SG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        FMIEDKLHL SFSKH   SL+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFD NGS RILFGD+G            
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G + AYF+GVESFCVGSSCLQ+SGFQALVDSGSSFTYLPAEVY++IVFEFDKQVK NATRI LQE PW+YCYN+S+L S NIPSMKLVFPLNQSFIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALP------PTNGQAIAPTAARM
        VY+LP NQGYK+F LTLEETD+DYG+IGQNLMVGY+MVFDRENLKLGWSKS+CLDINS  + +AKPPSN+G   SPIA+P      P N QAIAPTA+R 
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALP------PTNGQAIAPTAARM

Query:  SSKSS
        +S  S
Subjt:  SSKSS

XP_038875404.1 aspartic proteinase-like protein 1 isoform X2 [Benincasa hispida]5.5e-23581.26Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANC L++LL+A LFVD SL LTL+ KLVHRFSDEAKSLWESR+  NVS KFWPPRNS KYF+MLMDYDLKRRRLK GSKYDVLFPSEGSQVMFFGNEFN
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCI CAPLSASYYSVLDRDLSEYNP+LSSTSK L CGH+L                   YYSDNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        FMIEDKLHL SFSKHGT SLLQASV+LGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS RILFGD+GP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G + AYF+GVESFCVGSSCLQ+SGFQALVDSGSSFTYLP+EVYKKIVFEFDKQVKFNATRIVLQELPWNYCYN S+LVSF+IPSMKLVFPLNQSFIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSSC
        VYILPAN+G KVF LTLEETDEDYGVIGQNLMVGY+MVFDRENLKLGWSKS+CLDINSS T+HAKPPS++GN  SPIALPP N QAIAPTAAR S KSS 
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSSC

Query:  ILFFSLVATVACSFSGCLLDLLS
                T +C FS  LL LL+
Subjt:  ILFFSLVATVACSFSGCLLDLLS

TrEMBL top hitse value%identityAlignment
A0A0A0KN37 Peptidase A1 domain-containing protein3.0e-24787.17Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANCALL L IA LFV+CSL LTLSL LVHRFSDEAKSLWESRR  NVSAKFWPP NSLKYFQMLMDYDLKRRRL IGSKYDVLFPSEGSQV+FFGNEFN
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWID+GTPSVPFLVALDVGSDLLWVPCDCIQCAPLSA+YYSVLDRDLSEYNP+LSSTSK LFCGHQL                   YYSDNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        FMIEDKL LTSFSKHGTHSLLQASVV GCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS RILFGDDGP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G +AAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVK NATRIVL+ELPWNYCYN+STLVSFNIPSM+LVFPLNQ FIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS
        VY+LPANQGYKVF LTLEETDEDYGVIGQNLMVGY+MVFDRENLKLGWSKS+CLDINSSTTEHAKPPSNNGN KSPIALPPTN QAIAPTAAR SSKSS
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS

A0A1S3CD36 aspartic proteinase-like protein 11.4e-25288.58Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANC+LL LLIACLFVDCSLGLTLSLKLVHRFSDEAKSLW+S R  NVSAKFWPPRNSLKYFQML+DYDLKRRRLKIGSKYD+LFPSEGSQV+FFGNEFN
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTP VPFLVALDVGSDLLWVPCDC+QCAPLSASYYSVLDRDLSEYNP+LSSTSK LFCGHQL                   YYSDNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        +MIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS RI+FGDDGP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G +AAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQ FIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS
        VYILPANQGYKVF LTLEETDEDYGVIGQNLMVGY+MVFDRENLKLGWSKS+CLDINSSTTEHAKPPSNNGN KSPIALPPTN QAIAPTAAR SSKSS
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS

A0A5A7V9R0 Aspartic proteinase-like protein 16.3e-25388.78Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANCALL LLIACLFVDCSLGLTLSLKLVHRFSDEAKSLW+S R  NVSAKFWPPRNSLKYFQML+DYDLKRRRLKIGSKYD+LFPSEGSQV+FFGNEFN
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTP VPFLVALDVGSDLLWVPCDC+QCAPLSASYYSVLDRDLSEYNP+LSSTSK LFCGHQL                   YYSDNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        +MIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGS RI+FGDDGP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G +AAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQ FIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS
        VYILPANQGYKVF LTLEETDEDYGVIGQNLMVGY+MVFDRENLKLGWSKS+CLDINSSTTEHAKPPSNNGN KSPIALPPTN QAIAPTAAR SSKSS
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAARMSSKSS

A0A6J1CJM3 aspartic proteinase-like protein 17.3e-20974.46Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MANC ++ LL+AC   D S+  T S KL+HRFS+EAK+LWES R+ NVS KFWP R+SLKYF++LMD DLKRRRLKIGSK ++L PSEGS+V+FFGNEF+
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTPSV FLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNP+LSSTSK L C HQL                   YYSDNTS+SG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        FMIEDKLHL SFSKH   SL+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFD NGS RILFGD+G            
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G + AYF+GVESFCVGSSCLQ+SGFQALVDSGSSFTYLPAEVY++IVFEFDKQVK NATRI LQE PW+YCYN+S+L S NIPSMKLVFPLNQSFIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALP------PTNGQAIAPTAARM
        VY+LP NQGYK+F LTLEETD+DYG+IGQNLMVGY+MVFDRENLKLGWSKS+CLDINS  + +AKPPSN+G   SPIA+P      P N QAIAPTA+R 
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALP------PTNGQAIAPTAARM

Query:  SSKSS
        +S  S
Subjt:  SSKSS

A0A6J1HK50 aspartic proteinase-like protein 11.2e-20672.11Show/hide
Query:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN
        MAN  ++ L +AC FVD S+ L LS +L+HRFSDEAK+LW+SR   N S KFWP RNSLKYF+ L DYDLKRRRLKIGSKY+V+FPSEG++V+FFGNEF+
Subjt:  MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFN

Query:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG
        WLHYTWIDIGTPSV FLVALD GSDLLWVPCDCIQCAPLSAS+YS LDRDLS YNP+LS+TS+ L C HQL                   YY+DNTSTSG
Subjt:  WLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLY------------------YYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------
        FMIEDKLHL SFSKHGT  LLQASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFDNNGS RILFGD+GP           
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP-----------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL
         G + AYF+ VESFCVGSSCLQ+SGF ALVDSGSSFTYLP E+YKKIVFEFDKQVK NATRI+LQE PWNYCYN S+L S  IPSMKLVFPLNQSFIHD 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDL

Query:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALP------PTNGQAIAPTAARM
        VY LP +QGYK+F LTLEETD+DYGVIGQNLMVGY++VFDRENL+LGWSKS+CLDIN     HAKPPSN+G   SP ALP      P N Q IAPTAAR 
Subjt:  VYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALP------PTNGQAIAPTAARM

Query:  SSKSSCILFFSLVATVACSFSGCLLDL
         SKS      SL A     FS C L L
Subjt:  SSKSSCILFFSLVATVACSFSGCLLDL

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 361.6e-1925.89Show/hide
Query:  LHYTWIDIGTPSVPFLVALDVGSDLLWVPC-DCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQL-------------------YYYSDNTSTS
        L++T I +G+P   + V +D GSD+LWV C  C +C P+       L   LS Y+   SSTSK++ C                         Y D +++ 
Subjt:  LHYTWIDIGTPSVPFLVALDVGSDLLWVPC-DCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQL-------------------YYYSDNTSTS

Query:  GFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSY-LDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN-NGSRRILFGD-DGP-------
        G  I+D + L   + +   + L   VV GCG+ QSG      +A DG+MG G  N S+ + LA  G  +  FS C DN NG      G+ + P       
Subjt:  GFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSY-LDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN-NGSRRILFGD-DGP-------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQA--------LVDSGSSFTYLPAEVYKKIVFEF--DKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVF
              Y + ++   V    +      A        ++DSG++  YLP  +Y  ++ +    +QVK +    ++QE    + +  +T  +F + ++    
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQA--------LVDSGSSFTYLPAEVYKKIVFEF--DKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVF

Query:  PLNQS-FIHDLVYILPANQ---GYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQC
         L  S + HD ++ L  +    G++   +T ++   D  ++G  ++    +V+D EN  +GW+   C
Subjt:  PLNQS-FIHDLVYILPANQ---GYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQC

Q8VYV9 Aspartyl protease family protein 12.7e-5934.03Show/hide
Query:  ANCALLFLLIACLFVD------CSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD--LKRRRLKIGSKYDVLFPSEGSQVM
        ++C +LFL +  L         C        +  HRFSD+   +        +     P R+S KY++++   D  ++ RRL    +  V F S+G++ +
Subjt:  ANCALLFLLIACLFVD------CSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD--LKRRRLKIGSKYDVLFPSEGSQVM

Query:  FFGNEFNWLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPSLSSTSKDLFCG------------------HQLYYY
           +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   S L  DL+ Y+P+ SSTS  + C                   +Q+ Y 
Subjt:  FFGNEFNWLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPSLSSTSKDLFCG------------------HQLYYY

Query:  SDNTSTSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP---
        S+ TS++G ++ED LHL S  K  +   + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LA+EG+  N+FS+CF N+G+ RI FGD G    
Subjt:  SDNTSTSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP---

Query:  --------GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS-TLVSFNIPSMKLVFPL
                  +  Y I V    VG +      F A+ DSG+SFTYL    Y  I   F+        +    ELP+ YCY LS    SF  P++ L    
Subjt:  --------GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS-TLVSFNIPSMKLVFPL

Query:  NQSF--IHDLVYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSS--------TTEHAKPPSNNGNVKSPIALPPT
          S+   H LV I    +   V+ L + +  ED  +IGQN M GY++VFDRE L LGW +S C    +S        ++  A+PP+++ + ++   +P  
Subjt:  NQSF--IHDLVYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSS--------TTEHAKPPSNNGNVKSPIALPPT

Query:  NGQAIAPTAARMSSKSSCILFFSLVA
               +AA   S S  + FFS++A
Subjt:  NGQAIAPTAARMSSKSSCILFFSLVA

Q9LX20 Aspartic proteinase-like protein 14.7e-11244.23Show/hide
Query:  FLLIACLFV--DCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWLHYT
        FLL   LF+  + +L    S +L+HRFSDE ++  ++      S+   P + SL+Y+++L + D +R+R+ +G+K   L PSEGS+ +  GN+F WLHYT
Subjt:  FLLIACLFV--DCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWLHYT

Query:  WIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPSLSSTSKDLFCGHQL------------------YYYSDNTSTSGFMIE
        WIDIGTPSV FLVALD GS+LLW+PC+C+QCAPL+++YYS L  +DL+EYNPS SSTSK   C H+L                   Y S NTS+SG ++E
Subjt:  WIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPSLSSTSKDLFCGHQL------------------YYYSDNTSTSGFMIE

Query:  DKLHLTSFSKH---GTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP------------
        D LHLT  + +      S ++A VV+GCG+KQSG YLDG APDG+MGLGP  ISVP+ L++ GL+RN+FSLCFD   S RI FGD GP            
Subjt:  DKLHLTSFSKH---GTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP------------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSF-IHD
           Y+ Y +GVE+ C+G+SCL+++ F   +DSG SFTYLP E+Y+K+  E D+ +  NAT    + + W YCY  S      +P++KL F  N +F IH 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSF-IHD

Query:  LVYILPANQGYKVFLLTLEET-DEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPP-SNNGNVKSPIALP-----PTNGQAIAP---
         +++   +QG   F L +  +  E  G IGQN M GY+MVFDREN+KLGWS S+C +      +  +PP ++ G+  SP  LP        G A++P   
Subjt:  LVYILPANQGYKVFLLTLEET-DEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPP-SNNGNVKSPIALP-----PTNGQAIAP---

Query:  --TAARMSSKSSCILFFSLV
          T ++  S SS   F S++
Subjt:  --TAARMSSKSSCILFFSLV

Q9M9A8 Aspartyl protease APCB13.8e-2128.25Show/hide
Query:  DVLFPSEG---SQVMFF---GNEF-NWLHYTWIDIGTP--SVPFLVALDVGSDLLWVPCD--CIQCAPLSASYYSV----LDRDLSEYNPSLSSTSKDLF
        DVL  S G   S    F   GN + + L+YT I +G P     + + +D GS+L W+ CD  C  CA  +   Y      L R    +   +        
Subjt:  DVLFPSEG---SQVMFF---GNEF-NWLHYTWIDIGTP--SVPFLVALDVGSDLLWVPCD--CIQCAPLSASYYSV----LDRDLSEYNPSLSSTSKDLF

Query:  CG--HQLYY---YSDNTSTSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDG-AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCF--DN
        C   HQ  Y   Y+D++ + G + +DK HL    K    SL ++ +V GCG  Q G  L+     DG++GL    IS+P+ LA  G++ N    C   D 
Subjt:  CG--HQLYY---YSDNTSTSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDG-AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCF--DN

Query:  NGSRRILFGDDGPGTYA-------------AYFIGVESFCVGSSCLQRSG-----FQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQE-LP-
        NG   I  G D   ++              AY + V     G   L   G      + L D+GSS+TY P + Y ++V    +      TR    E LP 
Subjt:  NGSRRILFGDDGPGTYA-------------AYFIGVESFCVGSSCLQRSG-----FQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQE-LP-

Query:  -W----NYCY-NLSTLVSFNIP-----SMKLVFPLNQSFIHDLVYILPANQGYKVFLLTLEETDEDYG---VIGQNLMVGYQMVFDRENLKLGWSKSQCL
         W    N+ + +LS +  F  P       K +    +  I    Y++ +N+G  V L  L+ +    G   ++G   M G+ +V+D    ++GW KS C+
Subjt:  -W----NYCY-NLSTLVSFNIP-----SMKLVFPLNQSFIHDLVYILPANQGYKVFLLTLEETDEDYG---VIGQNLMVGYQMVFDRENLKLGWSKSQCL

Q9S9K4 Aspartic proteinase 391.6e-1926.43Show/hide
Query:  LHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQL-------------------YYYSDNTSTSG
        L++T I +G+P   + V +D GSD+LW+ C      P   +    L+  LS ++ + SSTSK + C                         Y+D +++ G
Subjt:  LHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQL-------------------YYYSDNTSTSG

Query:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDG-AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN--NGSRRILFGDDGPGT------
          I D L L   +       L   VV GCG  QSG   +G +A DGVMG G  N SV + LA  G  +  FS C DN   G    +   D P        
Subjt:  FMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDG-AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN--NGSRRILFGDDGPGT------

Query:  -----YAAYFIGVE----SFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEF--DKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPL
             Y    +G++    S  +  S ++  G   +VDSG++  Y P  +Y  ++      + VK +      Q      C++ ST V    P +   F  
Subjt:  -----YAAYFIGVE----SFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEF--DKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPL

Query:  NQS---FIHDLVYILPANQ---GYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQC
        +     + HD ++ L       G++   LT +E  E   ++G  ++    +V+D +N  +GW+   C
Subjt:  NQS---FIHDLVYILPANQ---GYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQC

Arabidopsis top hitse value%identityAlignment
AT2G17760.1 Eukaryotic aspartyl protease family protein1.9e-6034.03Show/hide
Query:  ANCALLFLLIACLFVD------CSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD--LKRRRLKIGSKYDVLFPSEGSQVM
        ++C +LFL +  L         C        +  HRFSD+   +        +     P R+S KY++++   D  ++ RRL    +  V F S+G++ +
Subjt:  ANCALLFLLIACLFVD------CSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD--LKRRRLKIGSKYDVLFPSEGSQVM

Query:  FFGNEFNWLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPSLSSTSKDLFCG------------------HQLYYY
           +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   S L  DL+ Y+P+ SSTS  + C                   +Q+ Y 
Subjt:  FFGNEFNWLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPSLSSTSKDLFCG------------------HQLYYY

Query:  SDNTSTSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP---
        S+ TS++G ++ED LHL S  K  +   + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LA+EG+  N+FS+CF N+G+ RI FGD G    
Subjt:  SDNTSTSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP---

Query:  --------GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS-TLVSFNIPSMKLVFPL
                  +  Y I V    VG +      F A+ DSG+SFTYL    Y  I   F+        +    ELP+ YCY LS    SF  P++ L    
Subjt:  --------GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS-TLVSFNIPSMKLVFPL

Query:  NQSF--IHDLVYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSS--------TTEHAKPPSNNGNVKSPIALPPT
          S+   H LV I    +   V+ L + +  ED  +IGQN M GY++VFDRE L LGW +S C    +S        ++  A+PP+++ + ++   +P  
Subjt:  NQSF--IHDLVYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSS--------TTEHAKPPSNNGNVKSPIALPPT

Query:  NGQAIAPTAARMSSKSSCILFFSLVA
               +AA   S S  + FFS++A
Subjt:  NGQAIAPTAARMSSKSSCILFFSLVA

AT3G51330.1 Eukaryotic aspartyl protease family protein9.0e-5031.79Show/hide
Query:  LLFLLIACLFVD-CSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD--LKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWL
        LL LL+ C  ++ C      S ++ H FSD  K   +S   D++     P + SL+YF++L   D  ++ R L   ++   +    G++ +   +   +L
Subjt:  LLFLLIACLFVD-CSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD--LKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWL

Query:  HYTWIDIGTPSVPFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCG------------------HQLYYYSDNTSTSGF
        HY  + +GTP+  FLVALD GSDL W+PC+C   C            R L+ Y+P+ SSTS  + C                   +Q+ Y S +T T+G 
Subjt:  HYTWIDIGTPSVPFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCG------------------HQLYYYSDNTSTSGF

Query:  MIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN--NGSRRILFGDDG-----------
        + ED LHL +    G    ++A++ LGCG+ Q+G     AA +G++GLG  + SVP++LA+  +  N+FS+CF N  +   RI FGD G           
Subjt:  MIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN--NGSRRILFGDDG-----------

Query:  PGTYAAYFIGVESFCVGSSCLQRSGFQ--ALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS-TLVSFNIPSMKLVFP-LNQSF
              Y + V    VG   +   G Q  AL D+G+SFT+L    Y  I   FD  V  +  R +  ELP+ +CY+LS    +   P + + F   +Q F
Subjt:  PGTYAAYFIGVESFCVGSSCLQRSGFQ--ALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS-TLVSFNIPSMKLVFP-LNQSF

Query:  IHDLVYILPANQGYKVFLL-TLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAA
        + + ++I+       ++ L  L+  D    +IGQN M GY++VFDRE + LGW +S C +    + E   PP       SP A  P       P AA
Subjt:  IHDLVYILPANQGYKVFLL-TLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNNGNVKSPIALPPTNGQAIAPTAA

AT3G51350.1 Eukaryotic aspartyl protease family protein5.1e-4530.69Show/hide
Query:  LLFLLIACL-FVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD-LKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWLH
        LL +L+ C  F  C        ++ H FSD  K        D V     P + SL+YF++L   D L R R    +  +     +G  +         L+
Subjt:  LLFLLIACL-FVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYD-LKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWLH

Query:  YTWIDIGTPSVPFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLYY-----------------YSDNTSTSGFMI
        Y  + +GTP   FLVALD GSDL W+PC+C   C              L+ Y P+ S+TS  + C  +  +                 YS++T T G ++
Subjt:  YTWIDIGTPSVPFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLYY-----------------YSDNTSTSGFMI

Query:  EDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN--NGSRRILFGDDG-------------
        +D LHL +  ++ T   ++A+V LGCG+KQ+G +    + +GV+GLG    SVP+LLA+  +  N+FS+CF        RI FGD G             
Subjt:  EDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDN--NGSRRILFGDDG-------------

Query:  PGTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS---TLVSFNIPSMKLVFPLNQSFI
        P T  AY + +    V    +    F A  D+GSSFT+L    Y  +   FD+ V+ +  R V  ELP+ +CY+LS   T + F +  M  +   ++  +
Subjt:  PGTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLS---TLVSFNIPSMKLVFPLNQSFI

Query:  HDLVYILPANQGYKVFLL-TLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINS--STT---EHAKPPSNNGNVKSPIALPPTNGQAIAPTA
        ++  +     +G  ++ L  L+       VIGQN + GY++VFDRE + LGW +S C +  S  STT      + P+ + +   P +LPPT      P  
Subjt:  HDLVYILPANQGYKVFLL-TLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINS--STT---EHAKPPSNNGNVKSPIALPPTNGQAIAPTA

Query:  ARMSS
         R S+
Subjt:  ARMSS

AT4G35880.1 Eukaryotic aspartyl protease family protein8.6e-6937.28Show/hide
Query:  LLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQ--MLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGN------
        L+ +L+   F  C+ G   + ++ HRFSDE K  W    +    AKF PP+ S +YF   +L D+ ++ RRL           SE S     GN      
Subjt:  LLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQ--MLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGN------

Query:  EFNWLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQL------------------YYYSDNTS
           +LHYT + +GTP + F+VALD GSDL WVPCDC +CAP   + Y+  + +LS YNP +S+T+K + C + L                   Y S  TS
Subjt:  EFNWLHYTWIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQL------------------YYYSDNTS

Query:  TSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGPG-------
        TSG ++ED +HLT+  K+     ++A V  GCG+ QSGS+LD AAP+G+ GLG   ISVP++LA+EGLV ++FS+CF ++G  RI FGD G         
Subjt:  TSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGPG-------

Query:  ----TYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFN-IPSMKLVFPLNQSFI
            ++  Y I V    VG++ +    F AL D+G+SFTYL   +Y  +   F  Q + +        +P+ YCY++S   + + IPS+ L    N  F 
Subjt:  ----TYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFN-IPSMKLVFPLNQSFI

Query:  HDLVYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTT
         +   I+ + +G  V+ L + ++ E   +IGQN M GY++VFDRE L L W K  C DI  + T
Subjt:  HDLVYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTT

AT5G10080.1 Eukaryotic aspartyl protease family protein3.3e-11344.23Show/hide
Query:  FLLIACLFV--DCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWLHYT
        FLL   LF+  + +L    S +L+HRFSDE ++  ++      S+   P + SL+Y+++L + D +R+R+ +G+K   L PSEGS+ +  GN+F WLHYT
Subjt:  FLLIACLFV--DCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWLHYT

Query:  WIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPSLSSTSKDLFCGHQL------------------YYYSDNTSTSGFMIE
        WIDIGTPSV FLVALD GS+LLW+PC+C+QCAPL+++YYS L  +DL+EYNPS SSTSK   C H+L                   Y S NTS+SG ++E
Subjt:  WIDIGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPSLSSTSKDLFCGHQL------------------YYYSDNTSTSGFMIE

Query:  DKLHLTSFSKH---GTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP------------
        D LHLT  + +      S ++A VV+GCG+KQSG YLDG APDG+MGLGP  ISVP+ L++ GL+RN+FSLCFD   S RI FGD GP            
Subjt:  DKLHLTSFSKH---GTHSLLQASVVLGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGP------------

Query:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSF-IHD
           Y+ Y +GVE+ C+G+SCL+++ F   +DSG SFTYLP E+Y+K+  E D+ +  NAT    + + W YCY  S      +P++KL F  N +F IH 
Subjt:  -GTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNATRIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSF-IHD

Query:  LVYILPANQGYKVFLLTLEET-DEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPP-SNNGNVKSPIALP-----PTNGQAIAP---
         +++   +QG   F L +  +  E  G IGQN M GY+MVFDREN+KLGWS S+C +      +  +PP ++ G+  SP  LP        G A++P   
Subjt:  LVYILPANQGYKVFLLTLEET-DEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPP-SNNGNVKSPIALP-----PTNGQAIAP---

Query:  --TAARMSSKSSCILFFSLV
          T ++  S SS   F S++
Subjt:  --TAARMSSKSSCILFFSLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATTGCGCCCTTCTTTTCCTCTTAATTGCTTGTCTCTTCGTGGACTGCTCTCTTGGTCTTACGCTCTCCTTGAAGCTAGTACATCGATTCTCCGATGAGGCTAA
ATCACTTTGGGAGTCCAGGAGGGCTGACAATGTCTCTGCAAAGTTCTGGCCGCCGAGGAATAGCTTGAAGTATTTTCAAATGCTTATGGACTATGACTTGAAGAGGCGAC
GGCTGAAGATCGGATCCAAGTACGACGTACTTTTTCCTTCTGAAGGAAGCCAAGTTATGTTCTTCGGAAACGAGTTTAATTGGTTACATTACACATGGATTGATATAGGA
ACACCGAGTGTTCCGTTTCTGGTTGCGTTGGATGTTGGAAGTGACCTTCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTTATTATAGCGTTCT
GGATAGGGATCTGAGTGAGTACAATCCATCTTTATCAAGCACCAGCAAGGACCTTTTTTGCGGTCATCAATTATATTATTACTCAGATAATACATCGACATCTGGATTTA
TGATTGAAGATAAATTGCATTTAACGTCTTTCAGTAAACATGGGACACATAGCCTTTTGCAAGCCTCAGTTGTATTAGGTTGTGGTAGGAAACAGAGTGGCAGCTATTTG
GATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAATATTTCAGTGCCAACCTTATTGGCACAAGAAGGATTAGTTAGAAACACATTTTCGCTTTGTTTTGA
TAATAACGGTTCTAGGAGAATCCTCTTTGGGGACGATGGTCCTGGCACCTATGCTGCCTATTTTATTGGGGTGGAGTCTTTTTGTGTTGGGAGTTCCTGTCTGCAGAGAA
GTGGATTCCAGGCGTTGGTTGACAGTGGCTCATCTTTTACATATCTTCCTGCAGAAGTCTATAAAAAGATTGTCTTCGAATTTGATAAACAAGTAAAATTTAATGCTACC
AGGATAGTTCTCCAGGAACTTCCCTGGAATTACTGCTATAATCTTAGTACGCTGGTGTCCTTTAATATTCCTAGCATGAAACTCGTGTTTCCATTGAATCAAAGCTTTAT
ACATGATCTGGTGTATATCCTCCCTGCAAACCAAGGGTATAAAGTGTTTTTGTTAACTTTAGAGGAGACAGATGAAGATTATGGTGTAATTGGACAAAACTTGATGGTGG
GTTACCAGATGGTTTTTGACAGAGAAAACCTTAAATTGGGTTGGTCCAAGTCCCAATGCCTAGATATCAACAGTAGTACGACAGAGCATGCCAAACCACCTTCAAATAAT
GGAAATGTCAAATCGCCAATTGCATTACCACCAACAAATGGGCAAGCAATTGCACCCACTGCTGCAAGAATGTCTTCTAAATCTTCCTGCATCCTATTTTTCTCCCTTGT
TGCTACTGTTGCTTGCAGCTTTTCTGGTTGCTTGCTGGATTTGTTGAGTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATTGCGCCCTTCTTTTCCTCTTAATTGCTTGTCTCTTCGTGGACTGCTCTCTTGGTCTTACGCTCTCCTTGAAGCTAGTACATCGATTCTCCGATGAGGCTAA
ATCACTTTGGGAGTCCAGGAGGGCTGACAATGTCTCTGCAAAGTTCTGGCCGCCGAGGAATAGCTTGAAGTATTTTCAAATGCTTATGGACTATGACTTGAAGAGGCGAC
GGCTGAAGATCGGATCCAAGTACGACGTACTTTTTCCTTCTGAAGGAAGCCAAGTTATGTTCTTCGGAAACGAGTTTAATTGGTTACATTACACATGGATTGATATAGGA
ACACCGAGTGTTCCGTTTCTGGTTGCGTTGGATGTTGGAAGTGACCTTCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTTATTATAGCGTTCT
GGATAGGGATCTGAGTGAGTACAATCCATCTTTATCAAGCACCAGCAAGGACCTTTTTTGCGGTCATCAATTATATTATTACTCAGATAATACATCGACATCTGGATTTA
TGATTGAAGATAAATTGCATTTAACGTCTTTCAGTAAACATGGGACACATAGCCTTTTGCAAGCCTCAGTTGTATTAGGTTGTGGTAGGAAACAGAGTGGCAGCTATTTG
GATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAATATTTCAGTGCCAACCTTATTGGCACAAGAAGGATTAGTTAGAAACACATTTTCGCTTTGTTTTGA
TAATAACGGTTCTAGGAGAATCCTCTTTGGGGACGATGGTCCTGGCACCTATGCTGCCTATTTTATTGGGGTGGAGTCTTTTTGTGTTGGGAGTTCCTGTCTGCAGAGAA
GTGGATTCCAGGCGTTGGTTGACAGTGGCTCATCTTTTACATATCTTCCTGCAGAAGTCTATAAAAAGATTGTCTTCGAATTTGATAAACAAGTAAAATTTAATGCTACC
AGGATAGTTCTCCAGGAACTTCCCTGGAATTACTGCTATAATCTTAGTACGCTGGTGTCCTTTAATATTCCTAGCATGAAACTCGTGTTTCCATTGAATCAAAGCTTTAT
ACATGATCTGGTGTATATCCTCCCTGCAAACCAAGGGTATAAAGTGTTTTTGTTAACTTTAGAGGAGACAGATGAAGATTATGGTGTAATTGGACAAAACTTGATGGTGG
GTTACCAGATGGTTTTTGACAGAGAAAACCTTAAATTGGGTTGGTCCAAGTCCCAATGCCTAGATATCAACAGTAGTACGACAGAGCATGCCAAACCACCTTCAAATAAT
GGAAATGTCAAATCGCCAATTGCATTACCACCAACAAATGGGCAAGCAATTGCACCCACTGCTGCAAGAATGTCTTCTAAATCTTCCTGCATCCTATTTTTCTCCCTTGT
TGCTACTGTTGCTTGCAGCTTTTCTGGTTGCTTGCTGGATTTGTTGAGTCCATAA
Protein sequenceShow/hide protein sequence
MANCALLFLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWESRRADNVSAKFWPPRNSLKYFQMLMDYDLKRRRLKIGSKYDVLFPSEGSQVMFFGNEFNWLHYTWIDIG
TPSVPFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPSLSSTSKDLFCGHQLYYYSDNTSTSGFMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYL
DGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSRRILFGDDGPGTYAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNAT
RIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQSFIHDLVYILPANQGYKVFLLTLEETDEDYGVIGQNLMVGYQMVFDRENLKLGWSKSQCLDINSSTTEHAKPPSNN
GNVKSPIALPPTNGQAIAPTAARMSSKSSCILFFSLVATVACSFSGCLLDLLSP