; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1200 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1200
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionaspartyl protease family protein 1-like
Genome locationMC01:17210176..17213128
RNA-Seq ExpressionMC01g1200
SyntenyMC01g1200
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034164 - Pepsin-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594755.1 Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. sororia]5.96e-25469.34Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MAWTFSSG QMLL LSV+LLA  LRSGEA SFKF+IHHRFS+SIKGIL SEGLPEK +P YYATMVHRD LV GRRLA++NGD  LTF YGN+TF I NL
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        G+LYYANISVG+P L FLVALDTGSDL WLPCEC SCLTYLNTT+GGKF LNHYSP DSTTS  VPCSNSLCEL+NQC+S T+TCPYEINYLSANTSS+G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLVQDVLHLATDD +L PV++KITFGCG +QTG+F   AAPNGLIGLGM++ISVPS LA+QGLT+DSFSMCFG DG GRIDFGD GT GQRETPFNTM N
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
        +PSYNVT T+IIVGGK+NN++F+AIFDSGTSF+Y+ +P YS+I+EQM+AGMKL+R   DPDFPFEYCY+LP N +    P LNFTM GGD++  +D F+ 
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSD-SPPTDSSPPTDSSPPS-DSPQSGDSPP
         P D    A CL ++KSTD I+LIG+                        YD+ A TPS ++PPA +SPP+D SPP D SPP D SPP+ DSP + DSPP
Subjt:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSD-SPPTDSSPPTDSSPPS-DSPQSGDSPP

Query:  SGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
        + DSP  PS  GGS G   LP+IG GDATRLNPL  VFVA+LAIL VV
Subjt:  SGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

XP_016899154.1 PREDICTED: aspartyl protease family protein 1-like [Cucumis melo]1.98e-25670.78Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MA TFSSGAQMLL LSVF+LAG LRSG+A SFKF+IHHRFSDS+K +L SEGLPEK +PGYYATMVHRDRLV GRRLA TN D  LTF YGNDT  I  L
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        GFLYYAN+SVGTP + FLVALDTGSDLFWLPCECSSC TYLNT+NGGKF LNHYSP DSTTS+ VPC++SLC    QC+S  +TCPYEINY+SANTSS G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLV+DVLHLA DD  LKPV+AKITFGCG +QTGIFA +AAPNGLIGLGMEKISVPSFLA QGLTSDSFSMCFG DG GRIDFGD G +GQ++TPFNTM++
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
          SYNVTF +I VGGK+NN+ F+AIFDSGTSF+Y+T+P YS I+EQMDAGMKL+R  F P FPF+YCY++ P+A     P+LNFTM+GGD +  +D FVV
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDS----SPPTDSSPPS-DSPQSGD
        +P D    AACL + KSTD IDLIG+                        YDNG +TPSD SPPAD SPPSDSPPTDS    SPPTD +PPS DSP S D
Subjt:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDS----SPPTDSSPPS-DSPQSGD

Query:  SPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
        SPPSGDSPPAPST GG NG   LPRIGA  A RLNPLG VFVA+LAIL VV
Subjt:  SPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

XP_022132579.1 aspartyl protease family protein 1-like [Momordica charantia]0.094.88Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
        FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTDV--GLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPS
        VPTD   GLAACLGIVKSTDPIDLIG+                        YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPS
Subjt:  VPTDV--GLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPS

Query:  GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
        GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
Subjt:  GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

XP_023518254.1 aspartyl protease family protein 1-like [Cucurbita pepo subsp. pepo]8.29e-25368.17Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MAWTFSSG QMLL LSV+LLA  LR+GEA SFKF+IHHRFS+SIKGIL SEGLPEK +P YYATMVHRDRLV GRRLA++NGD  LTF YGN+TF I NL
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        G+LYYANISVG+P L FLVALDTGSDL WLPCEC SCLTYLNTT+GGKF LNHYSP DSTTS  VPCSNSLCEL+NQC+S T+TCPYEINYLSANTSS+G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLVQDVLHLATDD +L PV+AKITFGCG +QTG+F   AAPNGLIGLGM++ISVPS LA+QGL++DSFSMCFG DG GRIDFGD GT GQRETPFNTM N
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
        +PSYNVT T+IIVGGK+NN++F+AIFDSGTSF+Y+ +P YS+I++QM+AGMKL+R   DPDFPFEYCY+LP N +    P LNFTM GGD++  LD F+ 
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPS-------DSPQS
         P D    A CL ++KS D I+LIG+                        YD+ A TPS ++PPAD+ P  DSPP D SPP + SPP+       DSP +
Subjt:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPS-------DSPQS

Query:  GDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
         DSPP+ DSP  PS  GGS G   LP+IG GDATRLNPL  VFVA+LAIL VV
Subjt:  GDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

XP_038882816.1 aspartyl protease family protein 1-like [Benincasa hispida]1.85e-25869.33Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MAWTF+SGAQMLL LS+FLLAG LRSG+A SFKFSIHHRFSDS+KGIL SEGLPEK +PGYYATMVHRDR V GRRLA    D  LTF YGNDTF I  L
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        GFLYYAN+SVGTP L F VALDTGSDLFWLPCEC SC TYLNT++GG+F LNHYSP DSTTS++VPCS+SLCEL+NQC+S  +TCPYEINYLSANTSS G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLV+DVLHLATDD  LKPV+AKITFGCGK+QTGIFA SAAPNGLIGLGMEKISVPSFLA QGLTSDSFSMCFG D  GRIDFGD G +GQ+ETPFN+M++
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
        F SYNV+F QIIVG K+NN+ F+AIFDSGTSF+Y+T+P YS I+EQMDAGM L+R  FD  FPFEYCY+LP N+     P LNFTMKGGD++  LD F+V
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTD---VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDN---------------SPPADNSPPSDSPPTDSSPPTD
        VP D      AACL I+KSTD IDLIG+                        YDNG  TPS +               SPP D+SPP+DSPPTD SPP++
Subjt:  VPTD---VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDN---------------SPPADNSPPSDSPPTDSSPPTD

Query:  SSPPS-DSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
         SPPS DSP S DSPPS DSPPAPST GG  G   LP    GDATRLNPL  VFVA+LAIL VV
Subjt:  SSPPS-DSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

TrEMBL top hitse value%identityAlignment
A0A0A0KML0 Peptidase A1 domain-containing protein6.67e-24365.22Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MA TFSSGAQMLL LSVF+LAG LRSG+A SFKF IHHRFSDSIKGI  SEGLPEK +PGYYATMVHRDRLV GRRLA ++ D  LTF YGNDT  I +L
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        GFLYYAN+SVGTP L FLVALDTGSDLFWLPCECSSC TYLNT+NGGKF LNHYSP DSTTS++VPC++SLC   N+C+S  + CPYE+ YLSANTSS G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLV+DVLHLATDD  LKPV+AKITFGCG +QTGIFA +AAPNGLIGLGMEKISVPSFLA QGLTS+SFSMCFG DG GRIDFGD G + Q++TPFNTM+ 
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVK-FDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFV
        + SYNVTF  I VGG+ N++ F+AIFDSGTSF+Y+T+P YS I +QMDAGMKL+R   F P+FPFEYCY++PP A  F +  LNFTMKGGD +   D FV
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVK-FDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFV

Query:  VVPTDVGL-AACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADN------------------SPPSDSPPTDSSPP
         +P D     ACL I KSTD IDLIG+                        YDNG  TPS ++PPAD+                  SPPSDSPPTD +PP
Subjt:  VVPTDVGL-AACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADN------------------SPPSDSPPTDSSPP

Query:  TDSSPPS-------------DSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
        +D SPPS             DSP S DSPPSGDSPPAPST GG  G   LP  G G A +LNPLG VF A+LAIL +V
Subjt:  TDSSPPS-------------DSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

A0A1S4DT41 aspartyl protease family protein 1-like9.57e-25770.78Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MA TFSSGAQMLL LSVF+LAG LRSG+A SFKF+IHHRFSDS+K +L SEGLPEK +PGYYATMVHRDRLV GRRLA TN D  LTF YGNDT  I  L
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        GFLYYAN+SVGTP + FLVALDTGSDLFWLPCECSSC TYLNT+NGGKF LNHYSP DSTTS+ VPC++SLC    QC+S  +TCPYEINY+SANTSS G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLV+DVLHLA DD  LKPV+AKITFGCG +QTGIFA +AAPNGLIGLGMEKISVPSFLA QGLTSDSFSMCFG DG GRIDFGD G +GQ++TPFNTM++
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
          SYNVTF +I VGGK+NN+ F+AIFDSGTSF+Y+T+P YS I+EQMDAGMKL+R  F P FPF+YCY++ P+A     P+LNFTM+GGD +  +D FVV
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDS----SPPTDSSPPS-DSPQSGD
        +P D    AACL + KSTD IDLIG+                        YDNG +TPSD SPPAD SPPSDSPPTDS    SPPTD +PPS DSP S D
Subjt:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDS----SPPTDSSPPS-DSPQSGD

Query:  SPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
        SPPSGDSPPAPST GG NG   LPRIGA  A RLNPLG VFVA+LAIL VV
Subjt:  SPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

A0A5A7SKI7 Aspartyl protease family protein 1-like8.17e-25368.37Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MA TFSSGAQMLL LSVF+LAG LRSG+A SFKF+IHHRFSDS+K +L SEGLPEK +PGYYATMVHRDRLV GRRLA TN D  LTF YGNDT  I  L
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        GFLYYAN+SVGTP + FLVALDTGSDLFWLPCECSSC TYLNT+NGGKF LNHYSP DSTTS+ VPC++SLC    QC+S  + CPYEINY+SANTSS G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLV+DVLHLA DD  LKPV+AKITFGCG +QTGIFA +AAPNGLIGLGMEKISVPSFLA QGLTSDSFSMCFG DG GRIDFGD G +GQ++TPFNTM++
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
          SYNVTF +I VGGK+NN+ F+AIFDSGTSF+Y+T+P YS I+EQMDAGMKL+R  F P FPF+YCY++ P+A     P+LNFTM+GGD +  +D FVV
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDS----SPPTDSSPPS--------
        +P D    AACL + KSTD IDLIG+                        YDNG +TPSD SPPAD SPPSDSPPTDS    SPPTD +PPS        
Subjt:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDS----SPPTDSSPPS--------

Query:  -----------DSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
                   DSP S DSPPSGDSPPAPST GG NG   LPRIGA  A RLNPLG VFVA+LAIL VV
Subjt:  -----------DSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

A0A6J1BTF7 aspartyl protease family protein 1-like0.094.88Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
        FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTDV--GLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPS
        VPTD   GLAACLGIVKSTDPIDLIG+                        YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPS
Subjt:  VPTDV--GLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPS

Query:  GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
        GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
Subjt:  GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

A0A6J1EEK3 aspartyl protease family protein 1-like4.43e-25268.56Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL
        MAWTFSSG QMLL LSV+LLA  LRSGEA SFKF+IHHRFS+SIKGIL SEGLPEK +P YYATMVHRD LV GRRLA++NGD  LTF YGN+TF I NL
Subjt:  MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNL

Query:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG
        G+LYYANISVG+P L FLVALDTGSDL WLPCEC SCLTYLNTT+GGKF LNHYSP DSTTS  VPCSNSLCEL+NQC+S T+TCPYEINYLSANTSS+G
Subjt:  GFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSG

Query:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN
        YLVQDVLHLATDD +L PV++KITFGCG +QTG+F   AAPNGLIGLGM++ISVPS LA+QGLT+DSFSMCFG DG GRIDFGD GT GQRETPFNTM N
Subjt:  YLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVN

Query:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV
        +PSYNVT T+IIVGGK+NN++F+AIFDSGTSF+Y+ +P YS+I+EQM+AGMKL+R   DPDFPFEYCY+LP N +    P LNFTM GGD++  +D F+ 
Subjt:  FPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVV

Query:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSD-SPPTDSSPPTDSSPPSDSPQSGDSPPS
         P D    A CL ++KSTD I+LIG+                        YD+ A TPS ++PPA +SPP+D SPP + SPP + SPP++     DSPP+
Subjt:  VPTD-VGLAACLGIVKSTDPIDLIGR------------------------YDNGAATPSDNSPPADNSPPSD-SPPTDSSPPTDSSPPSDSPQSGDSPPS

Query:  GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV
         DSP  PS+ GGS G   LP+IG GDATRLNPL  VFVA+LAIL VV
Subjt:  GDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 363.3e-2328.15Show/hide
Query:  AGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYANISVGTPELSFLVALDTGSDLF
        +G+F F++ H+F+             EKQ     + +   D   H R LA  N D PL    G D+    ++G LY+  I +G+P   + V +DTGSD+ 
Subjt:  AGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYANISVGTPELSFLVALDTGSDLF

Query:  WLPC-ECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCE--LANQCSSTTSTCPYEINYLSANTSSSGYLVQDV-LHLATDDKQLKPVDAKIT
        W+ C  C  C   + T  G    L+ Y  K S+TS +V C +  C   + ++       C Y + Y   +TS   ++  ++ L   T + +  P+  ++ 
Subjt:  WLPC-ECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCE--LANQCSSTTSTCPYEINYLSANTSSSGYLVQDV-LHLATDDKQLKPVDAKIT

Query:  FGCGKIQTGIFADS-AAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCF-GYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNVTFTQIIVGG-------
        FGCGK Q+G    + +A +G++G G    S+ S LA+ G T   FS C    +G G    G++ +   + TP   + N   YNV    + V G       
Subjt:  FGCGKIQTGIFADS-AAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCF-GYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNVTFTQIIVGG-------

Query:  --KSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLE
           S N     I DSGT+ +Y+   +Y+ + E++ A  +++
Subjt:  --KSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLE

Q766C2 Aspartic proteinase nepenthesin-29.0e-2125.74Show/hide
Query:  YYANISVGTPELSFLVALDTGSDLFWLPCE-CSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYL
        Y  N+++GTP+ SF   +DTGSDL W  CE C+ C +              ++P+DS++ +++PC +  C+     +   + C Y   Y   +T + GY+
Subjt:  YYANISVGTPELSFLVALDTGSDLFWLPCE-CSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYL

Query:  VQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSM-CFGYDGQGRIDFGDIGTSGQRETPFNTMV--
          +     T           I FGCG+   G    + A  GLIG+G   +S+PS L   G+   S+ M  +G      +  G   +     +P  T++  
Subjt:  VQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSM-CFGYDGQGRIDFGDIGTSGQRETPFNTMV--

Query:  --NFPSYNVTFTQIIVGGKSNNLQFSA-----------IFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTM
          N   Y +T   I VGG +  +  S            I DSGT+ +Y+    Y+ +A+     + L  V  +       C+Q P + ++   P ++   
Subjt:  --NFPSYNVTFTQIIVGGKSNNLQFSA-----------IFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTM

Query:  KGG
         GG
Subjt:  KGG

Q8VYV9 Aspartyl protease family protein 12.5e-11647.46Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGEL--RSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPL-TFVYGNDTFLI
        M W +SS   + L L + L +  +  R    G F F  HHRFSD + G+L  +GLP + S  YY  M HRDRL+ GRRLA  N D  L TF  GN+T  +
Subjt:  MAWTFSSGAQMLLFLSVFLLAGEL--RSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPL-TFVYGNDTFLI

Query:  GNLGFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTS
          LGFL+YAN++VGTP   F+VALDTGSDLFWLPC+C++C+  L    G    LN YSP  S+TST VPC+++LC   ++C+S  S CPY+I YLS  TS
Subjt:  GNLGFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTS

Query:  SSGYLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNT
        S+G LV+DVLHL ++DK  K + A++TFGCG++QTG+F D AAPNGL GLG+E ISVPS LA +G+ ++SFSMCFG DG GRI FGD G+  QRETP N 
Subjt:  SSGYLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNT

Query:  MVNFPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVK-FDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLD
            P+YN+T T+I VGG + +L+F A+FDSGTSF+Y+TD  Y+LI+E  ++    +R +  D + PFEYCY L PN +SF +P +N TMKGG +Y    
Subjt:  MVNFPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVK-FDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLD

Query:  PFVVVPTDVGLAACLGIVKSTDPIDLIGR---------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQS
        P VV+P       CL I+K  D I +IG+                      D      S  + P++ S  S  PP  S  P  ++ PS  P +
Subjt:  PFVVVPTDVGLAACLGIVKSTDPIDLIGR---------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQS

Q9LX20 Aspartic proteinase-like protein 14.6e-5735.94Show/hide
Query:  SGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSD----SIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGN-LG
        S +  LLF  +FL   E     A  F   + HRFSD    SIK    S+ LP KQS  YY  +   D     +R+        L    G+ T   GN  G
Subjt:  SGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSD----SIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGN-LG

Query:  FLYYANISVGTPELSFLVALDTGSDLFWLPCECSSC--LTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSS
        +L+Y  I +GTP +SFLVALDTGS+L W+PC C  C  LT    ++     LN Y+P  S+TS    CS+ LC+ A+ C S    CPY +NYLS NTSSS
Subjt:  FLYYANISVGTPELSFLVALDTGSDLFWLPCECSSC--LTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSS

Query:  GYLVQDVLHLA--TDDKQL---KPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETP
        G LV+D+LHL   T+++ +     V A++  GCGK Q+G + D  AP+GL+GLG  +ISVPSFL+  GL  +SFS+CF  +  GRI FGD+G S Q+ TP
Subjt:  GYLVQDVLHLA--TDDKQL---KPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETP

Query:  FNTMVN--FPSYNVTFTQIIVGGKS-NNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDN
        F  + N  +  Y V      +G        F+   DSG SF+Y+ + +Y  +A ++D  +      F+    +EYCY+   ++     P +       + 
Subjt:  FNTMVN--FPSYNVTFTQIIVGGKS-NNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDN

Query:  YGTLDPFVVVPTDVGLAA-CLGIVKS-TDPIDLIGR-YDNGAATPSDNSPPADNSPPSDS-----PPTDSSPPTDSSP---PSDSPQSGDSPPSGDSPPA
        +    P  V     GL   CL I  S  + I  IG+ Y  G     D         PS        P  +SP + SSP   P+D  QS      G    +
Subjt:  YGTLDPFVVVPTDVGLAA-CLGIVKS-TDPIDLIGR-YDNGAATPSDNSPPADNSPPSDS-----PPTDSSPPTDSSP---PSDSPQSGDSPPSGDSPPA

Query:  PSTAGGSNGTQP
        P+ AG +    P
Subjt:  PSTAGGSNGTQP

Q9S9K4 Aspartic proteinase 391.4e-2128.94Show/hide
Query:  QMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYANIS
        ++ + ++VF++  E  S    +F F   H+F+   K +   +                 D   H R LA+   D PL    G D+  + ++G LY+  I 
Subjt:  QMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYANIS

Query:  VGTPELSFLVALDTGSDLFWLPCE-CSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTST--CPYEINYLSANTSSSGYLVQDV
        +G+P   + V +DTGSD+ W+ C+ C  C T  N      F L+ +    S+TS  V C +  C   +Q  S      C Y I Y   +T S G  ++D+
Subjt:  VGTPELSFLVALDTGSDLFWLPCE-CSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTST--CPYEINYLSANTSSSGYLVQDV

Query:  LHL--ATDDKQLKPVDAKITFGCGKIQTGIFAD-SAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCF-GYDGQGRIDFGDIGTSGQRETPFNTMVNFP
        L L   T D +  P+  ++ FGCG  Q+G   +  +A +G++G G    SV S LA+ G     FS C     G G    G + +   + TP   + N  
Subjt:  LHL--ATDDKQLKPVDAKITFGCGKIQTGIFAD-SAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCF-GYDGQGRIDFGDIGTSGQRETPFNTMVNFP

Query:  SYNVTFTQIIVGGKSNNLQFS------AIFDSGTSFSYITDPVYSLIAE
         YNV    + V G S +L  S       I DSGT+ +Y    +Y  + E
Subjt:  SYNVTFTQIIVGGKSNNLQFS------AIFDSGTSFSYITDPVYSLIAE

Arabidopsis top hitse value%identityAlignment
AT2G17760.1 Eukaryotic aspartyl protease family protein1.8e-11747.46Show/hide
Query:  MAWTFSSGAQMLLFLSVFLLAGEL--RSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPL-TFVYGNDTFLI
        M W +SS   + L L + L +  +  R    G F F  HHRFSD + G+L  +GLP + S  YY  M HRDRL+ GRRLA  N D  L TF  GN+T  +
Subjt:  MAWTFSSGAQMLLFLSVFLLAGEL--RSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPL-TFVYGNDTFLI

Query:  GNLGFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTS
          LGFL+YAN++VGTP   F+VALDTGSDLFWLPC+C++C+  L    G    LN YSP  S+TST VPC+++LC   ++C+S  S CPY+I YLS  TS
Subjt:  GNLGFLYYANISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTS

Query:  SSGYLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNT
        S+G LV+DVLHL ++DK  K + A++TFGCG++QTG+F D AAPNGL GLG+E ISVPS LA +G+ ++SFSMCFG DG GRI FGD G+  QRETP N 
Subjt:  SSGYLVQDVLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNT

Query:  MVNFPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVK-FDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLD
            P+YN+T T+I VGG + +L+F A+FDSGTSF+Y+TD  Y+LI+E  ++    +R +  D + PFEYCY L PN +SF +P +N TMKGG +Y    
Subjt:  MVNFPSYNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVK-FDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLD

Query:  PFVVVPTDVGLAACLGIVKSTDPIDLIGR---------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQS
        P VV+P       CL I+K  D I +IG+                      D      S  + P++ S  S  PP  S  P  ++ PS  P +
Subjt:  PFVVVPTDVGLAACLGIVKSTDPIDLIGR---------------------YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQS

AT3G51330.1 Eukaryotic aspartyl protease family protein1.1e-9543.19Show/hide
Query:  QMLLFLSVFLLAGELRSGEA-GSFKFSIHHRFSDSIKGILDSEGL-PEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYAN
        Q+ + LS+ ++   L   EA G F F +HH FSD +K  L  + L PEK S  Y+  +  RDRL+ GR LA+ N + P+TF+ GN T  I  LGFL+YAN
Subjt:  QMLLFLSVFLLAGELRSGEA-GSFKFSIHHRFSDSIKGILDSEGL-PEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYAN

Query:  ISVGTPELSFLVALDTGSDLFWLPCEC-SSCLTYLNTTN-GGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQD
        +SVGTP   FLVALDTGSDLFWLPC C S+C+  L          LN YSP  S+TS+S+ CS+  C  +++CSS  S+CPY+I YLS +T ++G L +D
Subjt:  ISVGTPELSFLVALDTGSDLFWLPCEC-SSCLTYLNTTN-GGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQD

Query:  VLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFG--YDGQGRIDFGDIGTSGQRETPFNTMVNFPS
        VLHL T+D+ L+PV A IT GCGK QTG    SAA NGL+GLG++  SVPS LA   +T++SFSMCFG   D  GRI FGD G + Q ETP       P+
Subjt:  VLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFG--YDGQGRIDFGDIGTSGQRETPFNTMVNFPS

Query:  YNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDP-FVVVP
        Y V+ T++ VGG +  +Q  A+FD+GTSF+++ +P Y LI +  D  +  +R   DP+ PFE+CY L PN  +   PR+  T +GG      +P F+V  
Subjt:  YNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDP-FVVVP

Query:  TDVGLAACLGIVKSTD-PIDLIGR-YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPP---SDSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIG
         D     CLGI+KS D  I++IG+ + +G     D          SD    +S   T   PP   + SP +    PS   PPA +T    +        G
Subjt:  TDVGLAACLGIVKSTD-PIDLIGR-YDNGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPP---SDSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIG

Query:  AGDATRLNPLGCVFVAILAIL
         G A  L PL    + +L +L
Subjt:  AGDATRLNPLGCVFVAILAIL

AT3G51350.1 Eukaryotic aspartyl protease family protein3.0e-8839.89Show/hide
Query:  QMLLFLSVFLLA-GELRSGEAGSFKFSIHHRFSDSIKGILD-SEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYAN
        Q+ + LSV ++  G  R    G F F +HH FSDS+K  L   + +PE+ S  Y+  + HRDRL+ GR LA+ N + P+TF  GN T  +  LG LYYAN
Subjt:  QMLLFLSVFLLA-GELRSGEAGSFKFSIHHRFSDSIKGILD-SEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYAN

Query:  ISVGTPELSFLVALDTGSDLFWLPCEC-SSCLTYLNTTN-GGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQD
        +SVGTP  SFLVALDTGSDLFWLPC C ++C+  L          LN Y+P  STTS+S+ CS+  C  + +CSS +S CPY+I+Y S +T + G L+QD
Subjt:  ISVGTPELSFLVALDTGSDLFWLPCEC-SSCLTYLNTTN-GGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQD

Query:  VLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFG--YDGQGRIDFGDIGTSGQRETPFNTMVNFPS
        VLHLAT+D+ L PV A +T GCG+ QTG+F  + + NG++GLG++  SVPS LA   +T++SFSMCFG      GRI FGD G + Q ETPF ++    +
Subjt:  VLHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFG--YDGQGRIDFGDIGTSGQRETPFNTMVNFPS

Query:  YNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPT
        Y V  + + V G   +++  A FD+G+SF+++ +P Y ++ +  D  ++  R   DP+ PFE+CY L PNA +   P +  T  GG      +PF    T
Subjt:  YNVTFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPT

Query:  DVG-LAACLGIVKSTD-PIDLIGR---------YD--------NGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPSGDSPPAPSTA
          G +  CLG++KS    I++IG+         +D          +    D S  +   PP   P  ++  P+ S+PP   P+S   PP+  + P P   
Subjt:  DVG-LAACLGIVKSTD-PIDLIGR---------YD--------NGAATPSDNSPPADNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPSGDSPPAPSTA

Query:  GGSNGTQPLPRIGAGDATRLNPLGCVFVAILAIL
          S G       G G A  L PL    + +L +L
Subjt:  GGSNGTQPLPRIGAGDATRLNPLGCVFVAILAIL

AT3G51360.1 Eukaryotic aspartyl protease family protein2.3e-8842.11Show/hide
Query:  LRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGD-PPLTFVYGNDTFLIGNLGFLYYANISVGTPELSFLVALD
        L S  +GS  F IHHRFS+ +K +L   GLPE  S  YY  +VHRDR   GR+L + N +   ++F  GN T     + FL+YAN+++GTP   FLVALD
Subjt:  LRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGD-PPLTFVYGNDTFLIGNLGFLYYANISVGTPELSFLVALD

Query:  TGSDLFWLPCEC-SSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQDVLHLATDDKQLKPVDA
        TGSDLFWLPC C S+C+  + T  G +  LN Y+P  S +S+ V C+++LC L N+C S  S CPY I YLS  + S+G LV+DV+H++T++ + +  DA
Subjt:  TGSDLFWLPCEC-SSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQDVLHLATDDKQLKPVDA

Query:  KITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNVTFTQIIVGGKSNNLQ
        +ITFGC + Q G+F +  A NG++GL +  I+VP+ L   G+ SDSFSMCFG +G+G I FGD G+S Q ETP +  ++   Y+V+ T+  VG  + + +
Subjt:  KITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNVTFTQIIVGGKSNNLQ

Query:  FSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPTDVG--LAACLGIVKSTD
        F+A FDSGT+ +++ +P Y+ +       +   R+    D PFE+CY +   ++    P ++F MKGG  Y    P +V  T  G     CL ++K  +
Subjt:  FSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPTDVG--LAACLGIVKSTD

AT4G35880.1 Eukaryotic aspartyl protease family protein1.1e-11148.69Show/hide
Query:  LFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEG----LPEKQSPGYYATMVHRDRLVHGRRL--ATTNGDPPLTFVYGNDTFLIGNLGFLYYA
        LFL   L+     S     F F +HHRFSD +K   DS G     P K S  Y+  +V RD L+ GRRL  + +  +  LTF  GN T  I +LGFL+Y 
Subjt:  LFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEG----LPEKQSPGYYATMVHRDRLVHGRRL--ATTNGDPPLTFVYGNDTFLIGNLGFLYYA

Query:  NISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQDV
         + +GTP + F+VALDTGSDLFW+PC+C  C      T   +F L+ Y+PK STT+  V C+NSLC   NQC  T STCPY ++Y+SA TS+SG L++DV
Subjt:  NISVGTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQDV

Query:  LHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNV
        +HL T+DK  + V+A +TFGCG++Q+G F D AAPNGL GLGMEKISVPS LA +GL +DSFSMCFG+DG GRI FGD G+S Q ETPFN   + P+YN+
Subjt:  LHLATDDKQLKPVDAKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNV

Query:  TFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPTDVG
        T T++ VG    + +F+A+FD+GTSF+Y+ DP+Y+ ++E   +  + +R   D   PFEYCY +  +AN+   P L+ TMKG  ++   DP +V+ T+  
Subjt:  TFTQIIVGGKSNNLQFSAIFDSGTSFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPTDVG

Query:  LAACLGIVKSTDPIDLIGR
        L  CL IVKS++ +++IG+
Subjt:  LAACLGIVKSTDPIDLIGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTGGACGTTCAGTTCCGGTGCCCAAATGTTACTGTTTCTTTCTGTTTTCCTTCTCGCCGGCGAGCTGAGGAGCGGCGAGGCCGGTTCGTTTAAGTTCAGTATCCA
CCATCGATTTTCGGATTCGATTAAGGGGATTCTCGATTCCGAAGGTCTGCCGGAGAAGCAGAGCCCTGGATACTATGCTACCATGGTCCACCGCGATCGGTTAGTTCACG
GCCGGCGATTGGCAACCACTAACGGTGATCCGCCGCTGACGTTCGTTTATGGAAACGATACCTTCCTCATCGGCAATTTGGGATTTTTATACTACGCCAATATATCGGTC
GGAACGCCGGAGTTATCTTTTCTAGTGGCGTTGGATACCGGTAGTGATTTGTTCTGGTTACCGTGTGAATGCAGCAGTTGTCTTACTTACTTGAACACGACCAATGGTGG
AAAGTTTGGGTTGAATCATTACAGTCCAAAGGATTCAACAACGAGCACAAGTGTCCCTTGCAGCAATTCTTTGTGTGAACTTGCAAACCAATGCTCTTCAACCACAAGTA
CTTGTCCTTACGAAATTAATTACCTGTCTGCCAATACCTCATCCTCTGGGTACTTGGTACAGGACGTATTGCACTTGGCCACCGATGATAAACAATTAAAACCCGTTGAT
GCTAAGATTACTTTTGGGTGCGGTAAGATCCAGACTGGTATATTTGCAGATAGTGCAGCTCCCAATGGTCTTATTGGACTTGGAATGGAAAAGATATCGGTTCCAAGCTT
CTTAGCGAGCCAAGGGCTCACTTCAGATTCATTCTCCATGTGTTTTGGATACGATGGTCAGGGGAGGATCGATTTTGGAGACATAGGCACGTCAGGCCAGAGAGAAACAC
CCTTCAATACCATGGTGAACTTTCCATCCTACAATGTCACCTTCACTCAGATAATTGTGGGAGGAAAATCCAACAATCTTCAATTTTCTGCAATTTTCGACTCTGGTACC
TCGTTTTCATACATAACCGACCCAGTTTACTCTCTTATTGCCGAGCAAATGGATGCAGGGATGAAATTAGAGCGCGTTAAATTTGATCCTGATTTCCCATTTGAGTACTG
CTACCAACTTCCTCCAAATGCAAACAGTTTTTCTCATCCGAGACTGAACTTCACGATGAAGGGTGGAGATAACTATGGCACCTTGGATCCATTTGTTGTTGTTCCTACTG
ATGTGGGATTGGCTGCTTGTTTAGGCATTGTCAAAAGCACTGATCCAATTGATTTAATTGGACGCTACGACAATGGCGCCGCCACTCCTTCCGACAACTCTCCTCCGGCC
GACAACTCTCCGCCGTCCGACTCCCCTCCAACCGACAGCTCTCCTCCGACCGACAGTTCCCCTCCATCCGACTCTCCTCAGTCCGGCGACTCTCCCCCGTCCGGTGACTC
TCCTCCGGCTCCTTCTACCGCAGGAGGAAGCAACGGTACTCAACCATTGCCGAGAATTGGAGCGGGTGATGCCACGCGGTTGAACCCACTTGGCTGTGTATTTGTTGCCA
TTCTAGCAATTTTGGTGGTTGTTTGA
mRNA sequenceShow/hide mRNA sequence
CCGGATTTCTCAGGTTCTTTCCCGGAAAGTTTTTGAATCCTCTAATCGGCGTCCATGGCGTGGACGTTCAGTTCCGGTGCCCAAATGTTACTGTTTCTTTCTGTTTTCCT
TCTCGCCGGCGAGCTGAGGAGCGGCGAGGCCGGTTCGTTTAAGTTCAGTATCCACCATCGATTTTCGGATTCGATTAAGGGGATTCTCGATTCCGAAGGTCTGCCGGAGA
AGCAGAGCCCTGGATACTATGCTACCATGGTCCACCGCGATCGGTTAGTTCACGGCCGGCGATTGGCAACCACTAACGGTGATCCGCCGCTGACGTTCGTTTATGGAAAC
GATACCTTCCTCATCGGCAATTTGGGATTTTTATACTACGCCAATATATCGGTCGGAACGCCGGAGTTATCTTTTCTAGTGGCGTTGGATACCGGTAGTGATTTGTTCTG
GTTACCGTGTGAATGCAGCAGTTGTCTTACTTACTTGAACACGACCAATGGTGGAAAGTTTGGGTTGAATCATTACAGTCCAAAGGATTCAACAACGAGCACAAGTGTCC
CTTGCAGCAATTCTTTGTGTGAACTTGCAAACCAATGCTCTTCAACCACAAGTACTTGTCCTTACGAAATTAATTACCTGTCTGCCAATACCTCATCCTCTGGGTACTTG
GTACAGGACGTATTGCACTTGGCCACCGATGATAAACAATTAAAACCCGTTGATGCTAAGATTACTTTTGGGTGCGGTAAGATCCAGACTGGTATATTTGCAGATAGTGC
AGCTCCCAATGGTCTTATTGGACTTGGAATGGAAAAGATATCGGTTCCAAGCTTCTTAGCGAGCCAAGGGCTCACTTCAGATTCATTCTCCATGTGTTTTGGATACGATG
GTCAGGGGAGGATCGATTTTGGAGACATAGGCACGTCAGGCCAGAGAGAAACACCCTTCAATACCATGGTGAACTTTCCATCCTACAATGTCACCTTCACTCAGATAATT
GTGGGAGGAAAATCCAACAATCTTCAATTTTCTGCAATTTTCGACTCTGGTACCTCGTTTTCATACATAACCGACCCAGTTTACTCTCTTATTGCCGAGCAAATGGATGC
AGGGATGAAATTAGAGCGCGTTAAATTTGATCCTGATTTCCCATTTGAGTACTGCTACCAACTTCCTCCAAATGCAAACAGTTTTTCTCATCCGAGACTGAACTTCACGA
TGAAGGGTGGAGATAACTATGGCACCTTGGATCCATTTGTTGTTGTTCCTACTGATGTGGGATTGGCTGCTTGTTTAGGCATTGTCAAAAGCACTGATCCAATTGATTTA
ATTGGACGCTACGACAATGGCGCCGCCACTCCTTCCGACAACTCTCCTCCGGCCGACAACTCTCCGCCGTCCGACTCCCCTCCAACCGACAGCTCTCCTCCGACCGACAG
TTCCCCTCCATCCGACTCTCCTCAGTCCGGCGACTCTCCCCCGTCCGGTGACTCTCCTCCGGCTCCTTCTACCGCAGGAGGAAGCAACGGTACTCAACCATTGCCGAGAA
TTGGAGCGGGTGATGCCACGCGGTTGAACCCACTTGGCTGTGTATTTGTTGCCATTCTAGCAATTTTGGTGGTTGTTTGACTTTGATTATTATTAATCTCTGGGTTTTTT
CATATTTGCAGTGTATTTAATTTTTATTCTTGAAATAATTATTCTAGAAGGAAACAGTTTTATTGTTTTTCTCTGCTTTGGGTTTTGTAAATCAATGTTATTCTAATCAA
TTACATATTGGTTCAATTTCAATCAAAGTTCACAAA
Protein sequenceShow/hide protein sequence
MAWTFSSGAQMLLFLSVFLLAGELRSGEAGSFKFSIHHRFSDSIKGILDSEGLPEKQSPGYYATMVHRDRLVHGRRLATTNGDPPLTFVYGNDTFLIGNLGFLYYANISV
GTPELSFLVALDTGSDLFWLPCECSSCLTYLNTTNGGKFGLNHYSPKDSTTSTSVPCSNSLCELANQCSSTTSTCPYEINYLSANTSSSGYLVQDVLHLATDDKQLKPVD
AKITFGCGKIQTGIFADSAAPNGLIGLGMEKISVPSFLASQGLTSDSFSMCFGYDGQGRIDFGDIGTSGQRETPFNTMVNFPSYNVTFTQIIVGGKSNNLQFSAIFDSGT
SFSYITDPVYSLIAEQMDAGMKLERVKFDPDFPFEYCYQLPPNANSFSHPRLNFTMKGGDNYGTLDPFVVVPTDVGLAACLGIVKSTDPIDLIGRYDNGAATPSDNSPPA
DNSPPSDSPPTDSSPPTDSSPPSDSPQSGDSPPSGDSPPAPSTAGGSNGTQPLPRIGAGDATRLNPLGCVFVAILAILVVV