; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1216 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1216
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionaspartic proteinase-like protein 1
Genome locationMC04:20235931..20241099
RNA-Seq ExpressionMC04g1216
SyntenyMC04g1216
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032744.1 Aspartic proteinase-like protein 1 [Cucurbita argyrosperma subsp. argyrosperma]2.29e-31182.91Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
        MAN +VVLL + CFL DSSVA   SS+L+HRFS+EAKALW+SR+GN S KFWPRR+SLKYFE L D DLKRRRLKIGSK E++ PSEG+EV+FFGNEFDW
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW

Query:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
        LHYTWIDIGTPSVSFLVALD GSDLLWVPCDCIQCAPLSAS+YS LDRDLS YNPALS+TS++LSC HQLCAWS TCK PD+PCTYKRDYY+DNTS+SGF
Subjt:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF

Query:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
        MIEDKLHLASFSKH  + L+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFD NGSGRILFGDNG ATQQTT+FLPLF
Subjt:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF

Query:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
        GEFDAYFV VESFCVGSSCLQKSGF ALVDSGSSFTYLP E+Y++IVFEFDKQVKLNATRI LQEFPW+YCYN SSLES+ IPSMKLVFPLNQSFIHDPV
Subjt:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV

Query:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK
        Y LP +QGYK+FCLTLEETDDDYG+IGQNLMVGYR+VFDRENL+LGWSKSKCLDIN  ++ +AKPPSNDGSP A+P++   SPPNRQ IAPTA+R    K
Subjt:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK

Query:  SSPTASHFS
        SS TA HFS
Subjt:  SSPTASHFS

XP_022141351.1 aspartic proteinase-like protein 1 [Momordica charantia]0.0100Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
        MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW

Query:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
        LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
Subjt:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF

Query:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
        MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
Subjt:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF

Query:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
        GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
Subjt:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV

Query:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSKS
        YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSKS
Subjt:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSKS

Query:  SPTASHFSPLLLLFPVFLVVC
        SPTASHFSPLLLLFPVFLVVC
Subjt:  SPTASHFSPLLLLFPVFLVVC

XP_022963449.1 aspartic proteinase-like protein 1 [Cucurbita moschata]1.96e-31283.3Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
        MAN +VVLL +ACF  DSSVA   SS+LIHRFS+EAKALW+SR+GN S KFWPRR+SLKYFE L D DLKRRRLKIGSK E++ PSEG+EV+FFGNEFDW
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW

Query:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
        LHYTWIDIGTPSVSFLVALD GSDLLWVPCDCIQCAPLSAS+YS LDRDLS YNPALS+TS++LSC HQLCAWS TCK PD+PCTYKRDYY+DNTS+SGF
Subjt:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF

Query:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
        MIEDKLHLASFSKH  + L+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFD NGSGRILFGDNG ATQQTT+FLPLF
Subjt:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF

Query:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
        GEFDAYFV VESFCVGSSCLQKSGF ALVDSGSSFTYLP E+Y++IVFEFDKQVKLNATRI LQEFPW+YCYN SSLES+ IPSMKLVFPLNQSFIHDPV
Subjt:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV

Query:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK
        Y LP +QGYK+FCLTLEETDDDYG+IGQNLMVGYR+VFDRENL+LGWSKSKCLDIN  ++ +AKPPSNDGSP A+P++   SPPNRQ IAPTA+R   SK
Subjt:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK

Query:  SSPTASHFS
        SS TA HFS
Subjt:  SSPTASHFS

XP_023545003.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita pepo subsp. pepo]2.79e-31283.1Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
        MAN +VVLL +ACFL DSSVA   SS+L+HRFS+EAKALW+SR+GN S KFWPRR+SLKYFE L D DLKRRRLKIGSK E++ PSEG+EV+FFGNEFDW
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW

Query:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
        LHYTWIDIGTPSVSFLVALD GSDLLWVPCDCIQCAPLSAS+YS LDRDLS YNPALS+TS++LSC HQLCAWS TCK PD+PC+YKRDYY+DNTS+SGF
Subjt:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF

Query:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
        MIEDKLHLASFSKH  + L+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFD NGSGRILFGDNG ATQQTT+FLPLF
Subjt:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF

Query:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
        GEFDAYFV VESFCVGSSCLQKSGF ALVDSGSSFTYLP E+Y++IVFEFDKQVKLNATRI LQEFPW+YCYN SSLES+ IPSMKLVFPLNQSFIHDPV
Subjt:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV

Query:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK
        Y LP +QGYK+FCLTLEETDDDYG+IGQNLMVGYR+VFDRENL+LGWSKSKCLDIN  ++ +AKPPSNDGSP A+P++   SPPNRQ IAPTA+R   SK
Subjt:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK

Query:  SSPTASHFS
        SS TA HFS
Subjt:  SSPTASHFS

XP_038875404.1 aspartic proteinase-like protein 1 isoform X2 [Benincasa hispida]2.95e-31582.92Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESR-SGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD
        MANC++V LLMA    DSS+A T +SKL+HRFS+EAK+LWESR +GNVS KFWP R+S KYFE+LMD DLKRRRLK GSK ++L PSEGS+V+FFGNEF+
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESR-SGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD

Query:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG
        WLHYTWIDIGTPSV FLVALDVGSDLLWVPCDCI CAPLSASYYSVLDRDLSEYNPALSSTSKHLSC H+LCAWS TCK PDEPCTYKRDYYSDNTS+SG
Subjt:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG

Query:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL
        FMIEDKLHLASFSKH  +SL+QASV+LGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFD NGSGRILFGDNG ATQQTT+FLPL
Subjt:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL

Query:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP
        FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLP+EVY++IVFEFDKQVK NATRI LQE PW+YCYN SSL S +IPSMKLVFPLNQSFIHDP
Subjt:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP

Query:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINS-KSYNAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT
        VY+LP N+G K+FCLTLEETD+DYG+IGQNLMVGYRMVFDRENLKLGWSKSKCLDINS K+ +AKPPS+DG   SPIA+P      PPNRQAIAPTA+RT
Subjt:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINS-KSYNAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT

Query:  TSSKSSPTASHFSP-LLLLFPVFLVVC
         S KSS TAS FSP LLLL  VFLV C
Subjt:  TSSKSSPTASHFSP-LLLLFPVFLVVC

TrEMBL top hitse value%identityAlignment
A0A0A0KN37 Peptidase A1 domain-containing protein1.80e-30479.7Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESR-SGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD
        MANC ++LL +A    + S+A T S  L+HRFS+EAK+LWESR +GNVS KFWP  +SLKYF++LMD DLKRRRL IGSK ++L PSEGS+V+FFGNEF+
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESR-SGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD

Query:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG
        WLHYTWID+GTPSV FLVALDVGSDLLWVPCDCIQCAPLSA+YYSVLDRDLSEYNPALSSTSKHL C HQLCAWS TCK  ++PCTYKRDYYSDNTS+SG
Subjt:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG

Query:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL
        FMIEDKL L SFSKH   SL+QASVV GCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFD NGSGRILFGD+G ATQQTT+FLPL
Subjt:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL

Query:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP
        FGEF AYF+GVESFCVGSSCLQ+SGFQALVDSGSSFTYLPAEVY++IVFEFDKQVK+NATRI L+E PW+YCYN+S+L S NIPSM+LVFPLNQ FIHDP
Subjt:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP

Query:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSY-NAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT
        VYVLP NQGYK+FCLTLEETD+DYG+IGQNLMVGYRMVFDRENLKLGWSKSKCLDINS +  +AKPPSN+G   SPIA+P      P NRQAIAPTA+RT
Subjt:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSY-NAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT

Query:  TSSKSSPTASHFSPLLLLF-PVFLVVC
         SSKSS +ASHFSPLLLL    FLV C
Subjt:  TSSKSSPTASHFSPLLLLF-PVFLVVC

A0A1S3CD36 aspartic proteinase-like protein 12.95e-31080.65Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWES-RSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD
        MANC ++LLL+AC   D S+  T S KL+HRFS+EAK+LW+S R+GNVS KFWP R+SLKYF++L+D DLKRRRLKIGSK +ML PSEGS+V+FFGNEF+
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWES-RSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD

Query:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG
        WLHYTWIDIGTP V FLVALDVGSDLLWVPCDC+QCAPLSASYYSVLDRDLSEYNPALSSTSKHL C HQLCAWS TCK P++PCTYKRDYYSDNTS+SG
Subjt:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG

Query:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL
        +MIEDKLHL SFSKH   SL+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFD NGSGRI+FGD+G ATQQTT+FLPL
Subjt:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL

Query:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP
        FGEF AYF+GVESFCVGSSCLQ+SGFQALVDSGSSFTYLPAEVY++IVFEFDKQVK NATRI LQE PW+YCYN+S+L S NIPSMKLVFPLNQ FIHDP
Subjt:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP

Query:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSY-NAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT
        VY+LP NQGYK+FCLTLEETD+DYG+IGQNLMVGYRMVFDRENLKLGWSKSKCLDINS +  +AKPPSN+G   SPIA+P      P NRQAIAPTA+RT
Subjt:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSY-NAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT

Query:  TSSKSSPTASHFSPLLLLF-PVFLVVC
         SSKSS +AS+FSPLLLL    FLV C
Subjt:  TSSKSSPTASHFSPLLLLF-PVFLVVC

A0A5A7V9R0 Aspartic proteinase-like protein 11.46e-31080.65Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWES-RSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD
        MANC ++LLL+AC   D S+  T S KL+HRFS+EAK+LW+S R+GNVS KFWP R+SLKYF++L+D DLKRRRLKIGSK +ML PSEGS+V+FFGNEF+
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWES-RSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFD

Query:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG
        WLHYTWIDIGTP V FLVALDVGSDLLWVPCDC+QCAPLSASYYSVLDRDLSEYNPALSSTSKHL C HQLCAWS TCK P++PCTYKRDYYSDNTS+SG
Subjt:  WLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSG

Query:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL
        +MIEDKLHL SFSKH   SL+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFD NGSGRI+FGD+G ATQQTT+FLPL
Subjt:  FMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL

Query:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP
        FGEF AYF+GVESFCVGSSCLQ+SGFQALVDSGSSFTYLPAEVY++IVFEFDKQVK NATRI LQE PW+YCYN+S+L S NIPSMKLVFPLNQ FIHDP
Subjt:  FGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDP

Query:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSY-NAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT
        VY+LP NQGYK+FCLTLEETD+DYG+IGQNLMVGYRMVFDRENLKLGWSKSKCLDINS +  +AKPPSN+G   SPIA+P      P NRQAIAPTA+RT
Subjt:  VYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSY-NAKPPSNDG---SPIAIPSNEQKSPPNRQAIAPTASRT

Query:  TSSKSSPTASHFSPLLLLF-PVFLVVC
         SSKSS +AS+FSPLLLL    FLV C
Subjt:  TSSKSSPTASHFSPLLLLF-PVFLVVC

A0A6J1CJM3 aspartic proteinase-like protein 10.0100Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
        MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW

Query:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
        LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
Subjt:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF

Query:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
        MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
Subjt:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF

Query:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
        GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
Subjt:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV

Query:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSKS
        YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSKS
Subjt:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSKS

Query:  SPTASHFSPLLLLFPVFLVVC
        SPTASHFSPLLLLFPVFLVVC
Subjt:  SPTASHFSPLLLLFPVFLVVC

A0A6J1HK50 aspartic proteinase-like protein 19.51e-31383.3Show/hide
Query:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW
        MAN +VVLL +ACF  DSSVA   SS+LIHRFS+EAKALW+SR+GN S KFWPRR+SLKYFE L D DLKRRRLKIGSK E++ PSEG+EV+FFGNEFDW
Subjt:  MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDW

Query:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
        LHYTWIDIGTPSVSFLVALD GSDLLWVPCDCIQCAPLSAS+YS LDRDLS YNPALS+TS++LSC HQLCAWS TCK PD+PCTYKRDYY+DNTS+SGF
Subjt:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF

Query:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF
        MIEDKLHLASFSKH  + L+QASVVLGCGRKQSG YLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFD NGSGRILFGDNG ATQQTT+FLPLF
Subjt:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF

Query:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV
        GEFDAYFV VESFCVGSSCLQKSGF ALVDSGSSFTYLP E+Y++IVFEFDKQVKLNATRI LQEFPW+YCYN SSLES+ IPSMKLVFPLNQSFIHDPV
Subjt:  GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPV

Query:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK
        Y LP +QGYK+FCLTLEETDDDYG+IGQNLMVGYR+VFDRENL+LGWSKSKCLDIN  ++ +AKPPSNDGSP A+P++   SPPNRQ IAPTA+R   SK
Subjt:  YVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK

Query:  SSPTASHFS
        SS TA HFS
Subjt:  SSPTASHFS

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 361.3e-3029.03Show/hide
Query:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPC-DCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAW---SATCKRPDEPCTYKRDYYSDNTS
        L++T I +G+P   + V +D GSD+LWV C  C +C P+       L   LS Y+   SSTSK++ C+   C++   S TC    +PC+Y    Y D ++
Subjt:  LHYTWIDIGTPSVSFLVALDVGSDLLWVPC-DCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAW---SATCKRPDEPCTYKRDYYSDNTS

Query:  SSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSG--GYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDG-NGSGRILFGDNGSATQQT
        S G  I+D + L   + + + + +   VV GCG+ QSG  G  D A  DG+MG G  N S+ + LA  G  +  FS C D  NG G    G+  S   +T
Subjt:  SSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSG--GYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDG-NGSGRILFGDNGSATQQT

Query:  TRFLPLFGEFDAYFVGV----ESFCVGSSCLQKSG-FQALVDSGSSFTYLPAEVYRRIVFEF--DKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMK
        T  +P    ++    G+    +   +  S    +G    ++DSG++  YLP  +Y  ++ +    +QVKL+  + T        C++ +S      P + 
Subjt:  TRFLPLFGEFDAYFVGV----ESFCVGSSCLQKSG-FQALVDSGSSFTYLPAEVYRRIVFEF--DKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMK

Query:  LVFPLN---QSFIHDPVYVLPVNQ---GYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKC
        L F  +     + HD ++ L  +    G++   +T ++   D  ++G  ++    +V+D EN  +GW+   C
Subjt:  LVFPLN---QSFIHDPVYVLPVNQ---GYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKC

Q8VYV9 Aspartyl protease family protein 12.7e-7336.16Show/hide
Query:  ANCVVVLLLMACFLADSSVAH------TFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLP-SEGSEVLFF
        ++C ++ L +   LA S V         F  +  HRFS++         G +     P RDS KY+ ++   D   R  ++ ++D+ L+  S+G+E +  
Subjt:  ANCVVVLLLMACFLADSSVAH------TFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLP-SEGSEVLFF

Query:  GNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSD
         +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   S L  DL+ Y+P  SSTS  + C+  LC     C  P+  C Y+  Y S+
Subjt:  GNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSD

Query:  NTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQT
         TSS+G ++ED LHL S  K +K   + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LAK G+  N+FS+CF  +G+GRI FGD GS  Q+ 
Subjt:  NTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQT

Query:  TRFLPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVS-SLESANIPSMKLVFPLN
        T  L +      Y + V    VG +      F A+ DSG+SFTYL    Y  I   F+        + T  E P+ YCY +S + +S   P++ L     
Subjt:  TRFLPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVS-SLESANIPSMKLVFPLN

Query:  QSF-IHDPVYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKC---------LDINSKSYNAKPPSNDGSPIAIPSNEQKSP
         S+ ++ P+ V+P+ +   ++CL + +  +D  IIGQN M GYR+VFDRE L LGW +S C         L  N  S +A+PP++   P      E  + 
Subjt:  QSF-IHDPVYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKC---------LDINSKSYNAKPPSNDGSPIAIPSNEQKSP

Query:  PNRQAIAPTASRTTSSKSSPTASHFSPLLLL
        P+++    T S   S   S +   FS L +L
Subjt:  PNRQAIAPTASRTTSSKSSPTASHFSPLLLL

Q9LX20 Aspartic proteinase-like protein 17.9e-13449.52Show/hide
Query:  LLLMACFLA-DSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWI
        LL    FLA + ++A  FSS+LIHRFS+E +A  ++ S + S    P + SL+Y+ LL ++D +R+R+ +G+K + L+PSEGS+ +  GN+F WLHYTWI
Subjt:  LLLMACFLA-DSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWI

Query:  DIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGFMIEDK
        DIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YYS L  +DL+EYNP+ SSTSK   C H+LC  ++ C+ P E C Y  +Y S NTSSSG ++ED 
Subjt:  DIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGFMIEDK

Query:  LHLASFSKHA---KRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL-FG
        LHL   + +      S V+A VV+GCG+KQSG YLDG APDG+MGLGP  ISVP+ L+KAGL+RN+FSLCFD   SGRI FGD G + QQ+T FL L   
Subjt:  LHLASFSKHA---KRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL-FG

Query:  EFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSF-IHDPV
        ++  Y VGVE+ C+G+SCL+++ F   +DSG SFTYLP E+YR++  E D+ +  NAT    +   W YCY  S+     +P++KL F  N +F IH P+
Subjt:  EFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSF-IHDPV

Query:  YVLPVNQGYKIFCLTLEET-DDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK
        +V   +QG   FCL +  +  +  G IGQN M GYRMVFDREN+KLGWS SKC +   +   A P S   SP  +P++EQ+S     A++P  +  T SK
Subjt:  YVLPVNQGYKIFCLTLEET-DDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK

Query:  --SSPTASHFSPLLLLFPVFLVV
          SS ++  FS ++ LF   L++
Subjt:  --SSPTASHFSPLLLLFPVFLVV

Q9M9A8 Aspartyl protease APCB17.9e-2528.32Show/hide
Query:  GNEF-DWLHYTWIDIGTPSVS--FLVALDVGSDLLWVPCD--CIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRD
        GN + D L+YT I +G P     + + +D GS+L W+ CD  C  CA  +   Y     +L   + A     +     +QL      C +    C Y+ +
Subjt:  GNEF-DWLHYTWIDIGTPSVS--FLVALDVGSDLLWVPCD--CIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRD

Query:  YYSDNTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDG-AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCF--DGNGSGRILFGDN
         Y+D++ S G + +DK HL    K    SL ++ +V GCG  Q G  L+     DG++GL    IS+P+ LA  G++ N    C   D NG G I  G +
Subjt:  YYSDNTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDG-AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCF--DGNGSGRILFGDN

Query:  GSATQQTTRFLPLF--GEFDAYFVGVESFCVGSSCLQKSG-----FQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQE-----------FPW
           +   T ++P+      DAY + V     G   L   G      + L D+GSS+TY P + Y ++V    +   L  TR    E           FP+
Subjt:  GSATQQTTRFLPLF--GEFDAYFVGVESFCVGSSCLQKSG-----FQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQE-----------FPW

Query:  SYCYNVSSL---ESANIPSMKLVFPLNQSFIHDPVYVLPVNQGYKIFCLTLEE----TDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCL
        S   +V       +  I S  L+    +  I    Y++  N+G    CL + +     D    I+G   M G+ +V+D    ++GW KS C+
Subjt:  SYCYNVSSL---ESANIPSMKLVFPLNQSFIHDPVYVLPVNQGYKIFCLTLEE----TDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCL

Q9S9K4 Aspartic proteinase 392.2e-2727.65Show/hide
Query:  KFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEG-SEVLFFGNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDR
        KF  ++ +L++F+    +D +R    + S D   LP  G S V   G     L++T I +G+P   + V +D GSD+LW+ C      P   +    L+ 
Subjt:  KFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEG-SEVLFFGNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDR

Query:  DLSEYNPALSSTSKHLSCDHQLCAW---SATCKRPDEPCTYKRDYYSDNTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDG-AAPDG
         LS ++   SSTSK + CD   C++   S +C +P   C+Y    Y+D ++S G  I D L L   +   K   +   VV GCG  QSG   +G +A DG
Subjt:  DLSEYNPALSSTSKHLSCDHQLCAW---SATCKRPDEPCTYKRDYYSDNTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDG-AAPDG

Query:  VMGLGPGNISVPTLLAKAGLVRNTFSLCFDG-NGSGRILFGDNGSATQQTTRFLPLFGEFDAYFVGVE----SFCVGSSCLQKSGFQALVDSGSSFTYLP
        VMG G  N SV + LA  G  +  FS C D   G G    G   S   +TT  +P    ++   +G++    S  +  S ++  G   +VDSG++  Y P
Subjt:  VMGLGPGNISVPTLLAKAGLVRNTFSLCFDG-NGSGRILFGDNGSATQQTTRFLPLFGEFDAYFVGVE----SFCVGSSCLQKSGFQALVDSGSSFTYLP

Query:  AEVYRRIVFEF--DKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQS---FIHDPVYVLPVNQ---GYKIFCLTLEETDDDYGIIGQNLM
          +Y  ++      + VKL+    T Q      C++ S+      P +   F  +     + HD ++ L       G++   LT +E  +   ++G  ++
Subjt:  AEVYRRIVFEF--DKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQS---FIHDPVYVLPVNQ---GYKIFCLTLEETDDDYGIIGQNLM

Query:  VGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPP
            +V+D +N  +GW+   C      S + K     G   ++ ++   S P
Subjt:  VGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPP

Arabidopsis top hitse value%identityAlignment
AT2G17760.1 Eukaryotic aspartyl protease family protein1.9e-7436.16Show/hide
Query:  ANCVVVLLLMACFLADSSVAH------TFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLP-SEGSEVLFF
        ++C ++ L +   LA S V         F  +  HRFS++         G +     P RDS KY+ ++   D   R  ++ ++D+ L+  S+G+E +  
Subjt:  ANCVVVLLLMACFLADSSVAH------TFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLP-SEGSEVLFF

Query:  GNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSD
         +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   S L  DL+ Y+P  SSTS  + C+  LC     C  P+  C Y+  Y S+
Subjt:  GNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCA-PLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSD

Query:  NTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQT
         TSS+G ++ED LHL S  K +K   + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LAK G+  N+FS+CF  +G+GRI FGD GS  Q+ 
Subjt:  NTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQT

Query:  TRFLPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVS-SLESANIPSMKLVFPLN
        T  L +      Y + V    VG +      F A+ DSG+SFTYL    Y  I   F+        + T  E P+ YCY +S + +S   P++ L     
Subjt:  TRFLPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVS-SLESANIPSMKLVFPLN

Query:  QSF-IHDPVYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKC---------LDINSKSYNAKPPSNDGSPIAIPSNEQKSP
         S+ ++ P+ V+P+ +   ++CL + +  +D  IIGQN M GYR+VFDRE L LGW +S C         L  N  S +A+PP++   P      E  + 
Subjt:  QSF-IHDPVYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKC---------LDINSKSYNAKPPSNDGSPIAIPSNEQKSP

Query:  PNRQAIAPTASRTTSSKSSPTASHFSPLLLL
        P+++    T S   S   S +   FS L +L
Subjt:  PNRQAIAPTASRTTSSKSSPTASHFSPLLLL

AT3G51330.1 Eukaryotic aspartyl protease family protein5.3e-6133.33Show/hide
Query:  VVVLLLMACF-LADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDND--LKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWL
        V++ LL+ C+ L     +  FS ++ H FS+  K     +S  +     P + SL+YF++L   D  ++ R L   +++  +    G+  +   +   +L
Subjt:  VVVLLLMACF-LADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDND--LKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWL

Query:  HYTWIDIGTPSVSFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF
        HY  + +GTP+  FLVALD GSDL W+PC+C   C            R L+ Y+P  SSTS  + C    C  S+ C  P   C Y+  Y S +T ++G 
Subjt:  HYTWIDIGTPSVSFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGF

Query:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDG--NGSGRILFGDNGSATQQTTRFLP
        + ED LHL +  +  +   V+A++ LGCG+ Q+G     AA +G++GLG  + SVP++LAKA +  N+FS+CF    +  GRI FGD G   Q  T  LP
Subjt:  MIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDG--NGSGRILFGDNGSATQQTTRFLP

Query:  LFGEFDAYFVGVESFCVGSSCLQKSGFQ--ALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANI-PSMKLVFP-LNQS
               Y V V    VG   +   G Q  AL D+G+SFT+L    Y  I   FD  V  +  R    E P+ +CY++S  ++  + P + + F   +Q 
Subjt:  LFGEFDAYFVGVESFCVGSSCLQKSGFQ--ALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANI-PSMKLVFP-LNQS

Query:  FIHDPVYVLPVNQGYKIFCL-TLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKS--PPNRQAIAPT
        F+ +P++++       ++CL  L+  D    IIGQN M GYR+VFDRE + LGW +S C +  S      PP    +P    S    S  PP   A  P 
Subjt:  FIHDPVYVLPVNQGYKIFCL-TLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKS--PPNRQAIAPT

Query:  ASRTTSSKSSPTASH------FSPLLLLFPV
             S+++S T +        S LLLL P+
Subjt:  ASRTTSSKSSPTASH------FSPLLLLFPV

AT3G51350.1 Eukaryotic aspartyl protease family protein3.2e-5833.61Show/hide
Query:  PRRDSLKYFELLMDND-LKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDL
        P + SL+YF++L   D L R R    + DE  +  +G  +         L+Y  + +GTP  SFLVALD GSDL W+PC+C   C              L
Subjt:  PRRDSLKYFELLMDND-LKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDC-IQCAPLSASYYSVLDRDL

Query:  SEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGP
        + Y P  S+TS  + C  + C  S  C  P   C Y+   YS++T + G +++D LHLA+  ++   + V+A+V LGCG+KQ+G +    + +GV+GLG 
Subjt:  SEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGP

Query:  GNISVPTLLAKAGLVRNTFSLCFD---GNGSGRILFGDNGSATQQTTRFLPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIV
           SVP+LLAKA +  N+FS+CF    GN  GRI FGD G   Q+ T F+ +     AY V +    V    +    F A  D+GSSFT+L    Y  + 
Subjt:  GNISVPTLLAKAGLVRNTFSLCFD---GNGSGRILFGDNGSATQQTTRFLPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIV

Query:  FEFDKQVKLNATRITLQEFPWSYCYNVS-SLESANIPSMKLVFPLNQSFI-HDPVYVLPVNQGYKIFCL-TLEETDDDYGIIGQNLMVGYRMVFDRENLK
          FD+ V+ +  R    E P+ +CY++S +  +   P +++ F      I ++P +     +G  ++CL  L+       +IGQN + GYR+VFDRE + 
Subjt:  FEFDKQVKLNATRITLQEFPWSYCYNVS-SLESANIPSMKLVFPLNQSFI-HDPVYVLPVNQGYKIFCL-TLEETDDDYGIIGQNLMVGYRMVFDRENLK

Query:  LGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRT---------TSSKSSPTASHFSP----LLLLFPV
        LGW +S C +  S      PP      +  P+    +PP R ++ PT S T         T +  +  A++  P    LLLL P+
Subjt:  LGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRT---------TSSKSSPTASHFSP----LLLLFPV

AT4G35880.1 Eukaryotic aspartyl protease family protein3.1e-7736.81Show/hide
Query:  VVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDND--LKRRRLKIGSKDEMLLPSEGSEVLFFGN------
        ++ +L++  F   S     F+ ++ HRFS+E K  W   +G  ++  +P + S +YF  L+  D  ++ RRL     +     SE S     GN      
Subjt:  VVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDND--LKRRRLKIGSKDEMLLPSEGSEVLFFGN------

Query:  EFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTS
           +LHYT + +GTP + F+VALD GSDL WVPCDC +CAP   + Y+  + +LS YNP +S+T+K ++C++ LCA    C      C Y   Y S  TS
Subjt:  EFDWLHYTWIDIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTS

Query:  SSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRF
        +SG ++ED +HL +  K+ +R  V+A V  GCG+ QSG +LD AAP+G+ GLG   ISVP++LA+ GLV ++FS+CF  +G GRI FGD GS+ Q+ T F
Subjt:  SSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRF

Query:  LPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESAN-IPSMKLVFPLNQSF
          L      Y + V    VG++ +    F AL D+G+SFTYL   +Y  +   F  Q + +         P+ YCY++S+  +A+ IPS+ L    N  F
Subjt:  LPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESAN-IPSMKLVFPLNQSF

Query:  -IHDPVYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASR
         I+DP+ V+   +G  ++CL + ++  +  IIGQN M GYR+VFDRE L L W K  C DI   +      +   +     +   K+  N   +  T   
Subjt:  -IHDPVYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASR

Query:  TTSSKSSP
         + S SSP
Subjt:  TTSSKSSP

AT5G10080.1 Eukaryotic aspartyl protease family protein5.6e-13549.52Show/hide
Query:  LLLMACFLA-DSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWI
        LL    FLA + ++A  FSS+LIHRFS+E +A  ++ S + S    P + SL+Y+ LL ++D +R+R+ +G+K + L+PSEGS+ +  GN+F WLHYTWI
Subjt:  LLLMACFLA-DSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWI

Query:  DIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGFMIEDK
        DIGTPSVSFLVALD GS+LLW+PC+C+QCAPL+++YYS L  +DL+EYNP+ SSTSK   C H+LC  ++ C+ P E C Y  +Y S NTSSSG ++ED 
Subjt:  DIGTPSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVL-DRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGFMIEDK

Query:  LHLASFSKHA---KRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL-FG
        LHL   + +      S V+A VV+GCG+KQSG YLDG APDG+MGLGP  ISVP+ L+KAGL+RN+FSLCFD   SGRI FGD G + QQ+T FL L   
Subjt:  LHLASFSKHA---KRSLVQASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPL-FG

Query:  EFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSF-IHDPV
        ++  Y VGVE+ C+G+SCL+++ F   +DSG SFTYLP E+YR++  E D+ +  NAT    +   W YCY  S+     +P++KL F  N +F IH P+
Subjt:  EFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSF-IHDPV

Query:  YVLPVNQGYKIFCLTLEET-DDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK
        +V   +QG   FCL +  +  +  G IGQN M GYRMVFDREN+KLGWS SKC +   +   A P S   SP  +P++EQ+S     A++P  +  T SK
Subjt:  YVLPVNQGYKIFCLTLEET-DDDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSK

Query:  --SSPTASHFSPLLLLFPVFLVV
          SS ++  FS ++ LF   L++
Subjt:  --SSPTASHFSPLLLLFPVFLVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAATTGTGTGGTTGTTTTGCTCTTAATGGCTTGTTTCTTAGCGGACAGCTCGGTTGCGCATACGTTCTCGTCGAAGCTCATACATCGGTTTTCCGAGGAGGCGAA
AGCGCTTTGGGAGTCGAGGAGTGGTAATGTGTCTCGAAAATTTTGGCCACGGAGGGATAGCTTGAAGTATTTTGAATTGCTTATGGATAATGACTTGAAGAGACGGAGGC
TGAAGATCGGATCGAAGGACGAGATGCTTTTGCCTTCCGAAGGAAGCGAAGTTTTGTTCTTCGGGAACGAGTTTGATTGGTTACATTACACATGGATCGATATAGGAACA
CCAAGTGTTTCGTTTCTTGTTGCGTTGGATGTCGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTTATTATAGCGTACTGGA
TAGGGATTTGAGTGAGTACAACCCAGCTTTATCGAGCACCAGCAAGCACCTTTCTTGCGACCATCAATTATGTGCGTGGAGCGCAACTTGCAAAAGACCTGATGAGCCCT
GCACTTATAAACGAGATTATTACTCAGATAATACATCAAGTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGCGAAACGAAGTCTTGTG
CAAGCCTCGGTTGTATTAGGCTGTGGTAGGAAACAGAGTGGTGGCTATTTGGATGGGGCTGCCCCTGATGGTGTCATGGGTTTGGGTCCTGGAAACATTTCAGTGCCAAC
CTTATTGGCAAAAGCAGGATTAGTTAGAAACACGTTCTCACTTTGTTTTGATGGCAATGGTTCTGGGAGAATTCTCTTTGGGGACAATGGTTCTGCTACCCAGCAAACAA
CACGATTTTTGCCCTTATTTGGTGAATTTGATGCCTATTTTGTTGGGGTGGAATCTTTTTGTGTTGGTAGTTCCTGTCTGCAGAAAAGTGGATTCCAGGCATTGGTTGAC
AGTGGCTCATCTTTTACATATCTTCCAGCAGAAGTCTATAGAAGGATTGTCTTTGAGTTTGACAAACAAGTAAAATTAAATGCTACCAGAATAACTCTCCAGGAGTTTCC
CTGGAGTTACTGCTATAATGTCAGTTCGCTGGAGTCCGCCAATATTCCTAGCATGAAACTCGTGTTTCCTTTGAATCAAAGCTTTATACATGATCCTGTATATGTCCTCC
CTGTCAACCAAGGATATAAAATTTTCTGTTTAACTTTAGAGGAGACAGATGACGATTATGGTATAATTGGACAAAACTTGATGGTGGGTTATCGGATGGTTTTTGACAGG
GAAAATCTTAAATTGGGTTGGTCCAAGTCCAAATGCCTGGATATTAATAGCAAGTCATACAATGCCAAGCCTCCTTCAAATGACGGATCGCCAATTGCAATACCATCCAA
TGAACAAAAAAGCCCACCAAATAGGCAAGCAATTGCACCCACTGCTTCAAGAACGACGTCTTCCAAATCTTCCCCAACTGCATCCCATTTTTCTCCCTTGTTACTTTTGT
TTCCGGTTTTCCTGGTCGTTTGTTGA
mRNA sequenceShow/hide mRNA sequence
GTAACTCTCTCACGCCAATTTGGTTTTTCGTCATCAACATTCAACAACCTCTGCTCTGTTTGCAGACGGAACAATTTCATTCGCGTATTTGAAAATCCAGAACTGAGGTG
CTTTTTCCCACCCGAAGCCCGACGCTCAGTCGCCGAACATTCATCCGGCCTTCCCCGGACTTCGCCGTTCCGAAATCTGGTCTTCACCTCGACTTTCCATGGCGAATTGT
GTGGTTGTTTTGCTCTTAATGGCTTGTTTCTTAGCGGACAGCTCGGTTGCGCATACGTTCTCGTCGAAGCTCATACATCGGTTTTCCGAGGAGGCGAAAGCGCTTTGGGA
GTCGAGGAGTGGTAATGTGTCTCGAAAATTTTGGCCACGGAGGGATAGCTTGAAGTATTTTGAATTGCTTATGGATAATGACTTGAAGAGACGGAGGCTGAAGATCGGAT
CGAAGGACGAGATGCTTTTGCCTTCCGAAGGAAGCGAAGTTTTGTTCTTCGGGAACGAGTTTGATTGGTTACATTACACATGGATCGATATAGGAACACCAAGTGTTTCG
TTTCTTGTTGCGTTGGATGTCGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTTATTATAGCGTACTGGATAGGGATTTGAG
TGAGTACAACCCAGCTTTATCGAGCACCAGCAAGCACCTTTCTTGCGACCATCAATTATGTGCGTGGAGCGCAACTTGCAAAAGACCTGATGAGCCCTGCACTTATAAAC
GAGATTATTACTCAGATAATACATCAAGTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGCGAAACGAAGTCTTGTGCAAGCCTCGGTT
GTATTAGGCTGTGGTAGGAAACAGAGTGGTGGCTATTTGGATGGGGCTGCCCCTGATGGTGTCATGGGTTTGGGTCCTGGAAACATTTCAGTGCCAACCTTATTGGCAAA
AGCAGGATTAGTTAGAAACACGTTCTCACTTTGTTTTGATGGCAATGGTTCTGGGAGAATTCTCTTTGGGGACAATGGTTCTGCTACCCAGCAAACAACACGATTTTTGC
CCTTATTTGGTGAATTTGATGCCTATTTTGTTGGGGTGGAATCTTTTTGTGTTGGTAGTTCCTGTCTGCAGAAAAGTGGATTCCAGGCATTGGTTGACAGTGGCTCATCT
TTTACATATCTTCCAGCAGAAGTCTATAGAAGGATTGTCTTTGAGTTTGACAAACAAGTAAAATTAAATGCTACCAGAATAACTCTCCAGGAGTTTCCCTGGAGTTACTG
CTATAATGTCAGTTCGCTGGAGTCCGCCAATATTCCTAGCATGAAACTCGTGTTTCCTTTGAATCAAAGCTTTATACATGATCCTGTATATGTCCTCCCTGTCAACCAAG
GATATAAAATTTTCTGTTTAACTTTAGAGGAGACAGATGACGATTATGGTATAATTGGACAAAACTTGATGGTGGGTTATCGGATGGTTTTTGACAGGGAAAATCTTAAA
TTGGGTTGGTCCAAGTCCAAATGCCTGGATATTAATAGCAAGTCATACAATGCCAAGCCTCCTTCAAATGACGGATCGCCAATTGCAATACCATCCAATGAACAAAAAAG
CCCACCAAATAGGCAAGCAATTGCACCCACTGCTTCAAGAACGACGTCTTCCAAATCTTCCCCAACTGCATCCCATTTTTCTCCCTTGTTACTTTTGTTTCCGGTTTTCC
TGGTCGTTTGTTGAGTTTCTCGAGTTCATATTGTACATGCCGCCTTTCATGACTGGGGTTTCAACTTTCAATTACTAAATAGGTTATCCGTTCCTGTAAGTTATTTATCA
TATAGGGAAAAATAAGTTGCTTTTTTTTTTTTGGGTATCTATTAACGGATATATATTTTGTAGATATCTCTTACAATTGATCATGCATAAGTACAACTGCCAGTTCAATG
TGTAATTCTTTCTATGCTCCTCCATCAACCAAATTATTTGAAGAAGTGCCTGGTTTAGACATTGTTAACAAATAACCGAAGTTCTGTAAAACTATATTGGAAAGGTTTAG
GTTTTGTGTACGTCAATAATTTCAGTACATGCAAGGAAAAGAAACTTGGGTGGCGACTGACCAGCTTCCGCACCTCTCGTACTACTTACTATGCTGCAATATTACCAGAC
AGGTTACATGAAGAGGAGCATTTTCGGATAAATTGATTGATTAGCTCTATATGGGCTAGTGCCTTGGTCAGATGAGTCATGCAGGGTGACTCCAAGGTAAATAGACCTCC
CGCAATCAATGAAATATGAGTTGTGAAAGAAAAATTAATATATTATACATTGCCCAAAAACATTGGATAGCTCTTCACTCAAGTGGTTTAACCTTTTAGATCAATTTGAT
TTAGAAAAGAAATGGGAAGATGAAATGAGTTGTTCAAGTCACAAAATTTTAATTCGTAATGCAAAAAGAATGTGATTAGGAGGTTAATCGTATTAAGATGAAGCAAAGCA
ACCTAATGAAGTTGAACTGTGACTGATACCACAGGAGGAGACCACGATCAAGTGGGTGAATTCATGCCAAGGTTTGGCCTTTTCTTTTTCTGGCACTTGCTTTGGGAGAA
AGTTGTACTAAGCTAGTGGTCTACTTTCTTTTTCCCAGAAATATTAAGTTTTTGTTATTGGTGCAAATCTTTTCTTATATTTAAGATACGAAATTGTCTGTTTACTTTTA
AATTTTATTCAAAAG
Protein sequenceShow/hide protein sequence
MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKYFELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWIDIGT
PSVSFLVALDVGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRPDEPCTYKRDYYSDNTSSSGFMIEDKLHLASFSKHAKRSLV
QASVVLGCGRKQSGGYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLFGEFDAYFVGVESFCVGSSCLQKSGFQALVD
SGSSFTYLPAEVYRRIVFEFDKQVKLNATRITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPVYVLPVNQGYKIFCLTLEETDDDYGIIGQNLMVGYRMVFDR
ENLKLGWSKSKCLDINSKSYNAKPPSNDGSPIAIPSNEQKSPPNRQAIAPTASRTTSSKSSPTASHFSPLLLLFPVFLVVC