; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0574 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0574
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationMC03:12495774..12497333
RNA-Seq ExpressionMC03g0574
SyntenyMC03g0574
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]2.10e-29779.73Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+ + ++K L+ FLLL+ +   ++T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M++  G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHGV---SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS
        TYGASGVVTG+LT+DV+  HG    + N+  QIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Subjt:  TYGASGVVTGTLTKDVILMHGV---SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS

Query:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL
        KD +LQFTPLLKSP+YPNYYYIGLESITIG+G  +NN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE V+ YPRAKQVE+NTGFDL
Subjt:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL

Query:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ
        CYK+PCKNN   SS +DD     LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN+EVVYDL+
Subjt:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ

Query:  KERIGFQPMDCASSAASQGLHKN
        KER+GFQPMDC S AA QGLHKN
Subjt:  KERIGFQPMDCASSAASQGLHKN

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]3.63e-30079.7Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M+++ G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL
        TYGASGVVTG+LT+DV+ MHG        + N+  Q+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Subjt:  TYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL

Query:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT
        AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+G  NNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NT
Subjt:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT

Query:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV
        GFDLCYK+PCKNN   SS +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN++VV
Subjt:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV

Query:  YDLQKERIGFQPMDCASSAASQGLHKN
        YDL+KER+GFQ MDC S AA+QGLHKN
Subjt:  YDLQKERIGFQPMDCASSAASQGLHKN

XP_022142611.1 probable aspartyl protease At4g16563 [Momordica charantia]0.099.62Show/hide
Query:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGT
        SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGT
Subjt:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH
        TYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH
Subjt:  TYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH

Query:  SLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYK
        SLQFTPLLKSPLYPNYYYIGLES+TIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYK
Subjt:  SLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYK

Query:  IPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKER
        IPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKER
Subjt:  IPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKER

Query:  IGFQPMDCASSAASQGLHKN
        IGFQ MDCASSAASQGLHKN
Subjt:  IGFQPMDCASSAASQGLHKN

XP_023520027.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]3.01e-29280.27Show/hide
Query:  FFLLLILLLSVS-----ETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQV
        FF+L+++L+ VS     +T A P +  F   DSLVLGLVHSRTSLLTPKRGY S    S    KPME +G+D+VIEPLREIRDGYL+SLTLGTPPQVIQV
Subjt:  FFLLLILLLSVS-----ETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQV

Query:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGV
        YMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGASG+
Subjt:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGV

Query:  VTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPL
        V GTLTKDVI +HG SPNS+ +IP+FCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISSK+H L+FTPL
Subjt:  VTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPL

Query:  LKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNT
        LKSP YPNYYYIGLESITIG+G   N SRFGVSL+LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+++YPRAK+ E+NTGFDLCYK+P KNNT
Subjt:  LKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNT

Query:  NFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMD
         FS   +      LPSITFHFLNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMD         GD DGPAGIFGSFQQQN+EVVYDL+KER+GF+ MD
Subjt:  NFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMD

Query:  CASSAASQGLHK
        CAS A SQGLHK
Subjt:  CASSAASQGLHK

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]2.57e-29680.8Show/hide
Query:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGY--FSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL
        S +T+ + K+L++FLLL+ +    +T A   + N P  DSLV+GLVHSRT+LLTPK+GY   SRK       K ME    DNVIEPLREIRDGYL+SLTL
Subjt:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGY--FSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL

Query:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF
        GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPSF
Subjt:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF

Query:  AYTYGASGVVTGTLTKDVILMHGV---SPNSTTQ-IPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLA
        AYTYGASGVV GTLT+DV+LMH     SPNS+T+  PRFCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LA
Subjt:  AYTYGASGVVTGTLTKDVILMHGV---SPNSTTQ-IPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLA

Query:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG
        +SSKD  LQFTPLLKSP+YPNYYYIGLESITIG+G  N+N RFGVS  LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTG
Subjt:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG

Query:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY
        FDLCYK+PCKNN NFS  +DD   + LPSITFHFLNNVSVVLPQ NNFYAMAAP NSTVVKCLLFQSMDG GGD D   D DGPAGIFGSFQQQN+EVVY
Subjt:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY

Query:  DLQKERIGFQPMDCASSAASQGLHKN
        DL+KER+GFQPMDCA  AA+QGLHKN
Subjt:  DLQKERIGFQPMDCASSAASQGLHKN

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein1.02e-29779.73Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+ + ++K L+ FLLL+ +   ++T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M++  G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHGV---SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS
        TYGASGVVTG+LT+DV+  HG    + N+  QIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Subjt:  TYGASGVVTGTLTKDVILMHGV---SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS

Query:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL
        KD +LQFTPLLKSP+YPNYYYIGLESITIG+G  +NN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE V+ YPRAKQVE+NTGFDL
Subjt:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL

Query:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ
        CYK+PCKNN   SS +DD     LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN+EVVYDL+
Subjt:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ

Query:  KERIGFQPMDCASSAASQGLHKN
        KER+GFQPMDC S AA QGLHKN
Subjt:  KERIGFQPMDCASSAASQGLHKN

A0A1S3CAK9 aspartic proteinase nepenthesin-21.76e-30079.7Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M+++ G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL
        TYGASGVVTG+LT+DV+ MHG        + N+  Q+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Subjt:  TYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL

Query:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT
        AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+G  NNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NT
Subjt:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT

Query:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV
        GFDLCYK+PCKNN   SS +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN++VV
Subjt:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV

Query:  YDLQKERIGFQPMDCASSAASQGLHKN
        YDL+KER+GFQ MDC S AA+QGLHKN
Subjt:  YDLQKERIGFQPMDCASSAASQGLHKN

A0A5A7TNC9 Aspartic proteinase nepenthesin-21.76e-30079.7Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M+++ G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL
        TYGASGVVTG+LT+DV+ MHG        + N+  Q+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Subjt:  TYGASGVVTGTLTKDVILMHGV-------SPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL

Query:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT
        AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+G  NNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NT
Subjt:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT

Query:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV
        GFDLCYK+PCKNN   SS +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN++VV
Subjt:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV

Query:  YDLQKERIGFQPMDCASSAASQGLHKN
        YDL+KER+GFQ MDC S AA+QGLHKN
Subjt:  YDLQKERIGFQPMDCASSAASQGLHKN

A0A6J1CMP8 probable aspartyl protease At4g165630.099.62Show/hide
Query:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGT
        SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGT
Subjt:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH
        TYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH
Subjt:  TYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDH

Query:  SLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYK
        SLQFTPLLKSPLYPNYYYIGLES+TIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYK
Subjt:  SLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYK

Query:  IPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKER
        IPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKER
Subjt:  IPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKER

Query:  IGFQPMDCASSAASQGLHKN
        IGFQ MDCASSAASQGLHKN
Subjt:  IGFQPMDCASSAASQGLHKN

A0A6J1EHM1 probable aspartyl protease At4g165634.16e-29280.04Show/hide
Query:  FFLLLILLLSVSETAARPHRNNFPNT----DSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVY
        F L+L+L+L   E   +   N  P T    DSLVLGLVHSRTSLLTPKRGY S         KPME +G+D+VIEPLREIRDGYL+SLTLGTPPQVIQVY
Subjt:  FFLLLILLLSVSETAARPHRNNFPNT----DSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVY

Query:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVV
        MDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGASG+V
Subjt:  MDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVV

Query:  TGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLL
         GTLTKDVI +HG SPNS+ +IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISSK+H L+FTP L
Subjt:  TGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLL

Query:  KSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTN
        KSP YPNYYYIGLESITIG+G   N SRFGVSL+LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+++YPRAK+ E+NTGFDLCYK+P KNNT 
Subjt:  KSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTN

Query:  FSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC
        FS   +      LPSITFHFLNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMD         GD DGPAGIFGSFQQQN+EVVYDL+KER+GF+ MDC
Subjt:  FSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC

Query:  ASSAASQGLHK
        AS A SQGLHK
Subjt:  ASSAASQGLHK

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED31.2e-2826.73Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG-T
        Y++   LGTPPQ++ + +DT +D  W+PC      C  C    +N S      F+   SST    +C ++ C                     T  +G T
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG-T

Query:  CPRPCP-----SFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPSQLGFSHKG-FSHCFLPFKFSNN
        CP   P     SF  +YG     + +L +D + +      +   IP F FGC+ +       P G+ G GRG +SL SQ    + G FS+C         
Subjt:  CPRPCP-----SFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPSQLGFSHKG-FSHCFLPFKFSNN

Query:  PNFSSPLILGSLAIS--SKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLES
        P+F S    GSL +    +  S+++TPLL++P  P+ YY+ L  +++G      + +  V       D     G +IDSGT  T   +P+Y  +      
Subjt:  PNFSSPLILGSLAIS--SKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLES

Query:  VVTYPRAKQVEINT-----GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGD
               KQV +++      FD C+            S D+   N+ P IT H + ++ + LP  N     +A T    + CL   SM G         +
Subjt:  VVTYPRAKQVEINT-----GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGD

Query:  DDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC
         +    +  + QQQN+ +++D+   RIG  P  C
Subjt:  DDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC

Q766C2 Aspartic proteinase nepenthesin-21.0e-3228.29Show/hide
Query:  SINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIH
        SIN  ++   S  +  P+      YL+++ +GTP       MDTGSDL W  C      C  C      +       F+P  SS+     C S +C D+ 
Subjt:  SINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIH

Query:  SSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLG
            P + C    C                + Y YG      G +  +           T+ +P   FGC     G       G+ G G G LSLPSQLG
Subjt:  SSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLG

Query:  FSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITI-GDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTY
             FS+C   +  S+     S L LGS A S        T L+ S L P YYYI L+ IT+ GD +G  +S F       ++   G GGM+IDSGTT 
Subjt:  FSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITI-GDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTY

Query:  THLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSM
        T+LP+  Y+ +       +  P     E ++G   C++ P   +T             +P I+  F   V  +  Q      + +P    +  CL   S 
Subjt:  THLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSM

Query:  DGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASS
           G              IFG+ QQQ  +V+YDLQ   + F P  C +S
Subjt:  DGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASS

Q766C3 Aspartic proteinase nepenthesin-18.7e-3229.65Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        YL++L++GTP Q     MDTGSDL W         CQ C +  N         F+P  SS+     C S  C  + S                     TC
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP
              + Y YG      G++  + +    VS      IP   FGC     G       G+ G GRG LSLPSQL  +   FS+C  P   S   N    
Subjt:  PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP

Query:  LILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGD-GIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRA
        L+LGSLA S    S   T L++S   P +YYI L  +++G   +  + S F ++         G GG++IDSGTT T+     Y  +     S +  P  
Subjt:  LILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGD-GIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRA

Query:  KQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQ
             ++GFDLC++ P            D  +  +P+   HF +   + LP  N F    +P+N  +  CL   S   G               IFG+ Q
Subjt:  KQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQ

Query:  QQNMEVVYDLQKERIGFQPMDCASS
        QQNM VVYD     + F    C +S
Subjt:  QQNMEVVYDLQKERIGFQPMDCASS

Q940R4 Probable aspartyl protease At4g165631.1e-5334.51Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        YLISL++G+    + +Y+DTGSDL W PC    F C  CE    +   P   P S + S+T++  +C S  C   HSS    D C I+ C L  +  G C
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN
             PCP F Y YG  G +   L  D + +  VS      +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F S+   
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN

Query:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL
          SPLILG                        K +   FT +L++P +P +Y + L+ I+IG               LR ID  G GG+++DSGTT+T L
Subjt:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL

Query:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL
        P   Y+ ++   +S V   + RA +VE ++G   CY +   N T             +P++  HF  N  SV LP+ N FY              + CL+
Subjt:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL

Query:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS
          +   GG + +  G   G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS

Q9LNJ3 Aspartyl protease family protein 21.9e-3127.99Show/hide
Query:  SDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCT
        S +V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +       F P  S T     C S  C  + S         
Subjt:  SDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCT

Query:  IAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG--FSHK
         AGC+     + TC      +  +YG      G  + + +           ++     GC        VGA      G+ G G+G LS P Q G  F+ K
Subjt:  IAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG--FSHK

Query:  GFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPE
         FS+C +    S+ P   S ++ G+ A+S      +FTPLL +P    +YY+GL  I++G          GV+  L ++D  GNGG++IDSGT+ T L  
Subjt:  GFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPE

Query:  PLYSQLISNLE-SVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGG
        P Y  +         T  RA    +   FD C+ +   N               +P++  HF     V LP  N  Y +   TN     C  F    GG 
Subjt:  PLYSQLISNLE-SVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGG

Query:  GDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCA
                      I G+ QQQ   VVYDL   R+GF P  CA
Subjt:  GDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein2.5e-3428.73Show/hide
Query:  SATTISSKVLTFFLLLI-LLLSVSETAA----RPHRNNFPNTDSLVLGLVH-----SRTSLLTPKRGY------FSRKGSSSSINKPMEEIGSDNVIEPL
        ++++ SS +  FFL+L   L+SVS +      R    N P +    L L H     + T +   +RG        +R G+ + +    +   ++N+  P 
Subjt:  SATTISSKVLTFFLLLI-LLLSVSETAA----RPHRNNFPNTDSLVLGLVH-----SRTSLLTPKRGY------FSRKGSSSSINKPMEEIGSDNVIEPL

Query:  REIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLAT
              +L+ L++G P       +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  +  S+   D           
Subjt:  REIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLAT

Query:  LVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN
          K  C      + YTYG      G L  +         NS + I    FGC     G  + +  G+ G GRG LSL SQL      FS+C    + S  
Subjt:  LVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN

Query:  PNFSSPLILGSLAI-------SSKDHSLQFT-PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQL
           SS L +GSLA        +S D  +  T  LL++P  P++YY+ L+ IT+G        R  V     E+   G GGM+IDSGTT T+L E  +  L
Subjt:  PNFSSPLILGSLAI-------SSKDHSLQFT-PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQL

Query:  ISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL-LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGN
             S ++ P       +TG DLC+K+P            D   N+ +P + FHF     + LP  N   A     +ST V CL   S +G        
Subjt:  ISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL-LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGN

Query:  GDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC
                IFG+ QQQN  V++DL+KE + F P +C
Subjt:  GDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC

AT3G25700.1 Eukaryotic aspartyl protease family protein5.9e-3627.77Show/hide
Query:  LTFFL---LLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLT--PKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVI
        L FFL   L + LL  S  AA  + N +     L      S T  L    +R +F      S   KP+  + S  V+         Y + L +G PPQ +
Subjt:  LTFFL---LLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLT--PKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVI

Query:  QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGAS
         +  DTGSDL WV C      C++C  +           F P HSST     C    C  +   D        A     T +  TC      + Y Y   
Subjt:  QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGAS

Query:  GVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGSLA
         + +G   ++   +   S     ++    FGC          G ++    G+ G GRG +S  SQLG  F +K FS+C + +  S  P  +S LI+G+  
Subjt:  GVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGSLA

Query:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG
               L FTPLL +PL P +YY+ L+S+ +      N ++  +   + EID  GNGG ++DSGTT   L EP Y  +I+ +   V  P A    +  G
Subjt:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG

Query:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY
        FDLC  +                  +LP + F F      V P  N F           ++CL  QS+D   G             + G+  QQ     +
Subjt:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY

Query:  DLQKERIGFQPMDCA
        D  + R+GF    CA
Subjt:  DLQKERIGFQPMDCA

AT3G52500.1 Eukaryotic aspartyl protease family protein5.0e-4329.66Show/hide
Query:  ISSKVLTFFLLLILLLSVSETAARP--HRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRD--------GYLIS
        ++S +  FFL+ + ++S  +    P  H +  P    L L     R +  +  R +  + G+S    KP E+  S         ++         GY +S
Subjt:  ISSKVLTFFLLLILLLSVSETAARP--HRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRD--------GYLIS

Query:  LTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSG--PKLAP-FSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCP
        L+ GTP Q I    DTGS L W+PC +  + C  C+      SG  P L P F P +SS+S    C S  C  ++    P   C   GC   T     C 
Subjt:  LTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSG--PKLAP-FSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCP

Query:  RPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGS
          CP +   YG  G   G L  + +      P+ T  +P F  GC   + R+P GIAGFGRG +SLPSQ+    K FSHC +  +F ++ N ++ L L +
Subjt:  RPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGS

Query:  LA---ISSKDHSLQFTPLLKSPLYPN-----YYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVT-Y
         +     SK   L +TP  K+P   N     YYY+ L  I +G           +  K     T G+GG ++DSG+T+T +  P++  +     S ++ Y
Subjt:  LA---ISSKDHSLQFTPLLKSPLYPN-----YYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVT-Y

Query:  PRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFG
         R K +E  TG   C+ I  K +              +P + F F     + LP  +N++     T++  +  +  ++++  GG         GPA I G
Subjt:  PRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFG

Query:  SFQQQNMEVVYDLQKERIGFQPMDCA
        SFQQQN  V YDL+ +R GF    C+
Subjt:  SFQQQNMEVVYDLQKERIGFQPMDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein7.5e-5534.51Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        YLISL++G+    + +Y+DTGSDL W PC    F C  CE    +   P   P S + S+T++  +C S  C   HSS    D C I+ C L  +  G C
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN
             PCP F Y YG  G +   L  D + +  VS      +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F S+   
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN

Query:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL
          SPLILG                        K +   FT +L++P +P +Y + L+ I+IG               LR ID  G GG+++DSGTT+T L
Subjt:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL

Query:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL
        P   Y+ ++   +S V   + RA +VE ++G   CY +   N T             +P++  HF  N  SV LP+ N FY              + CL+
Subjt:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL

Query:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS
          +   GG + +  G   G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS

AT5G45120.1 Eukaryotic aspartyl protease family protein5.0e-17660.7Show/hide
Query:  VLTFFLLLILLL-SVSETAARPHRNNFPNTDS-LVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQV
        VL  FLL+ LLL + ++T AR H+N   ++ S LVL L  S  SL TPK        +   I KP+  +  D V+EPLRE+RDGYLI+L +GTPPQ +QV
Subjt:  VLTFFLLLILLL-SVSETAARPHRNNFPNTDS-LVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQV

Query:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNN-VSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG
        Y+DTGSDLTWVPCGNLSFDC +C + +NN +  P +  FSP HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K TC RPCPSFAYTYG  G
Subjt:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNN-VSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG

Query:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAIS-SKDHSLQFT
        +++G LT+D++         T  +PRF FGCV +TYREPIGIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG+ A+S +   SLQFT
Subjt:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAIS-SKDHSLQFT

Query:  PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKN
        P+L +P+YPN YYIGLESITIG  I        V L LR+ D++GNGGML+DSGTTYTHLPEP YSQL++ L+S +TYPRA + E  TGFDLCYK+PC N
Subjt:  PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKN

Query:  NTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQP
        N    +S+++    + PSITFHFLNN +++LPQGN+FYAM+AP++ +VV+CLLFQ+M+ G         D GPAG+FGSFQQQN++VVYDL+KERIGFQ 
Subjt:  NTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQP

Query:  MDCASSAASQGLHK
        MDC   AAS GL++
Subjt:  MDCASSAASQGLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCATCAGCCACCACTATTTCATCTAAAGTTTTGACCTTTTTCCTTCTTCTAATTCTACTTTTGAGTGTCTCAGAAACAGCTGCAAGGCCTCACAGAAACAATTTCCCCAA
CACAGATTCTTTAGTTCTTGGCCTTGTTCATTCAAGAACTTCCCTCCTCACTCCCAAAAGGGGCTACTTTTCAAGGAAAGGATCATCATCATCAATCAACAAGCCAATGG
AGGAAATTGGCAGTGATAATGTGATAGAGCCATTGAGGGAAATTAGGGATGGGTACCTAATTTCTCTCACATTAGGGACACCCCCACAAGTGATTCAAGTGTATATGGAC
ACTGGAAGTGACCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTCTCTGGCCCAAAGTTGGCTCCTTTTTCCCC
AACCCATTCTTCAACCTCCATTAGGGATACTTGTGGCAGCTCCTTTTGCATGGATATCCACAGCTCTGATAACCCTTTTGACCCATGCACAATTGCAGGCTGTTCCCTTG
CTACCCTTGTGAAGGGCACTTGCCCTAGGCCATGCCCTTCATTTGCCTACACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAAAGATGTCATTTTGATGCAT
GGAGTGTCCCCAAATTCCACTACCCAAATCCCTAGGTTTTGCTTTGGTTGTGTTGGTGCAACTTATAGAGAGCCAATTGGAATTGCTGGCTTTGGGAGAGGATTGCTCTC
TCTCCCTTCACAATTAGGGTTTTCCCATAAGGGTTTCTCCCATTGCTTCTTGCCCTTTAAGTTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATCCTTGGGAGCCTTG
CCATTTCTTCCAAAGATCATAGCTTGCAATTCACCCCTTTGTTGAAAAGTCCTCTTTACCCTAACTACTACTACATTGGTCTTGAGTCAATTACCATTGGGGATGGTATT
GGCAATAATAATTCTAGATTTGGGGTTTCTTTGAAGTTGAGGGAAATTGACACAAAGGGGAATGGGGGAATGTTGATTGATTCTGGTACAACATATACTCACTTGCCAGA
ACCATTGTATTCACAACTTATCTCTAATCTCGAGTCGGTTGTGACGTATCCTAGAGCTAAACAAGTAGAAATCAATACTGGGTTTGATCTTTGTTACAAAATCCCTTGTA
AAAACAACACTAATTTTTCTTCTTCTATGGATGATTATTGTTCCAATCTTCTGCCTTCTATAACATTCCATTTTTTGAACAACGTGAGTGTGGTTCTGCCCCAAGGGAAC
AATTTCTATGCCATGGCTGCTCCTACCAACTCCACTGTGGTGAAATGCTTGCTTTTCCAAAGCATGGATGGCGGGGGCGGGGACGGAGACGGGAATGGGGACGACGACGG
GCCGGCAGGCATTTTTGGGAGCTTCCAACAGCAAAACATGGAGGTTGTTTATGACTTGCAGAAGGAGAGAATAGGGTTTCAGCCAATGGACTGTGCTTCTTCTGCTGCCT
CTCAAGGACTCCACAAGAAT
mRNA sequenceShow/hide mRNA sequence
TCATCAGCCACCACTATTTCATCTAAAGTTTTGACCTTTTTCCTTCTTCTAATTCTACTTTTGAGTGTCTCAGAAACAGCTGCAAGGCCTCACAGAAACAATTTCCCCAA
CACAGATTCTTTAGTTCTTGGCCTTGTTCATTCAAGAACTTCCCTCCTCACTCCCAAAAGGGGCTACTTTTCAAGGAAAGGATCATCATCATCAATCAACAAGCCAATGG
AGGAAATTGGCAGTGATAATGTGATAGAGCCATTGAGGGAAATTAGGGATGGGTACCTAATTTCTCTCACATTAGGGACACCCCCACAAGTGATTCAAGTGTATATGGAC
ACTGGAAGTGACCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTCTCTGGCCCAAAGTTGGCTCCTTTTTCCCC
AACCCATTCTTCAACCTCCATTAGGGATACTTGTGGCAGCTCCTTTTGCATGGATATCCACAGCTCTGATAACCCTTTTGACCCATGCACAATTGCAGGCTGTTCCCTTG
CTACCCTTGTGAAGGGCACTTGCCCTAGGCCATGCCCTTCATTTGCCTACACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAAAGATGTCATTTTGATGCAT
GGAGTGTCCCCAAATTCCACTACCCAAATCCCTAGGTTTTGCTTTGGTTGTGTTGGTGCAACTTATAGAGAGCCAATTGGAATTGCTGGCTTTGGGAGAGGATTGCTCTC
TCTCCCTTCACAATTAGGGTTTTCCCATAAGGGTTTCTCCCATTGCTTCTTGCCCTTTAAGTTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATCCTTGGGAGCCTTG
CCATTTCTTCCAAAGATCATAGCTTGCAATTCACCCCTTTGTTGAAAAGTCCTCTTTACCCTAACTACTACTACATTGGTCTTGAGTCAATTACCATTGGGGATGGTATT
GGCAATAATAATTCTAGATTTGGGGTTTCTTTGAAGTTGAGGGAAATTGACACAAAGGGGAATGGGGGAATGTTGATTGATTCTGGTACAACATATACTCACTTGCCAGA
ACCATTGTATTCACAACTTATCTCTAATCTCGAGTCGGTTGTGACGTATCCTAGAGCTAAACAAGTAGAAATCAATACTGGGTTTGATCTTTGTTACAAAATCCCTTGTA
AAAACAACACTAATTTTTCTTCTTCTATGGATGATTATTGTTCCAATCTTCTGCCTTCTATAACATTCCATTTTTTGAACAACGTGAGTGTGGTTCTGCCCCAAGGGAAC
AATTTCTATGCCATGGCTGCTCCTACCAACTCCACTGTGGTGAAATGCTTGCTTTTCCAAAGCATGGATGGCGGGGGCGGGGACGGAGACGGGAATGGGGACGACGACGG
GCCGGCAGGCATTTTTGGGAGCTTCCAACAGCAAAACATGGAGGTTGTTTATGACTTGCAGAAGGAGAGAATAGGGTTTCAGCCAATGGACTGTGCTTCTTCTGCTGCCT
CTCAAGGACTCCACAAGAAT
Protein sequenceShow/hide protein sequence
SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMD
TGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMH
GVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGI
GNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGN
NFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKN