; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013369 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013369
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationscaffold402:700256..701824
RNA-Seq ExpressionMS013369
SyntenyMS013369
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145478.2 probable aspartyl protease At4g16563 [Cucumis sativus]4.5e-23579.73Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+ + ++K L+ FLLL+ +   ++T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M++  G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHG---VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS
        TYGASGVVTG+LT+DV+  HG    + N+  QIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Subjt:  TYGASGVVTGTLTKDVILMHG---VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS

Query:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL
        KD +LQFTPLLKSP+YPNYYYIGLESITIG+  G+NN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE V+ YPRAKQVE+NTGFDL
Subjt:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL

Query:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ
        CYK+PCKNN   SS +DD     LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN+EVVYDL+
Subjt:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ

Query:  KERIGFQPMDCASSAASQGLHKN
        KER+GFQPMDC S AA QGLHKN
Subjt:  KERIGFQPMDCASSAASQGLHKN

XP_008459091.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]2.8e-23779.7Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M+++ G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHG-------VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL
        TYGASGVVTG+LT+DV+ MHG        + N+  Q+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Subjt:  TYGASGVVTGTLTKDVILMHG-------VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL

Query:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT
        AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+  GNNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NT
Subjt:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT

Query:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV
        GFDLCYK+PCKNN   SS +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN++VV
Subjt:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV

Query:  YDLQKERIGFQPMDCASSAASQGLHKN
        YDL+KER+GFQ MDC S AA+QGLHKN
Subjt:  YDLQKERIGFQPMDCASSAASQGLHKN

XP_022142611.1 probable aspartyl protease At4g16563 [Momordica charantia]5.7e-30799.62Show/hide
Query:  TKSSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL
        TKSSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL
Subjt:  TKSSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL

Query:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF
        GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF
Subjt:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF

Query:  AYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSK
        AYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSK
Subjt:  AYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSK

Query:  DHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLC
        DHSLQFTPLLKSPLYPNYYYIGLES+TIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLC
Subjt:  DHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLC

Query:  YKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQK
        YKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQK
Subjt:  YKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQK

Query:  ERIGFQPMDCASSAASQGLHKNF
        ERIGFQ MDCASSAASQGLHKNF
Subjt:  ERIGFQPMDCASSAASQGLHKNF

XP_023520027.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]5.1e-23179.92Show/hide
Query:  TFFLLLILLLSVS-----ETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQ
        +FF+L+++L+ VS     +T A P        DSLVLGLVHSRTSLLTPKRGY S    S    KPM E+G+D+VIEPLREIRDGYL+SLTLGTPPQVIQ
Subjt:  TFFLLLILLLSVS-----ETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQ

Query:  VYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG
        VYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGASG
Subjt:  VYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG

Query:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTP
        +V GTLTKDVI +HG SPNS+ +IP+FCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISSK+H L+FTP
Subjt:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTP

Query:  LLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNN
        LLKSP YPNYYYIGLESITIG+  G N SRFGVSL+LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+++YPRAK+ E+NTGFDLCYK+P KNN
Subjt:  LLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNN

Query:  TNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPM
        T FS   +      LPSITFHFLNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMD         GD DGPAGIFGSFQQQN+EVVYDL+KER+GF+ M
Subjt:  TNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPM

Query:  DCASSAASQGLHK
        DCAS A SQGLHK
Subjt:  DCASSAASQGLHK

XP_038893627.1 probable aspartyl protease At4g16563 [Benincasa hispida]2.9e-23480.8Show/hide
Query:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGY--FSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL
        S +T+ + K+L++FLLL+ +    +T A   + N P  DSLV+GLVHSRT+LLTPK+GY   SRK       K ME    DNVIEPLREIRDGYL+SLTL
Subjt:  SSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGY--FSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL

Query:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF
        GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVK TCPRPCPSF
Subjt:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF

Query:  AYTYGASGVVTGTLTKDVILMH---GVSPNSTT-QIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLA
        AYTYGASGVV GTLT+DV+LMH     SPNS+T + PRFCFGCVGA+YREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LA
Subjt:  AYTYGASGVVTGTLTKDVILMH---GVSPNSTT-QIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLA

Query:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG
        +SSKD  LQFTPLLKSP+YPNYYYIGLESITIG+  GN+N RFGVS  LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NTG
Subjt:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG

Query:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY
        FDLCYK+PCKNN NF S +DD   + LPSITFHFLNNVSVVLPQ NNFYAMAAP NSTVVKCLLFQSMDG GGD D   D DGPAGIFGSFQQQN+EVVY
Subjt:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY

Query:  DLQKERIGFQPMDCASSAASQGLHKN
        DL+KER+GFQPMDCA  AA+QGLHKN
Subjt:  DLQKERIGFQPMDCASSAASQGLHKN

TrEMBL top hitse value%identityAlignment
A0A0A0LYP0 Peptidase A1 domain-containing protein2.2e-23579.73Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+ + ++K L+ FLLL+ +   ++T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M++  G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGP+LA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHG---VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS
        TYGASGVVTG+LT+DV+  HG    + N+  QIPRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISS
Subjt:  TYGASGVVTGTLTKDVILMHG---VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISS

Query:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL
        KD +LQFTPLLKSP+YPNYYYIGLESITIG+  G+NN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE V+ YPRAKQVE+NTGFDL
Subjt:  KDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDL

Query:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ
        CYK+PCKNN   SS +DD     LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN+EVVYDL+
Subjt:  CYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQ

Query:  KERIGFQPMDCASSAASQGLHKN
        KER+GFQPMDC S AA QGLHKN
Subjt:  KERIGFQPMDCASSAASQGLHKN

A0A1S3CAK9 aspartic proteinase nepenthesin-21.4e-23779.7Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M+++ G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHG-------VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL
        TYGASGVVTG+LT+DV+ MHG        + N+  Q+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Subjt:  TYGASGVVTGTLTKDVILMHG-------VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL

Query:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT
        AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+  GNNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NT
Subjt:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT

Query:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV
        GFDLCYK+PCKNN   SS +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN++VV
Subjt:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV

Query:  YDLQKERIGFQPMDCASSAASQGLHKN
        YDL+KER+GFQ MDC S AA+QGLHKN
Subjt:  YDLQKERIGFQPMDCASSAASQGLHKN

A0A5A7TNC9 Aspartic proteinase nepenthesin-21.4e-23779.7Show/hide
Query:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT
        S+T+I++K L+ FLLL+   +  +T A   + NFP  DSLVLGLVHSRTSLLTPK+GY      S    K M+++ G DNVIEPLREIRDGYL+SL++GT
Subjt:  SATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEI-GSDNVIEPLREIRDGYLISLTLGT

Query:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
        PPQV+QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNN+SGPKLA F PTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY
Subjt:  PPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAY

Query:  TYGASGVVTGTLTKDVILMHG-------VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL
        TYGASGVVTG+LT+DV+ MHG        + N+  Q+PRFCFGCVGATYREPIGIAGFGRGLLSLP QLGFS KGFSHCFLPFKFSNNPNFSSPLILG L
Subjt:  TYGASGVVTGTLTKDVILMHG-------VSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSL

Query:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT
        AISSKD +LQFTPLLKSP+YPNYYYIGLESITIG+  GNNN RFGVS KLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESV++YPRAKQVE+NT
Subjt:  AISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINT

Query:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV
        GFDLCYK+PCKNN   SS +DD   + LPSITFHFLNNVSVVLPQGNNFYAMAAP NSTVVKCLL+QSMDG G D D   DD+GPAGIFGSFQQQN++VV
Subjt:  GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVV

Query:  YDLQKERIGFQPMDCASSAASQGLHKN
        YDL+KER+GFQ MDC S AA+QGLHKN
Subjt:  YDLQKERIGFQPMDCASSAASQGLHKN

A0A6J1CMP8 probable aspartyl protease At4g165632.8e-30799.62Show/hide
Query:  TKSSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL
        TKSSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL
Subjt:  TKSSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTL

Query:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF
        GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF
Subjt:  GTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF

Query:  AYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSK
        AYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSK
Subjt:  AYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSK

Query:  DHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLC
        DHSLQFTPLLKSPLYPNYYYIGLES+TIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLC
Subjt:  DHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLC

Query:  YKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQK
        YKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQK
Subjt:  YKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQK

Query:  ERIGFQPMDCASSAASQGLHKNF
        ERIGFQ MDCASSAASQGLHKNF
Subjt:  ERIGFQPMDCASSAASQGLHKNF

A0A6J1EHM1 probable aspartyl protease At4g165637.2e-23179.53Show/hide
Query:  TFFLLLILLLSVS-----ETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQ
        ++F+L+++L+ VS     +T A P        DSLVLGLVHSRTSLLTPKRGY S     +   KPM E+G+D+VIEPLREIRDGYL+SLTLGTPPQVIQ
Subjt:  TFFLLLILLLSVS-----ETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQ

Query:  VYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG
        VYMDTGSDLTWVPCGNLSFDCQDC+EYQNNV GPKLA F PTHSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGASG
Subjt:  VYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG

Query:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTP
        +V GTLTKDVI +HG SPNS+ +IP+FCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILG+LAISSK+H L+FTP
Subjt:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTP

Query:  LLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNN
         LKSP YPNYYYIGLESITIG+  G N SRFGVSL+LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+++YPRAK+ E+NTGFDLCYK+P KNN
Subjt:  LLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNN

Query:  TNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPM
        T FS   +      LPSITFHFLNNVSVVLPQGN+FYAMAAP+NSTVVKCLLFQSMD         GD DGPAGIFGSFQQQN+EVVYDL+KER+GF+ M
Subjt:  TNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPM

Query:  DCASSAASQGLHK
        DCAS A SQGLHK
Subjt:  DCASSAASQGLHK

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED31.2e-2826.73Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG-T
        Y++   LGTPPQ++ + +DT +D  W+PC      C  C    +N S      F+   SST    +C ++ C                     T  +G T
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKG-T

Query:  CPRPCP-----SFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPSQLGFSHKG-FSHCFLPFKFSNN
        CP   P     SF  +YG     + +L +D + +      +   IP F FGC+ +       P G+ G GRG +SL SQ    + G FS+C         
Subjt:  CPRPCP-----SFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYRE---PIGIAGFGRGLLSLPSQLGFSHKG-FSHCFLPFKFSNN

Query:  PNFSSPLILGSLAIS--SKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLES
        P+F S    GSL +    +  S+++TPLL++P  P+ YY+ L  +++G      + +  V       D     G +IDSGT  T   +P+Y  +      
Subjt:  PNFSSPLILGSLAIS--SKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLES

Query:  VVTYPRAKQVEINT-----GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGD
               KQV +++      FD C+            S D+   N+ P IT H + ++ + LP  N     +A T    + CL   SM G         +
Subjt:  VVTYPRAKQVEINT-----GFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGD

Query:  DDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC
         +    +  + QQQN+ +++D+   RIG  P  C
Subjt:  DDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC

Q766C2 Aspartic proteinase nepenthesin-21.0e-3228.29Show/hide
Query:  SINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIH
        SIN  ++   S  +  P+      YL+++ +GTP       MDTGSDL W  C      C  C      +       F+P  SS+     C S +C D+ 
Subjt:  SINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIH

Query:  SSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLG
            P + C    C                + Y YG      G +  +           T+ +P   FGC     G       G+ G G G LSLPSQLG
Subjt:  SSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLG

Query:  FSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITI-GDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTY
             FS+C   +  S+     S L LGS A S        T L+ S L P YYYI L+ IT+ GD +G  +S F       ++   G GGM+IDSGTT 
Subjt:  FSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITI-GDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTY

Query:  THLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSM
        T+LP+  Y+ +       +  P     E ++G   C++ P   +T             +P I+  F   V  +  Q      + +P    +  CL   S 
Subjt:  THLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSM

Query:  DGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASS
           G              IFG+ QQQ  +V+YDLQ   + F P  C +S
Subjt:  DGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASS

Q766C3 Aspartic proteinase nepenthesin-18.7e-3229.65Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        YL++L++GTP Q     MDTGSDL W         CQ C +  N         F+P  SS+     C S  C  + S                     TC
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP
              + Y YG      G++  + +    VS      IP   FGC     G       G+ G GRG LSLPSQL  +   FS+C  P   S   N    
Subjt:  PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSP

Query:  LILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGD-GIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRA
        L+LGSLA S    S   T L++S   P +YYI L  +++G   +  + S F ++         G GG++IDSGTT T+     Y  +     S +  P  
Subjt:  LILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGD-GIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRA

Query:  KQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQ
             ++GFDLC++ P            D  +  +P+   HF +   + LP  N F    +P+N  +  CL   S   G               IFG+ Q
Subjt:  KQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQ

Query:  QQNMEVVYDLQKERIGFQPMDCASS
        QQNM VVYD     + F    C +S
Subjt:  QQNMEVVYDLQKERIGFQPMDCASS

Q940R4 Probable aspartyl protease At4g165631.1e-5334.51Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        YLISL++G+    + +Y+DTGSDL W PC    F C  CE    +   P   P S + S+T++  +C S  C   HSS    D C I+ C L  +  G C
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN
             PCP F Y YG  G +   L  D + +  VS      +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F S+   
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN

Query:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL
          SPLILG                        K +   FT +L++P +P +Y + L+ I+IG               LR ID  G GG+++DSGTT+T L
Subjt:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL

Query:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL
        P   Y+ ++   +S V   + RA +VE ++G   CY +   N T             +P++  HF  N  SV LP+ N FY              + CL+
Subjt:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL

Query:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS
          +   GG + +  G   G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS

Q9LNJ3 Aspartyl protease family protein 21.9e-3127.99Show/hide
Query:  SDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCT
        S +V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +       F P  S T     C S  C  + S         
Subjt:  SDNVIEPLREIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCT

Query:  IAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG--FSHK
         AGC+     + TC      +  +YG      G  + + +           ++     GC        VGA      G+ G G+G LS P Q G  F+ K
Subjt:  IAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC--------VGATYREPIGIAGFGRGLLSLPSQLG--FSHK

Query:  GFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPE
         FS+C +    S+ P   S ++ G+ A+S      +FTPLL +P    +YY+GL  I++G          GV+  L ++D  GNGG++IDSGT+ T L  
Subjt:  GFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPE

Query:  PLYSQLISNLE-SVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGG
        P Y  +         T  RA    +   FD C+ +   N               +P++  HF     V LP  N  Y +   TN     C  F    GG 
Subjt:  PLYSQLISNLE-SVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGG

Query:  GDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCA
                      I G+ QQQ   VVYDL   R+GF P  CA
Subjt:  GDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCA

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein2.5e-3428.73Show/hide
Query:  SATTISSKVLTFFLLLI-LLLSVSETAA----RPHRNNFPNTDSLVLGLVH-----SRTSLLTPKRGY------FSRKGSSSSINKPMEEIGSDNVIEPL
        ++++ SS +  FFL+L   L+SVS +      R    N P +    L L H     + T +   +RG        +R G+ + +    +   ++N+  P 
Subjt:  SATTISSKVLTFFLLLI-LLLSVSETAA----RPHRNNFPNTDSLVLGLVH-----SRTSLLTPKRGY------FSRKGSSSSINKPMEEIGSDNVIEPL

Query:  REIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLAT
              +L+ L++G P       +DTGSDL W  C      C +C +    +       F P  SS+  +  C S  C  +  S+   D           
Subjt:  REIRDGYLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLAT

Query:  LVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN
          K  C      + YTYG      G L  +         NS + I    FGC     G  + +  G+ G GRG LSL SQL      FS+C    + S  
Subjt:  LVKGTCPRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNN

Query:  PNFSSPLILGSLAI-------SSKDHSLQFT-PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQL
           SS L +GSLA        +S D  +  T  LL++P  P++YY+ L+ IT+G        R  V     E+   G GGM+IDSGTT T+L E  +  L
Subjt:  PNFSSPLILGSLAI-------SSKDHSLQFT-PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQL

Query:  ISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL-LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGN
             S ++ P       +TG DLC+K+P            D   N+ +P + FHF     + LP  N   A     +ST V CL   S +G        
Subjt:  ISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNL-LPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGN

Query:  GDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC
                IFG+ QQQN  V++DL+KE + F P +C
Subjt:  GDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDC

AT3G25700.1 Eukaryotic aspartyl protease family protein6.0e-3627.77Show/hide
Query:  LTFFL---LLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLT--PKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVI
        L FFL   L + LL  S  AA  + N +     L      S T  L    +R +F      S   KP+  + S  V+         Y + L +G PPQ +
Subjt:  LTFFL---LLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLT--PKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVI

Query:  QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGAS
         +  DTGSDL WV C      C++C  +           F P HSST     C    C  +   D        A     T +  TC      + Y Y   
Subjt:  QVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGAS

Query:  GVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGSLA
         + +G   ++   +   S     ++    FGC          G ++    G+ G GRG +S  SQLG  F +K FS+C + +  S  P  +S LI+G+  
Subjt:  GVVTGTLTKDVILMHGVSPNSTTQIPRFCFGC---------VGATYREPIGIAGFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGSLA

Query:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG
               L FTPLL +PL P +YY+ L+S+ +      N ++  +   + EID  GNGG ++DSGTT   L EP Y  +I+ +   V  P A    +  G
Subjt:  ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTG

Query:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY
        FDLC  +                  +LP + F F      V P  N F           ++CL  QS+D   G             + G+  QQ     +
Subjt:  FDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVY

Query:  DLQKERIGFQPMDCA
        D  + R+GF    CA
Subjt:  DLQKERIGFQPMDCA

AT3G52500.1 Eukaryotic aspartyl protease family protein5.0e-4329.66Show/hide
Query:  ISSKVLTFFLLLILLLSVSETAARP--HRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRD--------GYLIS
        ++S +  FFL+ + ++S  +    P  H +  P    L L     R +  +  R +  + G+S    KP E+  S         ++         GY +S
Subjt:  ISSKVLTFFLLLILLLSVSETAARP--HRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRD--------GYLIS

Query:  LTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSG--PKLAP-FSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCP
        L+ GTP Q I    DTGS L W+PC +  + C  C+      SG  P L P F P +SS+S    C S  C  ++    P   C   GC   T     C 
Subjt:  LTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSG--PKLAP-FSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCP

Query:  RPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGS
          CP +   YG  G   G L  + +      P+ T  +P F  GC   + R+P GIAGFGRG +SLPSQ+    K FSHC +  +F ++ N ++ L L +
Subjt:  RPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGS

Query:  LA---ISSKDHSLQFTPLLKSPLYPN-----YYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVT-Y
         +     SK   L +TP  K+P   N     YYY+ L  I +G           +  K     T G+GG ++DSG+T+T +  P++  +     S ++ Y
Subjt:  LA---ISSKDHSLQFTPLLKSPLYPN-----YYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVT-Y

Query:  PRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFG
         R K +E  TG   C+ I  K +              +P + F F     + LP  +N++     T++  +  +  ++++  GG         GPA I G
Subjt:  PRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFG

Query:  SFQQQNMEVVYDLQKERIGFQPMDCA
        SFQQQN  V YDL+ +R GF    C+
Subjt:  SFQQQNMEVVYDLQKERIGFQPMDCA

AT4G16563.1 Eukaryotic aspartyl protease family protein7.5e-5534.51Show/hide
Query:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC
        YLISL++G+    + +Y+DTGSDL W PC    F C  CE    +   P   P S + S+T++  +C S  C   HSS    D C I+ C L  +  G C
Subjt:  YLISLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTC

Query:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN
             PCP F Y YG  G +   L  D + +  VS      +  F FGC   T  EPIG+AGFGRG LSLP+QL     H G  FS+C +   F S+   
Subjt:  ---PRPCPSFAYTYGASGVVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SHKG--FSHCFLPFKF-SNNPN

Query:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL
          SPLILG                        K +   FT +L++P +P +Y + L+ I+IG               LR ID  G GG+++DSGTT+T L
Subjt:  FSSPLILGSLA------------------ISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHL

Query:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL
        P   Y+ ++   +S V   + RA +VE ++G   CY +   N T             +P++  HF  N  SV LP+ N FY              + CL+
Subjt:  PEPLYSQLISNLESVV--TYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFL-NNVSVVLPQGNNFYAMA----APTNSTVVKCLL

Query:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS
          +   GG + +  G   G   I G++QQQ  EVVYDL   R+GF    CAS
Subjt:  FQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCAS

AT5G45120.1 Eukaryotic aspartyl protease family protein3.8e-17660.7Show/hide
Query:  VLTFFLLLILLL-SVSETAARPHRNNFPNTDS-LVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQV
        VL  FLL+ LLL + ++T AR H+N   ++ S LVL L  S  SL TPK        +   I KP+  +  D V+EPLRE+RDGYLI+L +GTPPQ +QV
Subjt:  VLTFFLLLILLL-SVSETAARPHRNNFPNTDS-LVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQV

Query:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNN-VSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG
        Y+DTGSDLTWVPCGNLSFDC +C + +NN +  P +  FSP HSSTS RD+C SSFC++IHSSDNPFDPC +AGCS++ L+K TC RPCPSFAYTYG  G
Subjt:  YMDTGSDLTWVPCGNLSFDCQDCEEYQNN-VSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASG

Query:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAIS-SKDHSLQFT
        +++G LT+D++         T  +PRF FGCV +TYREPIGIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN SSPLILG+ A+S +   SLQFT
Subjt:  VVTGTLTKDVILMHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAIS-SKDHSLQFT

Query:  PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKN
        P+L +P+YPN YYIGLESITIG  I        V L LR+ D++GNGGML+DSGTTYTHLPEP YSQL++ L+S +TYPRA + E  TGFDLCYK+PC N
Subjt:  PLLKSPLYPNYYYIGLESITIGDGIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKN

Query:  NTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQP
        N    +S+++    + PSITFHFLNN +++LPQGN+FYAM+AP++ +VV+CLLFQ+M+ G         D GPAG+FGSFQQQN++VVYDL+KERIGFQ 
Subjt:  NTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQGNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQP

Query:  MDCASSAASQGLHK
        MDC   AAS GL++
Subjt:  MDCASSAASQGLHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACAAAATCATCAGCCACCACTATTTCATCTAAAGTTTTGACCTTTTTCCTTCTTCTAATTCTACTTTTGAGTGTCTCAGAAACAGCTGCAAGGCCTCACAGAAACAATTT
CCCCAACACAGATTCTTTAGTTCTTGGCCTTGTTCATTCAAGAACTTCCCTCCTCACTCCCAAAAGGGGCTACTTTTCAAGGAAGGGATCATCATCATCAATCAACAAGC
CAATGGAGGAAATTGGCAGTGATAATGTGATAGAGCCATTGAGGGAAATTAGGGATGGGTACCTAATTTCTCTCACATTAGGGACACCCCCACAAGTGATTCAAGTGTAT
ATGGACACTGGAAGTGACCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTCTCTGGCCCAAAGTTGGCTCCTTT
TTCCCCAACCCATTCTTCAACCTCCATTAGGGATACTTGTGGCAGCTCCTTTTGCATGGATATCCACAGCTCTGATAACCCTTTTGACCCATGCACAATTGCAGGCTGTT
CCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGGCCATGCCCTTCATTTGCCTACACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAAAGATGTCATTTTG
ATGCATGGAGTGTCCCCAAATTCCACTACCCAGATCCCTAGGTTTTGCTTTGGTTGTGTTGGTGCAACTTATAGAGAGCCAATTGGAATTGCTGGCTTTGGGAGAGGATT
GCTCTCTCTCCCTTCACAATTAGGGTTTTCCCATAAGGGTTTCTCCCATTGCTTCTTGCCCTTTAAGTTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATCCTTGGGA
GCCTTGCCATTTCTTCCAAAGATCATAGCTTGCAATTCACCCCTTTGTTGAAAAGTCCTCTTTACCCTAACTACTACTACATTGGTCTTGAGTCAATTACCATTGGGGAT
GGTATTGGCAATAATAATTCTAGATTTGGGGTTTCTTTGAAGTTGAGGGAAATTGACACAAAGGGGAATGGGGGAATGTTGATTGATTCTGGTACAACATATACTCACTT
GCCAGAACCATTGTATTCACAACTTATCTCTAATCTCGAGTCGGTTGTGACGTATCCTAGAGCTAAACAAGTAGAAATCAATACTGGGTTTGATCTTTGTTACAAAATCC
CTTGTAAAAACAACACTAATTTTTCTTCTTCTATGGATGATTATTGTTCCAATCTTCTGCCTTCTATAACATTCCATTTTTTGAACAACGTGAGTGTGGTTCTGCCCCAA
GGGAACAATTTCTATGCCATGGCTGCTCCTACCAACTCCACTGTGGTGAAATGCTTGCTTTTCCAAAGCATGGATGGCGGGGGCGGGGACGGAGACGGGAACGGGGACGA
CGACGGGCCGGCAGGCATTTTTGGGAGCTTCCAACAGCAAAACATGGAGGTTGTTTATGACTTGCAGAAGGAGAGAATAGGGTTTCAGCCAATGGACTGTGCTTCTTCTG
CTGCCTCTCAAGGACTCCACAAGAATTTT
mRNA sequenceShow/hide mRNA sequence
ACAAAATCATCAGCCACCACTATTTCATCTAAAGTTTTGACCTTTTTCCTTCTTCTAATTCTACTTTTGAGTGTCTCAGAAACAGCTGCAAGGCCTCACAGAAACAATTT
CCCCAACACAGATTCTTTAGTTCTTGGCCTTGTTCATTCAAGAACTTCCCTCCTCACTCCCAAAAGGGGCTACTTTTCAAGGAAGGGATCATCATCATCAATCAACAAGC
CAATGGAGGAAATTGGCAGTGATAATGTGATAGAGCCATTGAGGGAAATTAGGGATGGGTACCTAATTTCTCTCACATTAGGGACACCCCCACAAGTGATTCAAGTGTAT
ATGGACACTGGAAGTGACCTCACATGGGTTCCTTGTGGGAACCTCTCTTTTGATTGCCAAGATTGTGAGGAGTATCAAAACAATGTCTCTGGCCCAAAGTTGGCTCCTTT
TTCCCCAACCCATTCTTCAACCTCCATTAGGGATACTTGTGGCAGCTCCTTTTGCATGGATATCCACAGCTCTGATAACCCTTTTGACCCATGCACAATTGCAGGCTGTT
CCCTTGCTACCCTTGTGAAGGGCACTTGCCCTAGGCCATGCCCTTCATTTGCCTACACTTATGGGGCAAGTGGGGTTGTAACTGGAACCCTAACAAAAGATGTCATTTTG
ATGCATGGAGTGTCCCCAAATTCCACTACCCAGATCCCTAGGTTTTGCTTTGGTTGTGTTGGTGCAACTTATAGAGAGCCAATTGGAATTGCTGGCTTTGGGAGAGGATT
GCTCTCTCTCCCTTCACAATTAGGGTTTTCCCATAAGGGTTTCTCCCATTGCTTCTTGCCCTTTAAGTTCTCAAATAACCCTAATTTCTCAAGCCCTTTGATCCTTGGGA
GCCTTGCCATTTCTTCCAAAGATCATAGCTTGCAATTCACCCCTTTGTTGAAAAGTCCTCTTTACCCTAACTACTACTACATTGGTCTTGAGTCAATTACCATTGGGGAT
GGTATTGGCAATAATAATTCTAGATTTGGGGTTTCTTTGAAGTTGAGGGAAATTGACACAAAGGGGAATGGGGGAATGTTGATTGATTCTGGTACAACATATACTCACTT
GCCAGAACCATTGTATTCACAACTTATCTCTAATCTCGAGTCGGTTGTGACGTATCCTAGAGCTAAACAAGTAGAAATCAATACTGGGTTTGATCTTTGTTACAAAATCC
CTTGTAAAAACAACACTAATTTTTCTTCTTCTATGGATGATTATTGTTCCAATCTTCTGCCTTCTATAACATTCCATTTTTTGAACAACGTGAGTGTGGTTCTGCCCCAA
GGGAACAATTTCTATGCCATGGCTGCTCCTACCAACTCCACTGTGGTGAAATGCTTGCTTTTCCAAAGCATGGATGGCGGGGGCGGGGACGGAGACGGGAACGGGGACGA
CGACGGGCCGGCAGGCATTTTTGGGAGCTTCCAACAGCAAAACATGGAGGTTGTTTATGACTTGCAGAAGGAGAGAATAGGGTTTCAGCCAATGGACTGTGCTTCTTCTG
CTGCCTCTCAAGGACTCCACAAGAATTTT
Protein sequenceShow/hide protein sequence
TKSSATTISSKVLTFFLLLILLLSVSETAARPHRNNFPNTDSLVLGLVHSRTSLLTPKRGYFSRKGSSSSINKPMEEIGSDNVIEPLREIRDGYLISLTLGTPPQVIQVY
MDTGSDLTWVPCGNLSFDCQDCEEYQNNVSGPKLAPFSPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGVVTGTLTKDVIL
MHGVSPNSTTQIPRFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGSLAISSKDHSLQFTPLLKSPLYPNYYYIGLESITIGD
GIGNNNSRFGVSLKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVVTYPRAKQVEINTGFDLCYKIPCKNNTNFSSSMDDYCSNLLPSITFHFLNNVSVVLPQ
GNNFYAMAAPTNSTVVKCLLFQSMDGGGGDGDGNGDDDGPAGIFGSFQQQNMEVVYDLQKERIGFQPMDCASSAASQGLHKNF