; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001172 (gene) of Snake gourd v1 genome

Gene IDTan0001172
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionaspartic proteinase-like protein 2
Genome locationLG05:1424798..1428600
RNA-Seq ExpressionTan0001172
SyntenyTan0001172
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606020.1 Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia]8.9e-25790.38Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        M AIVR G+SVAVAVVMIQAATVL GFPAKLTLER F TNHGVE+AQLR RDR+RHGR+LQSSGGV+DFPVAGTYDPFLVGLYYTKVQLGNPPKDF+VQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSC+GCPETSGLQIQLNFFDPGSSSTASLVSCSDQ+CA+GVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYV DMIHLD+VVD+++T+N
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRA+DGIFGFGQQ LSVISQLSS+G+APKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNG+V
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPINPAVFATSNSQGTIIDSGTTLAYLAE+AY++FV AITN VSQS+QS++L+GNQCY+TS+SISDIFP VSLNFAGGASLVLRPQDYLIQQ+SV GTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH
        WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCS SVNVSTATKTGKSEFVNAGQ S SGSVQNQPNRVI+ LSILVLFVKLSIF+GF H
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH

XP_022958429.1 aspartic proteinase-like protein 2 [Cucurbita moschata]2.6e-25690.18Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        M AIVR G+SVAVAVVMIQAATVL GFPAKLTLER F TNHGVE+AQLR RDR+RHGR+LQSSGGV+DFPVAGTYDPFLVGLYYTKVQLGNPPKDF+VQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSC+GCPETSGLQIQLNFFDPGSSSTASLVSCSDQ+CA+GVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYV DMIHLD+VVD+++T+N
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRA+DGIFGFGQQ LSVISQLSS+G+APKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNG+V
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPINPAVFATSNSQGTIIDSGTTLAYLAE+AY++FV AITN VSQS+QS++L+GNQCY+TS+SISDIFP VSLNFAGGASLVLRPQDYLIQQ+SV GTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH
        WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGW NYDCS SVNVSTATKTGKSEFVNAGQ S SGSVQNQPNRVI+ LSILVLFVKLSIF+GF H
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH

XP_022995311.1 aspartic proteinase-like protein 2 [Cucurbita maxima]6.8e-25790.58Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        MAAIVR G+SV VAV+MIQ ATVL GFPAKLTLER F TNHGVE+AQLR RDR+RHGR+LQSSGGV+DFPVAGTYDPFLVGLYYTKVQLGNPPKDF+VQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSC+GCPETSGLQIQLNFFDPGSSSTASLVSCSDQ+CA+GVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYV DMIHLD+VVD+++T+N
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRA+DGIFGFGQQ LSVISQLSS+G+APKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNG+V
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPINPAVFATSNSQGTIIDSGTTLAYLAE+AY++FV AITN VSQSTQS+VL+GNQCY+TS+SISDIFP VSLNFAGGASLVLRPQDYLIQQ+SV GTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH
        WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCS SVNVSTATKTGKSEFVNAGQ S SGSVQNQPNRVIL LSILVLFVKLSIF+GF H
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH

XP_023532865.1 aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo]3.4e-25690.38Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        M AIVR G+SV VAVVMIQAATVL GFPAKLTLER F TNHGVE+AQLR RDR+RHGR+LQSSGGV+DFPVAGTYDPFLVGLYYTKVQLGNPPKDF+VQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSC+GCPETSGLQIQLNFFDPGSSSTASLVSCSDQ+CA+GVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYV DMIHLD+VVD+++T+N
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRA+DGIFGFGQQ LSVISQLSS+G+APKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNG+V
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPINPAVFATSNSQGTIIDSGTTLAYLAE+AY++FV AITN VSQSTQS+ L+GNQCY+TS+SISDIFP VSLNFAGGASLVLRPQDYLIQQ+SV GTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH
        WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCS SVNVSTATKTGKSEFVNAGQ S SGSVQNQPNRV+L LSILVLFVKLSIF+GF H
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH

XP_038902575.1 aspartic proteinase 39 [Benincasa hispida]3.8e-26091.4Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        MA IVRAG+SVAV VV+ QA TVL GFPAKLTLER F TNHGVE+A LR RD+ RHGRMLQSSGGV+DFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSCNGCP TSGLQIQLNFFDPGSS+TASLVSCSDQ+CALGVQSSDSAC GQSNQCAYVFQYGDGSGTSGYYV DMIHLD+VVD++ TSN
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRAVDGIFGFGQQ LSVISQLSS+GIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPI+PAVFATSNSQGTIIDSGTTLAYLAE+AYNSFV+A+TNIVSQSTQSVVLKGNQCYVTS+S++DIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS
        WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGW NYDCSMSVNVSTATKTGKSEFVNAGQFS +GSVQNQP+R IL+LSILVLFV+LSIFT F HS
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS

TrEMBL top hitse value%identityAlignment
A0A0A0KER5 Peptidase A1 domain-containing protein4.0e-25589.8Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        MA IV AG+SV V VV++QAA VL GFPAKLTLER F TNHGVE+A LR+RDRVRHGRMLQSSGGV+DF V+GTYDPFLVGLYYT+VQLGNPPKDFYVQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSCNGCP TSGLQI LNFFDPGSS+TASLVSCSDQ+CALGVQSSDSAC GQSNQCAYVFQYGDGSGTSGYYV DMIHLDVV+D+SVTSN
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQ LSVISQLSS+GIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPI+PAVFATS+SQGTIIDSGTTLAYLAE+AYN+FV+A+TNIVSQSTQSVVLKGN+CYVTS+S+SDIFPQVSLNFAGGASLVL  QDYLIQQNSVGGTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS
        WC+GFQKIPGQGITILGDLVLKDKIFIYDLANQRIGW NYDCSMSVNVSTATKTGKSEFVNAGQFS SGS+QNQP+R IL+LSI VLFV+L IFT F HS
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS

A0A1S3C8H1 aspartic proteinase-like protein 22.1e-25690Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        M  IV AG+SV V VV++QAATVL GFPA LTLER F TNHGVE+A LR RDRVRHGRMLQSSGGV+DFPVAGTYDPFLVGLYYT+VQLGNPPKDFYVQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSCNGCP TSGLQIQLNFFDPGSS+TASLVSCSDQ+CALGVQSSDSAC GQSNQCAYVFQYGDGSGTSGYYV DMIHLDVVVD+SVTSN
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRAVDGIFGFGQQ LSVISQLSS+GIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPI+PAVF+TSNSQGTIIDSGTTLAYLAE+AYN+FV+A+TNIVSQSTQSVVLKGN+CYVTS+S+SDIFPQVSLNFAGGASLVL  QDYLIQQNSVGGTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS
        WC+GFQKIPGQGITILGDLVLKDKIFIYDLANQRIGW NYDCSMSVNVSTATKTGKSE+VNAGQFS SGS+QNQP+R IL+LSI VLFV+L IFT F HS
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS

A0A5A7SVM3 Aspartic proteinase-like protein 22.1e-25690Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        M  IV AG+SV V VV++QAATVL GFPA LTLER F TNHGVE+A LR RDRVRHGRMLQSSGGV+DFPVAGTYDPFLVGLYYT+VQLGNPPKDFYVQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSCNGCP TSGLQIQLNFFDPGSS+TASLVSCSDQ+CALGVQSSDSAC GQSNQCAYVFQYGDGSGTSGYYV DMIHLDVVVD+SVTSN
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRAVDGIFGFGQQ LSVISQLSS+GIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPI+PAVF+TSNSQGTIIDSGTTLAYLAE+AYN+FV+A+TNIVSQSTQSVVLKGN+CYVTS+S+SDIFPQVSLNFAGGASLVL  QDYLIQQNSVGGTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS
        WC+GFQKIPGQGITILGDLVLKDKIFIYDLANQRIGW NYDCSMSVNVSTATKTGKSE+VNAGQFS SGS+QNQP+R IL+LSI VLFV+L IFT F HS
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS

A0A6J1H528 aspartic proteinase-like protein 21.2e-25690.18Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        M AIVR G+SVAVAVVMIQAATVL GFPAKLTLER F TNHGVE+AQLR RDR+RHGR+LQSSGGV+DFPVAGTYDPFLVGLYYTKVQLGNPPKDF+VQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSC+GCPETSGLQIQLNFFDPGSSSTASLVSCSDQ+CA+GVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYV DMIHLD+VVD+++T+N
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRA+DGIFGFGQQ LSVISQLSS+G+APKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNG+V
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPINPAVFATSNSQGTIIDSGTTLAYLAE+AY++FV AITN VSQS+QS++L+GNQCY+TS+SISDIFP VSLNFAGGASLVLRPQDYLIQQ+SV GTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH
        WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGW NYDCS SVNVSTATKTGKSEFVNAGQ S SGSVQNQPNRVI+ LSILVLFVKLSIF+GF H
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH

A0A6J1K1L8 aspartic proteinase-like protein 23.3e-25790.58Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI
        MAAIVR G+SV VAV+MIQ ATVL GFPAKLTLER F TNHGVE+AQLR RDR+RHGR+LQSSGGV+DFPVAGTYDPFLVGLYYTKVQLGNPPKDF+VQI
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQI

Query:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN
        DTGSDVLWVSCNSC+GCPETSGLQIQLNFFDPGSSSTASLVSCSDQ+CA+GVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYV DMIHLD+VVD+++T+N
Subjt:  DTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSN

Query:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV
        SSASV+FGCSTSQTGDLTKSDRA+DGIFGFGQQ LSVISQLSS+G+APKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNG+V
Subjt:  SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQV

Query:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV
        LPINPAVFATSNSQGTIIDSGTTLAYLAE+AY++FV AITN VSQSTQS+VL+GNQCY+TS+SISDIFP VSLNFAGGASLVLRPQDYLIQQ+SV GTTV
Subjt:  LPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTV

Query:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH
        WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCS SVNVSTATKTGKSEFVNAGQ S SGSVQNQPNRVIL LSILVLFVKLSIF+GF H
Subjt:  WCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPH

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 362.7e-8338.09Show/hide
Query:  LSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGV-----EMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTG
        +S  VAVV +    V+ G         VF+  H       ++++L++ D  RH RML +    +D P+ G      +GLY+TK++LG+PPK++YVQ+DTG
Subjt:  LSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGV-----EMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTG

Query:  SDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSA
        SD+LWV+C  C  CP  + L I L+ +D  +SST+  V C D  C+  +QS      G    C+Y   YGDGS + G ++ D I L+ V     T+  + 
Subjt:  SDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSA

Query:  SVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPI
         VVFGC  +Q+G L ++D AVDGI GFGQ   S+ISQL++ G   ++FSHCL  + +GGGI  +GE+  P V  TP+VP+Q HYN+ L+ + V+G  + +
Subjt:  SVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPI

Query:  NPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCV
         P++ +T+   GTIIDSGTTLAYL ++ YNS +  IT    Q    +V +   C+  +++    FP V+L+F     L + P DYL          ++C 
Subjt:  NPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCV

Query:  GFQK-----IPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQ---FSGSGSVQNQPNRVILHLSILV
        G+Q        G  + +LGDLVL +K+ +YDL N+ IGWA+++CS S+ V    K G       G     S + SV N    ++  LSIL+
Subjt:  GFQK-----IPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQ---FSGSGSVQNQPNRVILHLSILV

Q766C2 Aspartic proteinase nepenthesin-21.1e-3129.74Show/hide
Query:  VEMAQLRARDRVRH-GRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLV
        ++ A  R   R+R    MLQSS G+     AG       G Y   V +G P   F   +DTGSD++W  C  C  C            F+P  SS+ S +
Subjt:  VEMAQLRARDRVRH-GRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLV

Query:  SCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQL
         C  Q C          C   +N+C Y + YGDGS T GY  T+    +        ++S  ++ FGC     G   + + A  G+ G G   LS+ SQL
Subjt:  SCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQL

Query:  SSQGIAPKVFSHCLKG-DDSGGGILVLGEIV--------EPNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVFATSN--SQGTIIDSGTTLAYLAED
           G+    FS+C+     S    L LG              ++++ L P+  +Y + LQ I+V G  L I  + F   +  + G IIDSGTTL YL +D
Subjt:  SSQGIAPKVFSHCLKG-DDSGGGILVLGEIV--------EPNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVFATSN--SQGTIIDSGTTLAYLAED

Query:  AYNSFVLAITNIVSQSTQSVVLKG-NQCYVTSASISDI-FPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIY
        AYN+   A T+ ++  T      G + C+   +  S +  P++S+ F GG  L L  Q+ LI         V C+        GI+I G++  ++   +Y
Subjt:  AYNSFVLAITNIVSQSTQSVVLKG-NQCYVTSASISDI-FPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIY

Query:  DLANQRIGWANYDCSMS
        DL N  + +    C  S
Subjt:  DLANQRIGWANYDCSMS

Q8VYV9 Aspartyl protease family protein 14.4e-3330.29Show/hide
Query:  LYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGC----PETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQY-GDGS
        L+Y  V +G P   F V +DTGSD+ W+ C+ C  C        G  + LN + P +SST++ V C+  +C  G       C    + C Y  +Y  +G+
Subjt:  LYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGC----PETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQY-GDGS

Query:  GTSGYYVTDMIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVV
         ++G  V D++HL  V +   +    A V FGC   QTG +     A +G+FG G + +SV S L+ +GIA   FS C   D  G G +  G+    +  
Subjt:  GTSGYYVTDMIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVV

Query:  YTPLVPSQPH--YNLNLQSISVNGQVLPIN-PAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIV---SQSTQSVVLKGNQCYVTSASISDI-FP
         TPL   QPH  YN+ +  ISV G    +   AVF          DSGT+  YL + AY     +  ++       T    L    CY  S +     +P
Subjt:  YTPLVPSQPH--YNLNLQSISVNGQVLPIN-PAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIV---SQSTQSVVLKGNQCYVTSASISDI-FP

Query:  QVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDC
         V+L   GG+S    P  + +    +  T V+C+   KI  + I+I+G   +     ++D     +GW   DC
Subjt:  QVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDC

Q9LX20 Aspartic proteinase-like protein 16.4e-3228.8Show/hide
Query:  LYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQI-----QLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDG-
        L+YT + +G P   F V +DTGS++LW+ CN     P TS          LN ++P SSST+ +  CS ++C      S S C     QC Y   Y  G 
Subjt:  LYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQI-----QLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDG-

Query:  SGTSGYYVTDMIHLDVVVDASVTSNSS---ASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVE
        + +SG  V D++HL    +  + + SS   A VV GC   Q+GD      A DG+ G G   +SV S LS  G+    FS C   +DSG   +  G++  
Subjt:  SGTSGYYVTDMIHLDVVVDASVTSNSS---ASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVE

Query:  PNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVFATS----NSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVV-LKGNQCYVTSASISDI
             TP         L L +   +G ++ +       S     S  T IDSG +  YL E+ Y    L I   ++ ++++   +    CY +SA     
Subjt:  PNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVFATS----NSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVV-LKGNQCYVTSASISDI

Query:  FPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDC
         P + L F+   + V+    ++ QQ+   G   +C+       +GI  +G   ++    ++D  N ++GW+   C
Subjt:  FPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDC

Q9S9K4 Aspartic proteinase 396.1e-7537.86Show/hide
Query:  VAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTGSDVLWVS
        VAV V++I+ A+    F A+              +   ++ D  RH RML S    +D P+ G      VGLY+TK++LG+PPK+++VQ+DTGSD+LW++
Subjt:  VAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTGSDVLWVS

Query:  CNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSA--CLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSASVVFG
        C  C  CP  + L  +L+ FD  +SST+  V C D  C+  +  SDS    LG    C+Y   Y D S + G ++ DM+ L+ V     T      VVFG
Subjt:  CNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSA--CLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSASVVFG

Query:  CSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVF
        C + Q+G L   D AVDG+ GFGQ   SV+SQL++ G A +VFSHCL  +  GGGI  +G +  P V  TP+VP+Q HYN+ L  + V+G  L +  ++ 
Subjt:  CSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVF

Query:  ATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQ-SVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQK
            + GTI+DSGTTLAY  +  Y+S +  I  +  Q  +  +V +  QC+  S ++ + FP VS  F     L + P DYL          ++C G+Q 
Subjt:  ATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQ-SVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQK

Query:  IPGQGIT--------ILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNV
            G+T        +LGDLVL +K+ +YDL N+ IGWA+++CS S+ +
Subjt:  IPGQGIT--------ILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNV

Arabidopsis top hitse value%identityAlignment
AT1G08210.1 Eukaryotic aspartyl protease family protein1.0e-15758.33Show/hide
Query:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSS-GGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQ
        MA    AG+ +  AV+++ A T+  G  A L LER+   NH + + +LRA D  RHGR+LQS  GGVV+FPV G  DPFLVGLYYTKV+LG PP++F VQ
Subjt:  MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSS-GGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQ

Query:  IDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTS
        IDTGSDVLWVSC SCNGCP+TS LQIQL+FFDPG SS+ASLVSCSD+ C    Q ++S C   +N C+Y F+YGDGSGTSGYY++D +  D V+ +++  
Subjt:  IDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTS

Query:  NSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQ
        NSSA  VFGCS  Q+GDL +  RAVDGIFG GQ  LSVISQL+ QG+AP+VFSHCLKGD SGGGI+VLG+I  P+ VYTPLVPSQPHYN+NLQSI+VNGQ
Subjt:  NSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQ

Query:  VLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTT
        +LPI+P+VF  +   GTIID+GTTLAYL ++AY+ F+ A+ N VSQ  + +  +  QC+  +A   D+FPQVSL+FAGGAS+VL P+ YL Q  S  G++
Subjt:  VLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTT

Query:  VWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKL
        +WC+GFQ++  + ITILGDLVLKDK+ +YDL  QRIGWA YDCS+ VNVS +      + +N GQ+  SGS     NR    L ++V  V L
Subjt:  VWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKL

AT2G36670.1 Eukaryotic aspartyl protease family protein1.2e-15558.87Show/hide
Query:  IVRAGLSVAVAVVMIQAATVLGGF------PAK-LTLERVFSTNHGVEMAQLRARDRVRHGRML------QSSGGVVDFPVAGTYDPFLVG-----LYYT
        ++ A L+VA+AV    A+ +   +      P K L L+R F  +  VE+++LRARDRVRH R+L       S GGVVDFPV G+ DP+LVG     LY+T
Subjt:  IVRAGLSVAVAVVMIQAATVLGGF------PAK-LTLERVFSTNHGVEMAQLRARDRVRHGRML------QSSGGVVDFPVAGTYDPFLVG-----LYYT

Query:  KVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTD
        KV+LG+PP +F VQIDTGSD+LWV+C+SC+ CP +SGL I L+FFD   S TA  V+CSD +C+   Q++ + C  ++NQC Y F+YGDGSGTSGYY+TD
Subjt:  KVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTD

Query:  MIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQP
          + D ++  S+ +NSSA +VFGCST Q+GDLTKSD+AVDGIFGFG+  LSV+SQLSS+GI P VFSHCLKGD SGGG+ VLGEI+ P +VY+PLVPSQP
Subjt:  MIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQP

Query:  HYNLNLQSISVNGQVLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRP
        HYNLNL SI VNGQ+LP++ AVF  SN++GTI+D+GTTL YL ++AY+ F+ AI+N VSQ    ++  G QCY+ S SISD+FP VSLNFAGGAS++LRP
Subjt:  HYNLNLQSISVNGQVLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRP

Query:  QDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQ
        QDYL       G ++WC+GFQK P +  TILGDLVLKDK+F+YDLA QRIGWA+YDCSMSVNVS    T   + VN+GQ
Subjt:  QDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQ

AT2G36670.2 Eukaryotic aspartyl protease family protein1.7e-15759.49Show/hide
Query:  IVRAGLSVAVAVVMIQAATVLGGF------PAK-LTLERVFSTNHGVEMAQLRARDRVRHGRML------QSSGGVVDFPVAGTYDPFLVGLYYTKVQLG
        ++ A L+VA+AV    A+ +   +      P K L L+R F  +  VE+++LRARDRVRH R+L       S GGVVDFPV G+ DP+LVGLY+TKV+LG
Subjt:  IVRAGLSVAVAVVMIQAATVLGGF------PAK-LTLERVFSTNHGVEMAQLRARDRVRHGRML------QSSGGVVDFPVAGTYDPFLVGLYYTKVQLG

Query:  NPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLD
        +PP +F VQIDTGSD+LWV+C+SC+ CP +SGL I L+FFD   S TA  V+CSD +C+   Q++ + C  ++NQC Y F+YGDGSGTSGYY+TD  + D
Subjt:  NPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLD

Query:  VVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLN
         ++  S+ +NSSA +VFGCST Q+GDLTKSD+AVDGIFGFG+  LSV+SQLSS+GI P VFSHCLKGD SGGG+ VLGEI+ P +VY+PLVPSQPHYNLN
Subjt:  VVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLN

Query:  LQSISVNGQVLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLI
        L SI VNGQ+LP++ AVF  SN++GTI+D+GTTL YL ++AY+ F+ AI+N VSQ    ++  G QCY+ S SISD+FP VSLNFAGGAS++LRPQDYL 
Subjt:  LQSISVNGQVLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLI

Query:  QQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQ
              G ++WC+GFQK P +  TILGDLVLKDK+F+YDLA QRIGWA+YDCSMSVNVS    T   + VN+GQ
Subjt:  QQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQ

AT3G02740.1 Eukaryotic aspartyl protease family protein3.5e-8639.33Show/hide
Query:  EMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSC
        ++  LRA D  RH R+L +    +D P+ G   P  +GLY+ K+ LG P +DF+VQ+DTGSD+LWV+C  C  CP  S L ++L  +D  +SSTA  VSC
Subjt:  EMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSC

Query:  SDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSS
        SD  C+   Q S+       + C YV  YGDGS T+GY V D++HLD+V     T +++ +++FGC + Q+G L +S  AVDGI GFGQ   S ISQL+S
Subjt:  SDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSS

Query:  QGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIV
        QG   + F+HCL  +++GGGI  +GE+V P V  TP++    HY++NL +I V   VL ++   F + + +G IIDSGTTL YL +  YN  +  I    
Subjt:  QGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIV

Query:  SQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQK-----IPGQGITILGDLVLKDKIFIYDLANQRIGWA
         + T   V +   C+  +  + D FP V+  F    SL + P++YL Q         WC G+Q        G  +TILGD+ L +K+ +YD+ NQ IGW 
Subjt:  SQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQK-----IPGQGITILGDLVLKDKIFIYDLANQRIGWA

Query:  NYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLF
        N++CS  + V    ++G    V A   S S S+     +++  +S+L+ F
Subjt:  NYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLF

AT5G22850.1 Eukaryotic aspartyl protease family protein4.6e-20370.06Show/hide
Query:  AGLSVAVAVV---MIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTG
        A +  A A++   ++ AA +  GFPA L LERV   NH +E++QL+ARD  RHGR+LQS GGV+DFPV GT+DPF+VGLYYTK++LG PP+DFYVQ+DTG
Subjt:  AGLSVAVAVV---MIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTG

Query:  SDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSA
        SDVLWVSC SCNGCP+TSGLQIQLNFFDPGSS TAS +SCSDQ C+ G+QSSDS C  Q+N CAY FQYGDGSGTSG+YV+D++  D++V +S+  NS+A
Subjt:  SDVLWVSCNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSA

Query:  SVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPI
         VVFGCSTSQTGDL KSDRAVDGIFGFGQQG+SVISQL+SQGIAP+VFSHCLKG++ GGGILVLGEIVEPN+V+TPLVPSQPHYN+NL SISVNGQ LPI
Subjt:  SVVFGCSTSQTGDLTKSDRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPI

Query:  NPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCV
        NP+VF+TSN QGTIID+GTTLAYL+E AY  FV AITN VSQS + VV KGNQCYV + S+ DIFP VSLNFAGGAS+ L PQDYLIQQN+VGGT VWC+
Subjt:  NPAVFATSNSQGTIIDSGTTLAYLAEDAYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCV

Query:  GFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIF
        GFQ+I  QGITILGDLVLKDKIF+YDL  QRIGWANYDCS SVNVS  + +G+SE+VNAGQFS + +   + +  I+  ++++L + +++F
Subjt:  GFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANYDCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCGATTGTTCGTGCCGGACTGTCGGTGGCCGTTGCGGTGGTGATGATTCAGGCGGCAACGGTTCTGGGTGGGTTTCCGGCCAAGCTGACGTTGGAGAGGGTTTT
TTCGACGAATCACGGCGTCGAAATGGCTCAACTCCGCGCCCGGGACCGGGTTAGACATGGTAGAATGTTGCAGTCTTCTGGTGGTGTTGTTGATTTTCCTGTGGCTGGAA
CCTACGACCCGTTTCTCGTTGGGCTTTATTACACTAAAGTGCAACTAGGTAATCCTCCAAAGGATTTCTATGTGCAGATCGATACTGGAAGTGATGTTTTGTGGGTTAGC
TGCAACTCTTGCAACGGCTGCCCAGAAACTAGTGGGCTCCAGATTCAGCTCAATTTCTTTGATCCTGGTAGCTCATCAACAGCTTCTTTGGTCTCTTGTTCAGACCAAAT
GTGTGCTCTAGGAGTTCAATCCTCTGACTCTGCCTGTTTGGGCCAGAGCAACCAGTGTGCTTATGTCTTCCAATACGGAGATGGAAGTGGAACGTCGGGCTATTACGTTA
CGGACATGATTCATCTTGATGTAGTAGTTGATGCTTCTGTGACTTCGAATTCTTCAGCTTCAGTTGTGTTTGGGTGTAGCACATCACAGACTGGAGACTTGACTAAGTCA
GATAGAGCAGTCGATGGAATCTTCGGGTTTGGGCAACAGGGCTTGTCTGTAATTTCGCAACTGTCTTCACAAGGAATAGCGCCAAAAGTGTTCTCTCACTGCTTGAAAGG
AGATGATAGTGGTGGGGGTATACTGGTCCTGGGCGAGATTGTGGAGCCAAATGTTGTTTACACTCCTCTAGTACCATCACAGCCCCATTATAACTTGAATCTGCAAAGCA
TCTCCGTTAACGGTCAAGTATTACCCATCAATCCGGCTGTCTTTGCAACATCAAATAGCCAAGGAACCATAATTGACTCTGGCACTACTTTGGCATACCTTGCTGAGGAC
GCTTACAACTCTTTTGTTCTTGCTATCACGAACATAGTTTCACAATCGACACAGTCTGTTGTCCTCAAGGGAAATCAGTGTTATGTAACCTCCGCCAGTATCTCTGATAT
ATTTCCTCAAGTAAGCTTAAACTTCGCCGGTGGTGCATCATTGGTATTGAGACCCCAAGACTACCTCATCCAACAAAACTCTGTTGGTGGTACTACTGTTTGGTGCGTTG
GTTTCCAGAAAATTCCAGGTCAAGGGATTACAATTTTAGGGGACCTTGTTCTGAAAGACAAAATCTTCATTTACGATTTAGCTAATCAACGAATTGGATGGGCTAACTAT
GACTGTTCGATGTCAGTAAATGTTTCTACAGCTACCAAGACTGGAAAGAGTGAATTTGTGAACGCAGGGCAGTTCAGTGGCAGTGGCTCTGTGCAGAATCAGCCAAACAG
AGTTATTTTACATTTAAGCATTCTTGTATTGTTTGTTAAATTATCCATTTTTACCGGCTTCCCTCACTCATAG
mRNA sequenceShow/hide mRNA sequence
CTTTGTGACAGAGCATCGGTAGCGTAAGCCCAATTTGTTCACCGAAACTTCTGTTTTTGCTCCATTTTTCGAACCTTTCCTTTCTCTCTCTACACTATATGACATTCCTT
CTCTCTCTCTCTCTAGAATGACCATCGACAACATTCCCTACCTCGGTTACTTCCCCCTTGTTCTCTTCGTCCTTGTCTTCTGATATTTGTCTTTCTTCTTTGATTTGAGT
ATTTAATCGCTTTGTTTGGTGGGTATCCAAATTTGAAGACGAGGGATTGAGGAATGGCTGCGATTGTTCGTGCCGGACTGTCGGTGGCCGTTGCGGTGGTGATGATTCAG
GCGGCAACGGTTCTGGGTGGGTTTCCGGCCAAGCTGACGTTGGAGAGGGTTTTTTCGACGAATCACGGCGTCGAAATGGCTCAACTCCGCGCCCGGGACCGGGTTAGACA
TGGTAGAATGTTGCAGTCTTCTGGTGGTGTTGTTGATTTTCCTGTGGCTGGAACCTACGACCCGTTTCTCGTTGGGCTTTATTACACTAAAGTGCAACTAGGTAATCCTC
CAAAGGATTTCTATGTGCAGATCGATACTGGAAGTGATGTTTTGTGGGTTAGCTGCAACTCTTGCAACGGCTGCCCAGAAACTAGTGGGCTCCAGATTCAGCTCAATTTC
TTTGATCCTGGTAGCTCATCAACAGCTTCTTTGGTCTCTTGTTCAGACCAAATGTGTGCTCTAGGAGTTCAATCCTCTGACTCTGCCTGTTTGGGCCAGAGCAACCAGTG
TGCTTATGTCTTCCAATACGGAGATGGAAGTGGAACGTCGGGCTATTACGTTACGGACATGATTCATCTTGATGTAGTAGTTGATGCTTCTGTGACTTCGAATTCTTCAG
CTTCAGTTGTGTTTGGGTGTAGCACATCACAGACTGGAGACTTGACTAAGTCAGATAGAGCAGTCGATGGAATCTTCGGGTTTGGGCAACAGGGCTTGTCTGTAATTTCG
CAACTGTCTTCACAAGGAATAGCGCCAAAAGTGTTCTCTCACTGCTTGAAAGGAGATGATAGTGGTGGGGGTATACTGGTCCTGGGCGAGATTGTGGAGCCAAATGTTGT
TTACACTCCTCTAGTACCATCACAGCCCCATTATAACTTGAATCTGCAAAGCATCTCCGTTAACGGTCAAGTATTACCCATCAATCCGGCTGTCTTTGCAACATCAAATA
GCCAAGGAACCATAATTGACTCTGGCACTACTTTGGCATACCTTGCTGAGGACGCTTACAACTCTTTTGTTCTTGCTATCACGAACATAGTTTCACAATCGACACAGTCT
GTTGTCCTCAAGGGAAATCAGTGTTATGTAACCTCCGCCAGTATCTCTGATATATTTCCTCAAGTAAGCTTAAACTTCGCCGGTGGTGCATCATTGGTATTGAGACCCCA
AGACTACCTCATCCAACAAAACTCTGTTGGTGGTACTACTGTTTGGTGCGTTGGTTTCCAGAAAATTCCAGGTCAAGGGATTACAATTTTAGGGGACCTTGTTCTGAAAG
ACAAAATCTTCATTTACGATTTAGCTAATCAACGAATTGGATGGGCTAACTATGACTGTTCGATGTCAGTAAATGTTTCTACAGCTACCAAGACTGGAAAGAGTGAATTT
GTGAACGCAGGGCAGTTCAGTGGCAGTGGCTCTGTGCAGAATCAGCCAAACAGAGTTATTTTACATTTAAGCATTCTTGTATTGTTTGTTAAATTATCCATTTTTACCGG
CTTCCCTCACTCATAGCGTCTAGAGAAGAAGATTAGATTTACCATTCTTTCACTGTATTATATTCCGATGCCAACATATAGTTATATAGCCATTGAGCCGACGAAGATCC
GTTTGTGCGGCTGCTATGCGTATTGTCGAGTTTTTTTTTTTTTTTTTTCATTGATTTCTTGAATTAATATTATATTCTTCCACTAGAGTTGGTCTCAGCTGAGTTGTAAA
TTTCTTCATATGAACTAGTCGGTTCTATAGAAGTTTAATTTATAGATTATACTTGTTGATCCGAATCTCCTT
Protein sequenceShow/hide protein sequence
MAAIVRAGLSVAVAVVMIQAATVLGGFPAKLTLERVFSTNHGVEMAQLRARDRVRHGRMLQSSGGVVDFPVAGTYDPFLVGLYYTKVQLGNPPKDFYVQIDTGSDVLWVS
CNSCNGCPETSGLQIQLNFFDPGSSSTASLVSCSDQMCALGVQSSDSACLGQSNQCAYVFQYGDGSGTSGYYVTDMIHLDVVVDASVTSNSSASVVFGCSTSQTGDLTKS
DRAVDGIFGFGQQGLSVISQLSSQGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPINPAVFATSNSQGTIIDSGTTLAYLAED
AYNSFVLAITNIVSQSTQSVVLKGNQCYVTSASISDIFPQVSLNFAGGASLVLRPQDYLIQQNSVGGTTVWCVGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWANY
DCSMSVNVSTATKTGKSEFVNAGQFSGSGSVQNQPNRVILHLSILVLFVKLSIFTGFPHS