; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg15229 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg15229
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPeptidase A1
Genome locationCarg_Chr10:1779178..1786450
RNA-Seq ExpressionCarg15229
SyntenyCarg15229
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058227.1 aspartic proteinase CDR1-like [Cucumis melo var. makuwa]7.6e-30082.87Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
        MLPEVEEKHFRKGAQWFTMKRQHA+IVLADNLYYSKFR+YC+PGLEG NCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAE IT ELLQ
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ

Query:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI
        NITSIDVSVHVTSD+K S    S A                            +IINM+I L SLHHLLP LTLAFYLSTAII S    TKPSRLAT+LI
Subjt:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI

Query:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS
        HRNSYLHPLYDPNETVEDRSKRE+ SSIERFA+LESKIKELKSVGN ARS+L PFNRGSGFLVNLSIGSPPV QLVV+DTGSSLLWVQCLPCINCF+QS+
Subjt:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS

Query:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI
        SWFDPLKS+SFK LGCGFPGYNY++GY+CNG NQAEYKLRYLGGD+SQG+LAKESLLFET DEGKI+KTNLTFGCGHMNFKTN DD+YNGVFGLGAYP+I
Subjt:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI

Query:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY
        TMATQLGNKFSYCIGDIN+PLYTHN LVLG+G+Y+EGDSTPL+IHFGHYYV L+ ISVG+K L IDP AF+++ DG GGVLIDSGMTYTKLANGGFELLY
Subjt:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY

Query:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK
        DEI+DL  GLLERIPT+R+FEGLCFKGVVSRDL+G P VTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSE+LNLSVIGILAQQNYNV FDLEQMK
Subjt:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK

Query:  VFFSRIDCQLLDD
        VFF RIDCQLLD+
Subjt:  VFFSRIDCQLLDD

KAG6589695.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.52Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
        MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ

Query:  NITSIDVSVHVTSDKKRSTQSFS--------------------------VACIIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIH
        NITSIDVSVHVTSDKK+  Q +                           + C   N  +   S+   + F  +  +   AIISSMMTMTKPSRLATRLIH
Subjt:  NITSIDVSVHVTSDKKRSTQSFS--------------------------VACIIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIH

Query:  RNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSS
        RNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSS
Subjt:  RNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSS

Query:  WFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFIT
        WFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFIT
Subjt:  WFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFIT

Query:  MATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYD
        MATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYD
Subjt:  MATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYD

Query:  EILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKV
        EILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKV
Subjt:  EILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKV

Query:  FFSRIDCQLLDD
        FFSRIDCQLLDD
Subjt:  FFSRIDCQLLDD

KAG7023375.1 Aspartic proteinase nepenthesin-2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
        MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ

Query:  NITSIDVSVHVTSDKKRSTQSFSVACIIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSS
        NITSIDVSVHVTSDKKRSTQSFSVACIIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSS
Subjt:  NITSIDVSVHVTSDKKRSTQSFSVACIIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSS

Query:  IERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGY
        IERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGY
Subjt:  IERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGY

Query:  RCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQL
        RCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQL
Subjt:  RCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQL

Query:  VLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKG
        VLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKG
Subjt:  VLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKG

Query:  VVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
        VVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
Subjt:  VVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD

TYK28585.1 Peptidase A1 [Cucumis melo var. makuwa]4.5e-30083.03Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
        MLPEVEEKHFRKGAQWFTMKRQHA+IVLADNLYYSKFR+YC+PGLEG NCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAE IT ELLQ
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ

Query:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI
        NITSIDVSVHVTSD+K S    S A                            +IINM+I L SLHHLLP LTLAFYLSTAII S   MTKPSRLAT+LI
Subjt:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI

Query:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS
        HRNSYLHPLYDPNETVEDRSKRE+ SSIERFA+LESKIKELKSVGN ARS+L PFNRGSGFLVNLSIGSPPV QLVV+DTGSSLLWVQCLPCINCF+QS+
Subjt:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS

Query:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI
        SWFDPLKS+SFK LGCGFPGYNY++GY+CNG NQAEYKLRYLGGD+SQG+LAKESLLFET DEGKI+KTNLTFGCGHMNFKTN DD+YNGVFGLGAYP I
Subjt:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI

Query:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY
        TMATQLGNKFSYCIGDIN+PLYTHN LVLG+G+Y+EGDSTPL+IHFGHYYV L+ ISVG+K L IDP AF+++ DG GGVLIDSGMTYTKLANGGFELLY
Subjt:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY

Query:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK
        DEI+DL  GLLERIPT+R+FEGLCFKGVVSRDL+G P VTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSE+LNLSVIGILAQQNYNV FDLEQMK
Subjt:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK

Query:  VFFSRIDCQLLDD
        VFF RIDCQLLD+
Subjt:  VFFSRIDCQLLDD

XP_023515827.1 aspartic proteinase CDR1-like isoform X1 [Cucurbita pepo subsp. pepo]1.8e-25399.08Show/hide
Query:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV
        L  AIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV
Subjt:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV

Query:  IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH
        IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRK NLTFGCGH
Subjt:  IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH

Query:  MNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR
        MNFKTNIDDSYNGVFGLGAYP+ITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR
Subjt:  MNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR

Query:  GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML
        GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML
Subjt:  GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML

Query:  NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
        NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
Subjt:  NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD

TrEMBL top hitse value%identityAlignment
A0A5A7UU11 Aspartic proteinase CDR1-like3.7e-30082.87Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
        MLPEVEEKHFRKGAQWFTMKRQHA+IVLADNLYYSKFR+YC+PGLEG NCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAE IT ELLQ
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ

Query:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI
        NITSIDVSVHVTSD+K S    S A                            +IINM+I L SLHHLLP LTLAFYLSTAII S    TKPSRLAT+LI
Subjt:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI

Query:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS
        HRNSYLHPLYDPNETVEDRSKRE+ SSIERFA+LESKIKELKSVGN ARS+L PFNRGSGFLVNLSIGSPPV QLVV+DTGSSLLWVQCLPCINCF+QS+
Subjt:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS

Query:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI
        SWFDPLKS+SFK LGCGFPGYNY++GY+CNG NQAEYKLRYLGGD+SQG+LAKESLLFET DEGKI+KTNLTFGCGHMNFKTN DD+YNGVFGLGAYP+I
Subjt:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI

Query:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY
        TMATQLGNKFSYCIGDIN+PLYTHN LVLG+G+Y+EGDSTPL+IHFGHYYV L+ ISVG+K L IDP AF+++ DG GGVLIDSGMTYTKLANGGFELLY
Subjt:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY

Query:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK
        DEI+DL  GLLERIPT+R+FEGLCFKGVVSRDL+G P VTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSE+LNLSVIGILAQQNYNV FDLEQMK
Subjt:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK

Query:  VFFSRIDCQLLDD
        VFF RIDCQLLD+
Subjt:  VFFSRIDCQLLDD

A0A5D3DZ20 Peptidase A12.2e-30083.03Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ
        MLPEVEEKHFRKGAQWFTMKRQHA+IVLADNLYYSKFR+YC+PGLEG NCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAE IT ELLQ
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQ

Query:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI
        NITSIDVSVHVTSD+K S    S A                            +IINM+I L SLHHLLP LTLAFYLSTAII S   MTKPSRLAT+LI
Subjt:  NITSIDVSVHVTSDKKRSTQSFSVAC---------------------------IIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLI

Query:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS
        HRNSYLHPLYDPNETVEDRSKRE+ SSIERFA+LESKIKELKSVGN ARS+L PFNRGSGFLVNLSIGSPPV QLVV+DTGSSLLWVQCLPCINCF+QS+
Subjt:  HRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSS

Query:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI
        SWFDPLKS+SFK LGCGFPGYNY++GY+CNG NQAEYKLRYLGGD+SQG+LAKESLLFET DEGKI+KTNLTFGCGHMNFKTN DD+YNGVFGLGAYP I
Subjt:  SWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFI

Query:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY
        TMATQLGNKFSYCIGDIN+PLYTHN LVLG+G+Y+EGDSTPL+IHFGHYYV L+ ISVG+K L IDP AF+++ DG GGVLIDSGMTYTKLANGGFELLY
Subjt:  TMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLY

Query:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK
        DEI+DL  GLLERIPT+R+FEGLCFKGVVSRDL+G P VTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSE+LNLSVIGILAQQNYNV FDLEQMK
Subjt:  DEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMK

Query:  VFFSRIDCQLLDD
        VFF RIDCQLLD+
Subjt:  VFFSRIDCQLLDD

A0A6J1E195 probable aspartic protease At2g35615 isoform X12.2e-25298.39Show/hide
Query:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV
        L   IISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKS+GNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV
Subjt:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV

Query:  IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH
        IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGF GYNYVSGY+CNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH
Subjt:  IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH

Query:  MNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR
        MNFKTNIDDSYNGVFGLGAYP+ITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR
Subjt:  MNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR

Query:  GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML
        GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML
Subjt:  GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML

Query:  NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
        NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
Subjt:  NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD

A0A6J1E1B4 probable aspartic protease At2g35615 isoform X21.2e-25099.07Show/hide
Query:  MMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLL
        MMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKS+GNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLL
Subjt:  MMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLL

Query:  WVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNID
        WVQCLPCINCFRQSSSWFDPLKSSSFKILGCGF GYNYVSGY+CNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNID
Subjt:  WVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNID

Query:  DSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSG
        DSYNGVFGLGAYP+ITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSG
Subjt:  DSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSG

Query:  MTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGIL
        MTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGIL
Subjt:  MTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGIL

Query:  AQQNYNVAFDLEQMKVFFSRIDCQLLDD
        AQQNYNVAFDLEQMKVFFSRIDCQLLDD
Subjt:  AQQNYNVAFDLEQMKVFFSRIDCQLLDD

A0A6J1JE29 aspartic proteinase CDR1-like isoform X14.1e-25197.94Show/hide
Query:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV
        L   IISSMMTM +PSRLATRLIHRNS LHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV
Subjt:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVAQLVV

Query:  IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH
        IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGY+CNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH
Subjt:  IDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGH

Query:  MNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR
        MNFKTN+DDSYNGVFGLGAYP+ITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR
Subjt:  MNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGR

Query:  GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML
        GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML
Subjt:  GGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML

Query:  NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
        NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
Subjt:  NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356156.5e-4428.38Show/hide
Query:  LTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPP
        + L F+L  ++  ++ +   P   +  LIHR+S L P+Y+P  TV DR       S+ R      ++ +      +  ++         F ++++IG+PP
Subjt:  LTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPP

Query:  VAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYR--CNGYNQ-AEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRK
        +    + DTGS L WVQC PC  C++++   FD  KSS++K   C       +S     C+  N   +Y+  Y     S+G +A E++  +++    +  
Subjt:  VAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYR--CNGYNQ-AEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRK

Query:  TNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGN----KFSYCIGDINDPLYTHNQLVLGEGAYVEG-------DSTPL--EIHFGHYYVNLEG
            FGCG+ N  T  D++ +G+ GLG    +++ +QLG+    KFSYC+   +      + + LG  +            STPL  +    +YY+ LE 
Subjt:  TNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGN----KFSYCIGDINDPLYTHNQLVLGEGAYVEG-------DSTPL--EIHFGHYYVNLEG

Query:  ISVGTKRLNIDPKAFQMTWDG-----RGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLV
        ISVG K++     ++    DG      G ++IDSG T T L  G F+     + +   G  +R+   +     CFK   +   IGLP +T HF  GAD+ 
Subjt:  ISVGTKRLNIDPKAFQMTWDG-----RGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLV

Query:  LESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC
        L   + F +   D  CL+++P+      +++ G  AQ ++ V +DLE   V F  +DC
Subjt:  LESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC

Q6XBF8 Aspartic proteinase CDR11.9e-4329.31Show/hide
Query:  LSTAIISSMMTMTKPSR----LATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVA
        LS  ++SS+      ++        LIHR+S   P Y+P ET   R +     S+ R  +   K        N  +  +   +    +L+N+SIG+PP  
Subjt:  LSTAIISSMMTMTKPSR----LATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNLSPFNRGSGFLVNLSIGSPPVA

Query:  QLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYR--CNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNL
         + + DTGS LLW QC PC +C+ Q    FDP  SS++K + C       +          N   Y L Y     ++G +A ++L   +SD   ++  N+
Subjt:  QLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYR--CNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNL

Query:  TFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGN----KFSYCIGDINDPLYTHNQLVLGEGAYVEGD---STPLEIHFGH---YYVNLEGISVGTK
          GCGH N  T  +   +G+ GLG  P +++  QLG+    KFSYC+  +       +++  G  A V G    STPL         YY+ L+ ISVG+K
Subjt:  TFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGN----KFSYCIGDINDPLYTHNQLVLGEGAYVEGD---STPLEIHFGH---YYVNLEGISVGTK

Query:  RLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHG
        ++       +      G ++IDSG T T L    +  L D +    +   ++ P  +    LC+        + +P +T HF  GAD+ L+S + F Q  
Subjt:  RLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHG

Query:  GDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC
         D  C A   S S     S+ G +AQ N+ V +D     V F   DC
Subjt:  GDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC

Q766C2 Aspartic proteinase nepenthesin-23.6e-4230.25Show/hide
Query:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHP-LYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNL---SPFNRGSG-FLVNLSIGSPPV
        L  AI+S+++  T  +   T L H      P L    E V+      +   I+R   ++   + ++S+  + +S+    +P   G G +L+N++IG+P  
Subjt:  LSTAIISSMMTMTKPSRLATRLIHRNSYLHP-LYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNVARSNL---SPFNRGSG-FLVNLSIGSPPV

Query:  AQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLT
        +   ++DTGS L+W QC PC  CF Q +  F+P  SSSF  L C       +    CN  N+ +Y   Y  G T+QG +A E+  FETS        N+ 
Subjt:  AQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLT

Query:  FGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLG-NKFSYCIGDINDPLYTHNQLVLGEGA--YVEGDSTPLEIHFG----HYYVNLEGISVGTKRLNI
        FGCG  N       +  G+ G+G  P +++ +QLG  +FSYC+        + + L LG  A    EG  +   IH      +YY+ L+GI+VG   L I
Subjt:  FGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLG-NKFSYCIGDINDPLYTHNQLVLGEGA--YVEGDSTPLEIHFG----HYYVNLEGISVGTKRLNI

Query:  DPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRF
            FQ+  DG GG++IDSG T T L    +  +     D  N  L  +         CF+       + +P ++  F GG   + E   L     G   
Subjt:  DPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRF

Query:  CLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC
        CLA+   +S  L +S+ G + QQ   V +DL+ + V F    C
Subjt:  CLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC

Q766C3 Aspartic proteinase nepenthesin-12.1e-4233.71Show/hide
Query:  FLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFET
        +L+NLSIG+P      ++DTGS L+W QC PC  CF QS+  F+P  SSSF  L C       +S   C+  N  +Y   Y  G  +QG +  E+L F  
Subjt:  FLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFET

Query:  SDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLG-NKFSYCIGDINDPLYTHNQLVLGE--GAYVEGDSTPLEIHFGH----YYVNL
           G +   N+TFGCG  N       +  G+ G+G  P +++ +QL   KFSYC+  I     T + L+LG    +   G      I        YY+ L
Subjt:  SDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLG-NKFSYCIGDINDPLYTHNQLVLGE--GAYVEGDSTPLEIHFGH----YYVNL

Query:  EGISVGTKRLNIDPKAFQM-TWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLE
         G+SVG+ RL IDP AF + + +G GG++IDSG T T   N  ++ +  E +   N  L  +        LCF+       + +P    HF GG DL L 
Subjt:  EGISVGTKRLNIDPKAFQM-TWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLE

Query:  SGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC
        S + F        CLA+    S    +S+ G + QQN  V +D     V F+   C
Subjt:  SGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDC

Q9SV77 Aspartyl protease UND4.1e-3831.51Show/hide
Query:  RGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQS-SSWFDPLKSSSFKILGC------GFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQG
        RG  F+  +  GSP   Q + +DTGSSL W QC PC +C+ Q     + P  S +++   C        P + +    R   Y Q      YL     +G
Subjt:  RGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQS-SSWFDPLKSSSFKILGC------GFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQG

Query:  LLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSY---NGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHF
         LA+E +  +T D G  R   + FGC  ++     D SY    G+ GLG   + ++  + G+KFS+C+G+I++P  +HN L+LG+GA V+G  T + I  
Subjt:  LLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSY---NGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHF

Query:  GHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGG
        GH    LE I VG           ++T D    V +D+G T + L+      LY + +D  + L+   P   +   LC+K      L  +  V F F  G
Subjt:  GHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGG

Query:  ADLVLESGSLFRQHGGDRF-CLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQL
        A+L +   ++F Q G     CLAI  +N E  +  +IG++A Q YNV +DL     + ++ DC +
Subjt:  ADLVLESGSLFRQHGGDRF-CLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQL

Arabidopsis top hitse value%identityAlignment
AT2G23945.1 Eukaryotic aspartyl protease family protein6.6e-8441.09Show/hide
Query:  LLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSY--LHPLYDPNETVEDRSKREETSSIERFAYLESKI-KELKSVGNVARSNLSPFNRGSGFLVN
        LL F+T+++++ T  I       KP+R+A +LIHR S   L+P      T ED  K     S  RF YL++ I KEL S  +  + ++    + S FLVN
Subjt:  LLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSY--LHPLYDPNETVEDRSKREETSSIERFAYLESKI-KELKSVGNVARSNLSPFNRGSGFLVN

Query:  LSIGSPPVAQLVVIDTGSSLLWVQCLPCINCF--RQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSD
         S+G PPV QL ++DTGSSLLW+QC PC +C         F+P  SS+F    C      Y     C   N+  Y+  Y+ G  S+G+LAKE L F T +
Subjt:  LSIGSPPVAQLVVIDTGSSLLWVQCLPCINCF--RQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSD

Query:  EGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGH--YYVNLEGISVGT
           +    + FGCG+ N +  ++  + G+ GLGA P  ++A QLG+KFSYCIGD+ +  Y +NQLVLGE A + GD TP+E    +  YY+NLEGISVG 
Subjt:  EGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGH--YYVNLEGISVGT

Query:  KRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLF---
         +LNI+P  F+     R GV++DSG  YT LA+  +  LY+EI  + +  LER   R   + LC+ G VS +LIG P VTFHFAGGA+L +E+ S+F   
Subjt:  KRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLF---

Query:  -RQHGGDRFCLAILPS---NSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD
           +  + FC+++ P+     E    + IG++AQQ YN+ +DL++  ++  RIDC  LDD
Subjt:  -RQHGGDRFCLAILPS---NSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD

AT4G30030.1 Eukaryotic aspartyl protease family protein1.9e-8642.68Show/hide
Query:  RSKREETSSIERFAYLESKIKELKSVGNV-ARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCG
        R+K +E+S I +  YL SK      + N+   S+++P    + FL N+SIG+PPV QL++IDTGS L W+ CLPC  C+ Q+  +F P +SS+++   C 
Subjt:  RSKREETSSIERFAYLESKIKELKSVGNV-ARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCG

Query:  FPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDI
           +     +R       +Y LRY     ++G+LA+E L FETSD+G I K N+ FGCG  N        Y+GV GLG   F  +    G+KFSYC G +
Subjt:  FPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDI

Query:  NDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTR
         +P Y HN L+LG GA +EGD TPL+I    YY++L+ IS G K L+I+P  FQ  +  +GG +ID+G + T LA   +E L +EI  L   +L R+   
Subjt:  NDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTR

Query:  RQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLF-RQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLD
         Q+   C++G +  DL G P VTFHFAGGA+L L+  SLF     GD FCLA+  +  +  ++SVIG +AQQNYNV ++L  MKV+F R DC+++D
Subjt:  RQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLF-RQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLD

AT4G30040.1 Eukaryotic aspartyl protease family protein1.7e-7140.98Show/hide
Query:  SSIERFAYLESKIKELKSVGNVARSNLSPFNR--GSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNY
        +S+ER  YL++     K+ G++  ++LSP        FLVN+SIGSPP+ QL+ +DT S LLW+QCLPCINC+ QS   FDP +S + +   C    Y+ 
Subjt:  SSIERFAYLESKIKELKSVGNVARSNLSPFNR--GSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNY

Query:  VSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFET--SDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPL
         S          EY +RY+    S+G+LA+E LLF T   +       ++ FGCGH N+   +  +  G+ GLG Y   ++  + G KFSYC G ++DP 
Subjt:  VSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLLFET--SDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPL

Query:  YTHNQLVLG-EGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWD-GRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRR-
        Y HN LVLG +GA + GD+TPLEIH G YYV +E ISV    L IDP+ F      G GG +ID+G + T L    ++ L + I D+  G        + 
Subjt:  YTHNQLVLG-EGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDPKAFQMTWD-GRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRR-

Query:  -QFEGLCFKGVVSRDLI--GLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFF
           +  C+ G   RDL+  G P VTFHF+ GA+L L+  SLF +   + FCLA+ P      NL+ IG  AQQ+YN+ +DLE M+V F
Subjt:  -QFEGLCFKGVVSRDLI--GLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEMLNLSVIGILAQQNYNVAFDLEQMKVFF

AT5G57270.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.1e-5071.07Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLE-GRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELL
        MLPE+  + FRKGAQWFTMKRQHA+IV+AD LYYSKFR YC+PG+E  +NCIADEHYLPTFF+M+DP GI+NWSVT+VDWSER+WHPK+YRA  ++ +LL
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLE-GRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELL

Query:  QNITSIDVSVHVTSDKKRSTQ
        +NITS D+SVHVTS  KR  +
Subjt:  QNITSIDVSVHVTSDKKRSTQ

AT5G57270.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.1e-5071.07Show/hide
Query:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLE-GRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELL
        MLPE+  + FRKGAQWFTMKRQHA+IV+AD LYYSKFR YC+PG+E  +NCIADEHYLPTFF+M+DP GI+NWSVT+VDWSER+WHPK+YRA  ++ +LL
Subjt:  MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLE-GRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELL

Query:  QNITSIDVSVHVTSDKKRSTQ
        +NITS D+SVHVTS  KR  +
Subjt:  QNITSIDVSVHVTSDKKRSTQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACCGGAAGTGGAGGAGAAACACTTTAGAAAAGGTGCACAGTGGTTCACAATGAAACGGCAGCATGCTATAATAGTTTTGGCTGACAATCTTTATTACTCCAAATT
TCGCAACTACTGCCAGCCAGGTTTAGAAGGACGCAATTGCATAGCTGATGAGCACTACTTGCCGACCTTTTTCAATATGATTGACCCGACTGGAATTGCAAATTGGTCAG
TAACGCATGTTGATTGGTCTGAAAGAAAGTGGCATCCAAAATCTTATAGAGCGGAGGCTATTACCTCTGAGCTTTTGCAGAATATTACGTCGATTGATGTGAGTGTGCAT
GTAACAAGTGACAAGAAGAGATCAACACAGTCATTCTCCGTTGCATGCATCATAATCAACATGGTGATTCCGTTATTTTCTCTTCATCATCTTTTACCTTTCCTGACCTT
GGCATTTTATTTGTCAACAGCCATAATCTCATCCATGATGACCATGACAAAGCCTTCACGTTTGGCAACCAGGCTCATCCATCGAAACTCTTATCTACATCCACTATATG
ACCCAAATGAGACAGTTGAAGATCGATCGAAGAGAGAGGAGACGAGCTCAATTGAACGCTTTGCTTATCTTGAGTCGAAGATTAAAGAATTGAAGTCCGTTGGTAATGTG
GCTCGATCAAATCTCAGCCCTTTCAACCGAGGTAGTGGGTTTCTCGTTAATTTGTCGATCGGTTCTCCACCGGTGGCACAACTCGTAGTGATTGACACTGGTAGCTCCCT
CCTGTGGGTGCAATGTCTGCCTTGTATTAACTGTTTTAGGCAATCGAGCTCATGGTTTGATCCACTGAAATCATCAAGCTTCAAAATATTGGGTTGTGGCTTTCCTGGCT
ATAACTACGTCAGTGGTTACAGATGCAATGGTTATAATCAAGCCGAGTATAAGTTGAGGTACCTTGGTGGGGATACCTCCCAAGGACTTCTTGCGAAGGAATCACTTCTC
TTTGAGACATCTGATGAAGGTAAAATCAGAAAGACCAATTTAACATTTGGGTGTGGCCATATGAACTTCAAAACCAATATAGACGACAGCTACAATGGCGTGTTTGGACT
GGGCGCTTATCCCTTCATAACAATGGCCACTCAATTGGGCAACAAATTTTCCTATTGCATTGGCGATATCAACGATCCTCTCTACACTCACAACCAGCTCGTCTTGGGCG
AGGGAGCCTACGTCGAAGGCGATTCCACCCCTCTGGAGATCCACTTCGGCCATTATTATGTCAATTTAGAAGGCATCAGTGTTGGAACAAAGAGGCTCAACATTGACCCA
AAAGCCTTCCAGATGACATGGGATGGCAGAGGTGGAGTTCTAATCGACTCTGGAATGACTTACACCAAGCTCGCCAATGGTGGGTTTGAACTACTTTACGATGAGATACT
CGATTTGGCGAACGGTTTGTTGGAACGAATCCCAACCAGGAGGCAATTTGAAGGGCTATGTTTCAAAGGAGTTGTGAGTCGAGATCTTATTGGTCTGCCGCCGGTGACGT
TTCATTTCGCCGGCGGTGCCGATTTGGTATTGGAATCTGGAAGTTTGTTTAGGCAGCACGGTGGGGATCGGTTTTGTTTGGCCATCTTGCCAAGCAATTCCGAGATGTTG
AATCTGTCTGTGATTGGTATTTTGGCTCAACAGAATTACAATGTGGCTTTTGATCTTGAACAAATGAAAGTGTTCTTCAGCAGGATTGATTGTCAACTTCTAGACGACTA
A
mRNA sequenceShow/hide mRNA sequence
AGAATTTTATATATATATATGTGAAATTAAAAAACGTCGTCCTGAAAAATGCTGTGCACTTGAAACTGGAACGCCATTTCTACGCCTGCTCCGTCCAAGGTTCGGCTCTC
CCACATTGTTTTCCCTCCTTTTTTAGAGAGGAGAAGAGAGAGAAGGAGCACGACAACGAAGCGTTTCTGACATGGAAATGGAAATCATAGACTTTACGGGACAAGTTATT
AAATTATCTCCATTTCTTCCCGGCATATGAATGCCTGATTGTTCTATTTCTTTTTTCATTTTTCATGATTTGGAATTTGGATCTCGGAAGGGGACGGGGCGAAGTTCTGA
TACTCGACCGTTGGTTCGTGGGTCGATATCATTCTATTTTCCTTCATCAGTCAACTTTCAGATTTTCTGGTAACTTTTGGCATGTACTTTCTCATTGGAGGAGTGCATAT
TGGTAGATAGCAATTACAATGGAAATATAGATGCAGTACATTTTGTTGAAGATGAAGACAACTCAGGCGTTTTGTCATAGTGAGATGCAGATTTTCCCTGGGCCTCGCTA
TCGTACTCATATGAAACAGCCCTTATGGATTATCATCTTGGTTTCCTTCATCATTGTCTTCCTTATCTGTGCATACATGTATCCACCTCAAACTAGCGATGCCTGTTACA
TTTTTTCTTCTAGAGGCTGTAAGGTCATTACGGACTGGCTTCCGCCTGCTCCTGCTAGAGAACTTACCGATGAAGAGGTTGCTTCTCATGTTTTTATTCGAGAAATTTTG
AATTCACCTATTGTTCCATCAAAAACTCCAAAGTTAGCATTTATGTTTTTGACTCCTGGGTCTTTGCCGTTTGAGAAGCTGTGGGATAAATTTTTCAATGGTCATGAAGG
AAAATTCACTGTTTATGTCCATGCATCTAAGGAGAGACCAACTCATGTCAGCAGCCACTTTTTGGATCGGGATATTCATAGTGATCAGGTGGTGTGGGGTAAAATTACCA
TGGTTGATGCGGAGAGAAGATTGCTGGCAAATGCTCTAATGGATCCAGATAATCACCATTTTGTTTTACTTTCTGATAGTTGTGTGCCTTTGTATGGTTTTGACTATATC
TACAAGTATCTGATGCATTCAAATATAAGTTTTGTAGACTGAATATTTCTAATAGATACCAATTTTCTCAGCTTTAAGGATCCTGGTCCACATGGAAATGGCAGGTATTC
AGAGCACATGTTACCGGAAGTGGAGGAGAAACACTTTAGAAAAGGTGCACAGTGGTTCACAATGAAACGGCAGCATGCTATAATAGTTTTGGCTGACAATCTTTATTACT
CCAAATTTCGCAACTACTGCCAGCCAGGTTTAGAAGGACGCAATTGCATAGCTGATGAGCACTACTTGCCGACCTTTTTCAATATGATTGACCCGACTGGAATTGCAAAT
TGGTCAGTAACGCATGTTGATTGGTCTGAAAGAAAGTGGCATCCAAAATCTTATAGAGCGGAGGCTATTACCTCTGAGCTTTTGCAGAATATTACGTCGATTGATGTGAG
TGTGCATGTAACAAGTGACAAGAAGAGATCAACACAGTCATTCTCCGTTGCATGCATCATAATCAACATGGTGATTCCGTTATTTTCTCTTCATCATCTTTTACCTTTCC
TGACCTTGGCATTTTATTTGTCAACAGCCATAATCTCATCCATGATGACCATGACAAAGCCTTCACGTTTGGCAACCAGGCTCATCCATCGAAACTCTTATCTACATCCA
CTATATGACCCAAATGAGACAGTTGAAGATCGATCGAAGAGAGAGGAGACGAGCTCAATTGAACGCTTTGCTTATCTTGAGTCGAAGATTAAAGAATTGAAGTCCGTTGG
TAATGTGGCTCGATCAAATCTCAGCCCTTTCAACCGAGGTAGTGGGTTTCTCGTTAATTTGTCGATCGGTTCTCCACCGGTGGCACAACTCGTAGTGATTGACACTGGTA
GCTCCCTCCTGTGGGTGCAATGTCTGCCTTGTATTAACTGTTTTAGGCAATCGAGCTCATGGTTTGATCCACTGAAATCATCAAGCTTCAAAATATTGGGTTGTGGCTTT
CCTGGCTATAACTACGTCAGTGGTTACAGATGCAATGGTTATAATCAAGCCGAGTATAAGTTGAGGTACCTTGGTGGGGATACCTCCCAAGGACTTCTTGCGAAGGAATC
ACTTCTCTTTGAGACATCTGATGAAGGTAAAATCAGAAAGACCAATTTAACATTTGGGTGTGGCCATATGAACTTCAAAACCAATATAGACGACAGCTACAATGGCGTGT
TTGGACTGGGCGCTTATCCCTTCATAACAATGGCCACTCAATTGGGCAACAAATTTTCCTATTGCATTGGCGATATCAACGATCCTCTCTACACTCACAACCAGCTCGTC
TTGGGCGAGGGAGCCTACGTCGAAGGCGATTCCACCCCTCTGGAGATCCACTTCGGCCATTATTATGTCAATTTAGAAGGCATCAGTGTTGGAACAAAGAGGCTCAACAT
TGACCCAAAAGCCTTCCAGATGACATGGGATGGCAGAGGTGGAGTTCTAATCGACTCTGGAATGACTTACACCAAGCTCGCCAATGGTGGGTTTGAACTACTTTACGATG
AGATACTCGATTTGGCGAACGGTTTGTTGGAACGAATCCCAACCAGGAGGCAATTTGAAGGGCTATGTTTCAAAGGAGTTGTGAGTCGAGATCTTATTGGTCTGCCGCCG
GTGACGTTTCATTTCGCCGGCGGTGCCGATTTGGTATTGGAATCTGGAAGTTTGTTTAGGCAGCACGGTGGGGATCGGTTTTGTTTGGCCATCTTGCCAAGCAATTCCGA
GATGTTGAATCTGTCTGTGATTGGTATTTTGGCTCAACAGAATTACAATGTGGCTTTTGATCTTGAACAAATGAAAGTGTTCTTCAGCAGGATTGATTGTCAACTTCTAG
ACGACTAA
Protein sequenceShow/hide protein sequence
MLPEVEEKHFRKGAQWFTMKRQHAIIVLADNLYYSKFRNYCQPGLEGRNCIADEHYLPTFFNMIDPTGIANWSVTHVDWSERKWHPKSYRAEAITSELLQNITSIDVSVH
VTSDKKRSTQSFSVACIIINMVIPLFSLHHLLPFLTLAFYLSTAIISSMMTMTKPSRLATRLIHRNSYLHPLYDPNETVEDRSKREETSSIERFAYLESKIKELKSVGNV
ARSNLSPFNRGSGFLVNLSIGSPPVAQLVVIDTGSSLLWVQCLPCINCFRQSSSWFDPLKSSSFKILGCGFPGYNYVSGYRCNGYNQAEYKLRYLGGDTSQGLLAKESLL
FETSDEGKIRKTNLTFGCGHMNFKTNIDDSYNGVFGLGAYPFITMATQLGNKFSYCIGDINDPLYTHNQLVLGEGAYVEGDSTPLEIHFGHYYVNLEGISVGTKRLNIDP
KAFQMTWDGRGGVLIDSGMTYTKLANGGFELLYDEILDLANGLLERIPTRRQFEGLCFKGVVSRDLIGLPPVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSEML
NLSVIGILAQQNYNVAFDLEQMKVFFSRIDCQLLDD