; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004627 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004627
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionaspartyl protease family protein 2-like
Genome locationscaffold995:775423..777096
RNA-Seq ExpressionMS004627
SyntenyMS004627
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0006508 - proteolysis (biological process)
GO:0005840 - ribosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607981.1 Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. sororia]1.6e-29188.93Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNED-GHEEEENLMADSVKQSVK
        M+FLG Q+ S+RGFQN  V+L LIFLLLFSGVF TIAEAHVRQG N SNRSG+FGIELPEN+SSGIASSSASAPCSF NED   EEEE  MA+SVK+SVK
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNED-GHEEEENLMADSVKQSVK

Query:  LHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFV
        LHLKKRST+R TE KESITESA+RDLARIQTLH+RITERKNQDTTSRLK  NAE+RKPAEAV+P+ASP+SYS YFSGQL+ATLESGVSLGSGEYFIDVFV
Subjt:  LHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFV

Query:  GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS
        GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQ GPYYDPKDSISFRN+TC DPRCQLVSSPDPPQPCK ETQSCPYFYWYGDSSNTTGDFALETFTVNLTS
Subjt:  GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS

Query:  SATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDT
        S T  SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLL HPEL FTSLIGGKENPVDT
Subjt:  SATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDT

Query:  FYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWK
        FYYLQIKSIFVGGE+L+I EENW +SADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVS A+KLEFPEFEI FADGAVWK
Subjt:  FYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWK

Query:  FPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        FPVENYFIRIEQ D+ CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLG+APM+CA+V
Subjt:  FPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

XP_022135435.1 aspartyl protease family protein 2-like [Momordica charantia]0.0e+0099.64Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLH
        MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLH
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLH

Query:  LKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGS
        LKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGS
Subjt:  LKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGS

Query:  PPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSA
        PPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS 
Subjt:  PPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSA

Query:  TGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFY
        TGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFY
Subjt:  TGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFY

Query:  YLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFP
        YLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFP
Subjt:  YLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFP

Query:  VENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        VENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPM+CAEV
Subjt:  VENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

XP_022940746.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita moschata]3.3e-29289.09Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNED-GHEEEENLMADSVKQSVK
        M+FLG Q+ S+RGFQN  V+L LIFLLLFSGVF TIAEAHVRQG N SNRSG+FGIELPEN+SSGIASSSASAPCSF NED   EEEE  MA+SVK+SVK
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNED-GHEEEENLMADSVKQSVK

Query:  LHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFV
        LHLKKRST+R TEPKESITESA+RDLARIQTLH+RITERKNQDTTSRLK  NAE+RKPAEAV+P+ASP+SYS YFSGQL+ATLESGVSLGSGEYFIDVFV
Subjt:  LHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFV

Query:  GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS
        GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQ GPYYDPKDSISFRN+TC DPRCQLVSSPDPPQPCK ETQSCPYFYWYGDSSNTTGDFALETFTVNLTS
Subjt:  GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS

Query:  SATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDT
        S T  SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLL HPEL FTSLIGGKENPVDT
Subjt:  SATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDT

Query:  FYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWK
        FYYLQIKSIFVGGE+L+I EENW +SADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVS A+KLEFPEFEI FADGAVWK
Subjt:  FYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWK

Query:  FPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAE
        FPVENYFIRIEQ D+ CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLG+APM+CA+
Subjt:  FPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAE

XP_022981710.1 protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucurbita maxima]6.6e-29389.09Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKL
        M+FLG Q+ S+RGFQN  V+L LIFLLLFS VF+TIAEAHVRQG N SNRSG+FGIELPEN+SSGIA+SS SAPCSF NED  EEEE LMA SVK+SVKL
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKL

Query:  HLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVG
        HLKKRST+R TEPKESITESA+RDLARIQTLH+RITERKNQDTTSRLK  NAE+RKPAEAV+PAASP+SYS YFSGQL+ATLESGVSLGSGEYFIDVFVG
Subjt:  HLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVG

Query:  SPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS
        SPPKHFSLILDTGSDLNWIQCVPCHDCFEQ GPYYDPKDSISFRN+TCNDPRCQLVSSPDPPQPCK ETQSCPYFYWYGD SNTTGDFALETFTVNLTSS
Subjt:  SPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS

Query:  ATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTF
         TG SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLL HPEL FTSL GGKENPVDTF
Subjt:  ATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTF

Query:  YYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKF
        YYLQIKSIFVGGE+L+I EENW +SADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVS A+KLEFPEFEI FADGAVWKF
Subjt:  YYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKF

Query:  PVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        PVENYFIRIEQ D+ CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLG+APM+CA+V
Subjt:  PVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

XP_038898915.1 aspartyl protease family protein 2-like [Benincasa hispida]3.4e-29790.34Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGI-NSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKL
        MDFLGNQT SSRGF N KVFLTLIFLLLFSGVF+++ EAHV QG  NSNRSGIFGIELPENLSSGIA+SSASAPCSFG E   +E E LMADSVKQSVKL
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGI-NSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKL

Query:  HLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVG
        HLKKRST+RA EPKESITESA+RDLARIQTLH RI ERKNQDTTSRLKKSN EQ+KP EAV+PA SPESY+DYFSGQL+ATLESGVSLGSGEYFIDVFVG
Subjt:  HLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVG

Query:  SPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS
        SPPKHFSLILDTGSDLNWIQCVPC+DCFEQNGPYYDPKDSISFRN+TCNDPRC LVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS
Subjt:  SPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS

Query:  ATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTF
         TG SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLL HPELNFTSLIGGKENPVDTF
Subjt:  ATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTF

Query:  YYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKF
        YYLQIKSIFVGGE L+I EENWNLSADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVSG ++L+FPEF I F DGAVW F
Subjt:  YYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKF

Query:  PVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        PVENYFIRI+QLDI CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPM+CAEV
Subjt:  PVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

TrEMBL top hitse value%identityAlignment
A0A0A0L2W1 Peptidase A1 domain-containing protein1.0e-29188.79Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTI--AEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSV
        MDFLGNQ  SSRGFQN K+FLTLIFLLLFSGVF+T+   EAH+ QG + SNRSG+FGIELPENLSSGIASSSASAPCSFGNE    E E+LMADSVKQSV
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTI--AEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSV

Query:  KLHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAV-TPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDV
        KLHLKKRSTN A +PKESITESA+RDLARIQTLH RITERKNQDTTSRLKKSN E++KP E V +PA SPESY+DYFSGQL+ATLESGVSLGSGEYFIDV
Subjt:  KLHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAV-TPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDV

Query:  FVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNL
        F+GSPPKHFSLILDTGSDLNWIQCVPC DCFEQNGPYYDPKDSISFRN+TCNDPRCQLVSSPDPP+PCKFETQSCPYFYWYGDSSNTTGDFALETFTVNL
Subjt:  FVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNL

Query:  TSSATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPV
        TSS TG SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLL HPELNFTSLI GKENPV
Subjt:  TSSATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPV

Query:  DTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAV
        DTFYYLQIKSIFVGGE+L+I EENWNLSADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVSG ++L FPEF I FADGAV
Subjt:  DTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAV

Query:  WKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        W FPVENYFIRI+QLDI CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPM+CAEV
Subjt:  WKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

A0A5A7UD09 Aspartyl protease family protein 23.8e-28687.37Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNT--IAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSV
        MDFLG    SS GFQ+ K+FLTLIFLLLF+ VF+T  + EAH+ QG + SNRS +FGIELPENLSSGIASSSASAPCSFGNE    E E+LMADSVKQSV
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNT--IAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSV

Query:  KLHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAV-TPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDV
        KLHLKKRSTN A EP+ESITESA+RDLARIQTLH RI ERKNQDTTSRLKKSN E++KP E V +PA SPESY+DYFSGQL+ATLESGVSLGSGEYFIDV
Subjt:  KLHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAV-TPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDV

Query:  FVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNL
        F+GSPPKHFSLILDTGSDLNWIQCVPC+DCFEQNGPYYDPKDSISFRN+TCNDPRCQLVSSPDPPQPCKFE QSCPYFYWYGDSSNTTGDFALETFTVNL
Subjt:  FVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNL

Query:  TSSATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPV
        TSS TG SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLL HPELNFTSLIGGKENPV
Subjt:  TSSATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPV

Query:  DTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAV
        DTFYYLQIKSIFVGGE+L+I EENWNLSADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVS  ++L FPEF I FADGAV
Subjt:  DTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAV

Query:  WKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        W FPVENYFIRI+QLDI CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPM+CAEV
Subjt:  WKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

A0A6J1C128 aspartyl protease family protein 2-like0.0e+0099.64Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLH
        MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLH
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLH

Query:  LKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGS
        LKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGS
Subjt:  LKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGS

Query:  PPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSA
        PPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS 
Subjt:  PPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSA

Query:  TGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFY
        TGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFY
Subjt:  TGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFY

Query:  YLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFP
        YLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFP
Subjt:  YLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFP

Query:  VENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        VENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPM+CAEV
Subjt:  VENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

A0A6J1FJB5 protein ASPARTIC PROTEASE IN GUARD CELL 1-like1.6e-29289.09Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNED-GHEEEENLMADSVKQSVK
        M+FLG Q+ S+RGFQN  V+L LIFLLLFSGVF TIAEAHVRQG N SNRSG+FGIELPEN+SSGIASSSASAPCSF NED   EEEE  MA+SVK+SVK
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNED-GHEEEENLMADSVKQSVK

Query:  LHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFV
        LHLKKRST+R TEPKESITESA+RDLARIQTLH+RITERKNQDTTSRLK  NAE+RKPAEAV+P+ASP+SYS YFSGQL+ATLESGVSLGSGEYFIDVFV
Subjt:  LHLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFV

Query:  GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS
        GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQ GPYYDPKDSISFRN+TC DPRCQLVSSPDPPQPCK ETQSCPYFYWYGDSSNTTGDFALETFTVNLTS
Subjt:  GSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS

Query:  SATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDT
        S T  SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLL HPEL FTSLIGGKENPVDT
Subjt:  SATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDT

Query:  FYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWK
        FYYLQIKSIFVGGE+L+I EENW +SADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVS A+KLEFPEFEI FADGAVWK
Subjt:  FYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWK

Query:  FPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAE
        FPVENYFIRIEQ D+ CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLG+APM+CA+
Subjt:  FPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAE

A0A6J1J0D9 protein ASPARTIC PROTEASE IN GUARD CELL 1-like3.2e-29389.09Show/hide
Query:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKL
        M+FLG Q+ S+RGFQN  V+L LIFLLLFS VF+TIAEAHVRQG N SNRSG+FGIELPEN+SSGIA+SS SAPCSF NED  EEEE LMA SVK+SVKL
Subjt:  MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGIN-SNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKL

Query:  HLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVG
        HLKKRST+R TEPKESITESA+RDLARIQTLH+RITERKNQDTTSRLK  NAE+RKPAEAV+PAASP+SYS YFSGQL+ATLESGVSLGSGEYFIDVFVG
Subjt:  HLKKRSTNRATEPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVG

Query:  SPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS
        SPPKHFSLILDTGSDLNWIQCVPCHDCFEQ GPYYDPKDSISFRN+TCNDPRCQLVSSPDPPQPCK ETQSCPYFYWYGD SNTTGDFALETFTVNLTSS
Subjt:  SPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS

Query:  ATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTF
         TG SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLL HPEL FTSL GGKENPVDTF
Subjt:  ATGTSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTF

Query:  YYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKF
        YYLQIKSIFVGGE+L+I EENW +SADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVS A+KLEFPEFEI FADGAVWKF
Subjt:  YYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKF

Query:  PVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        PVENYFIRIEQ D+ CLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLG+APM+CA+V
Subjt:  PVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-26.2e-6037.93Show/hide
Query:  LESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDS
        +E+ V  G GEY ++V +G+P   FS I+DTGSDL W QC PC  CF Q  P ++P+DS SF  + C    CQ +    P + C      C Y Y YGD 
Subjt:  LESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDS

Query:  SNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLA
        S T G  A ETFT   +S          V N+ FGCG  N+G   G  AGL+G+G GPLS  SQL       FSYC+    S  S  S L  G     + 
Subjt:  SNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLA

Query:  HPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCY-NVSG
            + T+LI    NP  T+YY+ ++ I VGG+ L I    + L  DG GG IIDSGTTL+Y    AY  + +AF  ++    + E    L  C+   S 
Subjt:  HPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCY-NVSG

Query:  AEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKC
           ++ PE  + F DG V     +N  I   +  + CLAM  + +  +SI GN QQQ   +LYD +N  + + P +C
Subjt:  AEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKC

Q766C3 Aspartic proteinase nepenthesin-12.3e-6238.36Show/hide
Query:  LESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDS
        +E+ V  G GEY +++ +G+P + FS I+DTGSDL W QC PC  CF Q+ P ++P+ S SF  + C+   CQ +SSP            C Y Y YGD 
Subjt:  LESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDS

Query:  SNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLA
        S T G    ET T    S          + N+ FGCG  N+G   G  AGL+G+GRGPLS  SQL       FSYC+    S T   S L+ G   + + 
Subjt:  SNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLA

Query:  HPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGG-GGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNV-S
            N T+LI   + P  TFYY+ +  + VG   L I    + L+++ G GG IIDSGTTL+YF + AYQ++++ F+ ++    +         C+   S
Subjt:  HPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGG-GGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNV-S

Query:  GAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKC
            L+ P F +HF DG   + P ENYFI      + CLAM G+    +SI GN QQQN  ++YDT NS + +A  +C
Subjt:  GAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 25.0e-6233.1Show/hide
Query:  HRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQN
        H R+  R  +DT     + +A  R+ +  V P++      + F   +V    SG+  GSGEYF+ + VGSPP+   +++D+GSD+ W+QC PC  C++Q+
Subjt:  HRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQN

Query:  GPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHGAAGL
         P +DP  S S+  V+C    C  + +          +  C Y   YGD S T G  ALET T   T           V NV  GCGH NRG+F GAAGL
Subjt:  GPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHGAAGL

Query:  LGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHP-ELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGGG
        LG+G G +SF  QL    G +F YCLV R +D++ S  L+FG +    A P   ++  L+     P  +FYY+ +K + VGG  + + +  ++L+  G G
Subjt:  LGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHP-ELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGGG

Query:  GTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGTPKSALSII
        G ++D+GT ++     AY   ++ F  +           I   CY++SG   +  P    +F +G V   P  N+ + ++     C A   +P + LSII
Subjt:  GTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGTPKSALSII

Query:  GNYQQQNFHILYDTKNSRLGYAPMKC
        GN QQ+   + +D  N  +G+ P  C
Subjt:  GNYQQQNFHILYDTKNSRLGYAPMKC

Q9LNJ3 Aspartyl protease family protein 22.7e-7137.03Show/hide
Query:  TEPKESITESA--IRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSL
        +E   SIT +   I  L+  +T     + R  +D+      +    + P   VT A  P  +S        +++ SG+S GSGEYF  + VG+P ++  +
Subjt:  TEPKESITESA--IRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSL

Query:  ILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFR
        +LDTGSD+ W+QC PC  C+ Q+ P +DP+ S ++  + C+ P C+ + S      C    ++C Y   YGD S T GDF+ ET T              
Subjt:  ILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFR

Query:  RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENP-VDTFYYLQIKS
        RV+ V  GCGH N GLF GAAGLLGLG+G LSF  Q    +   FSYCLVDR++ +  SS ++FG   +        FT L+    NP +DTFYY+ +  
Subjt:  RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENP-VDTFYYLQIKS

Query:  IFVGGEELK-ISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYF
        I VGG  +  ++   + L   G GG IIDSGT+++    PAY  +++AF    KT K   DF +   C+++S   +++ P   +HF  GA    P  NY 
Subjt:  IFVGGEELK-ISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYF

Query:  IRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCA
        I ++     C A  GT    LSIIGN QQQ F ++YD  +SR+G+AP  CA
Subjt:  IRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCA

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 12.0e-6635.09Show/hide
Query:  HRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSD---YFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCF
        ++ +T  + +  +SR+    A+ R   E V  +     Y++   Y +  L   + SG S GSGEYF  + VG+P K   L+LDTGSD+NWIQC PC DC+
Subjt:  HRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSD---YFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCF

Query:  EQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHGA
        +Q+ P ++P  S +++++TC+ P+C L+ +      C+  +  C Y   YGD S T G+ A +T T   +          ++ NV  GCGH N GLF GA
Subjt:  EQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHGA

Query:  AGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGG-------KENPVDTFYYLQIKSIFVGGEELKISEEN
        AGLLGLG G LS ++Q+++    SFSYCLVDR+S  S S               + N   L GG       +   +DTFYY+ +    VGGE++ + +  
Subjt:  AGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGG-------KENPVDTFYYLQIKSIFVGGEELKISEEN

Query:  WNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLR-KVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAML
        +++ A G GG I+D GT ++     AY ++++AFL+  V   K      +   CY+ S    ++ P    HF  G     P +NY I ++     C A  
Subjt:  WNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLR-KVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAML

Query:  GTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKC
         T  S+LSIIGN QQQ   I YD   + +G +  KC
Subjt:  GTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein1.9e-7237.03Show/hide
Query:  TEPKESITESA--IRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSL
        +E   SIT +   I  L+  +T     + R  +D+      +    + P   VT A  P  +S        +++ SG+S GSGEYF  + VG+P ++  +
Subjt:  TEPKESITESA--IRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSL

Query:  ILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFR
        +LDTGSD+ W+QC PC  C+ Q+ P +DP+ S ++  + C+ P C+ + S      C    ++C Y   YGD S T GDF+ ET T              
Subjt:  ILDTGSDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFR

Query:  RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENP-VDTFYYLQIKS
        RV+ V  GCGH N GLF GAAGLLGLG+G LSF  Q    +   FSYCLVDR++ +  SS ++FG   +        FT L+    NP +DTFYY+ +  
Subjt:  RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENP-VDTFYYLQIKS

Query:  IFVGGEELK-ISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYF
        I VGG  +  ++   + L   G GG IIDSGT+++    PAY  +++AF    KT K   DF +   C+++S   +++ P   +HF  GA    P  NY 
Subjt:  IFVGGEELK-ISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYF

Query:  IRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCA
        I ++     C A  GT    LSIIGN QQQ F ++YD  +SR+G+AP  CA
Subjt:  IRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCA

AT2G42980.1 Eukaryotic aspartyl protease family protein1.4e-17959.15Show/hide
Query:  LTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLHLKKRSTNRATEPKESITESA
        L L  +  FSG   T++  H     + N   +F  +     SS  ASSS S  C F +++ H+  +    +SVK   ++   K+ T R T    S+ +  
Subjt:  LTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLHLKKRSTNRATEPKESITESA

Query:  IRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQC
        I+DL RI+TLH R  + K Q       K+   ++K    ++   +PE       G+L+ATLESG++LGSGEYF+DV VG+PPKHFSLILDTGSDLNW+QC
Subjt:  IRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQC

Query:  VPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWN
        +PC+DCF QNG +YDPK S SF+N+TCNDPRC L+SSPDPP  C+ + QSCPYFYWYGD SNTTGDFA+ETFTVNLT++  G+SE+ +V N+MFGCGHWN
Subjt:  VPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWN

Query:  RGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEEN
        RGLF GA+GLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS+T+VSSKLIFGED+DLL H  LNFTS + GKEN V+TFYY+QIKSI VGG+ L I EE 
Subjt:  RGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEEN

Query:  WNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVK-TYKLVEDFPILHPCYNVSGAEK--LEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLA
        WN+S+DG GGTIIDSGTTLSYF++PAY+ IK  F  K+K  Y +  DFP+L PC+NVSG E+  +  PE  I F DG VW FP EN FI + + D+ CLA
Subjt:  WNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVK-TYKLVEDFPILHPCYNVSGAEK--LEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLA

Query:  MLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        +LGTPKS  SIIGNYQQQNFHILYDTK SRLG+ P KCA++
Subjt:  MLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

AT3G25700.1 Eukaryotic aspartyl protease family protein1.5e-7240.67Show/hide
Query:  SGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQN-GPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFET--QSCPYFYWYGD
        SG + GSG+YF+D+ +G PP+   LI DTGSDL W++C  C +C   +    + P+ S +F    C DP C+LV  PD    C       +C Y Y Y D
Subjt:  SGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCHDCFEQN-GPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFET--QSCPYFYWYGD

Query:  SSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRG------LFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGE
         S T+G FA ET     TS  T + +  R+++V FGCG    G       F+GA G++GLGRGP+SF+SQL   +G+ FSYCL+D       +S LI G 
Subjt:  SSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRG------LFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGE

Query:  DRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPC
          D ++  +L FT L+    +P  TFYY+++KS+FV G +L+I    W +   G GGT++DSGTTL++ ++PAY+++  A  R+VK        P    C
Subjt:  DRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPC

Query:  YNVSGAEKLE--FPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGT-PKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCA
         NVSG  K E   P  +  F+ GAV+  P  NYFI  E+  I CLA+    PK   S+IGN  QQ F   +D   SRLG++   CA
Subjt:  YNVSGAEKLE--FPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGT-PKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCA

AT3G59080.1 Eukaryotic aspartyl protease family protein6.5e-19863.99Show/hide
Query:  FQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLHLKKRSTNRATE-P
        F      L LIF   F   F+  + A        N SG  GI+ P  +  G ASSS S  C F +    E+E         ++VK HLK+R T    +  
Subjt:  FQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLHLKKRSTNRATE-P

Query:  KESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG
          S+ E  IRDL RIQTLH+R+ E+ NQ+T S+ +K N    K     TP AS        +GQLVATLESG++LGSGEYF+DV VGSPPKHFSLILDTG
Subjt:  KESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG

Query:  SDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENV
        SDLNWIQC+PC+DCF+QNG +YDPK S S++N+TCND RC LVSSPDPP PCK + QSCPY+YWYGDSSNTTGDFA+ETFTVNLT++  G+SE   VEN+
Subjt:  SDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENV

Query:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGE
        MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDT+VSSKLIFGED+DLL+HP LNFTS + GKEN VDTFYY+QIKSI V GE
Subjt:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGE

Query:  ELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVK-TYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQL
         L I EE WN+S+DG GGTIIDSGTTLSYF++PAY+ IK     K K  Y +  DFPIL PC+NVSG   ++ PE  I FADGAVW FP EN FI + + 
Subjt:  ELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVK-TYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQL

Query:  DIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        D+ CLAMLGTPKSA SIIGNYQQQNFHILYDTK SRLGYAP KCA++
Subjt:  DIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV

AT3G59080.2 Eukaryotic aspartyl protease family protein1.7e-17459.41Show/hide
Query:  FQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLHLKKRSTNRATE-P
        F      L LIF   F   F+  + A        N SG  GI+ P  +  G ASSS S  C F +    E+E         ++VK HLK+R T    +  
Subjt:  FQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLHLKKRSTNRATE-P

Query:  KESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG
          S+ E  IRDL RIQTLH+R+ E+ NQ+T S+ +K N    K     TP AS        +GQLVATLESG++LGSGEYF+DV VGSPPKHFSLILDTG
Subjt:  KESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG

Query:  SDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENV
        SDLNWIQC+PC+DCF+QN                                    + QSCPY+YWYGDSSNTTGDFA+ETFTVNLT++  G+SE   VEN+
Subjt:  SDLNWIQCVPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENV

Query:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGE
        MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDT+VSSKLIFGED+DLL+HP LNFTS + GKEN VDTFYY+QIKSI V GE
Subjt:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGE

Query:  ELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVK-TYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQL
         L I EE WN+S+DG GGTIIDSGTTLSYF++PAY+ IK     K K  Y +  DFPIL PC+NVSG   ++ PE  I FADGAVW FP EN FI + + 
Subjt:  ELKISEENWNLSADGGGGTIIDSGTTLSYFSDPAYQTIKEAFLRKVK-TYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQL

Query:  DIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV
        D+ CLAMLGTPKSA SIIGNYQQQNFHILYDTK SRLGYAP KCA++
Subjt:  DIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMKCAEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTCTCGGTAACCAAACAAGCAGTAGCAGAGGTTTTCAGAATCGCAAAGTGTTTCTTACATTGATTTTCCTTTTGCTTTTCTCCGGCGTTTTTAATACGATTGC
AGAAGCGCATGTTCGTCAAGGAATCAACTCCAATCGCTCTGGTATTTTCGGAATCGAATTGCCGGAAAATCTCAGCTCCGGTATTGCATCTTCCTCCGCGAGCGCTCCGT
GTAGTTTCGGTAATGAAGATGGACACGAAGAGGAAGAAAATTTAATGGCGGATTCGGTTAAGCAATCAGTGAAGCTCCACTTGAAAAAGCGGTCAACGAATCGAGCGACG
GAACCGAAAGAATCAATCACCGAATCTGCAATTAGGGATTTGGCGAGAATCCAGACTCTTCATAGGAGAATCACTGAGAGGAAGAATCAAGACACGACTTCGAGATTGAA
GAAGAGCAATGCCGAGCAGCGGAAACCGGCGGAGGCGGTTACTCCGGCAGCATCGCCGGAATCTTACTCCGATTACTTCTCCGGCCAGCTTGTGGCTACTCTCGAATCCG
GCGTCAGTCTCGGCTCCGGAGAGTACTTCATCGATGTCTTCGTCGGTTCTCCGCCCAAACACTTCTCTCTGATTCTCGATACTGGAAGCGACCTAAACTGGATTCAATGC
GTCCCTTGCCACGATTGTTTCGAGCAAAACGGGCCGTATTACGATCCGAAAGATTCAATTTCTTTCAGAAACGTTACCTGTAACGATCCTCGATGTCAATTGGTTTCGTC
TCCAGATCCTCCGCAGCCGTGCAAATTCGAGACGCAATCGTGCCCTTATTTCTACTGGTACGGCGACAGTTCGAACACCACCGGCGATTTTGCGCTCGAGACGTTCACCG
TCAATCTCACCTCGTCGGCGACGGGGACGTCGGAGTTCCGGCGGGTGGAGAATGTGATGTTCGGATGCGGCCACTGGAACAGAGGCCTCTTCCACGGCGCCGCCGGATTG
TTAGGGCTCGGCCGTGGCCCTCTCTCGTTTTCATCGCAGCTTCAATCGCTCTACGGCCATTCCTTCTCCTACTGTCTCGTCGATCGAAACAGCGATACAAGCGTGAGCAG
CAAGCTGATTTTCGGCGAAGATAGAGATCTATTAGCGCATCCAGAACTGAATTTCACATCGCTGATCGGAGGAAAGGAAAATCCAGTCGACACATTCTACTATCTGCAAA
TCAAATCGATCTTCGTCGGAGGAGAGGAGCTCAAAATCTCCGAGGAGAACTGGAACCTCTCCGCCGACGGCGGCGGCGGAACAATCATCGACTCCGGCACCACTCTCAGC
TATTTCTCCGATCCGGCTTACCAGACAATCAAGGAAGCATTTCTGCGGAAAGTGAAAACCTACAAACTGGTGGAAGATTTTCCGATTCTGCATCCTTGTTACAACGTCTC
CGGCGCCGAGAAACTGGAATTTCCAGAATTCGAAATCCACTTCGCCGACGGCGCCGTGTGGAAATTCCCGGTGGAGAATTACTTCATCAGAATCGAGCAATTGGATATCG
CGTGCTTGGCGATGTTAGGGACTCCAAAATCGGCGCTGTCGATCATCGGAAATTACCAGCAGCAAAATTTTCACATACTGTACGATACGAAGAACTCGCGGCTGGGCTAC
GCGCCGATGAAATGTGCCGAAGTT
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTCTCGGTAACCAAACAAGCAGTAGCAGAGGTTTTCAGAATCGCAAAGTGTTTCTTACATTGATTTTCCTTTTGCTTTTCTCCGGCGTTTTTAATACGATTGC
AGAAGCGCATGTTCGTCAAGGAATCAACTCCAATCGCTCTGGTATTTTCGGAATCGAATTGCCGGAAAATCTCAGCTCCGGTATTGCATCTTCCTCCGCGAGCGCTCCGT
GTAGTTTCGGTAATGAAGATGGACACGAAGAGGAAGAAAATTTAATGGCGGATTCGGTTAAGCAATCAGTGAAGCTCCACTTGAAAAAGCGGTCAACGAATCGAGCGACG
GAACCGAAAGAATCAATCACCGAATCTGCAATTAGGGATTTGGCGAGAATCCAGACTCTTCATAGGAGAATCACTGAGAGGAAGAATCAAGACACGACTTCGAGATTGAA
GAAGAGCAATGCCGAGCAGCGGAAACCGGCGGAGGCGGTTACTCCGGCAGCATCGCCGGAATCTTACTCCGATTACTTCTCCGGCCAGCTTGTGGCTACTCTCGAATCCG
GCGTCAGTCTCGGCTCCGGAGAGTACTTCATCGATGTCTTCGTCGGTTCTCCGCCCAAACACTTCTCTCTGATTCTCGATACTGGAAGCGACCTAAACTGGATTCAATGC
GTCCCTTGCCACGATTGTTTCGAGCAAAACGGGCCGTATTACGATCCGAAAGATTCAATTTCTTTCAGAAACGTTACCTGTAACGATCCTCGATGTCAATTGGTTTCGTC
TCCAGATCCTCCGCAGCCGTGCAAATTCGAGACGCAATCGTGCCCTTATTTCTACTGGTACGGCGACAGTTCGAACACCACCGGCGATTTTGCGCTCGAGACGTTCACCG
TCAATCTCACCTCGTCGGCGACGGGGACGTCGGAGTTCCGGCGGGTGGAGAATGTGATGTTCGGATGCGGCCACTGGAACAGAGGCCTCTTCCACGGCGCCGCCGGATTG
TTAGGGCTCGGCCGTGGCCCTCTCTCGTTTTCATCGCAGCTTCAATCGCTCTACGGCCATTCCTTCTCCTACTGTCTCGTCGATCGAAACAGCGATACAAGCGTGAGCAG
CAAGCTGATTTTCGGCGAAGATAGAGATCTATTAGCGCATCCAGAACTGAATTTCACATCGCTGATCGGAGGAAAGGAAAATCCAGTCGACACATTCTACTATCTGCAAA
TCAAATCGATCTTCGTCGGAGGAGAGGAGCTCAAAATCTCCGAGGAGAACTGGAACCTCTCCGCCGACGGCGGCGGCGGAACAATCATCGACTCCGGCACCACTCTCAGC
TATTTCTCCGATCCGGCTTACCAGACAATCAAGGAAGCATTTCTGCGGAAAGTGAAAACCTACAAACTGGTGGAAGATTTTCCGATTCTGCATCCTTGTTACAACGTCTC
CGGCGCCGAGAAACTGGAATTTCCAGAATTCGAAATCCACTTCGCCGACGGCGCCGTGTGGAAATTCCCGGTGGAGAATTACTTCATCAGAATCGAGCAATTGGATATCG
CGTGCTTGGCGATGTTAGGGACTCCAAAATCGGCGCTGTCGATCATCGGAAATTACCAGCAGCAAAATTTTCACATACTGTACGATACGAAGAACTCGCGGCTGGGCTAC
GCGCCGATGAAATGTGCCGAAGTT
Protein sequenceShow/hide protein sequence
MDFLGNQTSSSRGFQNRKVFLTLIFLLLFSGVFNTIAEAHVRQGINSNRSGIFGIELPENLSSGIASSSASAPCSFGNEDGHEEEENLMADSVKQSVKLHLKKRSTNRAT
EPKESITESAIRDLARIQTLHRRITERKNQDTTSRLKKSNAEQRKPAEAVTPAASPESYSDYFSGQLVATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQC
VPCHDCFEQNGPYYDPKDSISFRNVTCNDPRCQLVSSPDPPQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSATGTSEFRRVENVMFGCGHWNRGLFHGAAGL
LGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLAHPELNFTSLIGGKENPVDTFYYLQIKSIFVGGEELKISEENWNLSADGGGGTIIDSGTTLS
YFSDPAYQTIKEAFLRKVKTYKLVEDFPILHPCYNVSGAEKLEFPEFEIHFADGAVWKFPVENYFIRIEQLDIACLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGY
APMKCAEV