; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013827 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013827
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionaspartyl protease family protein 2
Genome locationChr02:5129288..5130973
RNA-Seq ExpressionHG10013827
SyntenyHG10013827
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0006508 - proteolysis (biological process)
GO:0005840 - ribosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052377.1 aspartyl protease family protein 2 [Cucumis melo var. makuwa]5.2e-29890.59Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV
        MDFLG    SS GFQ+ K+FLTLI LLLF+ +FD+  +VEAH+PQ F+ SN S +FGIELPENLSSGIASSSASAPCSFGNEGEE E ESLM DSVKQSV
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV

Query:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID
        KLHLKKRST+ ANEP+ESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVE RK  + +  SP  SPES+ DYFSGQLMATLESGVSLGSGEYFID
Subjt:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID

Query:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
        VF+GSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDP QPCKFE QSCPYFYWYGDSSNTTGDFALETFTVN
Subjt:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN

Query:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP
        LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLLTHPELNFTSLIGGKENP
Subjt:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP

Query:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA
        VDTFYYLQIKSIFVG EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVS TD+L FPEF IQFADGA
Subjt:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA

Query:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        VWNFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

KGN54937.1 hypothetical protein Csa_012800 [Cucumis sativus]3.2e-30391.83Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSI--VEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV
        MDFLGNQ GSSRGFQN K+FLTLI LLLFSG+F ++  VEAH+PQ F+ SN SG+FGIELPENLSSGIASSSASAPCSFGNEGEE ERESLM DSVKQSV
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSI--VEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV

Query:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID
        KLHLKKRST+ AN+PKESITESAVRDLARIQTLHTRI ERKNQDTTSRLKKSNVE RK  + E  SP  SPES+ DYFSGQLMATLESGVSLGSGEYFID
Subjt:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID

Query:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
        VF+GSPPKHFSLILDTGSDLNWIQCVPC+DCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDP +PCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
Subjt:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN

Query:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP
        LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLLTHPELNFTSLI GKENP
Subjt:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP

Query:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA
        VDTFYYLQIKSIFVG EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTD+L FPEF IQFADGA
Subjt:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA

Query:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        VWNFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

XP_008465452.1 PREDICTED: aspartyl protease family protein 2 [Cucumis melo]5.2e-29890.59Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV
        MDFLG    SS GFQ+ K+FLTLI LLLF+ +FD+  +VEAH+PQ F+ SN S +FGIELPENLSSGIASSSASAPCSFGNEGEE E ESLM DSVKQSV
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV

Query:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID
        KLHLKKRST+ ANEP+ESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVE RK  + +  SP  SPES+ DYFSGQLMATLESGVSLGSGEYFID
Subjt:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID

Query:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
        VF+GSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDP QPCKFE QSCPYFYWYGDSSNTTGDFALETFTVN
Subjt:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN

Query:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP
        LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLLTHPELNFTSLIGGKENP
Subjt:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP

Query:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA
        VDTFYYLQIKSIFVG EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVS TD+L FPEF IQFADGA
Subjt:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA

Query:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        VWNFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

XP_031740447.1 aspartyl protease family protein 2 [Cucumis sativus]3.2e-30391.83Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSI--VEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV
        MDFLGNQ GSSRGFQN K+FLTLI LLLFSG+F ++  VEAH+PQ F+ SN SG+FGIELPENLSSGIASSSASAPCSFGNEGEE ERESLM DSVKQSV
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSI--VEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV

Query:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID
        KLHLKKRST+ AN+PKESITESAVRDLARIQTLHTRI ERKNQDTTSRLKKSNVE RK  + E  SP  SPES+ DYFSGQLMATLESGVSLGSGEYFID
Subjt:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID

Query:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
        VF+GSPPKHFSLILDTGSDLNWIQCVPC+DCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDP +PCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
Subjt:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN

Query:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP
        LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLLTHPELNFTSLI GKENP
Subjt:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP

Query:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA
        VDTFYYLQIKSIFVG EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTD+L FPEF IQFADGA
Subjt:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA

Query:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        VWNFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

XP_038898915.1 aspartyl protease family protein 2-like [Benincasa hispida]0.0e+0094.3Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKL
        MDFLGNQTGSSRGF N KVFLTLI LLLFSG+FDS+VEAHVPQ F+NSN SGIFGIELPENLSSGIA+SSASAPCSFG EGEEDE E+LM DSVKQSVKL
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKL

Query:  HLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVF
        HLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQ+K    EAVSP  SPES+ DYFSGQL+ATLESGVSLGSGEYFIDVF
Subjt:  HLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVF

Query:  VGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT
        VGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRC LVSSPDP QPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT
Subjt:  VGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT

Query:  SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVD
        SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVD
Subjt:  SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVD

Query:  TFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVW
        TFYYLQIKSIFVG E+LQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTD+L+FPEFGIQF DGAVW
Subjt:  TFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVW

Query:  NFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        NFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  NFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

TrEMBL top hitse value%identityAlignment
A0A0A0L2W1 Peptidase A1 domain-containing protein1.5e-30391.83Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSI--VEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV
        MDFLGNQ GSSRGFQN K+FLTLI LLLFSG+F ++  VEAH+PQ F+ SN SG+FGIELPENLSSGIASSSASAPCSFGNEGEE ERESLM DSVKQSV
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSI--VEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV

Query:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID
        KLHLKKRST+ AN+PKESITESAVRDLARIQTLHTRI ERKNQDTTSRLKKSNVE RK  + E  SP  SPES+ DYFSGQLMATLESGVSLGSGEYFID
Subjt:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID

Query:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
        VF+GSPPKHFSLILDTGSDLNWIQCVPC+DCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDP +PCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
Subjt:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN

Query:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP
        LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLLTHPELNFTSLI GKENP
Subjt:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP

Query:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA
        VDTFYYLQIKSIFVG EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTD+L FPEF IQFADGA
Subjt:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA

Query:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        VWNFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

A0A1S3CNY2 aspartyl protease family protein 22.5e-29890.59Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV
        MDFLG    SS GFQ+ K+FLTLI LLLF+ +FD+  +VEAH+PQ F+ SN S +FGIELPENLSSGIASSSASAPCSFGNEGEE E ESLM DSVKQSV
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV

Query:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID
        KLHLKKRST+ ANEP+ESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVE RK  + +  SP  SPES+ DYFSGQLMATLESGVSLGSGEYFID
Subjt:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID

Query:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
        VF+GSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDP QPCKFE QSCPYFYWYGDSSNTTGDFALETFTVN
Subjt:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN

Query:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP
        LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLLTHPELNFTSLIGGKENP
Subjt:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP

Query:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA
        VDTFYYLQIKSIFVG EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVS TD+L FPEF IQFADGA
Subjt:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA

Query:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        VWNFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

A0A5A7UD09 Aspartyl protease family protein 22.5e-29890.59Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV
        MDFLG    SS GFQ+ K+FLTLI LLLF+ +FD+  +VEAH+PQ F+ SN S +FGIELPENLSSGIASSSASAPCSFGNEGEE E ESLM DSVKQSV
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDS--IVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSV

Query:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID
        KLHLKKRST+ ANEP+ESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVE RK  + +  SP  SPES+ DYFSGQLMATLESGVSLGSGEYFID
Subjt:  KLHLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFID

Query:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN
        VF+GSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDP QPCKFE QSCPYFYWYGDSSNTTGDFALETFTVN
Subjt:  VFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN

Query:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP
        LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR+SDTSVSSKLIFGED+DLLTHPELNFTSLIGGKENP
Subjt:  LTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP

Query:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA
        VDTFYYLQIKSIFVG EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVS TD+L FPEF IQFADGA
Subjt:  VDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGA

Query:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        VWNFPVENYFIRI+QLDIVCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  VWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

A0A6J1C128 aspartyl protease family protein 2-like4.0e-29690.37Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKL
        MDFLGNQT SSRGFQN KVFLTLI LLLFSG+F++I EAHV Q   NSN SGIFGIELPENLSSGIASSSASAPCSFGNE   +E E+LM DSVKQSVKL
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKL

Query:  HLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVF
        HLKKRST+RA EPKESITESA+RDLARIQTLH RI ERKNQDTTSRLKKSN EQRK A  EAV+P ASPES++DYFSGQL+ATLESGVSLGSGEYFIDVF
Subjt:  HLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVF

Query:  VGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT
        VGSPPKHFSLILDTGSDLNWIQCVPC+DCFEQNGPYYDPKDSISFRN+TCNDPRCQLVSSPDP QPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT
Subjt:  VGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT

Query:  SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVD
        SS TG SEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLL HPELNFTSLIGGKENPVD
Subjt:  SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVD

Query:  TFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVW
        TFYYLQIKSIFVG E+L+I EENWNLSADG GGTIIDSGTTLSYFSDPAY+ IKEAFLRKVK YKLVEDFPILHPCYNVSG +KLEFPEF I FADGAVW
Subjt:  TFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVW

Query:  NFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
         FPVENYFIRIEQLDI CLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLGYAPMRCAEV
Subjt:  NFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

A0A6J1J0D9 protein ASPARTIC PROTEASE IN GUARD CELL 1-like1.5e-29088.41Show/hide
Query:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKL
        M+FLG Q+GS+RGFQN  V+L LI LLLFS +FD+I EAHV Q FN SN SG+FGIELPEN+SSGIA+SS SAPCSF NE EE+E E LM  SVK+SVKL
Subjt:  MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKL

Query:  HLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVF
        HLKKRSTSR  EPKESITESAVRDLARIQTLH RI ERKNQDTTSRLK  N E+RK A  EAVSP ASP+S++ YFSGQLMATLESGVSLGSGEYFIDVF
Subjt:  HLKKRSTSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVF

Query:  VGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT
        VGSPPKHFSLILDTGSDLNWIQCVPC+DCFEQ GPYYDPKDSISFRNITCNDPRCQLVSSPDP QPCK ETQSCPYFYWYGD SNTTGDFALETFTVNLT
Subjt:  VGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLT

Query:  SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVD
        SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPEL FTSL GGKENPVD
Subjt:  SSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVD

Query:  TFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVW
        TFYYLQIKSIFVG EKLQIPEENW +SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK YKLVEDFPILHPCYNVS  DKLEFPEF IQFADGAVW
Subjt:  TFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVW

Query:  NFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
         FPVENYFIRIEQ D+VCLAMLGTPKSALSI+GNYQQQNFHILYDTKNSRLG+APMRCA+V
Subjt:  NFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-27.8e-6338.48Show/hide
Query:  QLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFY
        Q  + +E+ V  G GEY ++V +G+P   FS I+DTGSDL W QC PC  CF Q  P ++P+DS SF  + C    CQ +    PS+ C      C Y Y
Subjt:  QLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFY

Query:  WYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGED
         YGD S T G  A ETFT   +S          V N+ FGCG  N+G   G  AGL+G+G GPLS  SQL       FSYC+    S  S  S L  G  
Subjt:  WYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGED

Query:  RDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCY
           +     + T+LI    NP  T+YY+ ++ I VG + L IP   + L  DG GG IIDSGTTL+Y    AY  + +AF  ++    + E    L  C+
Subjt:  RDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCY

Query:  -NVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRC
           S    ++ PE  +QF DG V N   +N  I   +  ++CLAM  + +  +SI GN QQQ   +LYD +N  + + P +C
Subjt:  -NVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRC

Q766C3 Aspartic proteinase nepenthesin-13.9e-6238.62Show/hide
Query:  LESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDS
        +E+ V  G GEY +++ +G+P + FS I+DTGSDL W QC PC  CF Q+ P ++P+ S SF  + C+   CQ +SSP  S         C Y Y YGD 
Subjt:  LESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDS

Query:  SNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLT
        S T G    ET T    S          + N+ FGCG  N+G   G  AGL+G+GRGPLS  SQL       FSYC+    S T   S L+ G   + +T
Subjt:  SNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHG-AAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLT

Query:  HPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNL-SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNV-S
            N T+LI   + P  TFYY+ +  + VG  +L I    + L S +G GG IIDSGTTL+YF + AY+ +++ F+ ++    +         C+   S
Subjt:  HPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNL-SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNV-S

Query:  GTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRC
            L+ P F + F DG     P ENYFI      ++CLAM G+    +SI GN QQQN  ++YDT NS + +A  +C
Subjt:  GTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 23.5e-6332.08Show/hide
Query:  HTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFE
        H R+  R  +DT           R +A+   +S    P S + Y      + + SG+  GSGEYF+ + VGSPP+   +++D+GSD+ W+QC PC  C++
Subjt:  HTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFE

Query:  QNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAA
        Q+ P +DP  S S+  ++C    C  + +          +  C Y   YGD S T G  ALET T   T           V NV  GCGH NRG+F GAA
Subjt:  QNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAA

Query:  GLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGA
        GLLG+G G +SF  QL    G +F YCLV R +D++ S  L+FG +         ++  L+     P  +FYY+ +K + VG  ++ +P+  ++L+  G 
Subjt:  GLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGA

Query:  GGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSI
        GG ++D+GT ++     AY   ++ F  +           I   CY++SG   +  P     F +G V   P  N+ + ++     C A   +P + LSI
Subjt:  GGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSI

Query:  MGNYQQQNFHILYDTKNSRLGYAPMRC
        +GN QQ+   + +D  N  +G+ P  C
Subjt:  MGNYQQQNFHILYDTKNSRLGYAPMRC

Q9LNJ3 Aspartyl protease family protein 27.3e-6934.52Show/hide
Query:  PENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANE-PKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKA
        P + S   AS  +  P S      E E ES        S+ L+L       +N+ P E  +    RD  R++++ T  A+   ++ T             
Subjt:  PENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANE-PKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKA

Query:  AVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQL
            A  P            G   +++ SG+S GSGEYF  + VG+P ++  ++LDTGSD+ W+QC PC  C+ Q+ P +DP+ S ++  I C+ P C+ 
Subjt:  AVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQL

Query:  VSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY
        + S      C    ++C Y   YGD S T GDF+ ET T              RV+ V  GCGH N GLF GAAGLLGLG+G LSF  Q    +   FSY
Subjt:  VSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY

Query:  CLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP-VDTFYYLQIKSIFVGREKLQ-IPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKE
        CLVDR++ +  SS ++FG   +        FT L+    NP +DTFYY+ +  I VG  ++  +    + L   G GG IIDSGT+++    PAY  +++
Subjt:  CLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENP-VDTFYYLQIKSIFVGREKLQ-IPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKE

Query:  AFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAP
        AF    K  K   DF +   C+++S  ++++ P   + F  GA  + P  NY I ++     C A  GT    LSI+GN QQQ F ++YD  +SR+G+AP
Subjt:  AFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAP

Query:  MRCA
          CA
Subjt:  MRCA

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.1e-6737.06Show/hide
Query:  TDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQ
        T Y +  L   + SG S GSGEYF  + VG+P K   L+LDTGSD+NWIQC PC DC++Q+ P ++P  S +++++TC+ P+C L+     +  C+  + 
Subjt:  TDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQ

Query:  SCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKL
         C Y   YGD S T G+ A +T    +T   +GK     + NV  GCGH N GLF GAAGLLGLG G LS ++Q+++    SFSYCLVDR+S  S S   
Subjt:  SCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKL

Query:  IFGEDRDLLTHPELNFTSLIGG-------KENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLR-KVKGY
                    + N   L GG       +   +DTFYY+ +    VG EK+ +P+  +++ A G+GG I+D GT ++     AY  +++AFL+  V   
Subjt:  IFGEDRDLLTHPELNFTSLIGG-------KENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLR-KVKGY

Query:  KLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRC
        K      +   CY+ S    ++ P     F  G   + P +NY I ++     C A   T  S+LSI+GN QQQ   I YD   + +G +  +C
Subjt:  KLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRC

Arabidopsis top hitse value%identityAlignment
AT1G25510.1 Eukaryotic aspartyl protease family protein6.6e-7334.3Show/hide
Query:  SNHSGIFGIELPE--NLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANEPKE--SITESAV-RDLARIQTLHTRIAERKNQD
        ++HS +F   LPE    ++ I + + S   +         ++   T S   S  L L  R + R  E  +  S+T + + RD AR+++L TR+    N  
Subjt:  SNHSGIFGIELPE--NLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANEPKE--SITESAV-RDLARIQTLHTRIAERKNQD

Query:  TTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDS
        + + LK              +S   + E         + A L SG + GSGEYF  V +G P +   ++LDTGSD+NW+QC PC DC+ Q  P ++P  S
Subjt:  TTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDS

Query:  ISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLS
         S+  ++C+ P+C  +   +    C+  T  C Y   YGD S T GDFA ET T+  T           V+NV  GCGH N GLF GAAGLLGLG G L+
Subjt:  ISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLS

Query:  FSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTL
          SQL +    SFSYCLVDR+SD+  +S + FG        P+     L+  + + +DTFYYL +  I VG E LQIP+ ++ +   G+GG IIDSGT +
Subjt:  FSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTL

Query:  SYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHI
        +      Y  ++++F++     +      +   CYN+S    +E P     F  G +   P +NY I ++ +   CLA   T  S+L+I+GN QQQ   +
Subjt:  SYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTPKSALSIMGNYQQQNFHI

Query:  LYDTKNSRLGYAPMRC
         +D  NS +G++  +C
Subjt:  LYDTKNSRLGYAPMRC

AT2G42980.1 Eukaryotic aspartyl protease family protein4.7e-18058.91Show/hide
Query:  NSKVFLTLILLLLFS-GLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANEPK
        ++K+ + L+ L+LFS   F         +  + S+   +F  +     SS  ASSS S  C F ++ E D  +    +SVK   ++   K+ T R     
Subjt:  NSKVFLTLILLLLFS-GLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANEPK

Query:  ESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDT
         S+ +  ++DL RI+TLH R  + K Q      KK  +    + VG   +P  SP        G+L+ATLESG++LGSGEYF+DV VG+PPKHFSLILDT
Subjt:  ESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDT

Query:  GSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN
        GSDLNW+QC+PCYDCF QNG +YDPK S SF+NITCNDPRC L+SSPDP   C+ + QSCPYFYWYGD SNTTGDFA+ETFTVNLT++  G SE+ +V N
Subjt:  GSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVEN

Query:  VMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGR
        +MFGCGHWNRGLF GA+GLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNS+T+VSSKLIFGED+DLL H  LNFTS + GKEN V+TFYY+QIKSI VG 
Subjt:  VMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGR

Query:  EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK-GYKLVEDFPILHPCYNVSGTDK--LEFPEFGIQFADGAVWNFPVENYFIRI
        + L IPEE WN+S+DG GGTIIDSGTTLSYF++PAY IIK  F  K+K  Y +  DFP+L PC+NVSG ++  +  PE GI F DG VWNFP EN FI +
Subjt:  EKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVK-GYKLVEDFPILHPCYNVSGTDK--LEFPEFGIQFADGAVWNFPVENYFIRI

Query:  EQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
         + D+VCLA+LGTPKS  SI+GNYQQQNFHILYDTK SRLG+ P +CA++
Subjt:  EQLDIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

AT3G25700.1 Eukaryotic aspartyl protease family protein2.8e-7140.67Show/hide
Query:  SGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQN-GPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFET--QSCPYFYWYGD
        SG + GSG+YF+D+ +G PP+   LI DTGSDL W++C  C +C   +    + P+ S +F    C DP C+LV  PD +  C       +C Y Y Y D
Subjt:  SGVSLGSGEYFIDVFVGSPPKHFSLILDTGSDLNWIQCVPCYDCFEQN-GPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFET--QSCPYFYWYGD

Query:  SSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRG------LFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGE
         S T+G FA ET     TS  T   +  R+++V FGCG    G       F+GA G++GLGRGP+SF+SQL   +G+ FSYCL+D       +S LI G 
Subjt:  SSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRG------LFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGE

Query:  DRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPC
          D ++  +L FT L+    +P  TFYY+++KS+FV   KL+I    W +   G GGT++DSGTTL++ ++PAYR +  A  R+VK        P    C
Subjt:  DRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPC

Query:  YNVSGTDKLE--FPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGT-PKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCA
         NVSG  K E   P    +F+ GAV+  P  NYFI  E+  I CLA+    PK   S++GN  QQ F   +D   SRLG++   CA
Subjt:  YNVSGTDKLE--FPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGT-PKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCA

AT3G59080.1 Eukaryotic aspartyl protease family protein7.0e-20063.8Show/hide
Query:  SKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANE-PKE
        SK    L L+  F   F     A +       N SG  GI+ P  +  G ASSS S  C F +  +E  +E        ++VK HLK+R T+   +    
Subjt:  SKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANE-PKE

Query:  SITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG
        S+ E  +RDL RIQTLH R+ E+ NQ+T S+ +K N ++        V  T    S  +  +GQL+ATLESG++LGSGEYF+DV VGSPPKHFSLILDTG
Subjt:  SITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG

Query:  SDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENV
        SDLNWIQC+PCYDCF+QNG +YDPK S S++NITCND RC LVSSPDP  PCK + QSCPY+YWYGDSSNTTGDFA+ETFTVNLT++  G SE   VEN+
Subjt:  SDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENV

Query:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGRE
        MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDT+VSSKLIFGED+DLL+HP LNFTS + GKEN VDTFYY+QIKSI V  E
Subjt:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGRE

Query:  KLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-YKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQL
         L IPEE WN+S+DGAGGTIIDSGTTLSYF++PAY  IK     K KG Y +  DFPIL PC+NVSG   ++ PE GI FADGAVWNFP EN FI + + 
Subjt:  KLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-YKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQL

Query:  DIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        D+VCLAMLGTPKSA SI+GNYQQQNFHILYDTK SRLGYAP +CA++
Subjt:  DIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV

AT3G59080.2 Eukaryotic aspartyl protease family protein2.9e-17759.23Show/hide
Query:  SKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANE-PKE
        SK    L L+  F   F     A +       N SG  GI+ P  +  G ASSS S  C F +  +E  +E        ++VK HLK+R T+   +    
Subjt:  SKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKRSTSRANE-PKE

Query:  SITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG
        S+ E  +RDL RIQTLH R+ E+ NQ+T S+ +K N ++        V  T    S  +  +GQL+ATLESG++LGSGEYF+DV VGSPPKHFSLILDTG
Subjt:  SITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSLILDTG

Query:  SDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENV
        SDLNWIQC+PCYDCF+QN                                    + QSCPY+YWYGDSSNTTGDFA+ETFTVNLT++  G SE   VEN+
Subjt:  SDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENV

Query:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGRE
        MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDT+VSSKLIFGED+DLL+HP LNFTS + GKEN VDTFYY+QIKSI V  E
Subjt:  MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGRE

Query:  KLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-YKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQL
         L IPEE WN+S+DGAGGTIIDSGTTLSYF++PAY  IK     K KG Y +  DFPIL PC+NVSG   ++ PE GI FADGAVWNFP EN FI + + 
Subjt:  KLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG-YKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQL

Query:  DIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV
        D+VCLAMLGTPKSA SI+GNYQQQNFHILYDTK SRLGYAP +CA++
Subjt:  DIVCLAMLGTPKSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTCTCGGTAACCAAACAGGCAGTAGCAGAGGATTTCAGAATTCCAAAGTGTTTCTTACATTGATTTTACTTTTGCTTTTCTCTGGCTTATTTGATTCG
ATTGTTGAAGCGCATGTTCCTCAAAAATTCAACAACTCCAATCACTCTGGTATTTTCGGAATCGAATTGCCGGAAAATCTTAGCTCCGGTATTGCTTCTTCATCC
GCCAGCGCTCCTTGTAGCTTTGGAAATGAAGGTGAAGAAGATGAGAGAGAGAGTTTAATGACGGATTCAGTGAAGCAATCGGTGAAGCTTCACTTGAAAAAGCGG
TCAACGAGTCGAGCGAACGAACCAAAGGAATCGATTACTGAATCTGCAGTTAGGGATTTGGCGAGAATCCAGACGCTTCATACCAGAATCGCCGAGAGGAAGAAT
CAAGATACGACTTCGAGATTGAAGAAGAGCAATGTCGAGCAAAGGAAGGCTGCGGTGGGGGAGGCAGTTTCTCCGACCGCATCGCCAGAATCTCATACCGATTAC
TTCTCCGGTCAGCTTATGGCGACTTTGGAATCCGGCGTCAGTCTCGGCTCTGGTGAGTACTTTATCGACGTCTTCGTCGGTTCTCCGCCGAAACACTTCTCTCTG
ATTCTCGATACTGGAAGCGATTTGAACTGGATTCAATGTGTACCTTGCTACGATTGTTTCGAGCAAAACGGACCTTATTACGATCCTAAAGATTCAATTTCTTTC
AGAAACATAACCTGTAACGATCCTCGATGTCAATTAGTTTCGTCTCCAGATCCTTCGCAGCCGTGCAAATTCGAGACGCAATCGTGCCCTTATTTTTACTGGTAC
GGCGACAGTTCGAACACTACCGGCGATTTCGCGCTTGAAACGTTCACTGTCAATCTGACCTCGTCGACGACGGGGAAATCGGAGTTCCGTCGAGTGGAGAATGTT
ATGTTCGGATGCGGCCATTGGAACAGAGGTCTCTTCCATGGCGCCGCCGGACTATTAGGGCTCGGCCGAGGACCTCTCTCTTTCTCATCGCAGCTTCAATCGCTC
TATGGTCATTCCTTCTCCTACTGTCTCGTCGATCGAAACAGCGATACCAGCGTGAGCAGCAAGCTGATCTTCGGCGAAGACAGAGACTTATTAACTCATCCAGAA
CTGAATTTCACATCTCTAATCGGCGGAAAGGAAAATCCAGTCGATACATTCTACTACCTACAAATCAAATCGATCTTCGTTGGAAGAGAGAAACTCCAAATCCCG
GAGGAGAATTGGAACCTCTCCGCCGACGGCGCCGGTGGAACAATCATCGATTCCGGCACAACTCTCAGCTATTTCTCCGATCCGGCTTACCGGATCATCAAGGAA
GCATTCTTGAGGAAAGTGAAAGGCTATAAACTAGTTGAAGATTTTCCGATCTTACATCCTTGCTACAACGTCTCCGGCACCGATAAACTGGAATTTCCAGAATTC
GGAATTCAGTTCGCCGATGGCGCCGTGTGGAACTTTCCGGTGGAGAATTACTTCATAAGAATCGAGCAATTGGATATCGTTTGCTTAGCGATGTTAGGAACTCCA
AAATCAGCACTGTCGATCATGGGAAATTACCAGCAGCAGAATTTCCACATATTGTACGATACGAAGAATTCAAGACTGGGCTACGCGCCGATGAGATGTGCTGAA
GTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTCTCGGTAACCAAACAGGCAGTAGCAGAGGATTTCAGAATTCCAAAGTGTTTCTTACATTGATTTTACTTTTGCTTTTCTCTGGCTTATTTGATTCG
ATTGTTGAAGCGCATGTTCCTCAAAAATTCAACAACTCCAATCACTCTGGTATTTTCGGAATCGAATTGCCGGAAAATCTTAGCTCCGGTATTGCTTCTTCATCC
GCCAGCGCTCCTTGTAGCTTTGGAAATGAAGGTGAAGAAGATGAGAGAGAGAGTTTAATGACGGATTCAGTGAAGCAATCGGTGAAGCTTCACTTGAAAAAGCGG
TCAACGAGTCGAGCGAACGAACCAAAGGAATCGATTACTGAATCTGCAGTTAGGGATTTGGCGAGAATCCAGACGCTTCATACCAGAATCGCCGAGAGGAAGAAT
CAAGATACGACTTCGAGATTGAAGAAGAGCAATGTCGAGCAAAGGAAGGCTGCGGTGGGGGAGGCAGTTTCTCCGACCGCATCGCCAGAATCTCATACCGATTAC
TTCTCCGGTCAGCTTATGGCGACTTTGGAATCCGGCGTCAGTCTCGGCTCTGGTGAGTACTTTATCGACGTCTTCGTCGGTTCTCCGCCGAAACACTTCTCTCTG
ATTCTCGATACTGGAAGCGATTTGAACTGGATTCAATGTGTACCTTGCTACGATTGTTTCGAGCAAAACGGACCTTATTACGATCCTAAAGATTCAATTTCTTTC
AGAAACATAACCTGTAACGATCCTCGATGTCAATTAGTTTCGTCTCCAGATCCTTCGCAGCCGTGCAAATTCGAGACGCAATCGTGCCCTTATTTTTACTGGTAC
GGCGACAGTTCGAACACTACCGGCGATTTCGCGCTTGAAACGTTCACTGTCAATCTGACCTCGTCGACGACGGGGAAATCGGAGTTCCGTCGAGTGGAGAATGTT
ATGTTCGGATGCGGCCATTGGAACAGAGGTCTCTTCCATGGCGCCGCCGGACTATTAGGGCTCGGCCGAGGACCTCTCTCTTTCTCATCGCAGCTTCAATCGCTC
TATGGTCATTCCTTCTCCTACTGTCTCGTCGATCGAAACAGCGATACCAGCGTGAGCAGCAAGCTGATCTTCGGCGAAGACAGAGACTTATTAACTCATCCAGAA
CTGAATTTCACATCTCTAATCGGCGGAAAGGAAAATCCAGTCGATACATTCTACTACCTACAAATCAAATCGATCTTCGTTGGAAGAGAGAAACTCCAAATCCCG
GAGGAGAATTGGAACCTCTCCGCCGACGGCGCCGGTGGAACAATCATCGATTCCGGCACAACTCTCAGCTATTTCTCCGATCCGGCTTACCGGATCATCAAGGAA
GCATTCTTGAGGAAAGTGAAAGGCTATAAACTAGTTGAAGATTTTCCGATCTTACATCCTTGCTACAACGTCTCCGGCACCGATAAACTGGAATTTCCAGAATTC
GGAATTCAGTTCGCCGATGGCGCCGTGTGGAACTTTCCGGTGGAGAATTACTTCATAAGAATCGAGCAATTGGATATCGTTTGCTTAGCGATGTTAGGAACTCCA
AAATCAGCACTGTCGATCATGGGAAATTACCAGCAGCAGAATTTCCACATATTGTACGATACGAAGAATTCAAGACTGGGCTACGCGCCGATGAGATGTGCTGAA
GTTTAA
Protein sequenceShow/hide protein sequence
MDFLGNQTGSSRGFQNSKVFLTLILLLLFSGLFDSIVEAHVPQKFNNSNHSGIFGIELPENLSSGIASSSASAPCSFGNEGEEDERESLMTDSVKQSVKLHLKKR
STSRANEPKESITESAVRDLARIQTLHTRIAERKNQDTTSRLKKSNVEQRKAAVGEAVSPTASPESHTDYFSGQLMATLESGVSLGSGEYFIDVFVGSPPKHFSL
ILDTGSDLNWIQCVPCYDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPSQPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENV
MFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTSVSSKLIFGEDRDLLTHPELNFTSLIGGKENPVDTFYYLQIKSIFVGREKLQIP
EENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDKLEFPEFGIQFADGAVWNFPVENYFIRIEQLDIVCLAMLGTP
KSALSIMGNYQQQNFHILYDTKNSRLGYAPMRCAEV