; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G10940 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G10940
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAspartic proteinase CDR1-like
Genome locationClcChr07:25564830..25569267
RNA-Seq ExpressionClc07G10940
SyntenyClc07G10940
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044968.1 putative aspartic protease [Cucumis melo var. makuwa]9.7e-20081.94Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        M  ISIFFYFLLFF S+AT +GG  +GFTTSLFHRD+LLSPLHNPSLSRYD +  +FRRSFSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTP 
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        V+FIAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC+SDTCRSL+S HCG DL++CSYGYSYGDRSFTYGDLASDKITIGSFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGG+LSLVSQM+TIA +K QFSYCLPTFFSN NITGKISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV NKRF+A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
         DMSAMT +GNIIIDSGTTLT LPR+LY GVVSTLA VIKAKRVDDP+GILELCY+AG +EDLNIP+I AHF G ADVKLLP+NTFA VA+NV CLTLAP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ
        A+++AIFGNLAQINF VGYDL  KRLSFKPT+
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ

KAA3468560.1 aspartic proteinase CDR1-like [Gossypium australe]2.5e-18745.66Show/hide
Query:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN
        GF+  LFHRD++ SP +NP  +  DR+TNA RRSF+R    FK   +V T   +S +  DSGE+LM +S+GTP  D +AIADTGSDL WTQC PC +CF 
Subjt:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN

Query:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSF-----KLSKTLIGCGHQNGGTFGGVTSGIIGLGGG
        Q    F+P +SS+YR +SC++  C  L+   C  D  +C Y  SYGD SF+ GDLA+D +T+ S         KT+IGCG  NGGTF   TSGIIGLGGG
Subjt:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSF-----KLSKTLIGCGHQNGGTFGGVTSGIIGLGGG

Query:  ALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLT
         +SL+SQ+ T  A K  FSYCL    S A  + KI+FG NA+VSGP V+STPLV KSPDTFYFLTLEAI+V  KR + T   S  ++ GNIIIDSGTTLT
Subjt:  ALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLT

Query:  FLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDL
         LP + Y+ V S + S I AKR++ P G L LCY A   ++  IP +  HF   AD+KL PLNTF  V++   C + +   D+AI+GNL+Q++F++GYD 
Subjt:  FLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDL

Query:  EQKRLSFKPTQPLMVTMALPLLFSTAIPFFKSHLSPITTISPMPSAAHRA----------ATTTVVVQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSD
                 TQ   V++ L    S   PF+       TT   + +A  R+          + TT      I   +GEYL+++S+GTP    + +ADTGSD
Subjt:  EQKRLSFKPTQPLMVTMALPLLFSTAIPFFKSHLSPITTISPMPSAAHRA----------ATTTVVVQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSD

Query:  LTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPCTSQFCHALDVANCG--VQGICDYSYTYGDQTYTKGDLGFDKITFGSSS------VNSVIGCGHQSGG
        L WTQC PC +CF Q  P+F+P KSS++R + C+S  C  +    C       C YS TYGD +++KGD+ +D +T GS++       +++IGCG+ + G
Subjt:  LTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPCTSQFCHALDVANCG--VQGICDYSYTYGDQTYTKGDLGFDKITFGSSS------VNSVIGCGHQSGG

Query:  GF-GYASGVIGLGGGELSLVSQMSQNAAVSRRFSYC-LPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNPNTYYYMTLEAISIGN---ECHMVDMSSK
         F G ASG+IGLGGGE+SL++Q+   + ++ +FSYC LP      + K+NFG NA+VSGPG VSTPL+ K+PNT+Y++TL+AIS+G    E     + + 
Subjt:  GF-GYASGVIGLGGGELSLVSQMSQNAAVSRRFSYC-LPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNPNTYYYMTLEAISIGN---ECHMVDMSSK

Query:  QGNMIIDTGTTLTILPKELYDGVVSSLLKIVKARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGADVKLLPVNTFMKVAKNVSCLTLTAASPRDGF
        +GN++ID+GTTLT++P + Y  + S++       R + P GF  LC+ A  +      P +T HFA  ADVKL  +NTF+KV    +C    A SP    
Subjt:  QGNMIIDTGTTLTILPKELYDGVVSSLLKIVKARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGADVKLLPVNTFMKVAKNVSCLTLTAASPRDGF

Query:  GIWGNLVQTNFLVGYDLEVRRLSFKPTICA
         I+GNL Q NFL+GYD + + +SFKPT C+
Subjt:  GIWGNLVQTNFLVGYDLEVRRLSFKPTICA

XP_004149004.1 probable aspartic protease At2g35615 [Cucumis sativus]6.9e-19881.4Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        MAAISIFFYFLLFF S+ T +GG  +GFTTSLF RD+ LSPLHNPSLSRYD + +AFRRSFSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTPP
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        V+ IAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC SDTCRSL+SYHCGPDLQ+CSYGYSYGDRSFTYGDLASD+ITIGSFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGG+LSLVSQM TIA +K +FSYCLPTFFSNANITG ISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV  KRF+A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
        N +SAMT  GNIIIDSGTTLT LPR+LY GV STLA VIKAKRVDDP+GILELCY+AG V+DLNIP+I AHF GGADVKLLP+NTFA VA+NV+CLT AP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKP
        A+ +AIFGNLAQINF VGYDL  KRLSF+P
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKP

XP_008452150.1 PREDICTED: probable aspartic protease At2g35615 [Cucumis melo]2.2e-19981.71Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        M AISIFFYFLLFF S+AT +GG  +GFTTSL+HRD+LLSPLHNPSLSRYD +  +FRRSFSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTP 
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        V+FIAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC+SDTCRSL+S HCG DL++CSYGYSYGDRSFTYGDLASDKITIGSFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGG+LSLVSQM+TIA +K QFSYCLPTFFSN NITGKISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV NKRF+A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
         DMSAMT +GNIIIDSGTTLT LPR+LY GVVSTLA VIK KRVDDP+GILELCY+AG +EDLNIP+I AHF G ADVKLLP+NTFA VA+NV CLTLAP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ
        A+++AIFGNLAQINF VGYDL  KRLSFKPT+
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ

XP_038889220.1 aspartic proteinase CDR1-like [Benincasa hispida]4.0e-21488.63Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        MAAISIFFYFLLF F+EATTN G GNGFTTSLFHRD+LLSPLHN SLS +DR TNAFRRSFSRSATL  HV  VSTACI SPIIP+SGEFLMSVSIGTPP
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        VDFIAIADTGSDLTWTQCLPCQ+CFNQS  +FNPRRSSSYRNVSCTSDTCRSLDSYHCG DLQTCSYGYSYGDRSFTYGDLASDKITI SFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFS+ NITGKISFGQNAVVSGPKVISTPLV++SPDTFYFLTLEAISVANKR +A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
        +D SA+T+RGNIIIDSGTTLTFLPRNLY  +VSTL SVIKAKRVDDP+GILELCY AG  +DL+IPVI AHF GGADVKLLPLNTFALVAENV+CLTLAP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPT
        ASDLAIFGNLAQINF+VGYDLE KRLSFKPT
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPT

TrEMBL top hitse value%identityAlignment
A0A0A0KZZ3 Peptidase A1 domain-containing protein3.4e-19881.4Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        MAAISIFFYFLLFF S+ T +GG  +GFTTSLF RD+ LSPLHNPSLSRYD + +AFRRSFSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTPP
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        V+ IAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC SDTCRSL+SYHCGPDLQ+CSYGYSYGDRSFTYGDLASD+ITIGSFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGG+LSLVSQM TIA +K +FSYCLPTFFSNANITG ISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV  KRF+A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
        N +SAMT  GNIIIDSGTTLT LPR+LY GV STLA VIKAKRVDDP+GILELCY+AG V+DLNIP+I AHF GGADVKLLP+NTFA VA+NV+CLT AP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKP
        A+ +AIFGNLAQINF VGYDL  KRLSF+P
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKP

A0A1S3BT75 probable aspartic protease At2g356151.0e-19981.71Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        M AISIFFYFLLFF S+AT +GG  +GFTTSL+HRD+LLSPLHNPSLSRYD +  +FRRSFSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTP 
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        V+FIAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC+SDTCRSL+S HCG DL++CSYGYSYGDRSFTYGDLASDKITIGSFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGG+LSLVSQM+TIA +K QFSYCLPTFFSN NITGKISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV NKRF+A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
         DMSAMT +GNIIIDSGTTLT LPR+LY GVVSTLA VIK KRVDDP+GILELCY+AG +EDLNIP+I AHF G ADVKLLP+NTFA VA+NV CLTLAP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ
        A+++AIFGNLAQINF VGYDL  KRLSFKPT+
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ

A0A5A7TPZ5 Putative aspartic protease4.7e-20081.94Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        M  ISIFFYFLLFF S+AT +GG  +GFTTSLFHRD+LLSPLHNPSLSRYD +  +FRRSFSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTP 
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        V+FIAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC+SDTCRSL+S HCG DL++CSYGYSYGDRSFTYGDLASDKITIGSFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGG+LSLVSQM+TIA +K QFSYCLPTFFSN NITGKISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV NKRF+A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
         DMSAMT +GNIIIDSGTTLT LPR+LY GVVSTLA VIKAKRVDDP+GILELCY+AG +EDLNIP+I AHF G ADVKLLP+NTFA VA+NV CLTLAP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ
        A+++AIFGNLAQINF VGYDL  KRLSFKPT+
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ

A0A5B6VH54 Aspartic proteinase CDR1-like1.2e-18745.66Show/hide
Query:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN
        GF+  LFHRD++ SP +NP  +  DR+TNA RRSF+R    FK   +V T   +S +  DSGE+LM +S+GTP  D +AIADTGSDL WTQC PC +CF 
Subjt:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN

Query:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSF-----KLSKTLIGCGHQNGGTFGGVTSGIIGLGGG
        Q    F+P +SS+YR +SC++  C  L+   C  D  +C Y  SYGD SF+ GDLA+D +T+ S         KT+IGCG  NGGTF   TSGIIGLGGG
Subjt:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSF-----KLSKTLIGCGHQNGGTFGGVTSGIIGLGGG

Query:  ALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLT
         +SL+SQ+ T  A K  FSYCL    S A  + KI+FG NA+VSGP V+STPLV KSPDTFYFLTLEAI+V  KR + T   S  ++ GNIIIDSGTTLT
Subjt:  ALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLT

Query:  FLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDL
         LP + Y+ V S + S I AKR++ P G L LCY A   ++  IP +  HF   AD+KL PLNTF  V++   C + +   D+AI+GNL+Q++F++GYD 
Subjt:  FLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDL

Query:  EQKRLSFKPTQPLMVTMALPLLFSTAIPFFKSHLSPITTISPMPSAAHRA----------ATTTVVVQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSD
                 TQ   V++ L    S   PF+       TT   + +A  R+          + TT      I   +GEYL+++S+GTP    + +ADTGSD
Subjt:  EQKRLSFKPTQPLMVTMALPLLFSTAIPFFKSHLSPITTISPMPSAAHRA----------ATTTVVVQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSD

Query:  LTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPCTSQFCHALDVANCG--VQGICDYSYTYGDQTYTKGDLGFDKITFGSSS------VNSVIGCGHQSGG
        L WTQC PC +CF Q  P+F+P KSS++R + C+S  C  +    C       C YS TYGD +++KGD+ +D +T GS++       +++IGCG+ + G
Subjt:  LTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPCTSQFCHALDVANCG--VQGICDYSYTYGDQTYTKGDLGFDKITFGSSS------VNSVIGCGHQSGG

Query:  GF-GYASGVIGLGGGELSLVSQMSQNAAVSRRFSYC-LPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNPNTYYYMTLEAISIGN---ECHMVDMSSK
         F G ASG+IGLGGGE+SL++Q+   + ++ +FSYC LP      + K+NFG NA+VSGPG VSTPL+ K+PNT+Y++TL+AIS+G    E     + + 
Subjt:  GF-GYASGVIGLGGGELSLVSQMSQNAAVSRRFSYC-LPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNPNTYYYMTLEAISIGN---ECHMVDMSSK

Query:  QGNMIIDTGTTLTILPKELYDGVVSSLLKIVKARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGADVKLLPVNTFMKVAKNVSCLTLTAASPRDGF
        +GN++ID+GTTLT++P + Y  + S++       R + P GF  LC+ A  +      P +T HFA  ADVKL  +NTF+KV    +C    A SP    
Subjt:  QGNMIIDTGTTLTILPKELYDGVVSSLLKIVKARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGADVKLLPVNTFMKVAKNVSCLTLTAASPRDGF

Query:  GIWGNLVQTNFLVGYDLEVRRLSFKPTICA
         I+GNL Q NFL+GYD + + +SFKPT C+
Subjt:  GIWGNLVQTNFLVGYDLEVRRLSFKPTICA

A0A5D3D1Z7 Putative aspartic protease1.0e-19981.71Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        M AISIFFYFLLFF S+AT +GG  +GFTTSL+HRD+LLSPLHNPSLSRYD +  +FRRSFSRSATL  H+T+VSTACI+SPIIPDSGEFLMS+ IGTP 
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC
        V+FIAIADTGSDLTWTQCLPC+ECFNQS+ IFNPRRSSSYR VSC+SDTCRSL+S HCG DL++CSYGYSYGDRSFTYGDLASDKITIGSFKL KT+IGC
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGC

Query:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT
        GHQNGGTFGGVTSGIIGLGGG+LSLVSQM+TIA +K QFSYCLPTFFSN NITGKISFG+ AVVSG +V+STPLV +SPDTFYFLTLEAISV NKRF+A 
Subjt:  GHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEAT

Query:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP
         DMSAMT +GNIIIDSGTTLT LPR+LY GVVSTLA VIK KRVDDP+GILELCY+AG +EDLNIP+I AHF G ADVKLLP+NTFA VA+NV CLTLAP
Subjt:  NDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAP

Query:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ
        A+++AIFGNLAQINF VGYDL  KRLSFKPT+
Subjt:  ASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQ

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356158.2e-9345.09Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        MA   +  +FL  FFS   ++ G    F+  L HRD+ LSP++NP ++  DR+  AF RS SRS   F H   +S   +QS +I   GEF MS++IGTPP
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK---
        +   AIADTGSDLTW QC PCQ+C+ ++  IF+ ++SS+Y++  C S  C++L S    C      C Y YSYGD+SF+ GD+A++ ++I S   S    
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK---

Query:  --TLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG----PKVISTPLVAKSPDTFYFLTLEA
          T+ GCG+ NGGTF    SGIIGLGGG LSL+SQ+   ++I ++FSYCL    +  N T  I+ G N++ S       V+STPLV K P T+Y+LTLEA
Subjt:  --TLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG----PKVISTPLVAKSPDTFYFLTLEA

Query:  ISVANKRFEAT------NDMSAMTK-RGNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLL
        ISV  K+   T      ND   +++  GNIIIDSGTTLT L    +    S +  SV  AKRV DP G+L  C+ +GS E + +P I  HF  GADV+L 
Subjt:  ISVANKRFEAT------NDMSAMTK-RGNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLL

Query:  PLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFK
        P+N F  ++E++ CL++ P +++AI+GN AQ++F+VGYDLE + +SF+
Subjt:  PLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFK

Q6XBF8 Aspartic proteinase CDR13.0e-9547.33Show/hide
Query:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN
        GFT  L HRD+  SP +NP  +   R+ NA  RS +R   +F      +T   Q  +  +SGE+LM+VSIGTPP   +AIADTGSDL WTQC PC +C+ 
Subjt:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN

Query:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSY-HCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCGHQNGGTFGGVTSGIIGLGG
        Q   +F+P+ SS+Y++VSC+S  C +L++   C  +  TCSY  SYGD S+T G++A D +T+GS      +L   +IGCGH N GTF    SGI+GLGG
Subjt:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSY-HCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCGHQNGGTFGGVTSGIIGLGG

Query:  GALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAK-SPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTT
        G +SL+ Q+    +I  +FSYCL    S  + T KI+FG NA+VSG  V+STPL+AK S +TFY+LTL++ISV +K+ + +   S  +  GNIIIDSGTT
Subjt:  GALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAK-SPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTT

Query:  LTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGY
        LT LP   Y+ +   +AS I A++  DP   L LCY+A    DL +PVI  HF  GADVKL   N F  V+E++ C     +   +I+GN+AQ+NF+VGY
Subjt:  LTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGY

Query:  DLEQKRLSFKPT
        D   K +SFKPT
Subjt:  DLEQKRLSFKPT

Q766C2 Aspartic proteinase nepenthesin-23.0e-5536.39Show/hide
Query:  SLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSC
        +L++Y+ I  A +R   R  ++  +    S++ I++P+    GE+LM+V+IGTP   F AI DTGSDL WTQC PC +CF+Q   IFNP+ SSS+  + C
Subjt:  SLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSC

Query:  TSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPT
         S  C+ L S  C  +   C Y Y YGD S T G +A++  T  +  +     GCG  N G   G  +G+IG+G G LSL SQ+        QFSYC+ +
Subjt:  TSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPT

Query:  FFSNANITGKISFGQNAVVSGPKVISTPLVAKSPD-TFYFLTLEAISVANKRFEATNDMSAMTK--RGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAK
        + S++  T  +    + V  G    ST L+  S + T+Y++TL+ I+V        +    +     G +IIDSGTTLT+LP++ Y  V       I   
Subjt:  FFSNANITGKISFGQNAVVSGPKVISTPLVAKSPD-TFYFLTLEAISVANKRFEATNDMSAMTK--RGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAK

Query:  RVDDPAGILELCYTAGS-VEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDL--AIFGNLAQINFVVGYDLEQKRLSFKPTQ
         VD+ +  L  C+   S    + +P I+  F GG  + L   N     AE V CL +  +S L  +IFGN+ Q    V YDL+   +SF PTQ
Subjt:  RVDDPAGILELCYTAGS-VEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDL--AIFGNLAQINFVVGYDLEQKRLSFKPTQ

Q766C3 Aspartic proteinase nepenthesin-12.9e-5838.33Show/hide
Query:  VQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSDLTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPCTSQFCHALDVANCGVQGICDYSYTYGDQTYTKG
        V++ +  G GEYL+++SIGTP   +  I DTGSDL WTQC PC +CFNQS PIFNP  SSSF  LPC+SQ C AL    C     C Y+Y YGD + T+G
Subjt:  VQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSDLTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPCTSQFCHALDVANCGVQGICDYSYTYGDQTYTKG

Query:  DLGFDKITFGSSSV-NSVIGCGHQSGG-GFGYASGVIGLGGGELSLVSQMSQNAAVSRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNP-N
         +G + +TFGS S+ N   GCG  + G G G  +G++G+G G LSL SQ+        +FSYC+  + S     +  G  A     G  +T L+  +   
Subjt:  DLGFDKITFGSSSV-NSVIGCGHQSGG-GFGYASGVIGLGGGELSLVSQMSQNAAVSRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNP-N

Query:  TYYYMTLEAISIGNECHMVDMS-------SKQGNMIIDTGTTLTILPKELYDGVVSSLLKIVKARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGA
        T+YY+TL  +S+G+    +D S       +  G +IID+GTTLT      Y  V    +  +    V        LCF        L IP    HF GG 
Subjt:  TYYYMTLEAISIGNECHMVDMS-------SKQGNMIIDTGTTLTILPKELYDGVVSSLLKIVKARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGA

Query:  DVKLLPVNTFMKVAKNVSCLTLTAASPRDGFGIWGNLVQTNFLVGYDLEVRRLSFKPTIC
        D++L   N F+  +  + CL + ++S   G  I+GN+ Q N LV YD     +SF    C
Subjt:  DVKLLPVNTFMKVAKNVSCLTLTAASPRDGFGIWGNLVQTNFLVGYDLEVRRLSFKPTIC

Q8S9J6 Aspartyl protease family protein At5g107709.5e-4938.53Show/hide
Query:  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPC-QECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSY--HCGP-DLQTCSYGYSYGDRSFTYGDLA
        SG ++++V +GTP  D   I DTGSDLTWTQC PC + C++Q   IFNP +S+SY NVSC+S  C SL S   + G      C YG  YGD+SF+ G LA
Subjt:  SGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPC-QECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSY--HCGP-DLQTCSYGYSYGDRSFTYGDLA

Query:  SDKITI-GSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDT
         +K T+  S        GCG  N G F GV +G++GLG   LS  SQ  T  A  + FSYCLP   S+A+ TG ++FG   +    K   TP+   +  T
Subjt:  SDKITI-GSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDT

Query:  -FYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKL
         FY L + AI+V  ++       S +      +IDSGT +T LP   YA + S+  + +          IL+ C+     + + IP +A  F GGA V+L
Subjt:  -FYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKL

Query:  LPLNTFALVAENVSCLTLAPASD---LAIFGNLAQINFVVGYDLEQKRLSFKP
             F +   +  CL  A  SD    AIFGN+ Q    V YD    R+ F P
Subjt:  LPLNTFALVAENVSCLTLAPASD---LAIFGNLAQINFVVGYDLEQKRLSFKP

Arabidopsis top hitse value%identityAlignment
AT1G31450.1 Eukaryotic aspartyl protease family protein1.6e-9144.47Show/hide
Query:  FFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSD
        FFF  A+ +  +    T  L HRD+  SPL+NP  +  DR+  AF RS SRS    +  TT +   +QS +I + GE+ MS+SIGTPP    AIADTGSD
Subjt:  FFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSD

Query:  LTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK-----TLIGCGHQNG
        LTW QC PCQ+C+ Q+  +F+ ++SS+Y+  SC S TC++L  +   C      C Y YSYGD SFT GD+A++ I+I S   S      T+ GCG+ NG
Subjt:  LTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK-----TLIGCGHQNG

Query:  GTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPK----VISTPLVAKSPDTFYFLTLEAISVANKRFEATN
        GTF    SGIIGLGGG LSLVSQ+   ++I ++FSYCL    +  N T  I+ G N++ S P      ++TPL+ K P+T+YFLTLEA++V   +   T 
Subjt:  GTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPK----VISTPLVAKSPDTFYFLTLEAISVANKRFEATN

Query:  -----DMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSC
             +  +  + GNIIIDSGTTLT L    Y    + +  SV  AKRV DP G+L  C+ +G  +++ +P I  HF   ADVKL P+N F  + E+  C
Subjt:  -----DMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSC

Query:  LTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFK
        L++ P +++AI+GN+ Q++F+VGYDLE K +SF+
Subjt:  LTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFK

AT1G64830.1 Eukaryotic aspartyl protease family protein8.7e-10650.23Show/hide
Query:  SIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFI
        S+ F  LL     +  N    +GFT  L HRD+  SP +N + +   R+ NA RRS +RS   F +    S    QS I  + GE+LM++SIGTPPV  +
Subjt:  SIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFI

Query:  AIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIG
        AIADTGSDL WTQC PC++C+ Q+  +F+P+ SS+YR VSC+S  CR+L+   C  D  TCSY  +YGD S+T GD+A D +T+GS       L   +IG
Subjt:  AIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIG

Query:  CGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEA
        CGH+N GTF    SGIIGLGGG+ SLVSQ+    +I  +FSYCL  F S   +T KI+FG N +VSG  V+ST +V K P T+YFL LEAISV +K+ + 
Subjt:  CGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEA

Query:  TNDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLA
        T+ +   T  GNI+IDSGTTLT LP N Y  + S +AS IKA+RV DP GIL LCY   S     +P I  HF GG DVKL  LNTF  V+E+VSC   A
Subjt:  TNDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLA

Query:  PASDLAIFGNLAQINFVVGYDLEQKRLSFKPT
            L IFGNLAQ+NF+VGYD     +SFK T
Subjt:  PASDLAIFGNLAQINFVVGYDLEQKRLSFKPT

AT2G28220.1 Eukaryotic aspartyl protease family protein8.1e-10434.04Show/hide
Query:  RRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH
        RRS S S  L K+   +  A   +  + D   +LM + +GTPP +  A  DTGSDL WTQC+PC +C++Q   IF+P +SS++    C            
Subjt:  RRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH

Query:  CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCGHQN----GGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFS
             ++C Y   Y D +++ G LA++ +TI S     F +++T IGCG  N       F   +SGI+GL  G  SL+SQM+         SYC    FS
Subjt:  CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCGHQN----GGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFS

Query:  NANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPA
            T KI+FG NA+V+G   ++  +  K  + FY+L L+A+SV + R E T       + GNI+IDSG+T+T+ P +    V   +  V+ A RV DP+
Subjt:  NANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYAGVVSTLASVIKAKRVDDPA

Query:  GILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTF-ALVAENVSCLTL---APASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQPLMVTMALPLLF
        G   LCY + +++    PVI  HF GGAD+ L   N +    +  + CL +   +P  + AIFGN AQ NF+VGYD                  +  LL 
Subjt:  GILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTF-ALVAENVSCLTL---APASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQPLMVTMALPLLF

Query:  STAIPFFKSHLSPITTISPMPSAAHRAATTTVVVQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSDLTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPC
          A P+                        T+   S        YL+ + +GTPP + +   DTGSD+ WTQC+PC  C++Q  PIF+P KSS+FR   C
Subjt:  STAIPFFKSHLSPITTISPMPSAAHRAATTTVVVQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSDLTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPC

Query:  TSQFCHALDVANCGVQGICDYSYTYGDQTYTKGDLGFDKITFGSSS------VNSVIGCG-----HQSGGGFGYASGVIGLGGGELSLVSQMSQNAAVSR
            CH              Y   Y D+TY+KG L  + +T  S+S        + IGCG      Q  G    +SG++GL  G LSL+SQM  +     
Subjt:  TSQFCHALDVANCGVQGICDYSYTYGDQTYTKGDLGFDKITFGSSS------VNSVIGCG-----HQSGGGFGYASGVIGLGGGELSLVSQMSQNAAVSR

Query:  RFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNPNTYYYMTLEAISIGNECHM---VDMSSKQGNMIIDTGTTLTILPKELYDGVVSSLLKIVK
          SYC          KINFG NA+V+G G V+  +  K  N +YY+ L+A+S+ +           ++ GN+ ID+GTTLT  P    + V  ++ ++V 
Subjt:  RFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLVPKNPNTYYYMTLEAISIGNECHM---VDMSSKQGNMIIDTGTTLTILPKELYDGVVSSLLKIVK

Query:  ARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGADVKLLPVNTFMK-VAKNVSCLTLTAASPRDGFGIWGNLVQTNFLVGYDLEVRRLSFKPTICA
        A +V D G    LC+  Y  +  +  P+IT HF+GGAD+ L   N +++ +   + CL +    P     ++GN  Q NFLVGYD     +SF PT C+
Subjt:  ARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGADVKLLPVNTFMK-VAKNVSCLTLTAASPRDGFGIWGNLVQTNFLVGYDLEVRRLSFKPTICA

AT2G35615.1 Eukaryotic aspartyl protease family protein5.8e-9445.09Show/hide
Query:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP
        MA   +  +FL  FFS   ++ G    F+  L HRD+ LSP++NP ++  DR+  AF RS SRS   F H   +S   +QS +I   GEF MS++IGTPP
Subjt:  MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPP

Query:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK---
        +   AIADTGSDLTW QC PCQ+C+ ++  IF+ ++SS+Y++  C S  C++L S    C      C Y YSYGD+SF+ GD+A++ ++I S   S    
Subjt:  VDFIAIADTGSDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYH--CGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSK---

Query:  --TLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG----PKVISTPLVAKSPDTFYFLTLEA
          T+ GCG+ NGGTF    SGIIGLGGG LSL+SQ+   ++I ++FSYCL    +  N T  I+ G N++ S       V+STPLV K P T+Y+LTLEA
Subjt:  --TLIGCGHQNGGTFGGVTSGIIGLGGGALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSG----PKVISTPLVAKSPDTFYFLTLEA

Query:  ISVANKRFEAT------NDMSAMTK-RGNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLL
        ISV  K+   T      ND   +++  GNIIIDSGTTLT L    +    S +  SV  AKRV DP G+L  C+ +GS E + +P I  HF  GADV+L 
Subjt:  ISVANKRFEAT------NDMSAMTK-RGNIIIDSGTTLTFLPRNLYAGVVSTL-ASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLL

Query:  PLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFK
        P+N F  ++E++ CL++ P +++AI+GN AQ++F+VGYDLE + +SF+
Subjt:  PLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFK

AT5G33340.1 Eukaryotic aspartyl protease family protein2.1e-9647.33Show/hide
Query:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN
        GFT  L HRD+  SP +NP  +   R+ NA  RS +R   +F      +T   Q  +  +SGE+LM+VSIGTPP   +AIADTGSDL WTQC PC +C+ 
Subjt:  GFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTGSDLTWTQCLPCQECFN

Query:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSY-HCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCGHQNGGTFGGVTSGIIGLGG
        Q   +F+P+ SS+Y++VSC+S  C +L++   C  +  TCSY  SYGD S+T G++A D +T+GS      +L   +IGCGH N GTF    SGI+GLGG
Subjt:  QSRRIFNPRRSSSYRNVSCTSDTCRSLDSY-HCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGS-----FKLSKTLIGCGHQNGGTFGGVTSGIIGLGG

Query:  GALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAK-SPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTT
        G +SL+ Q+    +I  +FSYCL    S  + T KI+FG NA+VSG  V+STPL+AK S +TFY+LTL++ISV +K+ + +   S  +  GNIIIDSGTT
Subjt:  GALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAK-SPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTT

Query:  LTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGY
        LT LP   Y+ +   +AS I A++  DP   L LCY+A    DL +PVI  HF  GADVKL   N F  V+E++ C     +   +I+GN+AQ+NF+VGY
Subjt:  LTFLPRNLYAGVVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGY

Query:  DLEQKRLSFKPT
        D   K +SFKPT
Subjt:  DLEQKRLSFKPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCATTTCAATCTTCTTCTATTTCCTCCTCTTCTTCTTCTCGGAAGCAACCACCAATGGCGGTAGCGGCAATGGCTTCACCACCTCTCTTTTCCACCGCGATAC
CCTTCTCTCTCCTCTCCACAACCCATCTCTCTCCCGCTACGACCGCATTACCAATGCCTTCCGTCGCTCCTTCTCCCGCTCCGCCACCCTCTTCAAGCATGTCACTACCG
TCTCCACTGCCTGCATCCAATCTCCGATCATCCCCGACAGCGGTGAGTTCCTAATGTCTGTCTCTATTGGGACCCCGCCGGTTGATTTCATAGCCATCGCGGATACTGGC
AGCGATCTGACGTGGACCCAATGCTTGCCATGTCAGGAATGCTTCAACCAATCACGTCGCATTTTTAATCCACGTCGATCATCTTCCTACCGTAACGTGTCTTGCACGTC
TGATACTTGTCGCTCCCTCGACAGTTACCATTGTGGGCCCGACCTCCAAACCTGTAGCTATGGCTACAGCTATGGAGACCGATCCTTTACGTATGGTGACCTAGCATCTG
ATAAAATTACCATCGGGTCCTTCAAACTCTCCAAGACCCTCATTGGATGCGGCCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCGGGGATCATCGGACTTGGCGGT
GGCGCTCTCTCTTTGGTGTCGCAAATGAACACAATCGCTGCCATCAAACGGCAATTCTCATATTGTTTGCCAACTTTCTTCAGTAACGCAAATATCACAGGTAAGATAAG
CTTTGGCCAAAATGCCGTCGTTTCAGGGCCTAAAGTCATTTCTACCCCTCTCGTAGCGAAATCTCCCGATACCTTCTATTTCTTAACTCTTGAAGCAATCTCTGTCGCAA
ACAAGCGGTTTGAAGCTACAAACGACATGTCGGCCATGACCAAACGAGGGAATATTATTATCGATTCTGGTACGACATTGACGTTTCTGCCTCGAAATCTATACGCCGGT
GTTGTTTCGACTTTGGCGAGTGTTATTAAAGCAAAGCGAGTGGATGATCCAGCTGGGATTTTGGAACTCTGTTACACTGCGGGCAGCGTTGAGGATTTGAATATTCCAGT
CATTGCGGCACATTTTGGCGGTGGCGCCGACGTCAAATTGCTACCGTTGAACACATTTGCGTTGGTGGCTGAGAATGTGAGTTGTTTGACTTTGGCGCCGGCATCGGATT
TGGCCATTTTTGGGAACTTGGCGCAAATTAACTTTGTAGTCGGATATGATCTTGAGCAGAAGAGATTGTCGTTTAAACCTACTCAACCACTAATGGTCACAATGGCTTTA
CCACTTCTTTTTTCCACCGCGATTCCCTTCTTCAAATCTCATCTCTCTCCTATTACGACCATCTCACCAATGCCTTCCGCCGCTCATCGCGCCGCCACTACCACTGTTGT
CGTCCAATCCCCGATCGCCCCTGGAAGTGGCGAGTATCTAATATCTGTCTCCATCGGAACCCCGCCGGTGGATTACATTGGCATTGCCGACACGGGCAGCGATCTGACGT
GGACACAATGCTTGCCATGTCAGAAATGCTTCAACCAATCTCGTCCCATTTTCAACCCTCTCAAATCCTCCTCCTTTCGTCACTTGCCTTGCACGTCCCAATTCTGTCAT
GCCCTTGATGTTGCCAATTGTGGGGTCCAGGGAATTTGTGATTATAGTTACACATACGGAGATCAAACTTACACGAAGGGGGATTTAGGATTTGATAAGATCACTTTCGG
CTCATCATCCGTGAACTCAGTCATTGGATGTGGCCACCAGAGCGGCGGCGGATTCGGCTATGCCTCAGGTGTAATTGGACTCGGCGGTGGTGAACTCTCATTAGTCTCAC
AAATGAGCCAAAACGCCGCCGTTAGCCGGCGATTCTCTTATTGTTTACCAACGTTACTCAGTCACGCAAATGGCAAAATAAACTTCGGCCAAAACGCCGTCGTTTCTGGC
CCTGGAGTTGTTTCAACGCCACTAGTCCCCAAAAACCCCAACACGTATTATTACATGACTTTGGAAGCCATTTCCATTGGCAATGAATGTCACATGGTCGACATGTCGTC
CAAACAAGGCAATATGATTATAGACACCGGAACCACATTGACGATTCTTCCAAAGGAGTTGTACGACGGCGTCGTTTCGTCGCTACTAAAGATCGTTAAAGCAAGGCGAG
TGGAGGATCCCGGCGGCTTTTTGGGACTTTGCTTTGCTGCTTATGACAAGAGTGGTGGCTTGGGTATTCCGATCATCACAGCCCATTTTGCCGGTGGTGCCGACGTGAAG
TTGTTGCCAGTGAATACGTTTATGAAGGTGGCGAAAAATGTTAGTTGCTTGACGTTAACGGCGGCATCGCCGAGAGATGGTTTTGGGATTTGGGGGAATTTGGTGCAGAC
GAATTTCTTGGTCGGATATGATTTGGAGGTCAGGAGATTGTCGTTCAAGCCAACCATTTGTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCATTTCAATCTTCTTCTATTTCCTCCTCTTCTTCTTCTCGGAAGCAACCACCAATGGCGGTAGCGGCAATGGCTTCACCACCTCTCTTTTCCACCGCGATAC
CCTTCTCTCTCCTCTCCACAACCCATCTCTCTCCCGCTACGACCGCATTACCAATGCCTTCCGTCGCTCCTTCTCCCGCTCCGCCACCCTCTTCAAGCATGTCACTACCG
TCTCCACTGCCTGCATCCAATCTCCGATCATCCCCGACAGCGGTGAGTTCCTAATGTCTGTCTCTATTGGGACCCCGCCGGTTGATTTCATAGCCATCGCGGATACTGGC
AGCGATCTGACGTGGACCCAATGCTTGCCATGTCAGGAATGCTTCAACCAATCACGTCGCATTTTTAATCCACGTCGATCATCTTCCTACCGTAACGTGTCTTGCACGTC
TGATACTTGTCGCTCCCTCGACAGTTACCATTGTGGGCCCGACCTCCAAACCTGTAGCTATGGCTACAGCTATGGAGACCGATCCTTTACGTATGGTGACCTAGCATCTG
ATAAAATTACCATCGGGTCCTTCAAACTCTCCAAGACCCTCATTGGATGCGGCCACCAAAATGGTGGCACTTTCGGCGGAGTTACCTCGGGGATCATCGGACTTGGCGGT
GGCGCTCTCTCTTTGGTGTCGCAAATGAACACAATCGCTGCCATCAAACGGCAATTCTCATATTGTTTGCCAACTTTCTTCAGTAACGCAAATATCACAGGTAAGATAAG
CTTTGGCCAAAATGCCGTCGTTTCAGGGCCTAAAGTCATTTCTACCCCTCTCGTAGCGAAATCTCCCGATACCTTCTATTTCTTAACTCTTGAAGCAATCTCTGTCGCAA
ACAAGCGGTTTGAAGCTACAAACGACATGTCGGCCATGACCAAACGAGGGAATATTATTATCGATTCTGGTACGACATTGACGTTTCTGCCTCGAAATCTATACGCCGGT
GTTGTTTCGACTTTGGCGAGTGTTATTAAAGCAAAGCGAGTGGATGATCCAGCTGGGATTTTGGAACTCTGTTACACTGCGGGCAGCGTTGAGGATTTGAATATTCCAGT
CATTGCGGCACATTTTGGCGGTGGCGCCGACGTCAAATTGCTACCGTTGAACACATTTGCGTTGGTGGCTGAGAATGTGAGTTGTTTGACTTTGGCGCCGGCATCGGATT
TGGCCATTTTTGGGAACTTGGCGCAAATTAACTTTGTAGTCGGATATGATCTTGAGCAGAAGAGATTGTCGTTTAAACCTACTCAACCACTAATGGTCACAATGGCTTTA
CCACTTCTTTTTTCCACCGCGATTCCCTTCTTCAAATCTCATCTCTCTCCTATTACGACCATCTCACCAATGCCTTCCGCCGCTCATCGCGCCGCCACTACCACTGTTGT
CGTCCAATCCCCGATCGCCCCTGGAAGTGGCGAGTATCTAATATCTGTCTCCATCGGAACCCCGCCGGTGGATTACATTGGCATTGCCGACACGGGCAGCGATCTGACGT
GGACACAATGCTTGCCATGTCAGAAATGCTTCAACCAATCTCGTCCCATTTTCAACCCTCTCAAATCCTCCTCCTTTCGTCACTTGCCTTGCACGTCCCAATTCTGTCAT
GCCCTTGATGTTGCCAATTGTGGGGTCCAGGGAATTTGTGATTATAGTTACACATACGGAGATCAAACTTACACGAAGGGGGATTTAGGATTTGATAAGATCACTTTCGG
CTCATCATCCGTGAACTCAGTCATTGGATGTGGCCACCAGAGCGGCGGCGGATTCGGCTATGCCTCAGGTGTAATTGGACTCGGCGGTGGTGAACTCTCATTAGTCTCAC
AAATGAGCCAAAACGCCGCCGTTAGCCGGCGATTCTCTTATTGTTTACCAACGTTACTCAGTCACGCAAATGGCAAAATAAACTTCGGCCAAAACGCCGTCGTTTCTGGC
CCTGGAGTTGTTTCAACGCCACTAGTCCCCAAAAACCCCAACACGTATTATTACATGACTTTGGAAGCCATTTCCATTGGCAATGAATGTCACATGGTCGACATGTCGTC
CAAACAAGGCAATATGATTATAGACACCGGAACCACATTGACGATTCTTCCAAAGGAGTTGTACGACGGCGTCGTTTCGTCGCTACTAAAGATCGTTAAAGCAAGGCGAG
TGGAGGATCCCGGCGGCTTTTTGGGACTTTGCTTTGCTGCTTATGACAAGAGTGGTGGCTTGGGTATTCCGATCATCACAGCCCATTTTGCCGGTGGTGCCGACGTGAAG
TTGTTGCCAGTGAATACGTTTATGAAGGTGGCGAAAAATGTTAGTTGCTTGACGTTAACGGCGGCATCGCCGAGAGATGGTTTTGGGATTTGGGGGAATTTGGTGCAGAC
GAATTTCTTGGTCGGATATGATTTGGAGGTCAGGAGATTGTCGTTCAAGCCAACCATTTGTGCTTAG
Protein sequenceShow/hide protein sequence
MAAISIFFYFLLFFFSEATTNGGSGNGFTTSLFHRDTLLSPLHNPSLSRYDRITNAFRRSFSRSATLFKHVTTVSTACIQSPIIPDSGEFLMSVSIGTPPVDFIAIADTG
SDLTWTQCLPCQECFNQSRRIFNPRRSSSYRNVSCTSDTCRSLDSYHCGPDLQTCSYGYSYGDRSFTYGDLASDKITIGSFKLSKTLIGCGHQNGGTFGGVTSGIIGLGG
GALSLVSQMNTIAAIKRQFSYCLPTFFSNANITGKISFGQNAVVSGPKVISTPLVAKSPDTFYFLTLEAISVANKRFEATNDMSAMTKRGNIIIDSGTTLTFLPRNLYAG
VVSTLASVIKAKRVDDPAGILELCYTAGSVEDLNIPVIAAHFGGGADVKLLPLNTFALVAENVSCLTLAPASDLAIFGNLAQINFVVGYDLEQKRLSFKPTQPLMVTMAL
PLLFSTAIPFFKSHLSPITTISPMPSAAHRAATTTVVVQSPIAPGSGEYLISVSIGTPPVDYIGIADTGSDLTWTQCLPCQKCFNQSRPIFNPLKSSSFRHLPCTSQFCH
ALDVANCGVQGICDYSYTYGDQTYTKGDLGFDKITFGSSSVNSVIGCGHQSGGGFGYASGVIGLGGGELSLVSQMSQNAAVSRRFSYCLPTLLSHANGKINFGQNAVVSG
PGVVSTPLVPKNPNTYYYMTLEAISIGNECHMVDMSSKQGNMIIDTGTTLTILPKELYDGVVSSLLKIVKARRVEDPGGFLGLCFAAYDKSGGLGIPIITAHFAGGADVK
LLPVNTFMKVAKNVSCLTLTAASPRDGFGIWGNLVQTNFLVGYDLEVRRLSFKPTICA