; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G00680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G00680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAspartic proteinase CDR1
Genome locationClcChr06:588499..595362
RNA-Seq ExpressionClc06G00680
SyntenyClc06G00680
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3468560.1 aspartic proteinase CDR1-like [Gossypium australe]6.2e-21048.24Show/hide
Query:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR-----NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSI
        MA I   I  +S      A  G   GF+VEL  RD   SP YN  +T   R+ +ALRRS +R       +  T  AE+ + ++ GEYLM++S+GTP F I
Subjt:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR-----NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSI

Query:  LAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI
        +A+ADTGSD+IWTQCKPC  C++Q+AP F PSKS+TY+K+SCS+  C+   E  SCS+   C Y++SYGD S S GD A DT+T+ S +GR V FP+  I
Subjt:  LAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI

Query:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEF
        GCG  N GTFD   SGI+GLG G  SL++Q+  S  GKFSYCL PI S    SSK+NFGSNA+VSG   VSTP+ +     +FY+L LE ++VG K+ +F
Subjt:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEF

Query:  PVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQ
          SS+   E N+IIDSGTTLT LP   Y+   + +++ I+ +R   P + L  C+    D++K P VT+HF  AD+ L   N F+RVSD  +C +F    
Subjt:  PVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQ

Query:  DNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR-----NTAALTDTAEAPIYNYRGQ
        D  + IYGN++Q +FL+GYD    +VS               VELIHRDS KSP YNP ET + R+ NA RRS SR       +  T  A   I    G+
Subjt:  DNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR-----NTAALTDTAEAPIYNYRGQ

Query:  YLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPIC-SFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTM
        YLM ISLGTP FS++A+ADTGSD++WTQC PC  C++Q AP+F+P+KS+TY+ ++CSS  C +  G   +    + C+YS+TYGD+S S+GD+A DT+T+
Subjt:  YLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPIC-SFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTM

Query:  GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYS
        GST+G+ VA P   IGCG++NAGTF    SGI+GLG G  SL++QLG    GKFSYCL P+     +SSK+NFGSNAIVSG   VST +       TFY 
Subjt:  GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYS

Query:  LKLEAVSVGESKFDFPVVSSRLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENV
        L L+A+SVG  + +F    S LG  E NI+IDSGTTLTL+P+D Y+   + +    N  R   P Q  + CY     ++EAP VT+HF  ADV L+  N 
Subjt:  LKLEAVSVGESKFDFPVVSSRLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENV

Query:  FIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC
        F++V D   C AF  A     NI IYGN++Q NFL+GYDTK+ +VSFKP DC
Subjt:  FIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC

KAF4377251.1 hypothetical protein F8388_012352 [Cannabis sativa]1.6e-19444.37Show/hide
Query:  IFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAAL------TDTAEAPIYSNRGEYLMELSVGTPPFSILA
        I  L+ L+S +  S + +    GF+VE+I RD   SP+YN SQTH  R+A+A RRSI+R +         T + E+ +Y++ GEYLM +S+GTPPF ILA
Subjt:  IFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAAL------TDTAEAPIYSNRGEYLMELSVGTPPFSILA

Query:  VADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFA-GESASCSS-QSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI
        +ADTGSD+ WTQC PCK CY+Q AP+F P+ S TY+  +C S +C  A G   SCSS    C YS+SYGD+S S G+ A D +T+ STSGR V FP   I
Subjt:  VADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFA-GESASCSS-QSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI

Query:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPI---GSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVG---
        GC H++ GTFD   SGIVGLG G  SL +Q+  S GGKFSYCL P    G+ T  SS L+FGSNAVVSG+  VSTPI +     +FY+L LEG++VG   
Subjt:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPI---GSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVG---

Query:  ---EKKF-EFPVSS---ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTT-DDYKAPPVTMHFEGADVPLPQENVFV
            KKF  F  SS       + N+IIDSGTTLT +P   Y++F + +++ + N +R  DP+  L  C+  ++  D+ +P +TM+F+GADV L Q N FV
Subjt:  ---EKKF-EFPVSS---ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTT-DDYKAPPVTMHFEGADVPLPQENVFV

Query:  RVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAI------RDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR---
        +VSD VVCL+F   Q   I IYGN+AQ NFL    +  L ++F     +++       D    +ELIHRDSPKSP Y+ S+TH+ RL+ AL RS  R   
Subjt:  RVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAI------RDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR---

Query:  ---------------NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE
                        T   T   ++ ++  RG+YL+ IS+GTPPF ILA+ADTGSD++WTQC PCP+C+ Q  P+F P  S+T+  + C S  C +  +
Subjt:  ---------------NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE

Query:  ERSCSAQSE-------CLYSITYGDSSHSQGDLAVDTVTM------------------GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASL
        + +    S+       C Y+ +YGDSS++ G LA++T+T                    S+S    +FP    GCG  N G F    SGI+GLG G  SL
Subjt:  ERSCSAQSE-------CLYSITYGDSSHSQGDLAVDTVTM------------------GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASL

Query:  VSQLGPATGGKFSYCLAP--IGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP
        +SQ+G +  GKFSYCL P  +   +  SSKL FG +++V G +  ST +  +   + +Y L L+AVSVG  KFD    SS+ GG  N+IIDSGT LT LP
Subjt:  VSQLGPATGGKFSYCLAP--IGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP

Query:  TDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDY--EAPPVTMHF-EGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVG
        T LY    T +  SI NL+   DPN Y+  CY T +DD    A  +T+HF EG D+ L+  N F+RV++D +C AF     DED++ I+GN++Q NFLV 
Subjt:  TDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDY--EAPPVTMHF-EGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVG

Query:  YDTKNMSVSFKPADC
        YD     +SFK  DC
Subjt:  YDTKNMSVSFKPADC

RDY01103.1 Aspartic proteinase CDR1, partial [Mucuna pruriens]1.1e-21147.52Show/hide
Query:  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQ
        GF+VELI RD PKSP YN  +T + ++ +A  RS SR       + A   T ++ I SN+GEYL++ S+GTPPF ++ +ADTGSD+IW+QCKPC  CY Q
Subjt:  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQ

Query:  NAPMFTPSKSATYKKLSCSSPICLFAGES--ASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGL
          P+F PSKS+TY+ +SC S +C   G++   S +    C Y++SYGD SHSQG  A DT T+ ST+G  VAF +++IGCG +NAGTFD+  SGIVGLG 
Subjt:  NAPMFTPSKSATYKKLSCSSPICLFAGES--ASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGL

Query:  GPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLT
        G  SL++Q+GPS   KFSYCL P+  +    SKLNFG NAVV+G   VSTPI I     +FY+LKLEG+SVG K+ EF   S       N+IIDSGTTLT
Subjt:  GPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLT

Query:  FLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYD
         LP   Y      ++  INL+R N  +Q L  C+ +  ++  + P +T HF GADV L   N F+ VS+ V C AF P   N   I+GN+AQ N+LVGYD
Subjt:  FLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYD

Query:  INTLSVSFKPADCIAIRDY-------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR--------NTAALTDTAEAPIYNYRGQYLMEISLGTP
        +   +VSFKP DC  I          GF+V+LIHRDS KSP+YNPSE+ + +L +A +RS +R          +  T T ++ I    G+YL++ S+GTP
Subjt:  INTLSVSFKPADCIAIRDY-------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR--------NTAALTDTAEAPIYNYRGQYLMEISLGTP

Query:  PFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSE--CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVA
        PF ++ V DTGSD++W+QC+PC  CY Q+ P+F+ SKS+TY+ + C S +C   G E +C + S+  C Y++ YGD SHS+G LA DT+T+ ST+   +A
Subjt:  PFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSE--CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVA

Query:  FPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVG
        FP+I  GCG +N G FD   SGIVG+G G  SL+SQ+GP+   KFSYCL P+ +++  +SKLNFG NA+V+G   VST I       TFY LKL+ +SVG
Subjt:  FPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVG

Query:  ESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFEGADVPLQRENVFIRVSDDAV
          + +    S    G+ NIIIDSGTTLT LP  LY    +E++  I L+R + P   L  CY + +++  EAP +T HF GADV L   N F+ VSD+  
Subjt:  ESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFEGADVPLQRENVFIRVSDDAV

Query:  CLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM
        C AF    + +    I+GNI+Q N LVGYD +  +VSFKP DC  M
Subjt:  CLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM

TKY49535.1 Aspartic proteinase CDR1 [Spatholobus suberectus]5.2e-20947.22Show/hide
Query:  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSIS------RNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQ
        GF+V+LI RD PKSP YN ++T + ++ +A  RS +      R +     T ++ I SN+GEYL++ S+GTPPF ++ +ADTGSD+IW QCKPC  CY Q
Subjt:  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSIS------RNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQ

Query:  NAPMFTPSKSATYKKLSCSSPICLFAGESASCS-SQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGLG
          P+F PSKS TY+ +SC S +C   G++   S S   C Y+ SYGD SHSQG+ A DT+T+GST+G  VAFP++ IGCG +NAGTFD+  SGIVGLG G
Subjt:  NAPMFTPSKSATYKKLSCSSPICLFAGESASCS-SQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGLG

Query:  PASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLTF
          SL +Q+GPS   KFSYCL P+  +   +SKLNFG NAVV+GS  VSTPI I     +FY+LKLEG+SVG K+ EF   S     E N+IIDSGTTLTF
Subjt:  PASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLTF

Query:  LPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDI
        LP  +Y    + ++  I+L+R N   + L  C+ +  ++  +AP +T+HF GADV L   N FV VSDDV C AF P       ++GN+AQ N LVGYD+
Subjt:  LPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDI

Query:  NTLSVSFKPADCIAIRDY--------------------------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR------NTAALTDTAEAPI
           +V+F     +A                              GF+V+LIHRDSPKSP YNP+ET + +L NA  RS +R       +    +T ++ I
Subjt:  NTLSVSFKPADCIAIRDY--------------------------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR------NTAALTDTAEAPI

Query:  YNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEE--RSCSAQSECLYSITYGDSSHSQGDL
         +  G+YL++ S+GTPPF ++ +ADTGSD+VWTQC+PC  CY Q+ P+F+PSKS TY+ V+C S +C    +    S +    C Y+++YGD SHS+G+L
Subjt:  YNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEE--RSCSAQSECLYSITYGDSSHSQGDL

Query:  AVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSD
        A +T+T+GST+G  VA P+I IGCG +NAG FD+  SGIVGLG G  SL++QLG A   KFSYCL P+  ++  +SKLNFG NA+V+G   VST I    
Subjt:  AVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSD

Query:  TYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFEGADV
           TFY LKLE +SVG  + +F   S+      N IIDSGTTLT LP   Y    +E++  INL+R   P+Q L  CY +  ++  +AP +T HF GADV
Subjt:  TYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFEGADV

Query:  PLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM
         L   N F+ VSD+  C AF +    E    I+GNI+Q N LVGYD     VSFKP DC  M
Subjt:  PLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM

XP_038876324.1 aspartic proteinase CDR1-like [Benincasa hispida]2.3e-20485.92Show/hide
Query:  RDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAP
        R++GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAA+TDTA APIYNYRGQYLM+ISLGTPPFSI+AVADTGSD++WTQCEPCPNCYEQSAP
Subjt:  RDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAP

Query:  MFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASL
        MFNPSKS TYKNV CSSPICS+AGE+ SCSA SECLYSI+YGD SHSQGD AVDTVTMGSTSG  V FP +AIGCGHDNAGTFDA+VSGIVGLGQG ASL
Subjt:  MFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASL

Query:  VSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTD
        VSQ+GPATGGKFSYCLAPIGN + ESSKLNFGSNA VSGS+AVST IYTS  YKTFYSLKLEAVSVGE+KFDFP+VSSRLGGE NIIIDSGTTLT LP D
Subjt:  VSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTD

Query:  LYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNM
        LYNNFAT IS SINLQRT+DPNQ+LD C+ATTTDDYEAP VTMHFEGADVPL RENVFIR+SDD VCLAFKA+  D++ IFIYGNISQNNFLVGYD KNM
Subjt:  LYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNM

Query:  SVSFKPADCVSM
         VSFK ADCV+M
Subjt:  SVSFKPADCVSM

TrEMBL top hitse value%identityAlignment
A0A371HE86 Aspartic proteinase CDR1 (Fragment)5.4e-21247.52Show/hide
Query:  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQ
        GF+VELI RD PKSP YN  +T + ++ +A  RS SR       + A   T ++ I SN+GEYL++ S+GTPPF ++ +ADTGSD+IW+QCKPC  CY Q
Subjt:  GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQ

Query:  NAPMFTPSKSATYKKLSCSSPICLFAGES--ASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGL
          P+F PSKS+TY+ +SC S +C   G++   S +    C Y++SYGD SHSQG  A DT T+ ST+G  VAF +++IGCG +NAGTFD+  SGIVGLG 
Subjt:  NAPMFTPSKSATYKKLSCSSPICLFAGES--ASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGL

Query:  GPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLT
        G  SL++Q+GPS   KFSYCL P+  +    SKLNFG NAVV+G   VSTPI I     +FY+LKLEG+SVG K+ EF   S       N+IIDSGTTLT
Subjt:  GPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILG-GEANMIIDSGTTLT

Query:  FLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYD
         LP   Y      ++  INL+R N  +Q L  C+ +  ++  + P +T HF GADV L   N F+ VS+ V C AF P   N   I+GN+AQ N+LVGYD
Subjt:  FLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDD-YKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYD

Query:  INTLSVSFKPADCIAIRDY-------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR--------NTAALTDTAEAPIYNYRGQYLMEISLGTP
        +   +VSFKP DC  I          GF+V+LIHRDS KSP+YNPSE+ + +L +A +RS +R          +  T T ++ I    G+YL++ S+GTP
Subjt:  INTLSVSFKPADCIAIRDY-------GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR--------NTAALTDTAEAPIYNYRGQYLMEISLGTP

Query:  PFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSE--CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVA
        PF ++ V DTGSD++W+QC+PC  CY Q+ P+F+ SKS+TY+ + C S +C   G E +C + S+  C Y++ YGD SHS+G LA DT+T+ ST+   +A
Subjt:  PFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSE--CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVA

Query:  FPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVG
        FP+I  GCG +N G FD   SGIVG+G G  SL+SQ+GP+   KFSYCL P+ +++  +SKLNFG NA+V+G   VST I       TFY LKL+ +SVG
Subjt:  FPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVG

Query:  ESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFEGADVPLQRENVFIRVSDDAV
          + +    S    G+ NIIIDSGTTLT LP  LY    +E++  I L+R + P   L  CY + +++  EAP +T HF GADV L   N F+ VSD+  
Subjt:  ESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDD-YEAPPVTMHFEGADVPLQRENVFIRVSDDAV

Query:  CLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM
        C AF    + +    I+GNI+Q N LVGYD +  +VSFKP DC  M
Subjt:  CLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM

A0A3Q7HJU2 Uncharacterized protein1.1e-19143.47Show/hide
Query:  FFLIFLISYAVVSAATTGRDY----GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNRGEYLMELSVGTPPFS
        F   F+    +VS   T  D+    GFT+ LI RD P SP+YN S T  +R+ +A  RS SR      ++    +T  + I    GEY+M+LS+GTPP  
Subjt:  FFLIFLISYAVVSAATTGRDY----GFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR------NTAALTDTAEAPIYSNRGEYLMELSVGTPPFS

Query:  ILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGST-SGRRVAFPRM
        I+A+ADTGSD+ WTQC+PC NC++Q++P+F   KS++YK   C +  C   G S+SC   + C Y +SYGD+S++ GD A D  T  ST S   VA P +
Subjt:  ILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGST-SGRRVAFPRM

Query:  AIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIES---SKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGE
        A GCGH N GTF+ + SGI+GLG G  S++ Q+     GKFSYCL  I   +  S   S +NFGS+A VSG   VSTP+ I     +FY+L LEGVSVG 
Subjt:  AIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIES---SKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGE

Query:  KKFEFPVSSILGG--EANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVC
        +  +F  S +  G  E N+IIDSGTTLT LP   Y++  +T+ +SI+  R  DP+     C+ +      AP +T HF  AD+ L   + F ++ + +VC
Subjt:  KKFEFPVSSILGG--EANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVC

Query:  LAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNT---AALTDTAEAPIY
        L   P  +  I I+GN+AQ NFL+GYD+    +SFKPADC       FT++LIHRDSP SP +NPS T Y RL +AL RS SR +       +  E+ + 
Subjt:  LAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNT---AALTDTAEAPIY

Query:  NYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVD
           G+YLM+IS+GTPP   L +ADTGSD+ WTQC+PC NC++Q  P+FNP KS++YK + C++ +C     + S    S C Y ++YGD SH+ GDL+++
Subjt:  NYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVD

Query:  TVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIG---NDTIESSKLNFGSNAIVSGSKAVSTLIYTSD
        T T  STS + V+ P I  GCGHDN GTF    SGI+GLG G  S+V+Q+     GKFSYCL P+    +++  +S +NFG+ A VSG   VST +   +
Subjt:  TVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIG---NDTIESSKLNFGSNAIVSGSKAVSTLIYTSD

Query:  TYKTFYSLKLEAVSVGESKFD---FPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTD-DYEAPPVTMHFEG
           TFY L LE +S+G    +   FPVV        NIIIDSGTTLT +P   Y N  + +  SIN  + +DP+     CY +  +   + P +  HF  
Subjt:  TYKTFYSLKLEAVSVGESKFD---FPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTD-DYEAPPVTMHFEG

Query:  ADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM
        AD+ L   N+F +V +  VCL     G   + I I+GN++Q NFL+GYD K   VSFKP DC ++
Subjt:  ADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM

A0A5B6VH54 Aspartic proteinase CDR1-like3.0e-21048.24Show/hide
Query:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR-----NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSI
        MA I   I  +S      A  G   GF+VEL  RD   SP YN  +T   R+ +ALRRS +R       +  T  AE+ + ++ GEYLM++S+GTP F I
Subjt:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISR-----NTAALTDTAEAPIYSNRGEYLMELSVGTPPFSI

Query:  LAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI
        +A+ADTGSD+IWTQCKPC  C++Q+AP F PSKS+TY+K+SCS+  C+   E  SCS+   C Y++SYGD S S GD A DT+T+ S +GR V FP+  I
Subjt:  LAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI

Query:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEF
        GCG  N GTFD   SGI+GLG G  SL++Q+  S  GKFSYCL PI S    SSK+NFGSNA+VSG   VSTP+ +     +FY+L LE ++VG K+ +F
Subjt:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEF

Query:  PVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQ
          SS+   E N+IIDSGTTLT LP   Y+   + +++ I+ +R   P + L  C+    D++K P VT+HF  AD+ L   N F+RVSD  +C +F    
Subjt:  PVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQ

Query:  DNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR-----NTAALTDTAEAPIYNYRGQ
        D  + IYGN++Q +FL+GYD    +VS               VELIHRDS KSP YNP ET + R+ NA RRS SR       +  T  A   I    G+
Subjt:  DNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR-----NTAALTDTAEAPIYNYRGQ

Query:  YLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPIC-SFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTM
        YLM ISLGTP FS++A+ADTGSD++WTQC PC  C++Q AP+F+P+KS+TY+ ++CSS  C +  G   +    + C+YS+TYGD+S S+GD+A DT+T+
Subjt:  YLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPIC-SFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTM

Query:  GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYS
        GST+G+ VA P   IGCG++NAGTF    SGI+GLG G  SL++QLG    GKFSYCL P+     +SSK+NFGSNAIVSG   VST +       TFY 
Subjt:  GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYS

Query:  LKLEAVSVGESKFDFPVVSSRLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENV
        L L+A+SVG  + +F    S LG  E NI+IDSGTTLTL+P+D Y+   + +    N  R   P Q  + CY     ++EAP VT+HF  ADV L+  N 
Subjt:  LKLEAVSVGESKFDFPVVSSRLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENV

Query:  FIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC
        F++V D   C AF  A     NI IYGN++Q NFL+GYDTK+ +VSFKP DC
Subjt:  FIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC

A0A7J6G2M2 Uncharacterized protein7.9e-19544.37Show/hide
Query:  IFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAAL------TDTAEAPIYSNRGEYLMELSVGTPPFSILA
        I  L+ L+S +  S + +    GF+VE+I RD   SP+YN SQTH  R+A+A RRSI+R +         T + E+ +Y++ GEYLM +S+GTPPF ILA
Subjt:  IFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAAL------TDTAEAPIYSNRGEYLMELSVGTPPFSILA

Query:  VADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFA-GESASCSS-QSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI
        +ADTGSD+ WTQC PCK CY+Q AP+F P+ S TY+  +C S +C  A G   SCSS    C YS+SYGD+S S G+ A D +T+ STSGR V FP   I
Subjt:  VADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFA-GESASCSS-QSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAI

Query:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPI---GSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVG---
        GC H++ GTFD   SGIVGLG G  SL +Q+  S GGKFSYCL P    G+ T  SS L+FGSNAVVSG+  VSTPI +     +FY+L LEG++VG   
Subjt:  GCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPI---GSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVG---

Query:  ---EKKF-EFPVSS---ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTT-DDYKAPPVTMHFEGADVPLPQENVFV
            KKF  F  SS       + N+IIDSGTTLT +P   Y++F + +++ + N +R  DP+  L  C+  ++  D+ +P +TM+F+GADV L Q N FV
Subjt:  ---EKKF-EFPVSS---ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTT-DDYKAPPVTMHFEGADVPLPQENVFV

Query:  RVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAI------RDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR---
        +VSD VVCL+F   Q   I IYGN+AQ NFL    +  L ++F     +++       D    +ELIHRDSPKSP Y+ S+TH+ RL+ AL RS  R   
Subjt:  RVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAI------RDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR---

Query:  ---------------NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE
                        T   T   ++ ++  RG+YL+ IS+GTPPF ILA+ADTGSD++WTQC PCP+C+ Q  P+F P  S+T+  + C S  C +  +
Subjt:  ---------------NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE

Query:  ERSCSAQSE-------CLYSITYGDSSHSQGDLAVDTVTM------------------GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASL
        + +    S+       C Y+ +YGDSS++ G LA++T+T                    S+S    +FP    GCG  N G F    SGI+GLG G  SL
Subjt:  ERSCSAQSE-------CLYSITYGDSSHSQGDLAVDTVTM------------------GSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASL

Query:  VSQLGPATGGKFSYCLAP--IGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP
        +SQ+G +  GKFSYCL P  +   +  SSKL FG +++V G +  ST +  +   + +Y L L+AVSVG  KFD    SS+ GG  N+IIDSGT LT LP
Subjt:  VSQLGPATGGKFSYCLAP--IGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP

Query:  TDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDY--EAPPVTMHF-EGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVG
        T LY    T +  SI NL+   DPN Y+  CY T +DD    A  +T+HF EG D+ L+  N F+RV++D +C AF     DED++ I+GN++Q NFLV 
Subjt:  TDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDY--EAPPVTMHF-EGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVG

Query:  YDTKNMSVSFKPADC
        YD     +SFK  DC
Subjt:  YDTKNMSVSFKPADC

F6HJ51 Uncharacterized protein7.6e-19041.38Show/hide
Query:  IFFLIFLISYAV-VSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSIS-----RNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILA
        IFF + ++ +   +      R  GF+V+LI RD P SP ++ S+T   R+ DA RRS+S     R TA  +D  ++ I  + GEYLM L +GTPP  ++A
Subjt:  IFFLIFLISYAV-VSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSIS-----RNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILA

Query:  VADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGC
        + DTGSD+ WTQC+PC +CY+Q  P+F P  S+TY+  SC +  CL  G+  SCS + +C +  SY D S + G+ A +T+T+ ST+G+ V+FP  A GC
Subjt:  VADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGC

Query:  GHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPV
        GH + G FD + SGIVGLG G  SL++Q+  +  G FSYCL P+ +++  SS++NFG++  VSG   VSTP+ +     +FY+L LEG+SVG+K+  +  
Subjt:  GHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPV

Query:  SS--ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQ
         S      E N+I+DSGTT TFLP   Y+    +++NSI  +R  DPN     C+  TT +  AP +T HF+ A+V L   N F+R+ +D+VC    P  
Subjt:  SS--ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQ

Query:  DNHIMIYGNIAQNNFLVGYDINTLSVS-------------------FKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSIS-----R
        D  I + GN+AQ NFLVG+D+    +S                   F   +       GF+V+LIHRDSP SP ++PS+T   RL +A  RS S     R
Subjt:  DNHIMIYGNIAQNNFLVGYDINTLSVS-------------------FKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSIS-----R

Query:  NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSIT
         +A  +D  ++ +    G+Y+M +S+GTPP  ++A+ DTGSD+ WTQC PC +CY+Q  P F+P  S+TY++ +C +  C   G +RSC    +C +  +
Subjt:  NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSIT

Query:  YGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGS
        Y D S + G+LAV+T+T+ ST+G+ V+FP  A GC H + G FD + SGIVGLG    S++SQL     G+FSYCL P+  D+  SS++NFG + IVSG+
Subjt:  YGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGS

Query:  KAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAP
          VST +        +Y + LE  SVG+ +  +   S +    E NII+DSGTT T LP + Y      ++ SI  +R  DPN     CY TT D  +AP
Subjt:  KAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLG-GEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAP

Query:  PVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC
         +T HF+ A+V LQ  N F+R+ +D VC           +I I GN++Q NFLVG+D +   VSFK ADC
Subjt:  PVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356153.6e-8841.43Show/hide
Query:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAPIYSN----RGEYLMELSVGTPPFSIL
        MA    L F + ++ V+ +++G    F+VELI RD P SP+YN   T   R+  A  RS+SR+       ++  + S      GE+ M +++GTPP  + 
Subjt:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAPIYSN----RGEYLMELSVGTPPFSIL

Query:  AVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPIC--LFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMA
        A+ADTGSD+ W QCKPC+ CY++N P+F   KS+TYK   C S  C  L + E     S + C Y  SYGD+S S+GD A +TV++ S SG  V+FP   
Subjt:  AVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPIC--LFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMA

Query:  IGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSG----SSAVSTPIYISDRFKSFYWLKLEGVSVGE
         GCG++N GTFD   SGI+GLG G  SL++Q+G S   KFSYCL+   + T  +S +N G+N++ S     S  VSTP+ +     ++Y+L LE +SVG+
Subjt:  IGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSG----SSAVSTPIYISDRFKSFYWLKLEGVSVGE

Query:  KKFEFPVSS--------ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVR
        KK  +  SS        +     N+IIDSGTTLT L    ++ FS+ +  S+   +R +DP   L +CF + + +   P +T+HF GADV L   N FV+
Subjt:  KKFEFPVSS--------ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVR

Query:  VSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIA
        +S+D+VCL+  P     + IYGN AQ +FLVGYD+ T +VSF+  DC A
Subjt:  VSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIA

Q6XBF8 Aspartic proteinase CDR11.5e-11350Show/hide
Query:  GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR----NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSA
        GFT +LIHRDSPKSP YNP ET   RL NA+ RS++R         T   +  + +  G+YLM +S+GTPPF I+A+ADTGSD++WTQC PC +CY Q  
Subjt:  GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR----NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSA

Query:  PMFNPSKSATYKNVACSSPICSFAGEERSCSA-QSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPA
        P+F+P  S+TYK+V+CSS  C+    + SCS   + C YS++YGD+S+++G++AVDT+T+GS+  R +    I IGCGH+NAGTF+   SGIVGLG GP 
Subjt:  PMFNPSKSATYKNVACSSPICSFAGEERSCSA-QSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPA

Query:  SLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP
        SL+ QLG +  GKFSYCL P+ +   ++SK+NFG+NAIVSGS  VST +    + +TFY L L+++SVG  +  +    S    E NIIIDSGTTLTLLP
Subjt:  SLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP

Query:  TDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTK
        T+ Y+     ++ SI+ ++  DP   L  CY + T D + P +TMHF+GADV L   N F++VS+D VC AF+ +     +  IYGN++Q NFLVGYDT 
Subjt:  TDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTK

Query:  NMSVSFKPADCVSM
        + +VSFKP DC  M
Subjt:  NMSVSFKPADCVSM

Q766C2 Aspartic proteinase nepenthesin-24.8e-6436.71Show/hide
Query:  RSQTHYHRIADALRRSISRN---TAALTDTA--EAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCS
        ++ T Y  I  A++R   R     A L  ++  E P+Y+  GEYLM +++GTP  S  A+ DTGSD+IWTQC+PC  C+ Q  P+F P  S+++  L C 
Subjt:  RSQTHYHRIADALRRSISRN---TAALTDTA--EAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCS

Query:  SPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCL
        S  C     S +C++ +EC Y+  YGD S +QG  A +T T  ++S      P +A GCG DN G    N +G++G+G GP SL +Q+G    G+FSYC+
Subjt:  SPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCL

Query:  TPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSIL---GGEANMIIDSGTTLTFLPMHLYNNFSTTISNSIN
        T  GS++  +  L   ++ V  GS   ST +  S    ++Y++ L+G++VG      P S+      G   MIIDSGTTLT+LP   YN  +   ++ IN
Subjt:  TPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSIL---GGEANMIIDSGTTLTFLPMHLYNNFSTTISNSIN

Query:  LQRTNDPNQFLDYCFATTTD--DYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADC
        L   ++ +  L  CF   +D    + P ++M F+G  + L ++N+ +  ++ V+CLA        I I+GNI Q    V YD+  L+VSF P  C
Subjt:  LQRTNDPNQFLDYCFATTTD--DYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADC

Q766C3 Aspartic proteinase nepenthesin-19.3e-6838.46Show/hide
Query:  GFTVELIHRDSPKSPMYNPSETHYHRLANALRRS---ISRNTAALTDTA--EAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQS
        GF + L H DS K      + T +  L  A+ R    + R  A L   +  E  +Y   G+YLM +S+GTP     A+ DTGSD++WTQC+PC  C+ QS
Subjt:  GFTVELIHRDSPKSPMYNPSETHYHRLANALRRS---ISRNTAALTDTA--EAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQS

Query:  APMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPA
         P+FNP  S+++  + CSS +C  A    +CS  + C Y+  YGD S +QG +  +T+T GS     V+ P I  GCG +N G    N +G+VG+G+GP 
Subjt:  APMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPA

Query:  SLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRL---GGEANIIIDSGTTLT
        SL SQL      KFSYC+ PIG+ T  +  L   +N++ +GS   +T +  S    TFY + L  +SVG ++      +  L    G   IIIDSGTTLT
Subjt:  SLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRL---GGEANIIIDSGTTLT

Query:  LLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTD--DYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLV
            + Y +   E    INL   N  +   D C+ T +D  + + P   MHF+G D+ L  EN FI  S+  +CLA    G     + I+GNI Q N LV
Subjt:  LLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTD--DYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLV

Query:  GYDTKNMSVSFKPADC
         YDT N  VSF  A C
Subjt:  GYDTKNMSVSFKPADC

Q9LNJ3 Aspartyl protease family protein 21.7e-5338.95Show/hide
Query:  GEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSE-CLYSISYGDRSHSQGDFAVDTV
        GEY   L VGTP   +  V DTGSDI+W QC PC+ CY Q+ P+F P KS TY  + CSSP C    +SA C+++ + CLY +SYGD S + GDF+ +T+
Subjt:  GEYLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSE-CLYSISYGDRSHSQGDFAVDTV

Query:  TMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVS-----TPIYISD
        T      RR     +A+GCGHDN G F    +G++GLG G  S   Q G     KFSYCL    +++  SS        VV G++AVS     TP+  + 
Subjt:  TMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVS-----TPIYISD

Query:  RFKSFYWLKLEGVSVGEKKFEFPVSSILG----GEANMIIDSGTTLTFLPMHLYNNFSTTIS-NSINLQRTNDPNQFLDYCF-ATTTDDYKAPPVTMHFE
        +  +FY++ L G+SVG  +     +S+      G   +IIDSGT++T L    Y          +  L+R  D + F D CF  +  ++ K P V +HF 
Subjt:  RFKSFYWLKLEGVSVGEKKFEFPVSSILG----GEANMIIDSGTTLTFLPMHLYNNFSTTIS-NSINLQRTNDPNQFLDYCF-ATTTDDYKAPPVTMHFE

Query:  GADVPLPQENVFVRV-SDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADC
        GADV LP  N  + V ++   C AF  G    + I GNI Q  F V YD+ +  V F P  C
Subjt:  GADVPLPQENVFVRV-SDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADC

Arabidopsis top hitse value%identityAlignment
AT1G31450.1 Eukaryotic aspartyl protease family protein1.5e-9243.09Show/hide
Query:  LSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTDT-AEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWT
        L++SF  A   +      TVELIHRDSP SP+YNP  T   RL  A  RSISR+    T T  ++ + +  G+Y M IS+GTPP  + A+ADTGSD+ W 
Subjt:  LSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTDT-AEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWT

Query:  QCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE-ERSCSAQSE-CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFD
        QC+PC  CY+Q++P+F+  KS+TYK  +C S  C    E E  C    + C Y  +YGD+S ++GD+A +T+++ S+SG  V+FP    GCG++N GTF+
Subjt:  QCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGE-ERSCSAQSE-CLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFD

Query:  ANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYK---TFYSLKLEAVSVGESKFDFPVVSSRLG
           SGI+GLG GP SLVSQLG + G KFSYCL+     T  +S +N G+N+I S     S  + T    K   T+Y L LEAV+VG++K  +      L 
Subjt:  ANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYK---TFYSLKLEAVSVGESKFDFPVVSSRLG

Query:  GEA-----NIIIDSGTTLTLLPTDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQ
        G++     NIIIDSGTTLTLL +  Y++F T +  S+   +R +DP   L  C+ +   +   P +TMHF  ADV L   N F+++++D VCL+     +
Subjt:  GEA-----NIIIDSGTTLTLLPTDLYNNFATEISGSI-NLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQ

Query:  DEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC
            + IYGN+ Q +FLVGYD +  +VSF+  DC
Subjt:  DEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADC

AT1G64830.1 Eukaryotic aspartyl protease family protein2.7e-11049.54Show/hide
Query:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAP------IYSNRGEYLMELSVGTPPFS
        MA + F   L+S  ++S        GFT++LI RD PKSP YN ++T   R+ +A+RRS +R+T   ++   +P      I SNRGEYLM +S+GTPP  
Subjt:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAP------IYSNRGEYLMELSVGTPPFS

Query:  ILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSS-QSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRM
        ILA+ADTGSD+IWTQC PC++CYQQ +P+F P +S+TY+K+SCSS  C  A E ASCS+ ++ C Y+I+YGD S+++GD AVDTVTMGS+  R V+   M
Subjt:  ILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSS-QSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRM

Query:  AIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKF
         IGCGH+N GTFD   SGI+GLG G  SLV+Q+  S  GKFSYCL P  S T  +SK+NFG+N +VSG   VST +   D   ++Y+L LE +SVG KK 
Subjt:  AIGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKF

Query:  EFPVSSILG-GEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFC
        +F  S+I G GE N++IDSGTTLT LP + Y    + ++++I  +R  DP+  L  C+  ++  +K P +T+HF+G DV L   N FV VS+DV C AF 
Subjt:  EFPVSSILG-GEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFC

Query:  PGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADC
          +   + I+GN+AQ NFLVGYD  + +VSFK  DC
Subjt:  PGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADC

AT2G28220.1 Eukaryotic aspartyl protease family protein4.1e-11937.19Show/hide
Query:  YLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMG
        YLM+L VGTPPF I A  DTGSD+IWTQC PC +CY Q  P+F PSKS+T+ +  C                   C Y I Y D ++S+G  A +TVT+ 
Subjt:  YLMELSVGTPPFSILAVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMG

Query:  STSGRRVAFPRMAIGCG-----HDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFK
        STSG         IGCG      DN+G F ++ SGIVGL +GP SL++QM     G  SYC +  G     +SK+NFG+NA+V+G   V+  ++I  +  
Subjt:  STSGRRVAFPRMAIGCG-----HDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFK

Query:  SFYWLKLEGVSVGEKKFEFPVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDP--NQFLDYCFATTTDDYKAPPVTMHFE-GADVPL
         FY+L L+ VSV + + E   +     + N++IDSG+T+T+ P+   N     +   +   R  DP  N  L Y F+ T D +  P +TMHF  GAD+ L
Subjt:  SFYWLKLEGVSVGEKKFEFPVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSINLQRTNDP--NQFLDYCFATTTDDYKAPPVTMHFE-GADVPL

Query:  PQENVFVRV-SDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRN
         + N+++   S  + CLA          I+GN AQNNFLVGYD ++L                    L+   SP                          
Subjt:  PQENVFVRV-SDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISRN

Query:  TAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITY
             DT    +Y+Y   YLM++ +GTPPF I+A  DTGSDI+WTQC PCPNCY Q AP+F+PSKS+T++   C+                + C Y I Y
Subjt:  TAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPICSFAGEERSCSAQSECLYSITY

Query:  GDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGT----FDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIV
         D ++S+G LA +TVT+ STSG         IGCG DN       F ++ SGIVGL  GP SL+SQ+     G  SYC +  G     +SK+NFG+NAIV
Subjt:  GDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGT----FDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIV

Query:  SGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYE
        +G   V+  ++       FY L L+AVSV E      + +     + NI IDSGTTLT  P    N     +   +   +  D       CY + T D  
Subjt:  SGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYE

Query:  APPVTMHFE-GADVPLQRENVFIR-VSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM
         P +TMHF  GAD+ L + N+++  ++    CLA      D     ++GN +QNNFLVGYD  +  +SF P +C ++
Subjt:  APPVTMHFE-GADVPLQRENVFIR-VSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM

AT2G35615.1 Eukaryotic aspartyl protease family protein2.6e-8941.43Show/hide
Query:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAPIYSN----RGEYLMELSVGTPPFSIL
        MA    L F + ++ V+ +++G    F+VELI RD P SP+YN   T   R+  A  RS+SR+       ++  + S      GE+ M +++GTPP  + 
Subjt:  MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAPIYSN----RGEYLMELSVGTPPFSIL

Query:  AVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPIC--LFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMA
        A+ADTGSD+ W QCKPC+ CY++N P+F   KS+TYK   C S  C  L + E     S + C Y  SYGD+S S+GD A +TV++ S SG  V+FP   
Subjt:  AVADTGSDIIWTQCKPCKNCYQQNAPMFTPSKSATYKKLSCSSPIC--LFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMA

Query:  IGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSG----SSAVSTPIYISDRFKSFYWLKLEGVSVGE
         GCG++N GTFD   SGI+GLG G  SL++Q+G S   KFSYCL+   + T  +S +N G+N++ S     S  VSTP+ +     ++Y+L LE +SVG+
Subjt:  IGCGHDNAGTFDANVSGIVGLGLGPASLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSG----SSAVSTPIYISDRFKSFYWLKLEGVSVGE

Query:  KKFEFPVSS--------ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVR
        KK  +  SS        +     N+IIDSGTTLT L    ++ FS+ +  S+   +R +DP   L +CF + + +   P +T+HF GADV L   N FV+
Subjt:  KKFEFPVSS--------ILGGEANMIIDSGTTLTFLPMHLYNNFSTTISNSI-NLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVR

Query:  VSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIA
        +S+D+VCL+  P     + IYGN AQ +FLVGYD+ T +VSF+  DC A
Subjt:  VSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIA

AT5G33340.1 Eukaryotic aspartyl protease family protein1.0e-11450Show/hide
Query:  GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR----NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSA
        GFT +LIHRDSPKSP YNP ET   RL NA+ RS++R         T   +  + +  G+YLM +S+GTPPF I+A+ADTGSD++WTQC PC +CY Q  
Subjt:  GFTVELIHRDSPKSPMYNPSETHYHRLANALRRSISR----NTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSA

Query:  PMFNPSKSATYKNVACSSPICSFAGEERSCSA-QSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPA
        P+F+P  S+TYK+V+CSS  C+    + SCS   + C YS++YGD+S+++G++AVDT+T+GS+  R +    I IGCGH+NAGTF+   SGIVGLG GP 
Subjt:  PMFNPSKSATYKNVACSSPICSFAGEERSCSA-QSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPA

Query:  SLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP
        SL+ QLG +  GKFSYCL P+ +   ++SK+NFG+NAIVSGS  VST +    + +TFY L L+++SVG  +  +    S    E NIIIDSGTTLTLLP
Subjt:  SLVSQLGPATGGKFSYCLAPIGNDTIESSKLNFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLP

Query:  TDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTK
        T+ Y+     ++ SI+ ++  DP   L  CY + T D + P +TMHF+GADV L   N F++VS+D VC AF+ +     +  IYGN++Q NFLVGYDT 
Subjt:  TDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAPPVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTK

Query:  NMSVSFKPADCVSM
        + +VSFKP DC  M
Subjt:  NMSVSFKPADCVSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACCCATTTTCTTTCTCATTTTCTTAATCTCCTATGCCGTCGTCTCAGCCGCCACCACAGGCCGTGACTATGGCTTCACCGTCGAACTCATCCGCCGTGACTACCC
CAAGTCCCCTATGTACAACCGATCGCAGACTCACTACCATCGCATCGCCGACGCCCTCCGCCGCTCCATCAGCCGTAACACGGCGGCGCTGACAGACACGGCGGAGGCCC
CTATTTACAGCAACAGAGGCGAATACCTCATGGAATTATCCGTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGGAGTGACATCATTTGGACCCAATGC
AAACCATGCAAAAATTGCTACCAACAAAACGCGCCAATGTTTACCCCGAGTAAATCGGCGACATACAAAAAACTGTCGTGCTCCTCTCCGATTTGCTTGTTTGCTGGTGA
GAGTGCTTCATGTTCCTCTCAGTCTGAGTGCTTGTACTCGATTTCTTACGGCGATAGGTCCCACAGCCAAGGAGATTTTGCCGTTGATACGGTTACTATGGGGTCTACCT
CTGGCCGCCGCGTGGCGTTTCCTCGTATGGCCATTGGTTGTGGTCATGACAACGCTGGCACTTTCGATGCTAATGTTTCTGGCATTGTCGGCCTTGGGCTAGGTCCGGCT
TCCCTCGTCACGCAAATGGGACCCTCCGCTGGCGGAAAGTTCTCTTACTGTTTAACTCCGATTGGAAGCAATACTATCGAGTCCAGCAAACTTAACTTTGGCTCTAATGC
CGTCGTCTCCGGCTCTAGCGCCGTCTCAACTCCTATATATATTAGTGATAGATTCAAAAGTTTCTACTGGCTCAAGTTAGAAGGCGTGAGCGTAGGGGAGAAAAAATTTG
AATTTCCAGTCTCTTCAATATTAGGCGGAGAAGCAAACATGATCATTGACTCAGGCACGACGCTTACTTTCCTCCCCATGCATTTATACAACAACTTCTCCACCACAATT
TCCAACTCGATAAACCTCCAGCGGACGAATGACCCAAATCAATTCTTAGATTACTGCTTCGCAACTACCACCGATGACTACAAAGCGCCGCCCGTCACGATGCACTTTGA
GGGTGCCGATGTGCCCCTTCCCCAAGAAAACGTGTTCGTTAGGGTGTCGGACGACGTTGTTTGCTTGGCCTTCTGTCCCGGCCAGGACAACCACATTATGATCTATGGCA
ACATTGCCCAGAACAACTTCTTGGTTGGTTATGATATTAACACCCTGTCTGTTTCTTTCAAGCCGGCAGATTGCATTGCCATCCGTGACTATGGCTTCACTGTCGAACTC
ATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCTGAGACTCACTACCACCGCCTCGCCAATGCCCTCCGCCGTTCCATCAGCCGTAACACAGCGGCATTGAC
AGACACAGCGGAGGCTCCTATTTACAACTATAGAGGCCAATACCTCATGGAAATATCCCTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGAAGCGACA
TCGTTTGGACTCAATGTGAACCATGCCCAAATTGCTACGAACAAAGCGCGCCAATGTTTAACCCGAGTAAATCGGCGACTTACAAAAATGTGGCGTGTTCCTCGCCGATT
TGCTCGTTTGCTGGTGAGGAACGTTCTTGTTCCGCTCAGTCCGAGTGTTTGTACTCGATTACTTACGGCGATAGTTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGT
TACTATGGGGTCCACCTCTGGTCGCCAGGTGGCGTTTCCTCGTATTGCCATTGGTTGTGGTCATGACAATGCTGGAACATTTGATGCTAATGTCTCAGGCATTGTTGGGC
TCGGGCAAGGTCCAGCTTCGCTTGTCTCACAATTGGGACCGGCTACTGGCGGAAAATTCTCTTACTGTTTAGCTCCAATCGGAAATGACACTATTGAGTCCAGCAAACTT
AACTTTGGCTCTAACGCGATCGTTTCCGGCTCTAAAGCTGTATCGACTCTTATTTATACTAGTGATACCTACAAAACCTTCTACTCACTCAAGCTAGAAGCTGTGAGCGT
AGGGGAGAGCAAATTTGATTTTCCAGTAGTCTCTTCAAGATTAGGCGGAGAAGCAAACATCATCATCGACTCTGGCACGACGCTTACTTTACTCCCAACGGATTTATACA
ACAACTTCGCCACTGAAATTTCCGGCTCGATAAACCTCCAGCGCACGAATGATCCGAATCAATATTTAGATGATTGCTACGCGACTACCACTGATGACTATGAAGCGCCA
CCCGTAACCATGCACTTTGAAGGCGCTGATGTACCCCTCCAACGAGAAAACGTGTTCATTAGAGTGTCGGATGACGCTGTTTGCTTGGCTTTTAAAGCAGCTGGGCAGGA
TGAGGACAATATTTTTATCTATGGCAACATTTCCCAGAACAACTTCTTGGTTGGTTATGATACTAAGAACATGTCTGTTTCTTTCAAGCCCGCGGATTGCGTTTCCATGT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGCACCCATTTTCTTTCTCATTTTCTTAATCTCCTATGCCGTCGTCTCAGCCGCCACCACAGGCCGTGACTATGGCTTCACCGTCGAACTCATCCGCCGTGACTACCC
CAAGTCCCCTATGTACAACCGATCGCAGACTCACTACCATCGCATCGCCGACGCCCTCCGCCGCTCCATCAGCCGTAACACGGCGGCGCTGACAGACACGGCGGAGGCCC
CTATTTACAGCAACAGAGGCGAATACCTCATGGAATTATCCGTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGGAGTGACATCATTTGGACCCAATGC
AAACCATGCAAAAATTGCTACCAACAAAACGCGCCAATGTTTACCCCGAGTAAATCGGCGACATACAAAAAACTGTCGTGCTCCTCTCCGATTTGCTTGTTTGCTGGTGA
GAGTGCTTCATGTTCCTCTCAGTCTGAGTGCTTGTACTCGATTTCTTACGGCGATAGGTCCCACAGCCAAGGAGATTTTGCCGTTGATACGGTTACTATGGGGTCTACCT
CTGGCCGCCGCGTGGCGTTTCCTCGTATGGCCATTGGTTGTGGTCATGACAACGCTGGCACTTTCGATGCTAATGTTTCTGGCATTGTCGGCCTTGGGCTAGGTCCGGCT
TCCCTCGTCACGCAAATGGGACCCTCCGCTGGCGGAAAGTTCTCTTACTGTTTAACTCCGATTGGAAGCAATACTATCGAGTCCAGCAAACTTAACTTTGGCTCTAATGC
CGTCGTCTCCGGCTCTAGCGCCGTCTCAACTCCTATATATATTAGTGATAGATTCAAAAGTTTCTACTGGCTCAAGTTAGAAGGCGTGAGCGTAGGGGAGAAAAAATTTG
AATTTCCAGTCTCTTCAATATTAGGCGGAGAAGCAAACATGATCATTGACTCAGGCACGACGCTTACTTTCCTCCCCATGCATTTATACAACAACTTCTCCACCACAATT
TCCAACTCGATAAACCTCCAGCGGACGAATGACCCAAATCAATTCTTAGATTACTGCTTCGCAACTACCACCGATGACTACAAAGCGCCGCCCGTCACGATGCACTTTGA
GGGTGCCGATGTGCCCCTTCCCCAAGAAAACGTGTTCGTTAGGGTGTCGGACGACGTTGTTTGCTTGGCCTTCTGTCCCGGCCAGGACAACCACATTATGATCTATGGCA
ACATTGCCCAGAACAACTTCTTGGTTGGTTATGATATTAACACCCTGTCTGTTTCTTTCAAGCCGGCAGATTGCATTGCCATCCGTGACTATGGCTTCACTGTCGAACTC
ATCCACCGTGACTCCCCCAAGTCCCCTATGTACAACCCATCTGAGACTCACTACCACCGCCTCGCCAATGCCCTCCGCCGTTCCATCAGCCGTAACACAGCGGCATTGAC
AGACACAGCGGAGGCTCCTATTTACAACTATAGAGGCCAATACCTCATGGAAATATCCCTCGGAACGCCGCCGTTTTCGATTCTAGCTGTTGCTGACACAGGAAGCGACA
TCGTTTGGACTCAATGTGAACCATGCCCAAATTGCTACGAACAAAGCGCGCCAATGTTTAACCCGAGTAAATCGGCGACTTACAAAAATGTGGCGTGTTCCTCGCCGATT
TGCTCGTTTGCTGGTGAGGAACGTTCTTGTTCCGCTCAGTCCGAGTGTTTGTACTCGATTACTTACGGCGATAGTTCCCACAGCCAAGGAGATCTTGCCGTTGATACCGT
TACTATGGGGTCCACCTCTGGTCGCCAGGTGGCGTTTCCTCGTATTGCCATTGGTTGTGGTCATGACAATGCTGGAACATTTGATGCTAATGTCTCAGGCATTGTTGGGC
TCGGGCAAGGTCCAGCTTCGCTTGTCTCACAATTGGGACCGGCTACTGGCGGAAAATTCTCTTACTGTTTAGCTCCAATCGGAAATGACACTATTGAGTCCAGCAAACTT
AACTTTGGCTCTAACGCGATCGTTTCCGGCTCTAAAGCTGTATCGACTCTTATTTATACTAGTGATACCTACAAAACCTTCTACTCACTCAAGCTAGAAGCTGTGAGCGT
AGGGGAGAGCAAATTTGATTTTCCAGTAGTCTCTTCAAGATTAGGCGGAGAAGCAAACATCATCATCGACTCTGGCACGACGCTTACTTTACTCCCAACGGATTTATACA
ACAACTTCGCCACTGAAATTTCCGGCTCGATAAACCTCCAGCGCACGAATGATCCGAATCAATATTTAGATGATTGCTACGCGACTACCACTGATGACTATGAAGCGCCA
CCCGTAACCATGCACTTTGAAGGCGCTGATGTACCCCTCCAACGAGAAAACGTGTTCATTAGAGTGTCGGATGACGCTGTTTGCTTGGCTTTTAAAGCAGCTGGGCAGGA
TGAGGACAATATTTTTATCTATGGCAACATTTCCCAGAACAACTTCTTGGTTGGTTATGATACTAAGAACATGTCTGTTTCTTTCAAGCCCGCGGATTGCGTTTCCATGT
GA
Protein sequenceShow/hide protein sequence
MAPIFFLIFLISYAVVSAATTGRDYGFTVELIRRDYPKSPMYNRSQTHYHRIADALRRSISRNTAALTDTAEAPIYSNRGEYLMELSVGTPPFSILAVADTGSDIIWTQC
KPCKNCYQQNAPMFTPSKSATYKKLSCSSPICLFAGESASCSSQSECLYSISYGDRSHSQGDFAVDTVTMGSTSGRRVAFPRMAIGCGHDNAGTFDANVSGIVGLGLGPA
SLVTQMGPSAGGKFSYCLTPIGSNTIESSKLNFGSNAVVSGSSAVSTPIYISDRFKSFYWLKLEGVSVGEKKFEFPVSSILGGEANMIIDSGTTLTFLPMHLYNNFSTTI
SNSINLQRTNDPNQFLDYCFATTTDDYKAPPVTMHFEGADVPLPQENVFVRVSDDVVCLAFCPGQDNHIMIYGNIAQNNFLVGYDINTLSVSFKPADCIAIRDYGFTVEL
IHRDSPKSPMYNPSETHYHRLANALRRSISRNTAALTDTAEAPIYNYRGQYLMEISLGTPPFSILAVADTGSDIVWTQCEPCPNCYEQSAPMFNPSKSATYKNVACSSPI
CSFAGEERSCSAQSECLYSITYGDSSHSQGDLAVDTVTMGSTSGRQVAFPRIAIGCGHDNAGTFDANVSGIVGLGQGPASLVSQLGPATGGKFSYCLAPIGNDTIESSKL
NFGSNAIVSGSKAVSTLIYTSDTYKTFYSLKLEAVSVGESKFDFPVVSSRLGGEANIIIDSGTTLTLLPTDLYNNFATEISGSINLQRTNDPNQYLDDCYATTTDDYEAP
PVTMHFEGADVPLQRENVFIRVSDDAVCLAFKAAGQDEDNIFIYGNISQNNFLVGYDTKNMSVSFKPADCVSM