; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012862 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012862
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionaspartic proteinase CDR1-like
Genome locationscaffold63:4073757..4080774
RNA-Seq ExpressionMS012862
SyntenyMS012862
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3468560.1 aspartic proteinase CDR1-like [Gossypium australe]5.4e-16443.82Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        +LM++SLGTP    V IADTGSDL WTQC PC+QCF Q  P F+P  SSTYR++SC +  C  L+ + C  D  +C Y  +YGD S++ G+L  D +T+ 
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  SFRLR-----NTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYY
        S   R      TVIGCG  NGGTF    +SGIIGLGGG +SL+SQL    +V  +FSYCL  I +  N +  INFG +A+VSG G VSTPLV K+PDT+Y
Subjt:  SFRLR-----NTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYY

Query:  YLTLEAVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPL
        +LTLEA++VG T+      S++    GNIIIDSGTTLT LP + Y  V S +T  I AKR E P G L LCY  N   +  IP +T HF   A +KL PL
Subjt:  YLTLEAVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPL

Query:  NTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS--------------------------------------------------------
        NTF  V++   C +F+   + AI+GNL+QM+FL+GYD   + +S                                                        
Subjt:  NTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS--------------------------------------------------------

Query:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGA--HQICDYSYTYGDQSYTKGELGFDKI
        GEYL+ +S+GTP    +A+ADTGSDL+WTQC PC +CF Q  P+FDP +SS++R ++C+S+ C  +    C +     C YS TYGD S++KG++ +D +
Subjt:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGA--HQICDYSYTYGDQSYTKGELGFDKI

Query:  TLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVT
        TLGS       L  T++GCG+ + G F   ASG+IGLGGG +SL++QL   S +  +FSYCL  + +Q   ++K+NFG NA+VSG G VSTP + K P T
Subjt:  TLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVT

Query:  YYYITLEAVSVGNQRHE-AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLP
        +Y++TL+A+SVG QR E   + +    GN++IDSGTTLT +P + Y  + S++       R + P G   LC+ A       + P +T HF+  ADV+L 
Subjt:  YYYITLEAVSVGNQRHE-AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLP

Query:  AVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA
         +NTF +V D  +C   + + +  I GNLAQMNFLIGYD  +  +SFK T C+
Subjt:  AVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA

KAG5589183.1 hypothetical protein H5410_039697 [Solanum commersonii]2.1e-16343.21Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        +LM++S+GTPPV+ V IADTGSDLTWTQC+PC  CF QS P+F+ + SSTY+ V C    C SL+ S C   G  C Y  +YGDQS+T G+L  DK T  
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  SFRLRNTVI-----GCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNI----FRNANLTGTINFGGHAVVSGRGAVSTPLVPKNP
        S    N VI     GCGH+NGGTF +  +SGIIGLGGG +S+++QL+K   +  +FSYCL  I      N+N+T  INFG  A+VSG   VSTPL+ K P
Subjt:  SFRLRNTVI-----GCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNI----FRNANLTGTINFGGHAVVSGRGAVSTPLVPKNP

Query:  DTYYYLTLEAVSVGN-------TRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHI--PTITA
         TYYYL LE VSVGN       ++ ++D +   +G  GNIIIDSGTTLT LP + Y  + STL   IRA R +DP+G   LCY   E ++G I  PTI  
Subjt:  DTYYYLTLEAVSVGN-------TRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHI--PTITA

Query:  HFVGGAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS-------------------------------------------
        HF   A ++L P +TF E+ + + CLT  P+ E AIFGNLAQ NFL+ YDL A ++S                                           
Subjt:  HFVGGAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS-------------------------------------------

Query:  ---------------GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGD
                       GE+L+++SIGTPP+D   IADTGSDL WTQC PC  CF Q  PIF+P+ SSS++ + C +  C+    S+C  +  C+Y   YGD
Subjt:  ---------------GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGD

Query:  QSYTKGELGFDKITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYC---LPNVYSQAKLTAKINFGGNAVVS
         S+T+G+L  +     S       +   V GCGH++ G F ++ SG+ GLGGG +S+V+Q+  +  +  +FSYC   L  +   +  T+ INFG NA+VS
Subjt:  QSYTKGELGFDKITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYC---LPNVYSQAKLTAKINFGGNAVVS

Query:  GSGVVSTPFVPKLPVTYYYITLEAVSVGNQ----RHEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGG
        G G++STP       T+YY+ L+ +S+GN+    +    ++     GN+IIDSGTTLT++P   Y ++ S L+R + A R +DP   L LC+ + E +G 
Subjt:  GSGVVSTPFVPKLPVTYYYITLEAVSVGNQ----RHEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGG

Query:  VDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTM--ASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC
        +D P I AHF+  AD+ L   N F +V + + CLT+    S+   ILGNLAQ NFLIGYD+ A ++SFK T C
Subjt:  VDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTM--ASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC

KYP35128.1 Aspartic proteinase nepenthesin-1 [Cajanus cajan]2.4e-15144.4Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDG-RTCGYGYTYGDQSYTYGELGRDKITI
        +L++ S+GTPP + +GIADTGSDL W+QC PC+QC+NQ  P+F+P  SSTY+ VSC S  C  +  + C  D   +C Y  +YGD S++ G L  D +T+
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDG-RTCGYGYTYGDQSYTYGELGRDKITI

Query:  GS-----FRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTY
         S            IGCG  N GTF S   SGI+GLGGG +SL++Q+    ++  +FSYCL  +   +  T  +NFG +AVV+G G VSTP++  + +T+
Subjt:  GS-----FRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTY

Query:  YYLTLEAVSVGNTR-HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLL
        YYL LE +SVG  R    D S+      GNIIIDSGTTLT LPQ+ Y  + S +   I  +R      IL LCY +       +P +TAHF  GA V L 
Subjt:  YYLTLEAVSVGNTR-HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLL

Query:  PLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFR
         LNTF  V+E VSC  FAP +  +IFGN+AQ+N LVGYD   + + GEYL++ SIGTPP + + IADTGSDLVW QC PC KC+NQ+ P+FDP +SS+++
Subjt:  PLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFR

Query:  HVACTSDPCRILDVS-----QCGAHQICDYSYTYGDQSYTKGELGFDKITLGS-------FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLN
         + C S     L  +       G    C+YS  YGD SY+ G L F+ +TL S       FP  K  +GCG  + G F    SG++GLG G +SL++++ 
Subjt:  HVACTSDPCRILDVS-----QCGAHQICDYSYTYGDQSYTKGELGFDKITLGS-------FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLN

Query:  KYSNVTRRFSYC-LPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYYITLEAVSVGNQRHEAA--AKMSFAAGNMIIDSGTTLTFLPQELYD
           ++  +FSYC LPN   ++K T+K+NFG NAVV+G G VSTP       T+Y + LE +SVG++R E    +  +   GN+IID+GTTLTFLP + Y 
Subjt:  KYSNVTRRFSYC-LPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYYITLEAVSVGNQRHEAA--AKMSFAAGNMIIDSGTTLTFLPQELYD

Query:  SVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSF
         + S +   ++  RV +P  +L LC+ +      +  P ITAHF+ GADV L  +NTF  V+ DV C   A      I GN+AQMN+LIGYD+V   +SF
Subjt:  SVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSF

Query:  KSTVC
        K   C
Subjt:  KSTVC

PHU08739.1 Aspartic proteinase CDR1 [Capsicum chinense]1.0e-15443.25Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSL-QASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITI
        +LM++S+GTPPV+ V IADTGSDLTWTQC PC  CF QS P+F+ + SSTY+ + C +  C S+  +S C      C Y   YGD S+T G+L  D  T 
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSL-QASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITI

Query:  GSFRLRNTVI-----GCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRN----ANLTGTINFGGHAVVSGRGAVSTPLVPKN
         S    N  I     GCGH+NGGTF +  +SGIIGLGGG +S++ QL+K   +  +FSYCL  I  +    +N+T  INFG +A+VSG   VSTPL+   
Subjt:  GSFRLRNTVI-----GCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRN----ANLTGTINFGGHAVVSGRGAVSTPLVPKN

Query:  PDTYYYLTLEAVSVGNT----RHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVG
          T+YYLTLE VSVGN     + +    S   G  GNIIIDSGTTLT +P   Y+ + S L   I A R ED +G   LCY ++E      PTI AHF  
Subjt:  PDTYYYLTLEAVSVGNT----RHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVG

Query:  GAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVG------------------YDLHA---------------------------------
         A ++L P +TF +V + + CLT  P+ E AIFGNLAQ NFL+G                  Y+L +                                 
Subjt:  GAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVG------------------YDLHA---------------------------------

Query:  RRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPC-RILDVSQCGAHQICDYSYTYGDQSYTKGELGF
          + GEYL+++SIGTPPV+ +AIADTGSDL WTQC PC  CF QS P+FD  +SS++   +C    C  I   S CG   IC+Y  +Y  QS T G+L F
Subjt:  RRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPC-RILDVSQCGAHQICDYSYTYGDQSYTKGELGF

Query:  DKITLGSFPLSKTVV-----GCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCL----PNVYSQAKLTAKINFGGNAVVSGSGVVSTPF
        DK TL S      V+     GCGHE+ G F    SG+IGLGGG +S++ QL+K+ N   +FSYCL        S + +T+ INFG  A++ G  VVSTP 
Subjt:  DKITLGSFPLSKTVV-----GCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCL----PNVYSQAKLTAKINFGGNAVVSGSGVVSTPF

Query:  VPKLPVTYYYITLEAVSVGNQRHE-----AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITA
        +   P T+YY+ L+ VSVGN + E       +      GN+IIDSGTT T LP +            + A R +DP G  GL + +++G+  +D P I +
Subjt:  VPKLPVTYYYITLEAVSVGNQRHE-----AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITA

Query:  HFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC
        HF+  AD+ L   NTF +V   + CLT+A + D  I GNLAQ NFLIGYD+VA ++SFK T C
Subjt:  HFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC

XP_022136655.1 aspartic proteinase CDR1-like [Momordica charantia]4.7e-19298.82Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYYYLTLE
        SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSL SQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYYYLTLE
Subjt:  SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYYYLTLE

Query:  AVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPLNTFGE
        AVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHF G AAVKLLPLNTFGE
Subjt:  AVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPLNTFGE

Query:  VAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS
        VAENVSCLTFAPSSE AIFGNLAQMNFLVGYDLHARRLS
Subjt:  VAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS

TrEMBL top hitse value%identityAlignment
A0A2G3BQJ5 Aspartic proteinase CDR15.0e-15543.25Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSL-QASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITI
        +LM++S+GTPPV+ V IADTGSDLTWTQC PC  CF QS P+F+ + SSTY+ + C +  C S+  +S C      C Y   YGD S+T G+L  D  T 
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSL-QASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITI

Query:  GSFRLRNTVI-----GCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRN----ANLTGTINFGGHAVVSGRGAVSTPLVPKN
         S    N  I     GCGH+NGGTF +  +SGIIGLGGG +S++ QL+K   +  +FSYCL  I  +    +N+T  INFG +A+VSG   VSTPL+   
Subjt:  GSFRLRNTVI-----GCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRN----ANLTGTINFGGHAVVSGRGAVSTPLVPKN

Query:  PDTYYYLTLEAVSVGNT----RHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVG
          T+YYLTLE VSVGN     + +    S   G  GNIIIDSGTTLT +P   Y+ + S L   I A R ED +G   LCY ++E      PTI AHF  
Subjt:  PDTYYYLTLEAVSVGNT----RHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVG

Query:  GAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVG------------------YDLHA---------------------------------
         A ++L P +TF +V + + CLT  P+ E AIFGNLAQ NFL+G                  Y+L +                                 
Subjt:  GAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVG------------------YDLHA---------------------------------

Query:  RRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPC-RILDVSQCGAHQICDYSYTYGDQSYTKGELGF
          + GEYL+++SIGTPPV+ +AIADTGSDL WTQC PC  CF QS P+FD  +SS++   +C    C  I   S CG   IC+Y  +Y  QS T G+L F
Subjt:  RRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPC-RILDVSQCGAHQICDYSYTYGDQSYTKGELGF

Query:  DKITLGSFPLSKTVV-----GCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCL----PNVYSQAKLTAKINFGGNAVVSGSGVVSTPF
        DK TL S      V+     GCGHE+ G F    SG+IGLGGG +S++ QL+K+ N   +FSYCL        S + +T+ INFG  A++ G  VVSTP 
Subjt:  DKITLGSFPLSKTVV-----GCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCL----PNVYSQAKLTAKINFGGNAVVSGSGVVSTPF

Query:  VPKLPVTYYYITLEAVSVGNQRHE-----AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITA
        +   P T+YY+ L+ VSVGN + E       +      GN+IIDSGTT T LP +            + A R +DP G  GL + +++G+  +D P I +
Subjt:  VPKLPVTYYYITLEAVSVGNQRHE-----AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITA

Query:  HFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC
        HF+  AD+ L   NTF +V   + CLT+A + D  I GNLAQ NFLIGYD+VA ++SFK T C
Subjt:  HFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC

A0A3Q7HJU2 Uncharacterized protein7.9e-16142.95Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        ++M++S+GTPPV+ V IADTGSDLTWTQC PC  CF QS P+F+ + SS+Y+   C +  C S+ +S C   G  C Y  +YGDQSYT G+L  D  T  
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  S------FRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRN---ANLTGTINFGGHAVVSGRGAVSTPLVPKNP
        S        + N   GCGH NGGTF +  +SGIIGLGGG++S+++QL+K   +  +FSYCL +I      +N+T  INFG  A VSG   VSTPL+ K P
Subjt:  S------FRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRN---ANLTGTINFGGHAVVSGRGAVSTPLVPKNP

Query:  DTYYYLTLEAVSVGNTRHAADMSSAVEGG-MGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHI--PTITAHFVGGA
         T+YYL LE VSVGN       S    GG  GNIIIDSGTTLT LP   Y  + STL   I A R EDP+G   LCY   E ++G I  PTIT HF   A
Subjt:  DTYYYLTLEAVSVGNTRHAADMSSAVEGG-MGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHI--PTITAHFVGGA

Query:  AVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS-------------------------------------------------
         ++L P +TF ++ E + CLT  P+ E AIFGNLAQ NFL+GYDL A ++S                                                 
Subjt:  AVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS-------------------------------------------------

Query:  ---------------GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGD
                       GEYL+++SIGTPP+D L IADTGSDL WTQC PC  CF Q  PIF+P++SSS++ + C +  C+    S C  +  C+Y  +YGD
Subjt:  ---------------GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGD

Query:  QSYTKGELGFDKITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYC---LPNVYSQAKLTAKINFGGNAVVS
        QS+T G+L  +  T  S       +   V GCGH++ G F ++ SG+IGLGGG +S+V+Q+  +  +  +FSYC   L ++   +  T+ INFG  A VS
Subjt:  QSYTKGELGFDKITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYC---LPNVYSQAKLTAKINFGGNAVVS

Query:  GSGVVSTPFVPKLPVTYYYITLEAVSVGNQRHE-----AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSG
        G  VVSTP + K P T+YY+ LE +S+GN+  E              GN+IIDSGTTLT++P   Y ++ S L+  + A + +DP     LC+ + + +G
Subjt:  GSGVVSTPFVPKLPVTYYYITLEAVSVGNQRHE-----AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSG

Query:  GVDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTMA-SSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC
         +D P I AHF+  AD+ L   N F +V + + CLT+    +   I GNLAQ NFLIGYD+ A ++SFK T C
Subjt:  GVDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTMA-SSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC

A0A5B6VH54 Aspartic proteinase CDR1-like2.6e-16443.82Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        +LM++SLGTP    V IADTGSDL WTQC PC+QCF Q  P F+P  SSTYR++SC +  C  L+ + C  D  +C Y  +YGD S++ G+L  D +T+ 
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  SFRLR-----NTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYY
        S   R      TVIGCG  NGGTF    +SGIIGLGGG +SL+SQL    +V  +FSYCL  I +  N +  INFG +A+VSG G VSTPLV K+PDT+Y
Subjt:  SFRLR-----NTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYY

Query:  YLTLEAVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPL
        +LTLEA++VG T+      S++    GNIIIDSGTTLT LP + Y  V S +T  I AKR E P G L LCY  N   +  IP +T HF   A +KL PL
Subjt:  YLTLEAVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPL

Query:  NTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS--------------------------------------------------------
        NTF  V++   C +F+   + AI+GNL+QM+FL+GYD   + +S                                                        
Subjt:  NTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS--------------------------------------------------------

Query:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGA--HQICDYSYTYGDQSYTKGELGFDKI
        GEYL+ +S+GTP    +A+ADTGSDL+WTQC PC +CF Q  P+FDP +SS++R ++C+S+ C  +    C +     C YS TYGD S++KG++ +D +
Subjt:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGA--HQICDYSYTYGDQSYTKGELGFDKI

Query:  TLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVT
        TLGS       L  T++GCG+ + G F   ASG+IGLGGG +SL++QL   S +  +FSYCL  + +Q   ++K+NFG NA+VSG G VSTP + K P T
Subjt:  TLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVT

Query:  YYYITLEAVSVGNQRHE-AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLP
        +Y++TL+A+SVG QR E   + +    GN++IDSGTTLT +P + Y  + S++       R + P G   LC+ A       + P +T HF+  ADV+L 
Subjt:  YYYITLEAVSVGNQRHE-AAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLP

Query:  AVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA
         +NTF +V D  +C   + + +  I GNLAQMNFLIGYD  +  +SFK T C+
Subjt:  AVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA

A0A6J1C4J6 aspartic proteinase CDR1-like2.3e-19298.82Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYYYLTLE
        SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSL SQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYYYLTLE
Subjt:  SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYYYLTLE

Query:  AVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPLNTFGE
        AVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHF G AAVKLLPLNTFGE
Subjt:  AVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPLNTFGE

Query:  VAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS
        VAENVSCLTFAPSSE AIFGNLAQMNFLVGYDLHARRLS
Subjt:  VAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS

F6HJ51 Uncharacterized protein1.6e-15340.68Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        +LM + +GTPPV  + I DTGSDLTWTQC PC  C+ Q  P+F+P+ SSTYR  SC +  C +L         + C + Y+Y D S+T G L  + +T+ 
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  S-----FRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYY
        S             GCGH +GG F   +SSGI+GLGGG LSL+SQL     +   FSYCL  +  +++++  INFG    VSG G VSTPLV K+PDT+Y
Subjt:  S-----FRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYY

Query:  YLTLEAVSVGNTR-HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLP
        YLTLE +SVG  R      S   E   GNII+DSGTT TFLPQ  Y  +  ++   I+ KR  DP GI  LCY  N   + + P ITAHF   A V+L P
Subjt:  YLTLEAVSVGNTR-HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLP

Query:  LNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS-------------------------------------------------------
        LNTF  + E++ C T AP+S+  + GNLAQ+NFLVG+DL  +R+S                                                       
Subjt:  LNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS-------------------------------------------------------

Query:  -----------------------------------GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRI
                                           GEY++ +SIGTPPV  +AI DTGSDL WTQC PC  C+ Q +P FDP+ SS++R  +C +  C  
Subjt:  -----------------------------------GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRI

Query:  L-DVSQCGAHQICDYSYTYGDQSYTKGELGFDKITLGSF---PLS--KTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVY
        L +   C   + C + Y+Y D S+T G L  + +T+ S    P+S      GC H S G F + +SG++GLG   LS++SQL   S +  RFSYCL  V+
Subjt:  L-DVSQCGAHQICDYSYTYGDQSYTKGELGFDKITLGSF---PLS--KTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVY

Query:  SQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYY-ITLEAVSVGNQR---HEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVE
        + + ++++INFG + +VSG+G VSTP V K P TYYY ITLE  SVG +R      + K     GN+I+DSGTT T+LP E Y  +  S+   ++ +RV 
Subjt:  SQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYY-ITLEAVSVGNQR---HEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVE

Query:  DPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC
        DP G+  LC+        +D P ITAHF   A+V L   NTF R+ +D+ C T+  +SD GILGNLAQ+NFL+G+D+   R+SFK+  C
Subjt:  DPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVC

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356151.5e-7943.32Show/hide
Query:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCG---AHQICDYSYTYGDQSYTKGELGFDK
        GE+ + ++IGTPP+   AIADTGSDL W QC PC++C+ ++ PIFD ++SS+++   C S  C+ L  ++ G   ++ IC Y Y+YGDQS++KG++  + 
Subjt:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCG---AHQICDYSYTYGDQSYTKGELGFDK

Query:  ITLGS---FPLS--KTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSG----SGVVSTPFVP
        +++ S    P+S   TV GCG+ + G F +  SG+IGLGGG LSL+SQL   S+++++FSYCL +  +    T+ IN G N++ S     SGVVSTP V 
Subjt:  ITLGS---FPLS--KTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSG----SGVVSTPFVP

Query:  KLPVTYYYITLEAVSVGNQRHEAAAK---------MSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVR-VVRARRVEDPGGVLGLCFAAEEGSGGVDFPT
        K P+TYYY+TLEA+SVG ++               +S  +GN+IIDSGTTLT L    +D   S++   V  A+RV DP G+L  CF  + GS  +  P 
Subjt:  KLPVTYYYITLEAVSVGNQRHEAAAK---------MSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVR-VVRARRVEDPGGVLGLCFAAEEGSGGVDFPT

Query:  ITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA
        IT HF+ GADVRL  +N F ++++D+ CL+M  +++  I GN AQM+FL+GYD+    +SF+   C+
Subjt:  ITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA

Q6XBF8 Aspartic proteinase CDR15.0e-8847.61Show/hide
Query:  SGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILD-VSQCGAH-QICDYSYTYGDQSYTKGELGFDK
        SGEYL+ VSIGTPP   +AIADTGSDL+WTQC PC  C+ Q  P+FDP+ SS+++ V+C+S  C  L+  + C  +   C YS +YGD SYTKG +  D 
Subjt:  SGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILD-VSQCGAH-QICDYSYTYGDQSYTKGELGFDK

Query:  ITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKL-P
        +TLGS       L   ++GCGH + G F    SG++GLGGGP+SL+ QL    ++  +FSYCL  + S+   T+KINFG NA+VSGSGVVSTP + K   
Subjt:  ITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKL-P

Query:  VTYYYITLEAVSVGNQRHEAAAKMSFAA-GNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVR
         T+YY+TL+++SVG+++ + +   S ++ GN+IIDSGTTLT LP E Y  +  ++   + A + +DP   L LC++A   +G +  P IT HF  GADV+
Subjt:  VTYYYITLEAVSVGNQRHEAAAKMSFAA-GNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVR

Query:  LPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA
        L + N F +V++D+ C     S  F I GN+AQMNFL+GYD V+  +SFK T CA
Subjt:  LPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA

Q766C2 Aspartic proteinase nepenthesin-24.6e-5738Show/hide
Query:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGDQSYTKGELGFDKITL
        GEYL+ V+IGTP   F AI DTGSDL+WTQC PC +CF+Q  PIF+P+ SSSF  + C S  C+ L    C  ++ C Y+Y YGD S T+G +  +  T 
Subjt:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGDQSYTKGELGFDKITL

Query:  GSFPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYYITLE
         +  +     GCG ++ G      +G+IG+G GPLSL SQL        +FSYC+ +  S +  T  +    + V  GS   +T     L  TYYYITL+
Subjt:  GSFPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYYITLE

Query:  AVSVGNQR----HEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPAVNT
         ++VG                  G MIIDSGTTLT+LPQ+ Y++V  +    +    V++    L  CF        V  P I+  F GG  + L   N 
Subjt:  AVSVGNQR----HEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPAVNT

Query:  FERVADDVSCLTMASSSDFG--ILGNLAQMNFLIGYDMVAMRLSFKSTVC
            A+ V CL M SSS  G  I GN+ Q    + YD+  + +SF  T C
Subjt:  FERVADDVSCLTMASSSDFG--ILGNLAQMNFLIGYDMVAMRLSFKSTVC

Q766C3 Aspartic proteinase nepenthesin-14.7e-6238.86Show/hide
Query:  AQMNFLVGYDLHARRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTY
        A +N   G +       GEYL+ +SIGTP   F AI DTGSDL+WTQC PC +CFNQS PIF+P+ SSSF  + C+S  C+ L    C ++  C Y+Y Y
Subjt:  AQMNFLVGYDLHARRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTY

Query:  GDQSYTKGELGFDKITLGSFPLSKTVVGCGHESDGGFGD-IASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVV
        GD S T+G +G + +T GS  +     GCG E++ GFG    +G++G+G GPLSL SQL+       +FSYC+  + S       +    N+V +GS   
Subjt:  GDQSYTKGELGFDKITLGSFPLSKTVVGCGHESDGGFGD-IASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVV

Query:  STPFVPKLPVTYYYITLEAVSVGNQR-----HEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFP
        +     ++P T+YYITL  +SVG+ R        A   +   G +IIDSGTTLT+     Y SV    +  +    V        LCF        +  P
Subjt:  STPFVPKLPVTYYYITLEAVSVGNQR-----HEAAAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFP

Query:  TITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSD-FGILGNLAQMNFLIGYDMVAMRLSFKSTVC
        T   HF GG D+ LP+ N F   ++ + CL M SSS    I GN+ Q N L+ YD     +SF S  C
Subjt:  TITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSD-FGILGNLAQMNFLIGYDMVAMRLSFKSTVC

Q9LNJ3 Aspartyl protease family protein 24.3e-4734.67Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        +   + +GTP      + DTGSD+ W QC PC +C++QS PIF+PR S TY  + C S  C  L ++GC    +TC Y  +YGD S+T G+   + +T  
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNP--DTYYYLT
          R++   +GCGH+N G F   A  G++GLG G LS   Q        ++FSYCL  + R+A+   +    G+A VS R A  TPL+  NP  DT+YY+ 
Subjt:  SFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNP--DTYYYLT

Query:  LEAVSVGNTR---HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGG-----AAV
        L  +SVG TR     A +    + G G +IIDSGT++T L +  Y  +        +  +      + + C+  + + +  +PT+  HF G      A  
Subjt:  LEAVSVGNTR---HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGG-----AAV

Query:  KLLPLNTFGEVAENVSCLTFAPSSES-AIFGNLAQMNFLVGYDLHARRL
         L+P++T G+      C  FA +    +I GN+ Q  F V YDL + R+
Subjt:  KLLPLNTFGEVAENVSCLTFAPSSES-AIFGNLAQMNFLVGYDLHARRL

Arabidopsis top hitse value%identityAlignment
AT1G31450.1 Eukaryotic aspartyl protease family protein1.8e-7744.94Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSL--QASGCGPDGRTCGYGYTYGDQSYTYGELGRDKIT
        + M +S+GTPP K   IADTGSDLTW QC PC QC+ Q+ P+F+ + SSTY+  SC S TC +L     GC      C Y Y+YGD S+T G++  + I+
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSL--QASGCGPDGRTCGYGYTYGDQSYTYGELGRDKIT

Query:  I-----GSFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSG----RGAVSTPLVPK
        I      S     TV GCG+ NGGTF     SGIIGLGGG LSLVSQL    ++ ++FSYCL +     N T  IN G +++ S        ++TPL+ K
Subjt:  I-----GSFRLRNTVIGCGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSG----RGAVSTPLVPK

Query:  NPDTYYYLTLEAVSVGNTR-----HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTK-VIRAKRAEDPAGILELCYATNEIRDGHIPTITAH
        +P+TYY+LTLEAV+VG T+         ++       GNIIIDSGTTLT L    YD   + + + V  AKR  DP G+L  C+ + +   G +P IT H
Subjt:  NPDTYYYLTLEAVSVGNTR-----HAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTK-VIRAKRAEDPAGILELCYATNEIRDGHIPTITAH

Query:  FVGGAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS
        F   A VKL P+N F ++ E+  CL+  P++E AI+GN+ QM+FLVGYDL  + +S
Subjt:  FVGGAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVGYDLHARRLS

AT1G64830.1 Eukaryotic aspartyl protease family protein6.1e-9750.28Show/hide
Query:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQ-ICDYSYTYGDQSYTKGELGFDKIT
        GEYL+ +SIGTPPV  LAIADTGSDL+WTQC PC  C+ Q+ P+FDP+ SS++R V+C+S  CR L+ + C   +  C Y+ TYGD SYTKG++  D +T
Subjt:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQ-ICDYSYTYGDQSYTKGELGFDKIT

Query:  LGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTY
        +GS       L   ++GCGHE+ G F    SG+IGLGGG  SLVSQL K  ++  +FSYCL    S+  LT+KINFG N +VSG GVVST  V K P TY
Subjt:  LGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTY

Query:  YYITLEAVSVGNQRHEAAAKM-SFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPA
        Y++ LEA+SVG+++ +  + +     GN++IDSGTTLT LP   Y  + S +   ++A RV+DP G+L LC+     S     P IT HF GG DV+L  
Subjt:  YYITLEAVSVGNQRHEAAAKM-SFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPA

Query:  VNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA
        +NTF  V++DVSC   A++    I GNLAQMNFL+GYD V+  +SFK T C+
Subjt:  VNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA

AT2G28220.1 Eukaryotic aspartyl protease family protein2.9e-11537.12Show/hide
Query:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG
        +LM++ +GTPP +     DTGSDL WTQC+PC  C++Q  PIF+P  SST+ +  C                G++C Y   Y D +Y+ G L  + +TI 
Subjt:  FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIG

Query:  S-----FRLRNTVIGCGHENGGTFGSG---ASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPD
        S     F +  T IGCG  N     SG   +SSGI+GL  G  SL+SQ++         SYC      +   T  INFG +A+V+G G V+  +  K  +
Subjt:  S-----FRLRNTVIGCGHENGGTFGSG---ASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPD

Query:  TYYYLTLEAVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKL
         +YYL L+AVSV + R    + +      GNI+IDSG+T+T+ P +  + V   + +V+ A R  DP+G   LCY +  I     P IT HF GGA + L
Subjt:  TYYYLTLEAVSVGNTRHAADMSSAVEGGMGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKL

Query:  LPLNTFGEV-AENVSCLTFAPSS--ESAIFGNLAQMNFLVGYDLHARRLSGE------------YLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCF
           N + E  +  + CL    +S  + AIFGN AQ NFLVGYD  +  L G             YL+++ +GTPP + +A  DTGSD++WTQC+PC  C+
Subjt:  LPLNTFGEV-AENVSCLTFAPSS--ESAIFGNLAQMNFLVGYDLHARRLSGE------------YLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCF

Query:  NQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGDQSYTKGELGFDKITLGS-----FPLSKTVVGCGHESD----GGFGDIASGVIG
        +Q  PIFDP +SS+FR   C  + C               Y   Y D++Y+KG L  + +T+ S     F +++T +GCG ++      GF   +SG++G
Subjt:  NQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGDQSYTKGELGFDKITLGS-----FPLSKTVVGCGHESD----GGFGDIASGVIG

Query:  LGGGPLSLVSQLN-KYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYYITLEAVSV-GNQRHEAAAKMSFAAGNMIIDSG
        L  GPLSL+SQ++  Y  +    SYC        + T+KINFG NA+V+G G V+     K    +YY+ L+AVSV  N             GN+ IDSG
Subjt:  LGGGPLSLVSQLN-KYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYYITLEAVSV-GNQRHEAAAKMSFAAGNMIIDSG

Query:  TTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVD-FPTITAHFSGGADVRLPAVNTF-ERVADDVSCLTMASS--SDFGILGNLAQ
        TTLT+ P    + V  ++ +VV A +V D G    LC+ ++     +D FP IT HFSGGAD+ L   N + E +   + CL +  +  S   + GN AQ
Subjt:  TTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVD-FPTITAHFSGGADVRLPAVNTF-ERVADDVSCLTMASS--SDFGILGNLAQ

Query:  MNFLIGYDMVAMRLSFKSTVCA
         NFL+GYD  +  +SF  T C+
Subjt:  MNFLIGYDMVAMRLSFKSTVCA

AT2G35615.1 Eukaryotic aspartyl protease family protein1.0e-8043.32Show/hide
Query:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCG---AHQICDYSYTYGDQSYTKGELGFDK
        GE+ + ++IGTPP+   AIADTGSDL W QC PC++C+ ++ PIFD ++SS+++   C S  C+ L  ++ G   ++ IC Y Y+YGDQS++KG++  + 
Subjt:  GEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCG---AHQICDYSYTYGDQSYTKGELGFDK

Query:  ITLGS---FPLS--KTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSG----SGVVSTPFVP
        +++ S    P+S   TV GCG+ + G F +  SG+IGLGGG LSL+SQL   S+++++FSYCL +  +    T+ IN G N++ S     SGVVSTP V 
Subjt:  ITLGS---FPLS--KTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSG----SGVVSTPFVP

Query:  KLPVTYYYITLEAVSVGNQRHEAAAK---------MSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVR-VVRARRVEDPGGVLGLCFAAEEGSGGVDFPT
        K P+TYYY+TLEA+SVG ++               +S  +GN+IIDSGTTLT L    +D   S++   V  A+RV DP G+L  CF  + GS  +  P 
Subjt:  KLPVTYYYITLEAVSVGNQRHEAAAK---------MSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVR-VVRARRVEDPGGVLGLCFAAEEGSGGVDFPT

Query:  ITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA
        IT HF+ GADVRL  +N F ++++D+ CL+M  +++  I GN AQM+FL+GYD+    +SF+   C+
Subjt:  ITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA

AT5G33340.1 Eukaryotic aspartyl protease family protein3.6e-8947.61Show/hide
Query:  SGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILD-VSQCGAH-QICDYSYTYGDQSYTKGELGFDK
        SGEYL+ VSIGTPP   +AIADTGSDL+WTQC PC  C+ Q  P+FDP+ SS+++ V+C+S  C  L+  + C  +   C YS +YGD SYTKG +  D 
Subjt:  SGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILD-VSQCGAH-QICDYSYTYGDQSYTKGELGFDK

Query:  ITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKL-P
        +TLGS       L   ++GCGH + G F    SG++GLGGGP+SL+ QL    ++  +FSYCL  + S+   T+KINFG NA+VSGSGVVSTP + K   
Subjt:  ITLGS-----FPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKL-P

Query:  VTYYYITLEAVSVGNQRHEAAAKMSFAA-GNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVR
         T+YY+TL+++SVG+++ + +   S ++ GN+IIDSGTTLT LP E Y  +  ++   + A + +DP   L LC++A   +G +  P IT HF  GADV+
Subjt:  VTYYYITLEAVSVGNQRHEAAAKMSFAA-GNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVR

Query:  LPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA
        L + N F +V++D+ C     S  F I GN+AQMNFL+GYD V+  +SFK T CA
Subjt:  LPAVNTFERVADDVSCLTMASSSDFGILGNLAQMNFLIGYDMVAMRLSFKSTVCA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTTCTGATGGAAGTCTCCCTCGGAACCCCGCCGGTAAAGTTCGTCGGAATCGCCGATACTGGAAGCGACCTGACGTGGACGCAGTGCCTGCCATGTAACCAATGCTTCAA
CCAATCAGGCCCCATTTTTAATCCACGTGGATCCTCCACCTACCGCCAAGTGTCGTGCAGGTCCGGTACTTGCCACTCCCTCCAGGCCTCCGGGTGTGGGCCCGACGGCC
GAACCTGCGGCTACGGTTACACGTACGGAGACCAGTCCTACACGTACGGAGAGCTAGGGCGTGACAAAATTACCATCGGGTCCTTCCGCCTCCGCAACACGGTAATCGGA
TGCGGCCATGAGAACGGGGGCACTTTCGGCAGCGGAGCTTCGTCGGGGATCATCGGACTCGGCGGCGGCAGCCTCTCGTTGGTCTCTCAGTTGAACAAAATCGGCGCCGT
CCGCCGGAGATTCTCCTATTGCTTGCCCAATATCTTCAGAAACGCGAATCTCACCGGCACAATAAACTTCGGAGGACACGCCGTCGTTTCGGGCCGTGGAGCAGTTTCAA
CGCCGCTCGTCCCGAAAAATCCGGACACATACTATTACCTGACTCTCGAAGCGGTCTCCGTGGGAAACACGCGTCACGCGGCCGACATGTCGTCTGCCGTGGAGGGAGGA
ATGGGGAACATAATCATCGACTCCGGGACGACACTGACGTTCCTTCCTCAGAACTTATACGACGGCGTCGTTTCGACTTTGACCAAAGTCATCCGAGCAAAGCGTGCGGA
GGATCCGGCTGGGATTCTGGAACTGTGCTACGCCACAAACGAGATCCGCGATGGGCATATTCCGACCATCACCGCCCATTTCGTCGGCGGTGCGGCCGTGAAGTTGCTGC
CGTTGAACACGTTTGGGGAGGTCGCCGAAAATGTGAGTTGCTTGACATTTGCGCCGTCGTCGGAGTCGGCCATTTTTGGGAATTTGGCTCAGATGAACTTTTTGGTCGGA
TATGATCTCCATGCGAGGAGGCTATCCGGCGAGTATCTAATCGAAGTCTCCATCGGAACGCCGCCGGTGGACTTCCTCGCCATCGCCGACACCGGAAGCGACCTGGTGTG
GACCCAGTGCCTGCCATGTCGGAAATGCTTCAACCAATCACTCCCCATTTTCGATCCACGTCGGTCTTCCTCCTTCCGCCACGTGGCATGCACGTCCGACCCCTGCCGTA
TCCTCGATGTCTCCCAATGTGGGGCCCACCAGATTTGCGACTACAGCTACACCTACGGAGATCAGTCCTACACCAAGGGGGAGCTGGGGTTTGATAAGATCACCCTCGGG
TCATTCCCGCTGAGTAAGACAGTCGTCGGATGCGGCCACGAGAGCGACGGCGGGTTCGGTGACATTGCTTCGGGGGTCATCGGACTCGGCGGTGGGCCCCTCTCATTGGT
CTCTCAGTTGAACAAATATTCTAACGTTACCCGGCGGTTCTCCTATTGCCTACCCAACGTCTACAGTCAAGCGAAACTCACCGCCAAAATAAACTTCGGTGGAAACGCTG
TCGTTTCGGGGAGTGGAGTCGTTTCGACGCCGTTTGTCCCCAAACTCCCCGTCACCTACTACTACATAACTCTGGAAGCCGTCTCCGTCGGGAACCAGCGTCACGAGGCG
GCGGCCAAAATGTCGTTCGCCGCGGGAAACATGATTATAGACTCCGGGACGACATTGACGTTTCTGCCCCAGGAATTATACGACAGCGTCGTTTCGAGTTTGGTGAGAGT
GGTCCGAGCGAGGCGGGTGGAGGATCCGGGCGGAGTTCTTGGACTGTGCTTCGCGGCGGAAGAAGGGAGCGGCGGAGTGGATTTTCCGACGATCACGGCCCATTTTTCCG
GCGGCGCCGACGTTAGATTGCCGGCTGTGAACACGTTCGAGAGGGTGGCGGATGATGTGAGTTGTTTAACGATGGCTTCGTCGTCGGATTTTGGGATTTTGGGGAATTTG
GCGCAGATGAACTTTTTGATCGGATATGATATGGTGGCGATGAGATTGTCGTTCAAGTCAACGGTGTGTGCT
mRNA sequenceShow/hide mRNA sequence
TTTCTGATGGAAGTCTCCCTCGGAACCCCGCCGGTAAAGTTCGTCGGAATCGCCGATACTGGAAGCGACCTGACGTGGACGCAGTGCCTGCCATGTAACCAATGCTTCAA
CCAATCAGGCCCCATTTTTAATCCACGTGGATCCTCCACCTACCGCCAAGTGTCGTGCAGGTCCGGTACTTGCCACTCCCTCCAGGCCTCCGGGTGTGGGCCCGACGGCC
GAACCTGCGGCTACGGTTACACGTACGGAGACCAGTCCTACACGTACGGAGAGCTAGGGCGTGACAAAATTACCATCGGGTCCTTCCGCCTCCGCAACACGGTAATCGGA
TGCGGCCATGAGAACGGGGGCACTTTCGGCAGCGGAGCTTCGTCGGGGATCATCGGACTCGGCGGCGGCAGCCTCTCGTTGGTCTCTCAGTTGAACAAAATCGGCGCCGT
CCGCCGGAGATTCTCCTATTGCTTGCCCAATATCTTCAGAAACGCGAATCTCACCGGCACAATAAACTTCGGAGGACACGCCGTCGTTTCGGGCCGTGGAGCAGTTTCAA
CGCCGCTCGTCCCGAAAAATCCGGACACATACTATTACCTGACTCTCGAAGCGGTCTCCGTGGGAAACACGCGTCACGCGGCCGACATGTCGTCTGCCGTGGAGGGAGGA
ATGGGGAACATAATCATCGACTCCGGGACGACACTGACGTTCCTTCCTCAGAACTTATACGACGGCGTCGTTTCGACTTTGACCAAAGTCATCCGAGCAAAGCGTGCGGA
GGATCCGGCTGGGATTCTGGAACTGTGCTACGCCACAAACGAGATCCGCGATGGGCATATTCCGACCATCACCGCCCATTTCGTCGGCGGTGCGGCCGTGAAGTTGCTGC
CGTTGAACACGTTTGGGGAGGTCGCCGAAAATGTGAGTTGCTTGACATTTGCGCCGTCGTCGGAGTCGGCCATTTTTGGGAATTTGGCTCAGATGAACTTTTTGGTCGGA
TATGATCTCCATGCGAGGAGGCTATCCGGCGAGTATCTAATCGAAGTCTCCATCGGAACGCCGCCGGTGGACTTCCTCGCCATCGCCGACACCGGAAGCGACCTGGTGTG
GACCCAGTGCCTGCCATGTCGGAAATGCTTCAACCAATCACTCCCCATTTTCGATCCACGTCGGTCTTCCTCCTTCCGCCACGTGGCATGCACGTCCGACCCCTGCCGTA
TCCTCGATGTCTCCCAATGTGGGGCCCACCAGATTTGCGACTACAGCTACACCTACGGAGATCAGTCCTACACCAAGGGGGAGCTGGGGTTTGATAAGATCACCCTCGGG
TCATTCCCGCTGAGTAAGACAGTCGTCGGATGCGGCCACGAGAGCGACGGCGGGTTCGGTGACATTGCTTCGGGGGTCATCGGACTCGGCGGTGGGCCCCTCTCATTGGT
CTCTCAGTTGAACAAATATTCTAACGTTACCCGGCGGTTCTCCTATTGCCTACCCAACGTCTACAGTCAAGCGAAACTCACCGCCAAAATAAACTTCGGTGGAAACGCTG
TCGTTTCGGGGAGTGGAGTCGTTTCGACGCCGTTTGTCCCCAAACTCCCCGTCACCTACTACTACATAACTCTGGAAGCCGTCTCCGTCGGGAACCAGCGTCACGAGGCG
GCGGCCAAAATGTCGTTCGCCGCGGGAAACATGATTATAGACTCCGGGACGACATTGACGTTTCTGCCCCAGGAATTATACGACAGCGTCGTTTCGAGTTTGGTGAGAGT
GGTCCGAGCGAGGCGGGTGGAGGATCCGGGCGGAGTTCTTGGACTGTGCTTCGCGGCGGAAGAAGGGAGCGGCGGAGTGGATTTTCCGACGATCACGGCCCATTTTTCCG
GCGGCGCCGACGTTAGATTGCCGGCTGTGAACACGTTCGAGAGGGTGGCGGATGATGTGAGTTGTTTAACGATGGCTTCGTCGTCGGATTTTGGGATTTTGGGGAATTTG
GCGCAGATGAACTTTTTGATCGGATATGATATGGTGGCGATGAGATTGTCGTTCAAGTCAACGGTGTGTGCT
Protein sequenceShow/hide protein sequence
FLMEVSLGTPPVKFVGIADTGSDLTWTQCLPCNQCFNQSGPIFNPRGSSTYRQVSCRSGTCHSLQASGCGPDGRTCGYGYTYGDQSYTYGELGRDKITIGSFRLRNTVIG
CGHENGGTFGSGASSGIIGLGGGSLSLVSQLNKIGAVRRRFSYCLPNIFRNANLTGTINFGGHAVVSGRGAVSTPLVPKNPDTYYYLTLEAVSVGNTRHAADMSSAVEGG
MGNIIIDSGTTLTFLPQNLYDGVVSTLTKVIRAKRAEDPAGILELCYATNEIRDGHIPTITAHFVGGAAVKLLPLNTFGEVAENVSCLTFAPSSESAIFGNLAQMNFLVG
YDLHARRLSGEYLIEVSIGTPPVDFLAIADTGSDLVWTQCLPCRKCFNQSLPIFDPRRSSSFRHVACTSDPCRILDVSQCGAHQICDYSYTYGDQSYTKGELGFDKITLG
SFPLSKTVVGCGHESDGGFGDIASGVIGLGGGPLSLVSQLNKYSNVTRRFSYCLPNVYSQAKLTAKINFGGNAVVSGSGVVSTPFVPKLPVTYYYITLEAVSVGNQRHEA
AAKMSFAAGNMIIDSGTTLTFLPQELYDSVVSSLVRVVRARRVEDPGGVLGLCFAAEEGSGGVDFPTITAHFSGGADVRLPAVNTFERVADDVSCLTMASSSDFGILGNL
AQMNFLIGYDMVAMRLSFKSTVCA