; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004989 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004989
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationscaffold4:10795268..10799792
RNA-Seq ExpressionSpg004989
SyntenySpg004989
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592908.1 Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. sororia]4.9e-14597Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASS+PSSMVFGDAAI
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SRLARFTPLI NPKLETFYYVELIGISVGGVRVRG+SASLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLKRGPEFSLFDTCYDLSGQS
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        AVKVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

KAG7025313.1 Aspartyl protease family protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-14597Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASS+PSSMVFGDAAI
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SRLARFTPLI NPKLETFYYVELIGISVGGVRVRG+SASLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLKRGPEFSLFDTCYDLSGQS
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        AVKVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

XP_022959948.1 aspartyl protease family protein 2 [Cucurbita moschata]4.9e-14597Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASS+PSSMVFGDAAI
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SRLARFTPLI NPKLETFYYVELIGISVGGVRVRG+SASLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLKRGPEFSLFDTCYDLSGQS
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        AVKVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

XP_023005015.1 aspartyl protease family protein 2 [Cucurbita maxima]1.4e-14496.63Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASS+PSSMVFGDAAI
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SRLARFTPLI NPKLETFYYVELIG SVGGVRVRG+SASLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLKRGPEFSLFDTCYDLSGQS
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        AVKVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

XP_038896166.1 aspartyl protease family protein 2 [Benincasa hispida]4.9e-14596.64Show/hide
Query:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA
        +TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGH NEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASS+PSSMVFGDAA
Subjt:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA

Query:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ
        ISR ARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLK+GPEFSLFDTCYDLSGQ
Subjt:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ

Query:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        S VKVPTVVLHFRGADMSLPATNYLIPVDD+GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDL+GSRIG
Subjt:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

TrEMBL top hitse value%identityAlignment
A0A0A0K4G2 Peptidase A1 domain-containing protein4.2e-14294.78Show/hide
Query:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA
        +TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGH NEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASS+PSSMVFGDAA
Subjt:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA

Query:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ
        ISRLARFTPLIRNPKL+TFYYV LIGISVGGVRVRGVS SLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFR GA HLKRGPEFSLFDTCYDLSGQ
Subjt:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ

Query:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        S+VKVPTVVLHFRGADM+LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

A0A1S3CHC4 aspartyl protease family protein 21.6e-14194.78Show/hide
Query:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA
        +TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGH NEGLFVGAAGLLGLGRGRLSFPSQTGIRFN KFSYCLVDRSASS+PSSMVFGDAA
Subjt:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA

Query:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ
        ISRLARFTPLIRNPKL+TFYYVELIGISVGGVRVRGV  SLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA HLKRGPEFSLFDTCYDLSGQ
Subjt:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ

Query:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        S+VKVPTVVLHFRGADM LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

A0A5A7U8Z2 Aspartyl protease family protein 21.6e-14194.78Show/hide
Query:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA
        +TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGH NEGLFVGAAGLLGLGRGRLSFPSQTGIRFN KFSYCLVDRSASS+PSSMVFGDAA
Subjt:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAA

Query:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ
        ISRLARFTPLIRNPKL+TFYYVELIGISVGGVRVRGV  SLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA HLKRGPEFSLFDTCYDLSGQ
Subjt:  ISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQ

Query:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        S+VKVPTVVLHFRGADM LPATNYLIPVD++GSFCFAFAGT+SGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  SAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

A0A6J1H7E0 aspartyl protease family protein 22.4e-14597Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASS+PSSMVFGDAAI
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SRLARFTPLI NPKLETFYYVELIGISVGGVRVRG+SASLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLKRGPEFSLFDTCYDLSGQS
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        AVKVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

A0A6J1KW81 aspartyl protease family protein 26.9e-14596.63Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGR SFPSQTG+RFNHKFSYCLVDRSASS+PSSMVFGDAAI
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SRLARFTPLI NPKLETFYYVELIG SVGGVRVRG+SASLFKLD AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGA+HLKRGPEFSLFDTCYDLSGQS
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        AVKVPTVVLHFRGADM+LPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

SwissProt top hitse value%identityAlignment
Q766C3 Aspartic proteinase nepenthesin-13.5e-4541.85Show/hide
Query:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVG-AAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDA
        T   + C Y   YGDGS T G   TETLTF    I  +  GCG +N+G   G  AGL+G+GRG LS PSQ  +    KFSYC+     SS PS+++ G  
Subjt:  TTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVG-AAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDA

Query:  AISRLARF--TPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYD
        A S  A    T LI++ ++ TFYY+ L G+SVG  R+  +  S F L+   G GG+IIDSGT++T     AY ++R  F +           S FD C+ 
Subjt:  AISRLARF--TPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDP-AGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYD

Query:  L-SGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGS
          S  S +++PT V+HF G D+ LP+ NY I    +G  C A   +  G+SI GNIQQQ   VVYD   S
Subjt:  L-SGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGS

Q8S9J6 Aspartyl protease family protein At5g107701.1e-4641.13Show/hide
Query:  CLYQVSYGDGSFTTGDFATETLTFRGNKIAK-VALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAISRLA
        C+Y + YGD SF+ G  A E  T   + +   V  GCG +N+GLF G AGLLGLGR +LSFPSQT   +N  FSYCL   S++S    + FG A ISR  
Subjt:  CLYQVSYGDGSFTTGDFATETLTFRGNKIAK-VALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAISRLA

Query:  RFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSAVKV
        +FTP+       +FY + ++ I+VGG ++  + +++F        G +IDSGT +TRL   AY ALR +F+A  +        S+ DTC+DLSG   V +
Subjt:  RFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSAVKV

Query:  PTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAGSRIG
        P V   F G  +    +  +  V      C AFAG    S  +I GN+QQQ   VVYD AG R+G
Subjt:  PTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTM--SGLSIIGNIQQQGFRVVYDLAGSRIG

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 22.4e-7051.71Show/hide
Query:  CLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAISRLAR
        C Y+V YGDGS+T G  A ETLTF    +  VA+GCGH N G+F+GAAGLLG+G G +SF  Q   +    F YCLV R   S   S+VFG  A+   A 
Subjt:  CLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAISRLAR

Query:  FTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSAVKVP
        + PL+RNP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+VTRL   AY A RD F++  A+L R    S+FDTCYDLSG  +V+VP
Subjt:  FTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSAVKVP

Query:  TVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        TV  +F  G  ++LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +G
Subjt:  TVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

Q9LNJ3 Aspartyl protease family protein 21.2e-12581.18Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRR TCLYQVSYGDGSFT GDF+TETLTFR N++  VALGCGHDNEGLFVGAAGLLGLG+G+LSFP QTG RFN KFSYCLVDRSASS+PSS+VFG+AA+
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV GV+ASLFKLD  GNGGVIIDSGTSVTRL RPAY A+RDAFR GA  LKR P+FSLFDTC+DLS  +
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGNRAG
         VKVPTVVLHFRGAD+SLPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+G   G
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGNRAG

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.7e-7658.36Show/hide
Query:  RRHTCLYQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        R + CLYQVSYGDGSFT G+ AT+T+TF    KI  VALGCGHDNEGLF GAAGLLGLG G LS  +Q        FSYCLVDR  S + SS+ F    +
Subjt:  RRHTCLYQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRG-PEFSLFDTCYDLSGQ
               PL+RN K++TFYYV L G SVGG +V  +  ++F +D +G+GGVI+D GT+VTRL   AY +LRDAF     +LK+G    SLFDTCYD S  
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRG-PEFSLFDTCYDLSGQ

Query:  SAVKVPTVVLHFRGA-DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        S VKVPTV  HF G   + LPA NYLIPVDDSG+FCFAFA T S LSIIGN+QQQG R+ YDL+ + IG
Subjt:  SAVKVPTVVLHFRGA-DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein8.3e-12781.18Show/hide
Query:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        TRR TCLYQVSYGDGSFT GDF+TETLTFR N++  VALGCGHDNEGLFVGAAGLLGLG+G+LSFP QTG RFN KFSYCLVDRSASS+PSS+VFG+AA+
Subjt:  TRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS
        SR+ARFTPL+ NPKL+TFYYV L+GISVGG RV GV+ASLFKLD  GNGGVIIDSGTSVTRL RPAY A+RDAFR GA  LKR P+FSLFDTC+DLS  +
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQS

Query:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGNRAG
         VKVPTVVLHFRGAD+SLPATNYLIPVD +G FCFAFAGTM GLSIIGNIQQQGFRVVYDLA SR+G   G
Subjt:  AVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGNRAG

AT1G25510.1 Eukaryotic aspartyl protease family protein3.6e-7756.55Show/hide
Query:  RRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAIS
        R  TCLY+VSYGDGS+T GDFATETLT     +  VA+GCGH NEGLFVGAAGLLGLG G L+ PSQ        FSYCLVDR + S  S++ FG  ++S
Subjt:  RRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAIS

Query:  RLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSA
          A   PL+RN +L+TFYY+ L GISVGG  ++ +  S F++D +G+GG+IIDSGT+VTRL    Y +LRD+F  G   L++    ++FDTCY+LS ++ 
Subjt:  RLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSA

Query:  VKVPTVVLHFRGADM-SLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        V+VPTV  HF G  M +LPA NY+IPVD  G+FC AFA T S L+IIGN+QQQG RV +DLA S IG
Subjt:  VKVPTVVLHFRGADM-SLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

AT3G18490.1 Eukaryotic aspartyl protease family protein1.2e-7758.36Show/hide
Query:  RRHTCLYQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI
        R + CLYQVSYGDGSFT G+ AT+T+TF    KI  VALGCGHDNEGLF GAAGLLGLG G LS  +Q        FSYCLVDR  S + SS+ F    +
Subjt:  RRHTCLYQVSYGDGSFTTGDFATETLTF-RGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAI

Query:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRG-PEFSLFDTCYDLSGQ
               PL+RN K++TFYYV L G SVGG +V  +  ++F +D +G+GGVI+D GT+VTRL   AY +LRDAF     +LK+G    SLFDTCYD S  
Subjt:  SRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRG-PEFSLFDTCYDLSGQ

Query:  SAVKVPTVVLHFRGA-DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        S VKVPTV  HF G   + LPA NYLIPVDDSG+FCFAFA T S LSIIGN+QQQG R+ YDL+ + IG
Subjt:  SAVKVPTVVLHFRGA-DMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

AT3G20015.1 Eukaryotic aspartyl protease family protein1.7e-7151.71Show/hide
Query:  CLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAISRLAR
        C Y+V YGDGS+T G  A ETLTF    +  VA+GCGH N G+F+GAAGLLG+G G +SF  Q   +    F YCLV R   S   S+VFG  A+   A 
Subjt:  CLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSRPSSMVFGDAAISRLAR

Query:  FTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSAVKVP
        + PL+RNP+  +FYYV L G+ VGGVR+  +   +F L   G+GGV++D+GT+VTRL   AY A RD F++  A+L R    S+FDTCYDLSG  +V+VP
Subjt:  FTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEFSLFDTCYDLSGQSAVKVP

Query:  TVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        TV  +F  G  ++LPA N+L+PVDDSG++CFAFA + +GLSIIGNIQQ+G +V +D A   +G
Subjt:  TVVLHF-RGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG

AT3G61820.1 Eukaryotic aspartyl protease family protein1.2e-11472.14Show/hide
Query:  LEDCPQWLTTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDR----SAS
        L+D  + +T R  TCLYQVSYGDGSFT GDF+TETLTF G ++  V LGCGHDNEGLFVGAAGLLGLGRG LSFPSQT  R+N KFSYCLVDR    S+S
Subjt:  LEDCPQWLTTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDR----SAS

Query:  SRPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEF
          PS++VFG+AA+ + + FTPL+ NPKL+TFYY++L+GISVGG RV GVS S FKLD  GNGGVIIDSGTSVTRLT+PAY ALRDAFR GA  LKR P +
Subjt:  SRPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLKRGPEF

Query:  SLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG
        SLFDTC+DLSG + VKVPTVV HF G ++SLPA+NYLIPV+  G FCFAFAGTM  LSIIGNIQQQGFRV YDL GSR+G
Subjt:  SLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGTATGGAAGGCCTATCAGAAGTGTTTGCCCACAAACTTTGGTTTATGGAGGCGGGGGTTAGATGTGTCGCCTATGTGCATCATGTGCAATTCAAAAATGGAGAC
CATTGATCATGCTTTATGTGGATGTAAACGTGCCAAACGAATTTGTGATGCAGTTTTTACAAGAGTGGATAATGGGATCCAAATTGCAGACAATTTTTCTGATCGTATAA
TATGTGGACTCCAACGATGTAATTGGATACATGATTATTGGGAGGAAACTCGAGTGAATGGGCAGAGTATTCCATTACGTGTTCAGTTTCAGGATCGAGCATCACAACCA
CAGGAGGACATTGTTCGGCTTCATACGGATGTAGCAATTGATCCTCGACGGGGCGGAGCTGGATACGAAGCAGTTATCACCAATTTGGATGGTATAATTTGTGGAGCTTT
GGTGTTTAAGGACCCTACGCATCTTTCTCCTCTGGCAGCAGAAGTTAATGCAATTATTCATGGTATACGGCTCCTAAAGCGTATGAATATTTCTTCTGCTTGTGTTCTTT
TTGACTCCCTCTCAGCGATAAAAATGATCCAAGGGGAATTGGAATTAACAACTGATGTGCATCATTGGATTACCCAAATCCAAAGGATGATACCTTCTTTTCAGATTATA
TCATTCAGTCATATTTCTAGGGAAGGGAATATGAGAGCTGATTATCTAGCTAAGGATGTTTTAGCTAATTGTCGATCTATGCTTTGGCTAGAGGATTGTCCACAGTGGCT
TACCACCCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGACGGCTCCTTCACCACCGGCGACTTCGCCACCGAAACGCTCACGTTTCGCGGCAATAAAATCGCCA
AAGTCGCCCTCGGCTGCGGCCACGACAACGAAGGCCTCTTCGTCGGCGCCGCCGGTTTGTTGGGCCTCGGCCGTGGCCGCTTGTCTTTCCCTTCCCAAACTGGAATCCGG
TTCAATCACAAATTCTCCTATTGCCTCGTCGACCGGTCCGCTTCCTCCAGACCGTCGTCCATGGTTTTCGGCGATGCAGCGATTTCCCGGCTCGCCCGGTTCACTCCTCT
GATTAGAAACCCAAAACTGGAAACGTTTTATTATGTCGAACTCATCGGAATCAGCGTCGGCGGAGTCCGCGTCCGCGGCGTCTCCGCTTCACTCTTCAAGCTCGATCCGG
CCGGCAACGGCGGCGTCATCATCGATTCAGGTACGTCGGTAACCCGGTTGACCCGACCCGCTTACACGGCTCTTCGCGACGCGTTCAGGGCCGGAGCGGCCCATTTGAAA
AGGGGTCCAGAGTTTTCGCTGTTCGATACTTGTTACGACTTGTCGGGTCAGTCCGCCGTGAAGGTTCCGACGGTGGTGCTGCATTTCCGGGGAGCCGACATGTCGTTGCC
GGCGACGAATTATTTGATTCCGGTGGACGACAGTGGGAGCTTTTGCTTTGCGTTTGCGGGTACAATGTCCGGTTTGTCGATTATTGGGAATATTCAACAGCAGGGGTTCC
GGGTTGTGTACGATTTGGCGGGTTCTCGGATCGGAAATAGAGCTGGAAGATGGACCCCAGAGGCGAAACGGGCCAATGGGTCGGGCCAAGACCGCAGGGGTCGGGCTACC
CTGCTTAGCCTCGGCCATGGGCCGAGGCCGAGCTTGTCCGGCTCCGTTCGGTCCCTGCTGCCTCTGGCCGCCCCGGTTCCACCTGATTCGTCCCGAAACGCCTCCGAATT
TCTAAAAACCCTAGGAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGTATGGAAGGCCTATCAGAAGTGTTTGCCCACAAACTTTGGTTTATGGAGGCGGGGGTTAGATGTGTCGCCTATGTGCATCATGTGCAATTCAAAAATGGAGAC
CATTGATCATGCTTTATGTGGATGTAAACGTGCCAAACGAATTTGTGATGCAGTTTTTACAAGAGTGGATAATGGGATCCAAATTGCAGACAATTTTTCTGATCGTATAA
TATGTGGACTCCAACGATGTAATTGGATACATGATTATTGGGAGGAAACTCGAGTGAATGGGCAGAGTATTCCATTACGTGTTCAGTTTCAGGATCGAGCATCACAACCA
CAGGAGGACATTGTTCGGCTTCATACGGATGTAGCAATTGATCCTCGACGGGGCGGAGCTGGATACGAAGCAGTTATCACCAATTTGGATGGTATAATTTGTGGAGCTTT
GGTGTTTAAGGACCCTACGCATCTTTCTCCTCTGGCAGCAGAAGTTAATGCAATTATTCATGGTATACGGCTCCTAAAGCGTATGAATATTTCTTCTGCTTGTGTTCTTT
TTGACTCCCTCTCAGCGATAAAAATGATCCAAGGGGAATTGGAATTAACAACTGATGTGCATCATTGGATTACCCAAATCCAAAGGATGATACCTTCTTTTCAGATTATA
TCATTCAGTCATATTTCTAGGGAAGGGAATATGAGAGCTGATTATCTAGCTAAGGATGTTTTAGCTAATTGTCGATCTATGCTTTGGCTAGAGGATTGTCCACAGTGGCT
TACCACCCGCCGCCACACCTGCCTCTACCAAGTCTCCTACGGCGACGGCTCCTTCACCACCGGCGACTTCGCCACCGAAACGCTCACGTTTCGCGGCAATAAAATCGCCA
AAGTCGCCCTCGGCTGCGGCCACGACAACGAAGGCCTCTTCGTCGGCGCCGCCGGTTTGTTGGGCCTCGGCCGTGGCCGCTTGTCTTTCCCTTCCCAAACTGGAATCCGG
TTCAATCACAAATTCTCCTATTGCCTCGTCGACCGGTCCGCTTCCTCCAGACCGTCGTCCATGGTTTTCGGCGATGCAGCGATTTCCCGGCTCGCCCGGTTCACTCCTCT
GATTAGAAACCCAAAACTGGAAACGTTTTATTATGTCGAACTCATCGGAATCAGCGTCGGCGGAGTCCGCGTCCGCGGCGTCTCCGCTTCACTCTTCAAGCTCGATCCGG
CCGGCAACGGCGGCGTCATCATCGATTCAGGTACGTCGGTAACCCGGTTGACCCGACCCGCTTACACGGCTCTTCGCGACGCGTTCAGGGCCGGAGCGGCCCATTTGAAA
AGGGGTCCAGAGTTTTCGCTGTTCGATACTTGTTACGACTTGTCGGGTCAGTCCGCCGTGAAGGTTCCGACGGTGGTGCTGCATTTCCGGGGAGCCGACATGTCGTTGCC
GGCGACGAATTATTTGATTCCGGTGGACGACAGTGGGAGCTTTTGCTTTGCGTTTGCGGGTACAATGTCCGGTTTGTCGATTATTGGGAATATTCAACAGCAGGGGTTCC
GGGTTGTGTACGATTTGGCGGGTTCTCGGATCGGAAATAGAGCTGGAAGATGGACCCCAGAGGCGAAACGGGCCAATGGGTCGGGCCAAGACCGCAGGGGTCGGGCTACC
CTGCTTAGCCTCGGCCATGGGCCGAGGCCGAGCTTGTCCGGCTCCGTTCGGTCCCTGCTGCCTCTGGCCGCCCCGGTTCCACCTGATTCGTCCCGAAACGCCTCCGAATT
TCTAAAAACCCTAGGAGCATGA
Protein sequenceShow/hide protein sequence
MFVWKAYQKCLPTNFGLWRRGLDVSPMCIMCNSKMETIDHALCGCKRAKRICDAVFTRVDNGIQIADNFSDRIICGLQRCNWIHDYWEETRVNGQSIPLRVQFQDRASQP
QEDIVRLHTDVAIDPRRGGAGYEAVITNLDGIICGALVFKDPTHLSPLAAEVNAIIHGIRLLKRMNISSACVLFDSLSAIKMIQGELELTTDVHHWITQIQRMIPSFQII
SFSHISREGNMRADYLAKDVLANCRSMLWLEDCPQWLTTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGIR
FNHKFSYCLVDRSASSRPSSMVFGDAAISRLARFTPLIRNPKLETFYYVELIGISVGGVRVRGVSASLFKLDPAGNGGVIIDSGTSVTRLTRPAYTALRDAFRAGAAHLK
RGPEFSLFDTCYDLSGQSAVKVPTVVLHFRGADMSLPATNYLIPVDDSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRIGNRAGRWTPEAKRANGSGQDRRGRAT
LLSLGHGPRPSLSGSVRSLLPLAAPVPPDSSRNASEFLKTLGA