; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G005400 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G005400
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBasic 7S globulin-like
Genome locationCG_Chr05:5259201..5267385
RNA-Seq ExpressionClCG05G005400
SyntenyClCG05G005400
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR033868 - Xylanase inhibitor I-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4378871.1 hypothetical protein G4B88_008341 [Cannabis sativa]4.5e-23253.69Show/hide
Query:  STSLSFFSSILFLLFSISIA-ATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCG
        S  L  F+S+L L+ S +I+   SFRPK+L+LPVTK  +  QYI HI QRTP V +K+ +DLGG+F+WVDC++G+ SST KP  CRSAQC+LA KS +C 
Subjt:  STSLSFFSSILFLLFSISIA-ATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCG

Query:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF
           +P     + + CG FP N     STSG+++ D++ + STNG  P + VSVP FLF C  TF+L+GL+ G  G+AG GR  I+LPS F+AAFSF +K 
Subjt:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF

Query:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL
        AVCLS +    GV+F GNGPY   P +D++KSLTYTPL +NPV+  G    G+ SSEYFIGVKSI ++ K VP+NT+LL ID NG+GGTKIS+V PYT L
Subjt:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL

Query:  ESSIYNAVVKTFTTELSK--IPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGG---------VEPRT
        E+SIYN+VV  F   L++  IP+VAAVAPF  C+++K    ++ G  VP ID +LQ++KV+WR+ GANSMV ++++VLCLGFVDGG             T
Subjt:  ESSIYNAVVKTFTTELSK--IPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGG---------VEPRT

Query:  AIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARC
        AIVIG  QIED LL+FDLASSRL  +           TSFRPKSL+LPV K  S LQY T I Q TP + +K+ +DLGG+F+W  CD+   SSTYK  RC
Subjt:  AIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARC

Query:  RSAQCHLASKSSSCGECFSPPRPGCNN-NTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGI
        RSAQC+L    S            C+  N C   P NT+I L +S   + D +S+ S++G  P R VS+P FLF C  T LL+GLA GV G+A  G   +
Subjt:  RSAQCHLASKSSSCGECFSPPRPGCNN-NTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGI

Query:  SLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVS-TAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDK
        SLP QF+AAFSF RKFA+CLS ST + GV+  G+GPY   P ID++K LTYTPL  NP++   G F   + SAEYFIGVKSI I  KTVP+NTTLL I+K
Subjt:  SLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVS-TAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDK

Query:  NGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDG
         G GGTKISTV PY+ LE+SI+ AVV  F   L  V RVAAVAPF  C+++KSF STR GPG+P I+ +LQ++KV+WRI GANSMV VN +V+CLGFVDG
Subjt:  NGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDG

Query:  G---------VEPRTAIVIGAHQIEDNLLEFDLASSRLG----FSSTLLGRMT
        G            RTAIVIG HQIEDNL++ DLA+SRL     F   L+ R+T
Subjt:  G---------VEPRTAIVIGAHQIEDNLLEFDLASSRLG----FSSTLLGRMT

KAG5391261.1 hypothetical protein IGI04_032802 [Brassica rapa subsp. trilocularis]2.2e-24755.75Show/hide
Query:  MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTK-HPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTT
        MA+S  +S    ++  L S S A  SFRPK+L+LPVTK   +LQY T I+QRTPLVP  +  DLGG+ +WVDCDRGYVSSTY+  RCRSA C  A  S  
Subjt:  MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTK-HPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTT

Query:  CGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNR
        C +CFSPPRPGCNNNTC   P NT+   +TSGE+A+DVVS+ STNG  P R V +P+ +FVCG+TFLL GLA G  GMAG GR  I LPSQFAAAFSFNR
Subjt:  CGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNR

Query:  KFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPY
        KFAVCL   T   GV F GNGPY FLP + ++   T TPL INPVSTA    +G+KSSEYFIGV +I I+ KTVP+N TLLKID   G GGTKISTVNPY
Subjt:  KFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPY

Query:  TVLESSIYNAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGA
        TVLE+SIYN     +  E +   I RVA+VAPFG C+S+++   TRLG  VP I L+L +  V+WRIFGANSMVS++D+V+CLGFVDGGV  RT++VIG 
Subjt:  TVLESSIYNAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGA

Query:  HQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTK-HPSLQYITHIHQ-------RTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPAR
        +Q+EDNL+EFDLA+  L   S L    +    S RP +L+LP+TK   +LQY T I+         TPLV   L  DLGG+ +WV+ D  Y S++++ + 
Subjt:  HQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTK-HPSLQYITHIHQ-------RTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPAR

Query:  CRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGR-TG
        C S  C+ A  S  C  C+S  RPGCN + C L   N + G+  +G++AS+ V + STNG  P R V +P+ +F  G+  LL+GLA G  GMAG GR + 
Subjt:  CRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGR-TG

Query:  ISLPSQFAAAFSFNRKFAVCLSGSTRSPG----VIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAG-----VFSAGDKSAEYFIGVKSIVINSKTVPL
        I LPSQFA AF FNRKFAVCL  S+ S G      F GNGPY FLP I +++ LT TPL +NPV  A       F  G+KS EYFIGV +I I  KTVP+
Subjt:  ISLPSQFAAAFSFNRKFAVCLSGSTRSPG----VIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAG-----VFSAGDKSAEYFIGVKSIVINSKTVPL

Query:  NTTLLKID-KNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQV
        NT LLKI+   G GGTK+STVNPYTVLE+SI+ AV   F  E +   + RV +V PFG C ++K+   TRLG  +P I L+LQ+K V+WRIFGANSMV V
Subjt:  NTTLLKID-KNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQV

Query:  NDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
        + +V+CLGFVDGGV   T+++IG  Q+EDNL+E DLA + LGFSSTLLGR T CANFNF S
Subjt:  NDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

KAG5574128.1 hypothetical protein H5410_063894 [Solanum commersonii]5.1e-25255.2Show/hide
Query:  MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTT
        M   T+  FF   L +LFS+S A T  RP++ LLPVTK  S  Q++T I+QRTPLVP+KLT+DLG +F+WVDC++GYVSS+YKP  C S  C  +     
Subjt:  MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTT

Query:  CGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNR
           C  PP PGCNNNTC   P N  I  ST GE+A DVVS+ ST+G  P + +S    +F C    LL+GLA GV G+ G G   +  P+Q A AFS  R
Subjt:  CGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNR

Query:  KFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYT
        KFA+CL+ ST S GVIF G+ PY FLP +D++K L YTPL  NPVST+G    G+ S +YFIGV SI IN   VP+NTTLL I ++G GGTKISTV+PYT
Subjt:  KFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYT

Query:  VLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI--WRIFGANSMVSINDEVLCLGFVDGGV--EPRTAIVIG
         LE+SIYNA+ K F   L+K+PRV  VAPF VCY+  S  STR+G GVP I+L+L NK     W I+GANSMV++N++VLCLGFVDGG+  EP T+IVIG
Subjt:  VLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVI--WRIFGANSMVSINDEVLCLGFVDGGV--EPRTAIVIG

Query:  AHQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCH
        AHQIEDNLL+FD+A+ +LGF+S+LL  M + +T                 Q++T I+QRTPLVP+KLT+DLG +F+WVDC++GYVSS+YKP  C S  C 
Subjt:  AHQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCH

Query:  LASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFA
         +   +    C  PP PGCNNNTC   P N  I  ST GE+A DVVS+ ST+G  P + +S    +F C    LL+GLA GV G+ GFG   +  P+Q A
Subjt:  LASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFA

Query:  AAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKI
         AFS  RKFA+CL+ ST S GVIF G+ PY FLP +D +K L YTPL  N VST+G + AG+ S +YFIGV SI I+S  VP+NTTLL I K+G GGTKI
Subjt:  AAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKI

Query:  STVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVI----WRIFGANSMVQVNDNVLCLGFVDG-GVE
        STV PYT LE+SIYNA+ K F   L  VP V  VAPF VCYN  S GS ++GPG+P I+L+L+NK       W I+GANSMV VN++V+C GFVDG   E
Subjt:  STVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVI----WRIFGANSMVQVNDNVLCLGFVDG-GVE

Query:  PRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
        P T+IVIG HQIEDNLL+FD+A+ RLGF+S+LL    TCANFNFT+
Subjt:  PRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

OWM87145.1 hypothetical protein CDL15_Pgr010177 [Punica granatum]3.0e-24455.46Show/hide
Query:  STSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKH-PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDR-GYVSSTYKPARCRSAQCHLASKSTTCG
        S ++ F  S+LF+    S+A  SFRPK+L+LPV K   + QY+T IHQRTP VPL L VDLGG+  WV CD+  Y+SST++P+RC SAQC L + S  CG
Subjt:  STSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKH-PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDR-GYVSSTYKPARCRSAQCHLASKSTTCG

Query:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF
         C   P PGCNNNTC L    T+  +  +G++ SDV+S+ ST+G  P R VSVP FLFVCG    L+ LA GVTGM   G T I+LP+Q ++AF F RKF
Subjt:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF

Query:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL
        A+CL  S  S GVIF G+GPYN LPN+D ++SLTYTPL  NP+ST+  +  G+ S EYFIGVKSI +  K + LN+TLL I  NG GGTK+STVN YTVL
Subjt:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL

Query:  ESSIYNAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQI
        E+SI+ A  K F +E +   I RV AV PF VC+SSK+   TR+G  VP I L LQNK  IW I G+NSMV IN++VLCLG VDGG   +TAIVIG +Q+
Subjt:  ESSIYNAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQI

Query:  EDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHP-SLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCD-RGYVSSTYKPARCRSAQCHLA
        E+NL++FDLA SR+GFS+TLLGR T CA       L LPV K   + QY+T IHQR P VPL L VDLGG   WV CD   Y+SST++P+RC SAQ    
Subjt:  EDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHP-SLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCD-RGYVSSTYKPARCRSAQCHLA

Query:  SKSSSCGECF-SPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAA
        ++      C  SP  PGCNNNTC L   NT+  +   G + SD +S+ ST+G  P   V VP FLFVCGS   L  LA G TGMA  GRT I+LP   ++
Subjt:  SKSSSCGECF-SPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAA

Query:  AFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKIS
        AFSF+RKFA+CL  S  S GVIF G+GPYN LPN+D ++SLTYTPL  NPV T      G+ S EYFIGVKSI + +K + LN+ LL I  NG+GGT++S
Subjt:  AFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKIS

Query:  TVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTA
        TVN YTVLE+SI+ A  K F +E +   + RV AV PF VC++SK+   TR+GP +P I L LQNK  IW I G+NSMVQ+N++ LCLG VDGG   +TA
Subjt:  TVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTA

Query:  IVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
        ++I  +Q+E+NL++FDLA SR+GFS+TLLGR TTC NFNFTS
Subjt:  IVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

RHN80197.1 putative nepenthesin [Medicago truncatula]5.5e-24647.19Show/hide
Query:  ILFLLFSISIAATSFRPKSLLLPVTKHPSL--QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPG
        +LF   S + +  SFRPK+L+LPVTK  +   QY   I+QRTPLVPL + VDLGG F+WVDC+  Y+SSTY+PARCRSAQC LA K   CG CFS P+PG
Subjt:  ILFLLFSISIAATSFRPKSLLLPVTKHPSL--QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPG

Query:  CNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTR
        CNNNTC + PGN++   + SGE+A D++S+ S+NGF P + V V  FLF C  TFLL+GLA G +GMAG GR  ++LPSQ A+AFSF +KFA+CLS    
Subjt:  CNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTR

Query:  SPGVIFSGNGPYNFLPNVDL-TKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPYTVLESSIYNA
        S GV+  G+GPY FLPNV   +KSLTYTPL INP STA  + + + S+EYFIGVK+I I+ K V L+T+LL ID  NG GGTKISTV+PYTVLE+SIY A
Subjt:  SPGVIFSGNGPYNFLPNVDL-TKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPYTVLESSIYNA

Query:  VVKTF--TTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEF
        V   F   +    I RV +VAPF  CY+  + + TRLGA VP+I+L LQN  VIWRIFGANSMV+INDEVLCLGFV GG     +IVIG +Q+E+NLL+F
Subjt:  VVKTF--TTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEF

Query:  DLASSRLGFSSTLLGRMTNCA-------------------------------------------------------------------------------
        DLA+S+LGFSS L GR T C                                                                                
Subjt:  DLASSRLGFSSTLLGRMTNCA-------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------TSFRPKSLLLPVTKHPSL--QYITHIHQR
                                                                                SFRPK+L+LP+TK  +   QY   I+QR
Subjt:  -----------------------------------------------------------------------TSFRPKSLLLPVTKHPSL--QYITHIHQR

Query:  TPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRA
        TPLVPL + VDLGGQF+WVDC+  Y+SSTY+PARCRSAQC LA+ S  CG+CFS P+PGCNNNTCG+ P N+I   +TSGE+A DV+S+ S+NGF P + 
Subjt:  TPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRA

Query:  VSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDL-TKSLTYTPLFINPVSTAGVF
        V V  FLF C  TFLL GLA G +GMAG GRT I+LPSQ A+AFSF RKFA+CLS    S GV+  G+GPY FLPN+   + SLTYTPL INPVSTA  F
Subjt:  VSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDL-TKSLTYTPLFINPVSTAGVF

Query:  SAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTF--TTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLP
        S G  SAEYFIGVK+I I+ K V LNT+LL ID NG GGTKISTV+PYTVLE+SIY AV   F   +    + RV +VAPF  CY + +   TRLG  +P
Subjt:  SAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTF--TTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLP

Query:  SIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
        +I+L LQN+ V+WRIFGANSMV +ND VLCLGFV+GG   RT+IVIG +Q+E+NLL+FDLA+S+LGFSS L GR TTC+NFNFTS
Subjt:  SIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

TrEMBL top hitse value%identityAlignment
A0A0A0L515 Xyloglucan-specific endoglucanase inhibitor protein3.9e-21388.52Show/hide
Query:  MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTC
        MA+STS SFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYIT IHQRTPLVP+KLTVDLGGQFMWVDCDRGYVSS+YKPARCRSAQC LASKS+ C
Subjt:  MAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTC

Query:  GECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRK
        G+CFSPPRPGCNNNTC LFPGNTII LSTSGEVASDVVSVSSTNGF PTRAVS+PNFLFVCGSTFLL+GLA GVTGMAGFGR GISLPSQFAAAFSFNRK
Subjt:  GECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRK

Query:  FAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV
        FAVCLSGST SPGVIFSGNGPY+FLPN+DLT S TYTPLFINPVSTAGVSSAG+KS+EYFIGV SIV+NSK VPLNTTLLKID NGNGGTKISTVNP+TV
Subjt:  FAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTV

Query:  LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIE
        LESSIY A+VK FTTE+SK+PRV AVAPF VCYSSKSF STRLGAGVP+IDL+LQNKKVIW +FGANSMV +NDEVLCLGFVDGGV+ RTAIVIGAHQIE
Subjt:  LESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIE

Query:  DNLLEFDLASSRLGFSSTLLGRMTNCA
        D LLEFDLA+SRLGF+ TLLGRMT CA
Subjt:  DNLLEFDLASSRLGFSSTLLGRMTNCA

A0A0D9ZYY9 Uncharacterized protein6.4e-22449.29Show/hide
Query:  ILFLLFSISIAATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGC
        +LF+  S   AA+   P ++LLPV+K  +  QY+T   QRTP  PLK  +DL G  +WVDC+ GYVSS+Y    C S QC LA  +     C   P P C
Subjt:  ILFLLFSISIAATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPGC

Query:  NNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRS
         N+TCG FP NT+  +STSG V +DV+S+ +T    P    + P FLF CG+TFL +GLA G TGM    R   + P+Q AA F F+RKFA+CL  +  +
Subjt:  NNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRS

Query:  PGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLESSIYNAVVK
         GV+  G+ PY F P VDL+KSL YTPL +NPVST GVS+ GD+S+EYF+G+  I +N + VPLNTTLL I++ G GGTK+STV PYTVLE+SI+ AV  
Subjt:  PGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLESSIYNAVVK

Query:  TFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASS
         F  E S IPRV AVAPF +CY     + TR+G  VP+++L+ Q++   W +FGANSMV+     LCLG VDGGV P T++VIG H +EDNLLEFDL  S
Subjt:  TFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASS

Query:  RLGFSST---------------LLGRMTNCATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCH
        RLG  +                +L      A   RP++++LPV+K  +  QY+T   QRTP VP+K  +DL G  +WVDCD GYVS +Y    C S  C 
Subjt:  RLGFSST---------------LLGRMTNCATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCH

Query:  LASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFA
        L         CF  P P C N TC  FP NT+  ++  G + +DV+S+ +T    P    +VP FLF CG TFL +GLA G TGM    R   + P+Q A
Subjt:  LASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFA

Query:  AAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTK-SLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTK
          F F+R+FA+CL  ++ + GV+  G+ PY F P +DL+K SL YTPL +N V TAG ++ G+ S EY IG+  I +N + VPLN TLL IDKNG GGT 
Subjt:  AAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTK-SLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTK

Query:  ISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGG--VEPR
        +ST +PYTVLE+SIY AV+  F  E + +PRV AVAPF +CY+ +  GSTR GP +P+I+L+LQ + V W ++GANSMV      LCLG VDGG  + P 
Subjt:  ISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGG--VEPR

Query:  TAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANF
        +++VIG H +EDNLLEFDL  SRLGFSS L  R TTC NF
Subjt:  TAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANF

A0A218XPJ7 Uncharacterized protein1.5e-24455.46Show/hide
Query:  STSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKH-PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDR-GYVSSTYKPARCRSAQCHLASKSTTCG
        S ++ F  S+LF+    S+A  SFRPK+L+LPV K   + QY+T IHQRTP VPL L VDLGG+  WV CD+  Y+SST++P+RC SAQC L + S  CG
Subjt:  STSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKH-PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDR-GYVSSTYKPARCRSAQCHLASKSTTCG

Query:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF
         C   P PGCNNNTC L    T+  +  +G++ SDV+S+ ST+G  P R VSVP FLFVCG    L+ LA GVTGM   G T I+LP+Q ++AF F RKF
Subjt:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF

Query:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL
        A+CL  S  S GVIF G+GPYN LPN+D ++SLTYTPL  NP+ST+  +  G+ S EYFIGVKSI +  K + LN+TLL I  NG GGTK+STVN YTVL
Subjt:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL

Query:  ESSIYNAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQI
        E+SI+ A  K F +E +   I RV AV PF VC+SSK+   TR+G  VP I L LQNK  IW I G+NSMV IN++VLCLG VDGG   +TAIVIG +Q+
Subjt:  ESSIYNAVVKTFTTELS--KIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQI

Query:  EDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHP-SLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCD-RGYVSSTYKPARCRSAQCHLA
        E+NL++FDLA SR+GFS+TLLGR T CA       L LPV K   + QY+T IHQR P VPL L VDLGG   WV CD   Y+SST++P+RC SAQ    
Subjt:  EDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHP-SLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCD-RGYVSSTYKPARCRSAQCHLA

Query:  SKSSSCGECF-SPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAA
        ++      C  SP  PGCNNNTC L   NT+  +   G + SD +S+ ST+G  P   V VP FLFVCGS   L  LA G TGMA  GRT I+LP   ++
Subjt:  SKSSSCGECF-SPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAA

Query:  AFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKIS
        AFSF+RKFA+CL  S  S GVIF G+GPYN LPN+D ++SLTYTPL  NPV T      G+ S EYFIGVKSI + +K + LN+ LL I  NG+GGT++S
Subjt:  AFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKIS

Query:  TVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTA
        TVN YTVLE+SI+ A  K F +E +   + RV AV PF VC++SK+   TR+GP +P I L LQNK  IW I G+NSMVQ+N++ LCLG VDGG   +TA
Subjt:  TVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTA

Query:  IVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
        ++I  +Q+E+NL++FDLA SR+GFS+TLLGR TTC NFNFTS
Subjt:  IVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

A0A396K0C6 Putative nepenthesin2.7e-24647.19Show/hide
Query:  ILFLLFSISIAATSFRPKSLLLPVTKHPSL--QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPG
        +LF   S + +  SFRPK+L+LPVTK  +   QY   I+QRTPLVPL + VDLGG F+WVDC+  Y+SSTY+PARCRSAQC LA K   CG CFS P+PG
Subjt:  ILFLLFSISIAATSFRPKSLLLPVTKHPSL--QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSPPRPG

Query:  CNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTR
        CNNNTC + PGN++   + SGE+A D++S+ S+NGF P + V V  FLF C  TFLL+GLA G +GMAG GR  ++LPSQ A+AFSF +KFA+CLS    
Subjt:  CNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTR

Query:  SPGVIFSGNGPYNFLPNVDL-TKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPYTVLESSIYNA
        S GV+  G+GPY FLPNV   +KSLTYTPL INP STA  + + + S+EYFIGVK+I I+ K V L+T+LL ID  NG GGTKISTV+PYTVLE+SIY A
Subjt:  SPGVIFSGNGPYNFLPNVDL-TKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKID-ENGNGGTKISTVNPYTVLESSIYNA

Query:  VVKTF--TTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEF
        V   F   +    I RV +VAPF  CY+  + + TRLGA VP+I+L LQN  VIWRIFGANSMV+INDEVLCLGFV GG     +IVIG +Q+E+NLL+F
Subjt:  VVKTF--TTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEF

Query:  DLASSRLGFSSTLLGRMTNCA-------------------------------------------------------------------------------
        DLA+S+LGFSS L GR T C                                                                                
Subjt:  DLASSRLGFSSTLLGRMTNCA-------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------TSFRPKSLLLPVTKHPSL--QYITHIHQR
                                                                                SFRPK+L+LP+TK  +   QY   I+QR
Subjt:  -----------------------------------------------------------------------TSFRPKSLLLPVTKHPSL--QYITHIHQR

Query:  TPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRA
        TPLVPL + VDLGGQF+WVDC+  Y+SSTY+PARCRSAQC LA+ S  CG+CFS P+PGCNNNTCG+ P N+I   +TSGE+A DV+S+ S+NGF P + 
Subjt:  TPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRA

Query:  VSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDL-TKSLTYTPLFINPVSTAGVF
        V V  FLF C  TFLL GLA G +GMAG GRT I+LPSQ A+AFSF RKFA+CLS    S GV+  G+GPY FLPN+   + SLTYTPL INPVSTA  F
Subjt:  VSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDL-TKSLTYTPLFINPVSTAGVF

Query:  SAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTF--TTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLP
        S G  SAEYFIGVK+I I+ K V LNT+LL ID NG GGTKISTV+PYTVLE+SIY AV   F   +    + RV +VAPF  CY + +   TRLG  +P
Subjt:  SAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTF--TTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLP

Query:  SIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
        +I+L LQN+ V+WRIFGANSMV +ND VLCLGFV+GG   RT+IVIG +Q+E+NLL+FDLA+S+LGFSS L GR TTC+NFNFTS
Subjt:  SIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

A0A7J6G7A7 Uncharacterized protein2.2e-23253.69Show/hide
Query:  STSLSFFSSILFLLFSISIA-ATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCG
        S  L  F+S+L L+ S +I+   SFRPK+L+LPVTK  +  QYI HI QRTP V +K+ +DLGG+F+WVDC++G+ SST KP  CRSAQC+LA KS +C 
Subjt:  STSLSFFSSILFLLFSISIA-ATSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCG

Query:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF
           +P     + + CG FP N     STSG+++ D++ + STNG  P + VSVP FLF C  TF+L+GL+ G  G+AG GR  I+LPS F+AAFSF +K 
Subjt:  ECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKF

Query:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL
        AVCLS +    GV+F GNGPY   P +D++KSLTYTPL +NPV+  G    G+ SSEYFIGVKSI ++ K VP+NT+LL ID NG+GGTKIS+V PYT L
Subjt:  AVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVL

Query:  ESSIYNAVVKTFTTELSK--IPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGG---------VEPRT
        E+SIYN+VV  F   L++  IP+VAAVAPF  C+++K    ++ G  VP ID +LQ++KV+WR+ GANSMV ++++VLCLGFVDGG             T
Subjt:  ESSIYNAVVKTFTTELSK--IPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGG---------VEPRT

Query:  AIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARC
        AIVIG  QIED LL+FDLASSRL  +           TSFRPKSL+LPV K  S LQY T I Q TP + +K+ +DLGG+F+W  CD+   SSTYK  RC
Subjt:  AIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARC

Query:  RSAQCHLASKSSSCGECFSPPRPGCNN-NTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGI
        RSAQC+L    S            C+  N C   P NT+I L +S   + D +S+ S++G  P R VS+P FLF C  T LL+GLA GV G+A  G   +
Subjt:  RSAQCHLASKSSSCGECFSPPRPGCNN-NTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGI

Query:  SLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVS-TAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDK
        SLP QF+AAFSF RKFA+CLS ST + GV+  G+GPY   P ID++K LTYTPL  NP++   G F   + SAEYFIGVKSI I  KTVP+NTTLL I+K
Subjt:  SLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVS-TAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDK

Query:  NGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDG
         G GGTKISTV PY+ LE+SI+ AVV  F   L  V RVAAVAPF  C+++KSF STR GPG+P I+ +LQ++KV+WRI GANSMV VN +V+CLGFVDG
Subjt:  NGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDG

Query:  G---------VEPRTAIVIGAHQIEDNLLEFDLASSRLG----FSSTLLGRMT
        G            RTAIVIG HQIEDNL++ DLA+SRL     F   L+ R+T
Subjt:  G---------VEPRTAIVIGAHQIEDNLLEFDLASSRLG----FSSTLLGRMT

SwissProt top hitse value%identityAlignment
I1JNS6 Probable aspartic proteinase GIP12.2e-7239.75Show/hide
Query:  LLLPVTKHPSLQ-YITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNT--CGLFPGNTIIGLS
        L+ P++K  + Q Y   +  +TPL P KL + LG    WV CD  Y SS+     C +  C+            S P   C+NN+  C LFP N +   +
Subjt:  LLLPVTKHPSLQ-YITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNT--CGLFPGNTIIGLS

Query:  TSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNI
               D +++ + +    +  V + +F+F C +  LL GLA    G+A  GR+  SLP+Q + + +  R F +CL  S+ + G     +   +FL + 
Subjt:  TSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNI

Query:  DLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAV
         +   LTYT L +NPV+   V      S EYFI + SI IN K + +N+++L +D+ G GGTKIST  PYTVLE+SIY   V+ F  E S   +    AV
Subjt:  DLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELS--KVPRVAAV

Query:  APFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQV---NDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRM
         PFGVCY +     TR+GP +P++DL++ ++ V WRIFG NSMV+V     +V CLGFVDGG   RT IVIG HQ+EDNL++FDL S+R GF+STLL + 
Subjt:  APFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQV---NDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRM

Query:  TTCAN
          C+N
Subjt:  TTCAN

P0DO21 Probable aspartic proteinase GIP28.4e-16570.07Show/hide
Query:  TSFRPKSLLLPVTKHP-SLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNT
        TSFRPK L+LP+TK   +LQY+T I QRTPLVP+ LT+DLGGQF+WVDCD+GYVSSTY+PARCRSAQC LA   S CG+CFSPP+PGCNNNTCGL P NT
Subjt:  TSFRPKSLLLPVTKHP-SLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNT

Query:  IIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYN
        I   +TSGE+ASD V V S+NG  P R VS  +FLFVCGSTFLL+GLA GV GMAG GRT ISLPSQF+A FSF RKFAVCLS ST S GV+  G+GPY 
Subjt:  IIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYN

Query:  FLPNIDLTKS-LTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPR
        FLPN +   +  +YTPLFINPVSTA  FS+ + S+EYFIGVKSI IN K VP+NTTLL ID  G GGTKISTVNPYT+LE+SIYNAV   F  EL  + R
Subjt:  FLPNIDLTKS-LTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPR

Query:  VAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGR
        VA+VAPF  C++S++  STR+GP +PSIDL+LQN+ V WRIFGANSMVQV++NVLCLGFVDGGV PRT+IV+G + IEDNLL+FDLA SRLGF+S++L R
Subjt:  VAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGR

Query:  MTTCANFNFTS
         TTCANFNFTS
Subjt:  MTTCANFNFTS

P82952 Gamma conglutin 11.3e-9345.37Show/hide
Query:  LGFSSTLLGRMTNCATS------FRPKSLLLPVTKHPSLQ-YITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGE
        L F S  L  +T+ AT        RP  L+L V K  +   ++  IH+RTPLV     +DL G+F+ V+C+  Y SSTYK   C S+QC  A+ S +C  
Subjt:  LGFSSTLLGRMTNCATS------FRPKSLLLPVTKHPSLQ-YITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGE

Query:  C-FSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCG-STFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRK
        C  S  RPGC+ N CGL   N +   S  GE+A DV+ + ST G  P   V+ P+FLF C  S  L  GL   V G+AG G + ISLP Q A+ F F  K
Subjt:  C-FSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCG-STFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRK

Query:  FAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTV
        FAVCL+ S    G +F G GPY   P ID+++ LTY P  I             +  EY+I V+S  IN+  +P       I K G GG  IST  PYT 
Subjt:  FAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTV

Query:  LESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKK-VIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQI
        L++ I+ A+ + F  +L  VP V  VAPFG C+++    ++++GP +PSIDL+L NKK ++WRIFGAN+M+Q    V+CL FVDGG+ P+  IVIG  Q+
Subjt:  LESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKK-VIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQI

Query:  EDNLLEFDLASSRLGFSSTLLGRMTTCANFNF
        EDNLL+FDL +SRLGFSS+LL R T CANFNF
Subjt:  EDNLLEFDLASSRLGFSSTLLGRMTTCANFNF

Q42369 Gamma conglutin 14.9e-7240.65Show/hide
Query:  TSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGEC--FSPPRPGCNNNTCGLFPG
        TS +P  L+LPV +  S   +  +IH+RTPL+ + L +DL G+ +WV C + Y SSTY+   C S QC  A+ +  C  C   +  RPGC+NNTCGL   
Subjt:  TSFRPKSLLLPVTKHPSL-QYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGEC--FSPPRPGCNNNTCGLFPG

Query:  NTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLD-GLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSG--
        N +   S  GE+A DV+++ ST+G      V VP FLF C  +FL   GL   V G  G G+  ISL +Q  + F   R+F+VCLS  + S G I  G  
Subjt:  NTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLD-GLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSG--

Query:  NGP--YNFLPN-IDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGV------KSIVINSKTVPLNTTLLKIDKNGN-GGTKISTVNPYTVLESSIYNA
        N P   N++ N +D+   L YTPL I+            K  EYFI V      K +VI +K   ++ +      +G  GG  I+T +PYTVL  SI+  
Subjt:  NGP--YNFLPN-IDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGV------KSIVINSKTVPLNTTLLKIDKNGN-GGTKISTVNPYTVLESSIYNA

Query:  VVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDL
          + F   + K  +V AV PFG+CY+S+     ++  G PS+DLIL     +WRI   N MVQ  D V CLGFVDGGV  R  I +GAH +E+NL+ FDL
Subjt:  VVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDL

Query:  ASSRLGF-SSTLLGRMTTCANFNFTSNP
          SR+GF S++L     TC+N    +NP
Subjt:  ASSRLGF-SSTLLGRMTTCANFNFTSNP

Q9FSH9 Gamma conglutin 11.3e-7239.3Show/hide
Query:  TSFRPKSLLLPVTKHPSLQ-YITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGEC--FSPPRPGCNNNTCGLFPG
        +S +P  L+LP+ +  S + +  +I +RTPL+ + + +DL G+ +WV C + Y SSTY+   C S QC  A+ +  C  C   +  RPGC+NNTCGL   
Subjt:  TSFRPKSLLLPVTKHPSLQ-YITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGEC--FSPPRPGCNNNTCGLFPG

Query:  NTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLD-GLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSG--
        N +   S  GE+A DV+++ ST+G      V +P FLF C  TFL   GL   V G  G G   ISLP+Q  + F   R+F +CLS    S G I  G  
Subjt:  NTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLD-GLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSG--

Query:  NGP--YNFLPN-IDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTV---------PLNTTLLKIDKNGNGGTKISTVNPYTVLESSIY
        N P   N++ N +D+   + YTPL I+            K  EYFI V +I +N   V         P +++    + +  GG  I+T NPYTVL  SI+
Subjt:  NGP--YNFLPN-IDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTV---------PLNTTLLKIDKNGNGGTKISTVNPYTVLESSIY

Query:  NAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEF
            + F   + K  +V AV PFG+CY++K     ++  G+PS+DLI+    V+WRI G N MVQ  D V CLGFVDGGV  R  I +G HQ+E+NL+ F
Subjt:  NAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEF

Query:  DLASSRLGF-SSTLLGRMTTCANFNFTSNP
        DLA SR+GF +++L     +C+N    +NP
Subjt:  DLASSRLGF-SSTLLGRMTTCANFNFTSNP

Arabidopsis top hitse value%identityAlignment
AT1G03220.1 Eukaryotic aspartyl protease family protein1.4e-14663.07Show/hide
Query:  LASSRLGFSSTLL---GRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSC
        +A S + FS  LL      ++  T FRPK+LLLPVTK  S LQY T I+QRTPLVP  +  DLGG+ +WVDCD+GYVSSTY+  RC SA C  A  S+SC
Subjt:  LASSRLGFSSTLL---GRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSC

Query:  GECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRK
        G CFSPPRPGC+NNTCG  P NT+ G +TSGE A DVVS+ STNG  P R V +PN +F CG+TFLL GLA G  GMAG GR  I LPSQFAAAFSF+RK
Subjt:  GECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRK

Query:  FAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKID-KNGNGGTKISTVNPYT
        FAVCL   T   GV F GNGPY FLP I ++ SL  TPL INPVSTA  FS G+KS+EYFIGV +I I  KTVP+N TLLKI+   G GGTKIS+VNPYT
Subjt:  FAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKID-KNGNGGTKISTVNPYT

Query:  VLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAH
        VLESSIYNA    F  + +   + RVA+V PFG C+++K+ G TRLG  +P I+L+L +K V+WRIFGANSMV V+D+V+CLGFVDGGV  RT++VIG  
Subjt:  VLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAH

Query:  QIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
        Q+EDNL+EFDLAS++ GFSSTLLGR T CANFNFTS
Subjt:  QIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

AT1G03230.1 Eukaryotic aspartyl protease family protein1.8e-14161.33Show/hide
Query:  LASSRLGFSSTLL----GRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSS
        +ASSR+   S LL       ++   SFRPK+LLLPVTK PS LQY T I+QRTPLVP  +  DLGG+  WVDCD+GYVS+TY+  RC SA C  A  S +
Subjt:  LASSRLGFSSTLL----GRMTNCATSFRPKSLLLPVTKHPS-LQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSS

Query:  CGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNR
        CG CFSPPRPGC+NNTCG FP N+I G +TSGE A DVVS+ STNG  P R V +PN +F CGST LL GLA G  GMAG GR  I LP QFAAAFSFNR
Subjt:  CGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNR

Query:  KFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKID-KNGNGGTKISTVNPY
        KFAVCL   T   GV F GNGPY FLP I +++ L  TPL INP +T   FS G+KS EYFIGV +I I  KT+P++ TLLKI+   G GGTKIS+VNPY
Subjt:  KFAVCLSGSTRSPGVIFSGNGPYNFLPNIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKID-KNGNGGTKISTVNPY

Query:  TVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGA
        TVLESSIY A    F  + +   + RVA+V PFG C+++K+ G TRLG  +P I L+L +K V+WRIFGANSMV V+D+V+CLGFVDGGV P  ++VIG 
Subjt:  TVLESSIYNAVVKTFTTELS--KVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGA

Query:  HQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS
         Q+EDNL+EFDLAS++ GFSSTLLGR T CANFNFTS
Subjt:  HQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTS

AT5G19100.1 Eukaryotic aspartyl protease family protein9.3e-5036.8Show/hide
Query:  KLTVDLGGQF-MWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPN
        K  +DL G   +  +C     S+TY P RC S +C  A+           P   C NN   +    T+   S +  +  D V +  T   + TR   + +
Subjt:  KLTVDLGGQF-MWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPN

Query:  FLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSP---GVIFSGNGPYNFLP-NIDLTKSLTYTPLFINPVSTAGVFSA
         L    +    DG         G   T +S+PSQ  + +    K A+CL  + RS    G ++ G G Y +LP + D++K    TPL  N          
Subjt:  FLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSP---GVIFSGNGPYNFLP-NIDLTKSLTYTPLFINPVSTAGVFSA

Query:  GDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDL
          KS EY I VKSI I +KTVP+            G TKIST+ PYTV ++S+Y A++  FT  + K+ +  AV PFG C+ S        G G+P IDL
Subjt:  GDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSKSFGSTRLGPGLPSIDL

Query:  ILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCA
        +L      WRI+G+NS+V+VN NV+CLGFVDGGV+P+  IVIG  Q+EDNL+EFDL +S+  FSS+LL   T+C+
Subjt:  ILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCA

AT5G19110.1 Eukaryotic aspartyl protease family protein2.4e-4533.1Show/hide
Query:  SSILFLLFSISI-AATSFRPKS-LLLPVTKH--PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSP
        SS+  LL  +SI AA + +  S  LLP+TKH   +L Y T         P+ L +DLG    W+DC +    S+ +   C+S+ C             S 
Subjt:  SSILFLLFSISI-AATSFRPKS-LLLPVTKH--PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSTTCGECFSP

Query:  PRPGCNNNTCGLFPGNTIIGLS--TSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVC
        P  GC   +C L+     +G +   +G V  D  S+ +T+G      VSV +F F C     L GL   V G+        S   Q  +AF+   KF++C
Subjt:  PRPGCNNNTCGLFPGNTIIGLS--TSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVC

Query:  LSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLESS
        L  S       F   G + F+P  + +          NP+        G  S +Y I VKSI +    + LN  LL       GG K+STV  YTVL++ 
Subjt:  LSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYTVLESS

Query:  IYNAVVKTFTTELSK--IPRVAAVAPFGVCYSSKSF-SSTRLGAGVPSIDLILQNK--KVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQI
        IYNA+ ++FT +     I +V +VAPF  C+ S++   +   G  VP I++ L  +  +V W  +GAN++V + + V+CL F+DGG  P+  +VIG HQ+
Subjt:  IYNAVVKTFTTELSK--IPRVAAVAPFGVCYSSKSF-SSTRLGAGVPSIDLILQNK--KVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQI

Query:  EDNLLEFDLASSRLGFSSTLLGRMTNCAT
        +D++LEFD + + L FS +LL   T+C+T
Subjt:  EDNLLEFDLASSRLGFSSTLLGRMTNCAT

AT5G19120.1 Eukaryotic aspartyl protease family protein2.8e-4635.06Show/hide
Query:  LLLPVTKH-PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTS
        ++ PV K  P+ QY+  I       P+KL VDL G  +W DC   +VSS+       S+ C  A   +      S  R    N  C L   N   G++  
Subjt:  LLLPVTKH-PSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTCGLFPGNTIIGLSTS

Query:  GEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDL
        GE+ SDV+SV S        +    + LF C   +LL GLA G  G+ G GR  ISLPSQ AA  +  R+  V LS      GV+ + +    F   +  
Subjt:  GEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNIDL

Query:  TKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFG
        ++SL YTPL                S  Y I VKSI +N +         K+   G    ++STV PYT+LESSIY    + +     +   V  VAPFG
Subjt:  TKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFG

Query:  VCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGF
        +C+ S            P++DL LQ++ V WRI G N MV V   V C G VDGG      IV+G  Q+E  +L+FDL +S +GF
Subjt:  VCYNSKSFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGGTCAAACTCCTTCCTCCGCCTATATAACCCATTGTCCCTTCCATTCCACACCAACGAGCACCTCCAAACATCCTCTTCAGGCTTCAATGGCGGCCTCAACTTC
CCTCTCTTTCTTCTCCTCTATTCTCTTCCTCCTCTTCTCCATTTCCATTGCTGCGACCTCCTTCCGCCCCAAATCCCTCCTTCTCCCCGTCACCAAACACCCATCTCTCC
AATACATCACCCACATCCACCAACGAACCCCTCTCGTTCCGCTCAAGCTCACGGTTGACCTCGGCGGTCAGTTCATGTGGGTCGACTGTGACCGTGGCTACGTTTCTTCC
ACCTACAAGCCTGCCCGTTGCCGCTCCGCCCAATGCCACCTCGCCTCTAAATCCACTACCTGCGGCGAGTGCTTTTCGCCCCCGCGCCCCGGTTGCAACAATAACACGTG
CGGCCTCTTCCCCGGCAATACCATTATCGGCCTCTCCACTAGCGGAGAAGTCGCTTCCGATGTCGTCTCCGTTTCCTCCACCAACGGCTTTATTCCAACCAGAGCCGTGT
CGGTGCCCAATTTCCTCTTCGTCTGTGGCTCGACGTTTCTCCTCGACGGTCTTGCCGGCGGCGTAACTGGAATGGCCGGATTCGGAAGAACCGGAATCTCTCTGCCTTCA
CAATTCGCAGCCGCGTTCAGCTTTAACCGGAAATTCGCCGTTTGCTTGAGCGGCTCCACCAGATCCCCTGGCGTCATCTTTTCCGGGAATGGCCCTTACAATTTCTTACC
CAACGTCGACTTAACAAAATCCCTCACTTATACCCCACTCTTCATCAACCCCGTCAGCACCGCCGGCGTCTCCTCCGCCGGAGATAAATCCTCCGAGTATTTCATCGGCG
TTAAATCCATCGTCATCAACTCCAAAACCGTCCCACTCAATACCACTCTCCTCAAAATCGACGAAAACGGAAACGGCGGTACAAAAATCAGCACAGTGAATCCATACACC
GTACTCGAATCGTCGATCTACAACGCGGTGGTGAAAACGTTCACGACGGAGCTGTCGAAAATTCCGAGAGTGGCAGCGGTGGCGCCGTTTGGGGTTTGTTATAGTTCAAA
GAGCTTTTCGAGTACTCGATTGGGGGCGGGCGTGCCGTCGATCGATTTGATTTTGCAGAACAAGAAAGTGATTTGGAGAATATTCGGTGCGAACTCAATGGTGTCTATAA
ACGACGAAGTTTTGTGCTTGGGATTTGTTGACGGTGGAGTTGAACCAAGAACGGCGATTGTTATTGGGGCCCACCAAATTGAGGATAATTTGCTTGAATTTGATTTGGCC
TCTTCCAGACTTGGATTTAGCTCCACTCTTCTAGGTCGGATGACTAATTGTGCTACCTCCTTCCGCCCCAAATCCCTCCTTCTCCCTGTCACCAAACACCCATCTCTCCA
ATACATCACCCACATCCACCAACGAACCCCTCTCGTTCCACTCAAGCTCACGGTCGACCTCGGCGGTCAGTTCATGTGGGTCGACTGTGACCGTGGCTACGTTTCTTCCA
CTTACAAGCCTGCCCGTTGCCGCTCCGCCCAATGCCACCTCGCCTCTAAATCCAGTTCCTGCGGCGAGTGCTTTTCGCCGCCGCGCCCTGGCTGCAACAACAACACGTGC
GGCCTCTTCCCCGGAAACACCATTATCGGCCTCTCCACCAGCGGAGAAGTCGCTTCCGACGTCGTCTCTGTTTCCTCCACCAACGGCTTTATTCCAACCAGAGCCGTGTC
GGTGCCCAATTTCCTCTTCGTCTGCGGGTCAACGTTTCTCCTCGACGGCCTCGCCGGCGGCGTAACTGGAATGGCCGGATTCGGAAGAACCGGAATCTCTCTGCCTTCAC
AATTCGCAGCCGCGTTCAGCTTTAACCGGAAATTCGCCGTTTGCTTGAGCGGCTCCACCAGATCCCCTGGCGTCATCTTCTCCGGGAACGGCCCTTACAATTTCTTACCC
AACATCGACTTAACAAAATCCCTCACTTATACCCCACTCTTCATCAACCCCGTCAGCACCGCCGGCGTCTTCTCCGCCGGAGATAAATCCGCCGAGTATTTCATCGGCGT
TAAATCCATCGTCATCAACTCCAAAACCGTCCCACTCAACACCACTCTCCTCAAAATCGACAAAAACGGAAACGGCGGCACCAAAATCAGCACCGTGAATCCATACACCG
TACTCGAATCGTCGATTTACAACGCGGTGGTGAAAACGTTCACGACGGAGTTGTCGAAAGTTCCGAGAGTGGCAGCAGTTGCGCCGTTTGGGGTTTGTTATAATTCAAAG
AGCTTCGGGAGTACCCGATTGGGGCCGGGGTTGCCATCGATCGATTTGATTTTGCAGAACAAGAAAGTGATTTGGAGAATCTTCGGAGCGAATTCGATGGTACAGGTAAA
CGATAACGTTTTGTGCTTGGGATTTGTTGACGGAGGAGTGGAACCAAGAACAGCGATTGTTATTGGGGCCCACCAAATTGAGGATAATTTGCTTGAATTTGATTTGGCCT
CTTCCAGACTTGGATTTAGCTCCACTTTATTGGGTCGAATGACTACTTGTGCTAATTTCAACTTTACTTCTAATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGGTCAAACTCCTTCCTCCGCCTATATAACCCATTGTCCCTTCCATTCCACACCAACGAGCACCTCCAAACATCCTCTTCAGGCTTCAATGGCGGCCTCAACTTC
CCTCTCTTTCTTCTCCTCTATTCTCTTCCTCCTCTTCTCCATTTCCATTGCTGCGACCTCCTTCCGCCCCAAATCCCTCCTTCTCCCCGTCACCAAACACCCATCTCTCC
AATACATCACCCACATCCACCAACGAACCCCTCTCGTTCCGCTCAAGCTCACGGTTGACCTCGGCGGTCAGTTCATGTGGGTCGACTGTGACCGTGGCTACGTTTCTTCC
ACCTACAAGCCTGCCCGTTGCCGCTCCGCCCAATGCCACCTCGCCTCTAAATCCACTACCTGCGGCGAGTGCTTTTCGCCCCCGCGCCCCGGTTGCAACAATAACACGTG
CGGCCTCTTCCCCGGCAATACCATTATCGGCCTCTCCACTAGCGGAGAAGTCGCTTCCGATGTCGTCTCCGTTTCCTCCACCAACGGCTTTATTCCAACCAGAGCCGTGT
CGGTGCCCAATTTCCTCTTCGTCTGTGGCTCGACGTTTCTCCTCGACGGTCTTGCCGGCGGCGTAACTGGAATGGCCGGATTCGGAAGAACCGGAATCTCTCTGCCTTCA
CAATTCGCAGCCGCGTTCAGCTTTAACCGGAAATTCGCCGTTTGCTTGAGCGGCTCCACCAGATCCCCTGGCGTCATCTTTTCCGGGAATGGCCCTTACAATTTCTTACC
CAACGTCGACTTAACAAAATCCCTCACTTATACCCCACTCTTCATCAACCCCGTCAGCACCGCCGGCGTCTCCTCCGCCGGAGATAAATCCTCCGAGTATTTCATCGGCG
TTAAATCCATCGTCATCAACTCCAAAACCGTCCCACTCAATACCACTCTCCTCAAAATCGACGAAAACGGAAACGGCGGTACAAAAATCAGCACAGTGAATCCATACACC
GTACTCGAATCGTCGATCTACAACGCGGTGGTGAAAACGTTCACGACGGAGCTGTCGAAAATTCCGAGAGTGGCAGCGGTGGCGCCGTTTGGGGTTTGTTATAGTTCAAA
GAGCTTTTCGAGTACTCGATTGGGGGCGGGCGTGCCGTCGATCGATTTGATTTTGCAGAACAAGAAAGTGATTTGGAGAATATTCGGTGCGAACTCAATGGTGTCTATAA
ACGACGAAGTTTTGTGCTTGGGATTTGTTGACGGTGGAGTTGAACCAAGAACGGCGATTGTTATTGGGGCCCACCAAATTGAGGATAATTTGCTTGAATTTGATTTGGCC
TCTTCCAGACTTGGATTTAGCTCCACTCTTCTAGGTCGGATGACTAATTGTGCTACCTCCTTCCGCCCCAAATCCCTCCTTCTCCCTGTCACCAAACACCCATCTCTCCA
ATACATCACCCACATCCACCAACGAACCCCTCTCGTTCCACTCAAGCTCACGGTCGACCTCGGCGGTCAGTTCATGTGGGTCGACTGTGACCGTGGCTACGTTTCTTCCA
CTTACAAGCCTGCCCGTTGCCGCTCCGCCCAATGCCACCTCGCCTCTAAATCCAGTTCCTGCGGCGAGTGCTTTTCGCCGCCGCGCCCTGGCTGCAACAACAACACGTGC
GGCCTCTTCCCCGGAAACACCATTATCGGCCTCTCCACCAGCGGAGAAGTCGCTTCCGACGTCGTCTCTGTTTCCTCCACCAACGGCTTTATTCCAACCAGAGCCGTGTC
GGTGCCCAATTTCCTCTTCGTCTGCGGGTCAACGTTTCTCCTCGACGGCCTCGCCGGCGGCGTAACTGGAATGGCCGGATTCGGAAGAACCGGAATCTCTCTGCCTTCAC
AATTCGCAGCCGCGTTCAGCTTTAACCGGAAATTCGCCGTTTGCTTGAGCGGCTCCACCAGATCCCCTGGCGTCATCTTCTCCGGGAACGGCCCTTACAATTTCTTACCC
AACATCGACTTAACAAAATCCCTCACTTATACCCCACTCTTCATCAACCCCGTCAGCACCGCCGGCGTCTTCTCCGCCGGAGATAAATCCGCCGAGTATTTCATCGGCGT
TAAATCCATCGTCATCAACTCCAAAACCGTCCCACTCAACACCACTCTCCTCAAAATCGACAAAAACGGAAACGGCGGCACCAAAATCAGCACCGTGAATCCATACACCG
TACTCGAATCGTCGATTTACAACGCGGTGGTGAAAACGTTCACGACGGAGTTGTCGAAAGTTCCGAGAGTGGCAGCAGTTGCGCCGTTTGGGGTTTGTTATAATTCAAAG
AGCTTCGGGAGTACCCGATTGGGGCCGGGGTTGCCATCGATCGATTTGATTTTGCAGAACAAGAAAGTGATTTGGAGAATCTTCGGAGCGAATTCGATGGTACAGGTAAA
CGATAACGTTTTGTGCTTGGGATTTGTTGACGGAGGAGTGGAACCAAGAACAGCGATTGTTATTGGGGCCCACCAAATTGAGGATAATTTGCTTGAATTTGATTTGGCCT
CTTCCAGACTTGGATTTAGCTCCACTTTATTGGGTCGAATGACTACTTGTGCTAATTTCAACTTTACTTCTAATCCTTGA
Protein sequenceShow/hide protein sequence
MTGQTPSSAYITHCPFHSTPTSTSKHPLQASMAASTSLSFFSSILFLLFSISIAATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSS
TYKPARCRSAQCHLASKSTTCGECFSPPRPGCNNNTCGLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPS
QFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLPNVDLTKSLTYTPLFINPVSTAGVSSAGDKSSEYFIGVKSIVINSKTVPLNTTLLKIDENGNGGTKISTVNPYT
VLESSIYNAVVKTFTTELSKIPRVAAVAPFGVCYSSKSFSSTRLGAGVPSIDLILQNKKVIWRIFGANSMVSINDEVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLA
SSRLGFSSTLLGRMTNCATSFRPKSLLLPVTKHPSLQYITHIHQRTPLVPLKLTVDLGGQFMWVDCDRGYVSSTYKPARCRSAQCHLASKSSSCGECFSPPRPGCNNNTC
GLFPGNTIIGLSTSGEVASDVVSVSSTNGFIPTRAVSVPNFLFVCGSTFLLDGLAGGVTGMAGFGRTGISLPSQFAAAFSFNRKFAVCLSGSTRSPGVIFSGNGPYNFLP
NIDLTKSLTYTPLFINPVSTAGVFSAGDKSAEYFIGVKSIVINSKTVPLNTTLLKIDKNGNGGTKISTVNPYTVLESSIYNAVVKTFTTELSKVPRVAAVAPFGVCYNSK
SFGSTRLGPGLPSIDLILQNKKVIWRIFGANSMVQVNDNVLCLGFVDGGVEPRTAIVIGAHQIEDNLLEFDLASSRLGFSSTLLGRMTTCANFNFTSNP