; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012497 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012497
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold1:4895870..4899806
RNA-Seq ExpressionSpg012497
SyntenySpg012497
Gene Ontology termsGO:0006032 - chitin catabolic process (biological process)
GO:0016998 - cell wall macromolecule catabolic process (biological process)
GO:0004568 - chitinase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008061 - chitin binding (molecular function)
InterPro domainsIPR000726 - Glycoside hydrolase, family 19, catalytic
IPR001002 - Chitin-binding, type 1
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR018371 - Chitin-binding, type 1, conserved site
IPR023346 - Lysozyme-like domain superfamily
IPR036861 - Endochitinase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7129869.1 hypothetical protein RHSIM_Rhsim10G0203000 [Rhododendron simsii]4.8e-24456.15Show/hide
Query:  TPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERNQD
        TP+  G GV S+I+++++ QMLK+ ND  C   GFYTY AFI AA +F GFGTTG+  T+KRE+AAF  QTSHETTGGW+TAP+GPY WGYCF+ E+   
Subjt:  TPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERNQD

Query:  V--YCTPS--QQWP---CAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGR
        +  YC  S   Q+P   CA+G+KYYGRGP Q+T+NYNYGPAG AI +NLL NPDLVAT+  ISFKTA+WFWM PQ  KPSCH+VITG W PS  + AAG 
Subjt:  V--YCTPS--QQWP---CAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGR

Query:  LPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHPVGPKPGPFTFPPKSQNSAKTLTS-------------------------------KSL
        +PGYGVITNIINGG+ECG G   +  DRIGFYK      G+            P   Q+ +  L++                               + L
Subjt:  LPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHPVGPKPGPFTFPPKSQNSAKTLTS-------------------------------KSL

Query:  SIDSIGAVQG---IGKMG---------------KLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKK--QPPKHSRKFLSAQSSGHPEK-PKLPTL
         +D +   +G   +  +G               +L       L    V   P P  A  LL      +  +   P     +F +     +P K PK PT+
Subjt:  SIDSIGAVQG---IGKMG---------------KLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKK--QPPKHSRKFLSAQSSGHPEK-PKLPTL

Query:  -FKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPD
         F S +L+DAK L++S I++T  P D++F+NSLLQSYAS+++L+D+ S LRHM+K  P+FSPD ST+ +LL+ S    D +LA+VR+ L  M   GF PD
Subjt:  -FKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPD

Query:  KVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAE
        K TTD+AVRSLC+ G ++ AVELV+E   K+SPPDS TYN L+K LC++R +S+V GFI EMR S   KPDLV+YTILIDNVCN KNLREATRL+ +L+E
Subjt:  KVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAE

Query:  EGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLL
        +GFKPDC+VYN +MKGYCML +GSE + VYKKMKEEG+EPD+VT+NTLIFGLSKSGRVKEARKFL +MAEMG  PD VTYTSLMNGMCREG+A GAL LL
Subjt:  EGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLL

Query:  EEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKARE
        EEMEAKGCSPNSCTYNTLLHGL K++LL + IELYG+MK+GD+KLET SY TF+R LCR+GR+AEAYEVFDYAVESKSL DV AYSTLES LK LKKA+E
Subjt:  EEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKARE

Query:  QGQAI
        QG A+
Subjt:  QGQAI

KAF7130787.1 hypothetical protein RHSIM_Rhsim10G0201800 [Rhododendron simsii]6.7e-24659.63Show/hide
Query:  TPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERNQD
        T +  G GV S+I+++ + QMLK+ ND  C  NGFYTY AFI AA++F GFGTTGD  T+KRE+AAFF QTSHETTGG+STAPDGPYAWGYC + E+   
Subjt:  TPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERNQD

Query:  V-YCTPS-QQWPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYG
          YC    QQ+PC   +KY+GRGP Q+T+NYNYGPAG AI  +LL  PDLVAT+ IISFKTA+WFWMTPQ  KPSCH+VITG+W PS+   AAG +PGYG
Subjt:  V-YCTPS-QQWPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYG

Query:  VITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHPVGPKPGPFTFPPKSQNSAKTLTSKSLSIDSIGAVQGIGKMGKLSPSFRSALSTTIVNKPPHP
        VITNIINGG+ECG G +    DRIGFYK  + T    P         FP  S                            L  + ++ + + I N  P P
Subjt:  VITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHPVGPKPGPFTFPPKSQNSAKTLTSKSLSIDSIGAVQGIGKMGKLSPSFRSALSTTIVNKPPHP

Query:  PAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTL-FKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQSYASIATLNDSISFLRHMSKVQ
                  P    KK PP+ S               KLPT+ F S +L+DAK L++S I++T  P D++F+NSLLQSYA++++L+D+ S LRHM+K  
Subjt:  PAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTL-FKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQSYASIATLNDSISFLRHMSKVQ

Query:  PSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYG
        P+FSPD ST+ +LL+ S    D +L +VR+ LN M   GF PDK TTD+AVRSLC+ G ++ AVELV+E   K+SPPDS TYN L+K LC++R +S+V G
Subjt:  PSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYG

Query:  FIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKEEGLEPDVVTFNTLIFGLSKSGR
        FI EMR S   KPDLV+YTILIDNVCN KNLREATRL+ +L+E+GFKPDC+VYN +MKGYCML +GSE + VYKKMKEEG+EPD+VT+NTLIFGLSKSGR
Subjt:  FIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKEEGLEPDVVTFNTLIFGLSKSGR

Query:  VKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRAL
         KEARKFL +MAEMG  PD VTYTSLMNGMCREG+A GAL LLEEMEAKGCSPNSCTYNTLLHGL K++LL + IELYG+MK+GD+KLET SY TF+R L
Subjt:  VKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRAL

Query:  CRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        CR+GR+AEAYEVFDYAVESKSL DV AYSTLES LK LKKA+EQG A+
Subjt:  CRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

KAG7026711.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-24091.28Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ
        MGKLSPSFRSA+ST IVNKPPHPPAAP LLS E RSLSKK+PPKHSR+  SAQSS H EKPKLPTLFKSA+LADAKKLYSSFI+TTKAPLD+RFYNSLLQ
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ

Query:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD
        SYASIA+LNDSISFLRHMSKVQPSFSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT GFNPD+ TTDIAVRSLCS GLIDEAVELVRE SQKHSPPD
Subjt:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVY FI+EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLA+EGFKPDCFVYN IMKGYCMLGRG EAIGVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMCR+GDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE ASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDV AYSTLE TLK+LKKA E+G  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

XP_022926982.1 pentatricopeptide repeat-containing protein At2g17670 [Cucurbita moschata]6.4e-24191.7Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ
        MGKLSPSFRSA+ST IVNKPPHPPAAP LLS E RSLSKK+PPKHSR+  SAQSS H EKPKLPTLFKSA+LADAKKLYSSFI+TTKAPLD+RFYNSLLQ
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ

Query:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD
        SYASIA+LNDSISFLRHMSKVQPSFSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT GFNPDK TTDIAVRSLCS GLIDEAVELVRE SQKHSPPD
Subjt:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVY FI+EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLA+EGFKPDCFVYN IMKGYCMLGRG EAIGVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE ASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDV AYSTLE TLK+LKKA E+G  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

XP_023003882.1 pentatricopeptide repeat-containing protein At2g17670-like [Cucurbita maxima]4.5e-24291.49Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ
        MGKLSPSFRSA+ST IVNKPP+PPAAP LLS E RSLSKK+PPKHSR+  SAQSSGH EKPKLPTLFKSA+LA+AKKLYSSFI+TTKAPLD+RFYNSLL 
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ

Query:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD
        SY SIA+LNDSISFLRHMSKVQP+FSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT+GFNPDK T DIAVRSLCS GLIDEAVELVRE SQKHSPPD
Subjt:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVYGFI EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPD VTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSC+YNTLLHGLSKSRLLD+GIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKSGDMKLE ASYATFVRALCRSGRIAEAYEVFDYAVES+SLTDVAAYSTLE+TLK+LKKAREQG  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

TrEMBL top hitse value%identityAlignment
A0A0A0KHF8 Uncharacterized protein5.3e-22586.41Show/hide
Query:  MGKLSPSFRSALST-TIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLL
        MGKLSPSFRS LS+ T++NKPPH PAAPPL        SKK  PK SRK  S QSSGHPEKPKLPT+FKSA+LADAKKLYSSF+S TKAP ++R +NSLL
Subjt:  MGKLSPSFRSALST-TIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLL

Query:  QSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPP
        QSYASIATLNDSISFLRHMSKVQPSFSPD+STFHILLSTSGN  DS+LASV+QILNFMVTNGFNPDKVT D+AVRSLCSVGL+DEAVELV+ELSQKH+PP
Subjt:  QSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPP

Query:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMK
        D +TYNHLVKQLCKSRALSTVY FI EMRSSCGAKPDLVTYTILIDNVCN  NLREA RLVS+L +EGFKPDCFVYNTIMKGYCM+GRG+EAIGVYKKMK
Subjt:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMK

Query:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIEL
        E GLEPDVVTFNTLIFGLSKSGRVKEAR FLDIMAEMGHFPDAVTYTSLMNGMCREG+ALGALSLL+EMEAKGC+PNSCTYNTLLHGLSKSRLLDRGIEL
Subjt:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIEL

Query:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        YGLMKS DMKLETASY+TFVRALCRSGRIAEAYEVFDYAVESKSLTDV+AY +LESTLKSLK AREQ  AI
Subjt:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

A0A1S4DT84 pentatricopeptide repeat-containing protein At2g176706.3e-21884.47Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ
        MGKLSPSFRS LST++++KP   PAAPPL        SKK  PK SRK  S QSSGHP KPKLPT+FKSA+LADAKKLYSSFIST+KAP ++R +NSLLQ
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ

Query:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD
        SYASIATLNDSISFLRHMSKVQPSFSPD+STFHILLSTS N  DSSLASVR+ILNFMVTNGFNPDKVT D+AVRSLCSVGL+DEAVELV+ELSQKH+P D
Subjt:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
         +TYNHLVKQLCKSRALSTVY FI EMRSSCGAKPDLVTYTILIDNVCN  NLREA RLVS+L +EGFKPDCFVYNTIMKGYCM+GRG+EAIGVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
         GLEPD+VTFNTLIFGLSKSGRVKEA  FLDIMAEMGHFPD VTYTSLMNGMCREG+ALGALSLL+EMEAKGC+PNS TYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE+ASY+TFVRALCRSGRIAEAYEVFDYAVESKSLTDV+AY +LESTLKSLK AREQ  A+
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

A0A6J1BW58 pentatricopeptide repeat-containing protein At2g176709.4e-23891.3Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHP-PAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLL
        MGKLSPSFRSA+STTI+NKPP P  AAPPLLS EPRSLSKK PPK SRK  SAQSSG  EKPK  TLFKS++LADAKKLYSSFI+TT+APLD+RFYNSLL
Subjt:  MGKLSPSFRSALSTTIVNKPPHP-PAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLL

Query:  QSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPP
        QSYASIATLNDSISFLR+MSKVQPSFSPDRSTFH+LLSTSGNG+ SSLASV+QILNFMV+NGFNPDKVTTDIAVRSLCS GLIDEAVELV+ELS+K SPP
Subjt:  QSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPP

Query:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMK
        DSFTYNHLVKQLCKSRALSTVYGFIDEMRSS G+KPDLVTYTILIDNVCNGKNLREATRL+SVL EEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMK
Subjt:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMK

Query:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIEL
        EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLE MEAKGCSPNSCTYNTLLHGL+KSRLLDRGIEL
Subjt:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIEL

Query:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        YGLMKSG MKLETASYAT VRALCRS RIAEAYEVFDYAVESKS+TDVAAYSTLESTLKSLKK REQG AI
Subjt:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

A0A6J1EGP9 pentatricopeptide repeat-containing protein At2g176703.1e-24191.7Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ
        MGKLSPSFRSA+ST IVNKPPHPPAAP LLS E RSLSKK+PPKHSR+  SAQSS H EKPKLPTLFKSA+LADAKKLYSSFI+TTKAPLD+RFYNSLLQ
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ

Query:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD
        SYASIA+LNDSISFLRHMSKVQPSFSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT GFNPDK TTDIAVRSLCS GLIDEAVELVRE SQKHSPPD
Subjt:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVY FI+EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLA+EGFKPDCFVYN IMKGYCMLGRG EAIGVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE ASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDV AYSTLE TLK+LKKA E+G  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

A0A6J1KT16 pentatricopeptide repeat-containing protein At2g17670-like2.2e-24291.49Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ
        MGKLSPSFRSA+ST IVNKPP+PPAAP LLS E RSLSKK+PPKHSR+  SAQSSGH EKPKLPTLFKSA+LA+AKKLYSSFI+TTKAPLD+RFYNSLL 
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQ

Query:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD
        SY SIA+LNDSISFLRHMSKVQP+FSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT+GFNPDK T DIAVRSLCS GLIDEAVELVRE SQKHSPPD
Subjt:  SYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVYGFI EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPD VTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSC+YNTLLHGLSKSRLLD+GIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKSGDMKLE ASYATFVRALCRSGRIAEAYEVFDYAVES+SLTDVAAYSTLE+TLK+LKKAREQG  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI

SwissProt top hitse value%identityAlignment
P36907 Endochitinase1.8e-12165.69Show/hide
Query:  MKAHTLLILAFAFVLGASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCKTGCQSQCGGS------TPTPSGSGVGSIITESLYNQMLKYHNDPRCPSN
        +K    ++      LG+ AEQCG QAGGA+CPNGLCCS+FGFCGSTD YC  GCQSQC  S      TP+  G  VG ++  SL++QMLKY ND RC  +
Subjt:  MKAHTLLILAFAFVLGASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCKTGCQSQCGGS------TPTPSGSGVGSIITESLYNQMLKYHNDPRCPSN

Query:  GFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYG
        GFYTY+AFI AA SF GFGTTGD  T+K+E+AAF  QTSHETTGGW TAPDGPYAWGYCF+ E+N Q+VYC+P + WPCA G+KYYGRGPIQLTHNYNYG
Subjt:  GFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYG

Query:  PAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTA
         AG AI  +L++NPDL++TN  +SFKTAIWFWMTPQ NKPS H+VITG+W PS+ D++AGR+PGYGVITNIINGG+ECGHG D RV DR+GFYK      
Subjt:  PAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTA

Query:  GLHPVG
        G+ P G
Subjt:  GLHPVG

P93680 Endochitinase9.7e-12371.48Show/hide
Query:  LLILAFAFVLG-ASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCKTGCQSQCGGSTPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFIT
        LL+L    + G A AEQCGRQAGGALCP GLCCSQFG+CGST DYC   CQSQCGG TP+P G GV S+I++S++NQMLK+ ND  C + GFYTYNAFI 
Subjt:  LLILAFAFVLG-ASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCKTGCQSQCGGSTPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFIT

Query:  AANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRER-NQDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNL
        AANSF GF + GD ATRKRE+AAF  QTSHETTGGW+TAPDGPYAWGYCF++E+ N   YC P+ QWPCA G+KYYGRGPIQ+++NYNYGPAG AI  +L
Subjt:  AANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRER-NQDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNL

Query:  LSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYK
        ++NPD VAT+ +ISFKTA+WFWMTPQ  KPSCHNVITG+W PS+ D AAGRLPGYGVITNIINGG+ECG G + +VADRIGFYK
Subjt:  LSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYK

Q09023 Endochitinase CH254.3e-12368.21Show/hide
Query:  MKAHTLLILAFAFVLGAS-AEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCK-TGCQSQCGGSTPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYT
        MK+  LL L F+F+L  S AEQCGRQAGGALCPNGLCCS+FG+CG T+ YCK  GCQSQCGG+ P P+G  +  II+ S ++ MLK+ ND  CP+ GFYT
Subjt:  MKAHTLLILAFAFVLGAS-AEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCK-TGCQSQCGGSTPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYT

Query:  YNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYGPAGN
        Y+AFI AA SFPGFGTTGD ATRK+E+AAFFGQTSHETTGGW+TAPDGPY+WGYCF +E+N    YC+PS +WPCA+G+ YYGRGP+QL+ NYNYG  G 
Subjt:  YNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYGPAGN

Query:  AISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHP
        AI ++LL+NPDLV+ + +I+FK AIWFWMTPQ  KPSCH VI GQWQPS  D AAGR+PGYGVITNIINGGLECG G DARVADRIGFY+      G++P
Subjt:  AISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHP

Query:  VG
         G
Subjt:  VG

Q42993 Chitinase 13.7e-12268.23Show/hide
Query:  MKAHTLLILAFAF-VLGASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCKTGCQSQ----CGGSTPTP----SGSGVGSIITESLYNQMLKYHNDPRC
        M+A  ++++A AF V+    EQCG QAGGALCPN LCCSQ+G+CGST  YC +GCQSQ    CGG  PTP     GSGV SI++ SL++QML + ND  C
Subjt:  MKAHTLLILAFAF-VLGASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCKTGCQSQ----CGGSTPTP----SGSGVGSIITESLYNQMLKYHNDPRC

Query:  PSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERNQDV---YCTPSQQWPCAAGQKYYGRGPIQLTH
        P+  FYTY+AF+ AAN+FP F TTGDAATRKRE+AAF  QTSHETTGGW+TAPDGPY+WGYCF  E N +V   YC  S QWPCAAG+KYYGRGPIQ+++
Subjt:  PSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERNQDV---YCTPSQQWPCAAGQKYYGRGPIQLTH

Query:  NYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYK
        NYNYGPAG AI +NLLSNPDLVA++A +SFKTA WFWMTPQ  KPSCH V+TGQW P+  D AAGR+PGYGV+TNIINGG+ECGHG D+RVADRIGFYK
Subjt:  NYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYK

Q84J71 Pentatricopeptide repeat-containing protein At2g176701.9e-15860Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSL---SKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNS
        MGK+  SFRS  +  +V K    P APP       ++   S K P          Q++  P +P L   FKS NL+DAK L++S  +T++ PLD++F+NS
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSL---SKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNS

Query:  LLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHS
        +LQSY SIA +ND++   +H+ K QP+F P RSTF ILLS +    DSS+++V ++LN MV NG  PD+VTTDIAVRSLC  G +DEA +L++EL++KHS
Subjt:  LLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHS

Query:  PPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKK
        PPD++TYN L+K LCK + L  VY F+DEMR     KPDLV++TILIDNVCN KNLREA  LVS L   GFKPDCF+YNTIMKG+C L +GSEA+GVYKK
Subjt:  PPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKK

Query:  MKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGI
        MKEEG+EPD +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMCR+G++LGALSLLEEMEA+GC+PN CTYNTLLHGL K+RL+D+G+
Subjt:  MKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGI

Query:  ELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQG
        ELY +MKS  +KLE+  YAT VR+L +SG++AEAYEVFDYAV+SKSL+D +AYSTLE+TLK LKKA+EQG
Subjt:  ELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQG

Arabidopsis top hitse value%identityAlignment
AT1G02360.1 Chitinase family protein6.1e-7251.23Show/hide
Query:  STPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-
        S+     + +  ++   LYN++  + ++  CP+NGFYTY +F+ A   FP FG+ G   T++ E+AAF  Q SHETTGGW+TAPDGPYAWG CF  E + 
Subjt:  STPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-

Query:  QDVYCTPSQ-QWPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGY
        Q  YC  S  QWPC   + Y GRGPIQL+ NYNYGPAG A+  + L NP+ V+ N++I+F+TA+WFWMTPQ  KPSCH+V+ G+++P++ D AA R  G+
Subjt:  QDVYCTPSQ-QWPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGY

Query:  GVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHPVGPKP
        G+ TNIINGGLECG   D RV DRIGF+   Q   GL  V   P
Subjt:  GVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHPVGPKP

AT2G17670.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-15960Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSL---SKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNS
        MGK+  SFRS  +  +V K    P APP       ++   S K P          Q++  P +P L   FKS NL+DAK L++S  +T++ PLD++F+NS
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSL---SKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNS

Query:  LLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHS
        +LQSY SIA +ND++   +H+ K QP+F P RSTF ILLS +    DSS+++V ++LN MV NG  PD+VTTDIAVRSLC  G +DEA +L++EL++KHS
Subjt:  LLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHS

Query:  PPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKK
        PPD++TYN L+K LCK + L  VY F+DEMR     KPDLV++TILIDNVCN KNLREA  LVS L   GFKPDCF+YNTIMKG+C L +GSEA+GVYKK
Subjt:  PPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKK

Query:  MKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGI
        MKEEG+EPD +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMCR+G++LGALSLLEEMEA+GC+PN CTYNTLLHGL K+RL+D+G+
Subjt:  MKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGI

Query:  ELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQG
        ELY +MKS  +KLE+  YAT VR+L +SG++AEAYEVFDYAV+SKSL+D +AYSTLE+TLK LKKA+EQG
Subjt:  ELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQG

AT2G17670.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.9e-11156.27Show/hide
Query:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSL---SKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNS
        MGK+  SFRS  +  +V K    P APP       ++   S K P          Q++  P +P L   FKS NL+DAK L++S  +T++ PLD++F+NS
Subjt:  MGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSL---SKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNS

Query:  LLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHS
        +LQSY SIA +ND++   +H+ K QP+F P RSTF ILLS +    DSS+++V ++LN MV NG  PD+VTTDIAVRSLC  G +DEA +L++EL++KHS
Subjt:  LLQSYASIATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHS

Query:  PPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKK
        PPD++TYN L+K LCK + L  VY F+DEMR     KPDLV++TILIDNVCN KNLREA  LVS L   GFKPDCF+YNTIMKG+C L +GSEA+GVYKK
Subjt:  PPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKK

Query:  MKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREG
        MKEEG+EPD +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMCR+G
Subjt:  MKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREG

AT3G12500.1 basic chitinase1.7e-12268.09Show/hide
Query:  MKAHTLLILAFAFVLG-ASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCK-TGCQSQC--GGSTPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGF
        MK +  L L F+ +L  +SAEQCGRQAGGALCPNGLCCS+FG+CG+T+ YCK  GCQSQC  GG+ P P+G  +  II+ S ++ MLK+ ND  CP+ GF
Subjt:  MKAHTLLILAFAFVLG-ASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCK-TGCQSQC--GGSTPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGF

Query:  YTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYGPA
        YTYNAFITAA SFPGFGTTGD ATRK+E+AAFFGQTSHETTGGW+TAPDGPY+WGYCF +E+N    YC PS  WPCA+G++YYGRGP+QL+ NYNYG  
Subjt:  YTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYGPA

Query:  GNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGL
        G AI  +LL+NPDLVA +A+I+FK AIWFWMT Q  KPSCH VI GQWQPS  D AAGRLPGYGVITNIINGGLECG G D RVADRIGFY+      G+
Subjt:  GNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGL

Query:  HPVG
        +P G
Subjt:  HPVG

AT4G01700.1 Chitinase family protein1.2e-7255.5Show/hide
Query:  SIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPS-QQ
        S++  +LY+Q+  + ++  CP+ GFY Y AF+ A  SFP FG+ G+  TR+RE+AAF  Q SHETTGGW+TAPDGPYAWG CF  E + Q  YC  S + 
Subjt:  SIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFPGFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERN-QDVYCTPS-QQ

Query:  WPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGL
        WPC +G+ Y GRGPIQL+ NYNYG AG A+  + L NP+LVA N++++FKTA+WFWMT Q  KPSCHNV+  +++P+  D AA R  GYG++TNIINGGL
Subjt:  WPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKTAIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGL

Query:  ECGHGPDARVADRIGFYK
        ECG   D RV DR+G+++
Subjt:  ECGHGPDARVADRIGFYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCCCATACGCTTCTAATCTTAGCCTTTGCCTTTGTTTTGGGAGCCTCAGCCGAGCAATGTGGGCGGCAGGCCGGTGGCGCTCTGTGCCCCAACGGCCTCTGCTG
CAGCCAGTTCGGGTTCTGTGGCAGCACCGACGACTACTGTAAAACTGGCTGCCAGAGCCAATGTGGCGGCTCAACTCCTACGCCCTCCGGCAGCGGCGTCGGAAGCATCA
TCACCGAGAGCCTTTACAATCAAATGCTCAAGTATCACAACGATCCTCGATGCCCTAGCAATGGATTCTATACTTACAATGCTTTCATTACTGCAGCTAATTCCTTCCCT
GGCTTCGGTACCACGGGAGATGCTGCTACTCGTAAGAGGGAGATGGCGGCTTTCTTTGGCCAAACTTCTCACGAGACTACTGGAGGATGGTCTACCGCACCAGACGGTCC
ATATGCATGGGGATACTGTTTCATACGTGAGAGAAATCAAGACGTATATTGCACACCTAGTCAACAATGGCCATGTGCTGCTGGCCAGAAATATTACGGACGTGGACCAA
TCCAACTAACTCACAACTACAACTACGGGCCAGCAGGCAACGCAATAAGTACGAATTTGTTGAGCAACCCTGATTTGGTGGCCACAAATGCAATCATATCATTCAAGACA
GCCATTTGGTTTTGGATGACACCACAAGGAAACAAACCATCATGTCATAATGTTATTACTGGCCAGTGGCAACCTTCGAGCACTGATACTGCTGCAGGAAGATTACCTGG
CTATGGTGTCATCACCAACATCATTAACGGTGGACTCGAGTGTGGGCATGGCCCCGATGCAAGAGTTGCTGATAGAATTGGATTTTACAAAAGTACTCAGACAACGGCGG
GCCTTCATCCAGTTGGGCCGAAGCCCGGCCCATTTACTTTCCCACCCAAATCTCAAAATTCTGCAAAAACCCTAACATCGAAATCCCTTTCAATTGATTCGATTGGAGCA
GTACAGGGGATTGGGAAGATGGGCAAATTATCGCCATCATTTCGGTCAGCGCTCTCCACCACCATCGTTAACAAACCACCTCATCCTCCGGCGGCGCCACCGCTCTTGTC
CGCCGAGCCCCGCTCTTTGTCCAAGAAACAACCTCCAAAACATTCCCGGAAATTTCTATCGGCGCAGAGCTCCGGCCACCCAGAAAAGCCTAAACTTCCGACGCTATTCA
AATCGGCCAATCTCGCAGATGCCAAGAAGCTCTACAGCTCCTTCATCTCCACCACAAAAGCCCCTCTCGACATTCGATTCTATAACTCTCTCCTTCAGTCTTACGCTTCA
ATCGCCACACTCAATGACTCCATCTCTTTCCTCCGCCACATGTCCAAAGTTCAACCTTCCTTCTCACCCGATCGATCGACCTTCCATATCTTGCTCTCTACCTCTGGGAA
TGGCACTGATTCCTCTCTCGCCTCGGTTCGGCAAATCCTCAATTTCATGGTCACCAATGGCTTCAATCCTGACAAGGTAACCACTGATATTGCTGTGCGATCGCTTTGTT
CGGTAGGTCTGATTGATGAAGCTGTAGAATTAGTTAGAGAATTATCACAAAAACACTCGCCTCCTGATTCTTTTACATACAATCATCTCGTTAAGCAACTTTGCAAGTCC
AGAGCTCTGTCTACGGTTTATGGTTTTATCGATGAAATGCGTAGTAGCTGTGGTGCAAAGCCCGATCTTGTTACTTATACAATCTTGATAGATAATGTATGCAATGGCAA
GAATCTCCGCGAAGCGACGCGGTTGGTAAGTGTGCTGGCCGAGGAGGGTTTTAAGCCGGATTGCTTTGTTTATAACACAATTATGAAGGGTTATTGTATGCTTGGCAGGG
GCAGCGAGGCAATTGGAGTCTATAAGAAAATGAAGGAAGAGGGATTGGAGCCTGATGTTGTAACTTTTAATACGTTGATTTTTGGGTTATCGAAGTCGGGACGAGTTAAG
GAAGCCAGAAAATTTTTGGACATTATGGCAGAGATGGGTCATTTCCCTGATGCAGTTACCTACACTTCTTTGATGAATGGAATGTGTCGTGAGGGTGATGCATTGGGAGC
TCTGTCATTGCTTGAGGAGATGGAGGCAAAGGGGTGCAGCCCCAATTCGTGCACATATAATACTTTACTCCATGGATTATCTAAGTCTAGGCTTTTGGATAGAGGGATTG
AATTGTATGGTTTGATGAAATCTGGTGATATGAAGCTTGAAACAGCTTCCTATGCTACTTTTGTGAGGGCACTTTGCAGGAGCGGTAGGATCGCTGAGGCTTATGAAGTG
TTTGATTATGCAGTTGAGAGTAAAAGTCTGACTGATGTTGCTGCGTATTCAACGTTAGAGAGTACATTGAAGTCTCTGAAGAAAGCAAGGGAGCAAGGCCAAGCTATATA
A
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCCCATACGCTTCTAATCTTAGCCTTTGCCTTTGTTTTGGGAGCCTCAGCCGAGCAATGTGGGCGGCAGGCCGGTGGCGCTCTGTGCCCCAACGGCCTCTGCTG
CAGCCAGTTCGGGTTCTGTGGCAGCACCGACGACTACTGTAAAACTGGCTGCCAGAGCCAATGTGGCGGCTCAACTCCTACGCCCTCCGGCAGCGGCGTCGGAAGCATCA
TCACCGAGAGCCTTTACAATCAAATGCTCAAGTATCACAACGATCCTCGATGCCCTAGCAATGGATTCTATACTTACAATGCTTTCATTACTGCAGCTAATTCCTTCCCT
GGCTTCGGTACCACGGGAGATGCTGCTACTCGTAAGAGGGAGATGGCGGCTTTCTTTGGCCAAACTTCTCACGAGACTACTGGAGGATGGTCTACCGCACCAGACGGTCC
ATATGCATGGGGATACTGTTTCATACGTGAGAGAAATCAAGACGTATATTGCACACCTAGTCAACAATGGCCATGTGCTGCTGGCCAGAAATATTACGGACGTGGACCAA
TCCAACTAACTCACAACTACAACTACGGGCCAGCAGGCAACGCAATAAGTACGAATTTGTTGAGCAACCCTGATTTGGTGGCCACAAATGCAATCATATCATTCAAGACA
GCCATTTGGTTTTGGATGACACCACAAGGAAACAAACCATCATGTCATAATGTTATTACTGGCCAGTGGCAACCTTCGAGCACTGATACTGCTGCAGGAAGATTACCTGG
CTATGGTGTCATCACCAACATCATTAACGGTGGACTCGAGTGTGGGCATGGCCCCGATGCAAGAGTTGCTGATAGAATTGGATTTTACAAAAGTACTCAGACAACGGCGG
GCCTTCATCCAGTTGGGCCGAAGCCCGGCCCATTTACTTTCCCACCCAAATCTCAAAATTCTGCAAAAACCCTAACATCGAAATCCCTTTCAATTGATTCGATTGGAGCA
GTACAGGGGATTGGGAAGATGGGCAAATTATCGCCATCATTTCGGTCAGCGCTCTCCACCACCATCGTTAACAAACCACCTCATCCTCCGGCGGCGCCACCGCTCTTGTC
CGCCGAGCCCCGCTCTTTGTCCAAGAAACAACCTCCAAAACATTCCCGGAAATTTCTATCGGCGCAGAGCTCCGGCCACCCAGAAAAGCCTAAACTTCCGACGCTATTCA
AATCGGCCAATCTCGCAGATGCCAAGAAGCTCTACAGCTCCTTCATCTCCACCACAAAAGCCCCTCTCGACATTCGATTCTATAACTCTCTCCTTCAGTCTTACGCTTCA
ATCGCCACACTCAATGACTCCATCTCTTTCCTCCGCCACATGTCCAAAGTTCAACCTTCCTTCTCACCCGATCGATCGACCTTCCATATCTTGCTCTCTACCTCTGGGAA
TGGCACTGATTCCTCTCTCGCCTCGGTTCGGCAAATCCTCAATTTCATGGTCACCAATGGCTTCAATCCTGACAAGGTAACCACTGATATTGCTGTGCGATCGCTTTGTT
CGGTAGGTCTGATTGATGAAGCTGTAGAATTAGTTAGAGAATTATCACAAAAACACTCGCCTCCTGATTCTTTTACATACAATCATCTCGTTAAGCAACTTTGCAAGTCC
AGAGCTCTGTCTACGGTTTATGGTTTTATCGATGAAATGCGTAGTAGCTGTGGTGCAAAGCCCGATCTTGTTACTTATACAATCTTGATAGATAATGTATGCAATGGCAA
GAATCTCCGCGAAGCGACGCGGTTGGTAAGTGTGCTGGCCGAGGAGGGTTTTAAGCCGGATTGCTTTGTTTATAACACAATTATGAAGGGTTATTGTATGCTTGGCAGGG
GCAGCGAGGCAATTGGAGTCTATAAGAAAATGAAGGAAGAGGGATTGGAGCCTGATGTTGTAACTTTTAATACGTTGATTTTTGGGTTATCGAAGTCGGGACGAGTTAAG
GAAGCCAGAAAATTTTTGGACATTATGGCAGAGATGGGTCATTTCCCTGATGCAGTTACCTACACTTCTTTGATGAATGGAATGTGTCGTGAGGGTGATGCATTGGGAGC
TCTGTCATTGCTTGAGGAGATGGAGGCAAAGGGGTGCAGCCCCAATTCGTGCACATATAATACTTTACTCCATGGATTATCTAAGTCTAGGCTTTTGGATAGAGGGATTG
AATTGTATGGTTTGATGAAATCTGGTGATATGAAGCTTGAAACAGCTTCCTATGCTACTTTTGTGAGGGCACTTTGCAGGAGCGGTAGGATCGCTGAGGCTTATGAAGTG
TTTGATTATGCAGTTGAGAGTAAAAGTCTGACTGATGTTGCTGCGTATTCAACGTTAGAGAGTACATTGAAGTCTCTGAAGAAAGCAAGGGAGCAAGGCCAAGCTATATA
A
Protein sequenceShow/hide protein sequence
MKAHTLLILAFAFVLGASAEQCGRQAGGALCPNGLCCSQFGFCGSTDDYCKTGCQSQCGGSTPTPSGSGVGSIITESLYNQMLKYHNDPRCPSNGFYTYNAFITAANSFP
GFGTTGDAATRKREMAAFFGQTSHETTGGWSTAPDGPYAWGYCFIRERNQDVYCTPSQQWPCAAGQKYYGRGPIQLTHNYNYGPAGNAISTNLLSNPDLVATNAIISFKT
AIWFWMTPQGNKPSCHNVITGQWQPSSTDTAAGRLPGYGVITNIINGGLECGHGPDARVADRIGFYKSTQTTAGLHPVGPKPGPFTFPPKSQNSAKTLTSKSLSIDSIGA
VQGIGKMGKLSPSFRSALSTTIVNKPPHPPAAPPLLSAEPRSLSKKQPPKHSRKFLSAQSSGHPEKPKLPTLFKSANLADAKKLYSSFISTTKAPLDIRFYNSLLQSYAS
IATLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSVGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKS
RALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRVK
EARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEV
FDYAVESKSLTDVAAYSTLESTLKSLKKAREQGQAI