; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002186 (gene) of Snake gourd v1 genome

Gene IDTan0002186
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG03:76198512..76206872
RNA-Seq ExpressionTan0002186
SyntenyTan0002186
Gene Ontology termsGO:0006032 - chitin catabolic process (biological process)
GO:0016998 - cell wall macromolecule catabolic process (biological process)
GO:0004568 - chitinase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008061 - chitin binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7129869.1 hypothetical protein RHSIM_Rhsim10G0203000 [Rhododendron simsii]5.3e-24356.3Show/hide
Query:  SGSGTGVGAIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERNQNV-
        S  G GV ++IS++ + QMLK+ ND  C  KGFY Y AF+ AA++F GFGTTG+  T++RE+AAFL QTSHETTGGW TAP+GPY WGYCF+ E+   + 
Subjt:  SGSGTGVGAIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERNQNV-

Query:  -YC--TPNQQWP---CVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVS
         YC  +   Q+P   C +G+KYYGRGP Q+T+NYNYGPAG AIGS LL NPD+VATDP +SFKTA+WFWM PQ  KPSCHDVI G W PS  + AAG V 
Subjt:  -YC--TPNQQWP---CVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVS

Query:  GYGVITNIINGGLECGHGPDSRVADRIGFYKRY-----YNGGRLSSWAEARPIYFPTQNLKSAKTL-----------------KSEKPFRLIREARRV--
        GYGVITNIINGG+ECG G   +  DRIGFYKRY        G         P +  +++ + +  L                  +    RL+R+   +  
Subjt:  GYGVITNIINGGLECGHGPDSRVADRIGFYKRY-----YNGGRLSSWAEARPIYFPTQNLKSAKTL-----------------KSEKPFRLIREARRV--

Query:  ------QGNWKMGKL--SPSF---------------RSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPP--KHSRKILSAQSSGHPEKPKLPTL-FKS
              +G   +  L   PS                R  +   +  +    PA++ L + +    S+ P P  K  R           + PK PT+ F S
Subjt:  ------QGNWKMGKL--SPSF---------------RSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPP--KHSRKILSAQSSGHPEKPKLPTL-FKS

Query:  ASLADAKKLYSSFIATTKAPLDLRFYNSLLHSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTT
         SL+DAK L++S I +T  P DL+F+NSLL SYAS++SL+D+ S LRHM+K  P+FSPD ST+ +LL+ S    D +LA+VR+ L  M   GF PDK TT
Subjt:  ASLADAKKLYSSFIATTKAPLDLRFYNSLLHSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTT

Query:  DIAVRSLCSAGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFK
        D+AVRSLC+AG ++ AVELV+E   K+SPPDS TYN L+K LC++R +S+V GFI EMR S   KPDLV+YTILIDNVCN KNLREATRL+ +L+E+GFK
Subjt:  DIAVRSLCSAGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFK

Query:  PDCFVYNTIMKGYCMLGRGSEAVGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEME
        PDC+VYN +MKGYCML +GSE + VYKKMKEEG+EPD+VT+NTLIFGLSKSGRVKEARKFL +MAEMG  PD VTYTSLMNGMCREG+A GAL LLEEME
Subjt:  PDCFVYNTIMKGYCMLGRGSEAVGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEME

Query:  EKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQA
         KGCSPNSCTYNTLLHGL K++LL + IELYG+MK+GD+KLET SY TF+R LCR+GR+AEAYEVFDYAVESKS+ DV AYSTLES LK LKKA+EQG A
Subjt:  EKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQA

Query:  I
        +
Subjt:  I

KAF7130787.1 hypothetical protein RHSIM_Rhsim10G0201800 [Rhododendron simsii]3.9e-24659.3Show/hide
Query:  SGSGTGVGAIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERNQNV-
        S  G GV ++IS++ + QMLK+ ND  C   GFY Y AF+ AA +F GFGTTGD +T++RE+AAF  QTSHETTGG+ TAPDGPYAWGYC + E+     
Subjt:  SGSGTGVGAIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERNQNV-

Query:  YCTPN-QQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVI
        YC  + QQ+PC   +KY+GRGP Q+T+NYNYGPAG AIG  LL  PD+VATDP++SFKTA+WFWMTPQ  KPSCHDVI G+W PS+   AAG V GYGVI
Subjt:  YCTPN-QQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVI

Query:  TNIINGGLECGHGPDSRVADRIGFYKRYYNGGRLSSWAEARPIYFPTQNLKSAKTLKSEKPFRLIREARRVQGNWKMGKLSPSFRSALSTTIVSKPPHPP
        TNIINGG+ECG G +    DRIGFYKRY            R + FP  +L     LK+ +                               + S+ P+P 
Subjt:  TNIINGGLECGHGPDSRVADRIGFYKRYYNGGRLSSWAEARPIYFPTQNLKSAKTLKSEKPFRLIREARRVQGNWKMGKLSPSFRSALSTTIVSKPPHPP

Query:  AAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTL-FKSASLADAKKLYSSFIATTKAPLDLRFYNSLLHSYASIASLNDSISFLRHMSKVQP
           P     P    KK PP+ S               KLPT+ F S SL+DAK L++S I +T  P DL+F+NSLL SYA+++SL+D+ S LRHM+K  P
Subjt:  AAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTL-FKSASLADAKKLYSSFIATTKAPLDLRFYNSLLHSYASIASLNDSISFLRHMSKVQP

Query:  SFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYGF
        +FSPD ST+ +LL+ S    D +L +VR+ LN M   GF PDK TTD+AVRSLC+AG ++ AVELV+E   K+SPPDS TYN L+K LC++R +S+V GF
Subjt:  SFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSRALSTVYGF

Query:  IDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRV
        I EMR S   KPDLV+YTILIDNVCN KNLREATRL+ +L+E+GFKPDC+VYN +MKGYCML +GSE + VYKKMKEEG+EPD+VT+NTLIFGLSKSGR 
Subjt:  IDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRV

Query:  KEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALC
        KEARKFL +MAEMG  PD VTYTSLMNGMCREG+A GAL LLEEME KGCSPNSCTYNTLLHGL K++LL + IELYG+MK+GD+KLET SY TF+R LC
Subjt:  KEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALC

Query:  RSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        R+GR+AEAYEVFDYAVESKS+ DV AYSTLES LK LKKA+EQG A+
Subjt:  RSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

KAG7026711.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-23890.85Show/hide
Query:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH
        MGKLSPSFRSA+ST IV+KPPHPPAA  LLSGE RSLSKK PPKHSR+  SAQSS H EKPKLPTLFKSASLADAKKLYSSFI TTKAPLD+RFYNSLL 
Subjt:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH

Query:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD
        SYASIASLNDSISFLRHMSKVQPSFSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT GFNPD+ TTDIAVRSLCSAGLIDEAVELVRE SQKHSPPD
Subjt:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVY FI+EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLA+EGFKPDCFVYN IMKGYCMLGRG EA+GVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMCR+GDALGALSLLEEME KGCSPNSCTYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE ASYATFVRALCRSGRIAEAYEVFDYAVESKS+TDV AYSTLE TLK+LKKA E+G  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

XP_022926982.1 pentatricopeptide repeat-containing protein At2g17670 [Cucurbita moschata]1.3e-23891.28Show/hide
Query:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH
        MGKLSPSFRSA+ST IV+KPPHPPAA  LLSGE RSLSKK PPKHSR+  SAQSS H EKPKLPTLFKSASLADAKKLYSSFI TTKAPLD+RFYNSLL 
Subjt:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH

Query:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD
        SYASIASLNDSISFLRHMSKVQPSFSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT GFNPDK TTDIAVRSLCSAGLIDEAVELVRE SQKHSPPD
Subjt:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVY FI+EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLA+EGFKPDCFVYN IMKGYCMLGRG EA+GVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEME KGCSPNSCTYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE ASYATFVRALCRSGRIAEAYEVFDYAVESKS+TDV AYSTLE TLK+LKKA E+G  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

XP_023003882.1 pentatricopeptide repeat-containing protein At2g17670-like [Cucurbita maxima]2.9e-24191.49Show/hide
Query:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH
        MGKLSPSFRSA+ST IV+KPP+PPAA  LLSGE RSLSKK PPKHSR+  SAQSSGH EKPKLPTLFKSASLA+AKKLYSSFI TTKAPLD+RFYNSLLH
Subjt:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH

Query:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD
        SY SIASLNDSISFLRHMSKVQP+FSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT+GFNPDK T DIAVRSLCSAGLIDEAVELVRE SQKHSPPD
Subjt:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVYGFI EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEA+GVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPD VTYTSLMNGMCREGDALGALSLLEEME KGCSPNSC+YNTLLHGLSKSRLLD+GIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKSGDMKLE ASYATFVRALCRSGRIAEAYEVFDYAVES+S+TDVAAYSTLE+TLK+LKKAREQG  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

TrEMBL top hitse value%identityAlignment
A0A0A0KHF8 Uncharacterized protein4.7e-22184.93Show/hide
Query:  MGKLSPSFRSALST-TIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL
        MGKLSPSFRS LS+ T+++KPPH PAA PL        SKK  PK SRK  S QSSGHPEKPKLPT+FKSASLADAKKLYSSF++ TKAP +LR +NSLL
Subjt:  MGKLSPSFRSALST-TIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL

Query:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP
         SYASIA+LNDSISFLRHMSKVQPSFSPD+STFHILLSTSGN  DS+LASV+QILNFMVTNGFNPDKVT D+AVRSLCS GL+DEAVELV+ELSQKH+PP
Subjt:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP

Query:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK
        D +TYNHLVKQLCKSRALSTVY FI EMRSSCGAKPDLVTYTILIDNVCN  NLREA RLVS+L +EGFKPDCFVYNTIMKGYCM+GRG+EA+GVYKKMK
Subjt:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK

Query:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL
        E GLEPDVVTFNTLIFGLSKSGRVKEAR FLDIMAEMGHFPDAVTYTSLMNGMCREG+ALGALSLL+EME KGC+PNSCTYNTLLHGLSKSRLLDRGIEL
Subjt:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL

Query:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        YGLMKS DMKLETASY+TFVRALCRSGRIAEAYEVFDYAVESKS+TDV+AY +LESTLKSLK AREQ  AI
Subjt:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

A0A1S4DT84 pentatricopeptide repeat-containing protein At2g176702.9e-21583.4Show/hide
Query:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH
        MGKLSPSFRS LST+++ KP   PAA PL        SKKP PK SRK  S QSSGHP KPKLPT+FKSASLADAKKLYSSFI+T+KAP +LR +NSLL 
Subjt:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH

Query:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD
        SYASIA+LNDSISFLRHMSKVQPSFSPD+STFHILLSTS N  DSSLASVR+ILNFMVTNGFNPDKVT D+AVRSLCS GL+DEAVELV+ELSQKH+P D
Subjt:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE
         +TYNHLVKQLCKSRALSTVY FI EMRSSCGAKPDLVTYTILIDNVCN  NLREA RLVS+L +EGFKPDCFVYNTIMKGYCM+GRG+EA+GVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
         GLEPD+VTFNTLIFGLSKSGRVKEA  FLDIMAEMGHFPD VTYTSLMNGMCREG+ALGALSLL+EME KGC+PNS TYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE+ASY+TFVRALCRSGRIAEAYEVFDYAVESKS+TDV+AY +LESTLKSLK AREQ  A+
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

A0A6J1BW58 pentatricopeptide repeat-containing protein At2g176702.1e-23791.72Show/hide
Query:  MGKLSPSFRSALSTTIVSKPPHPPAAA-PLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL
        MGKLSPSFRSA+STTI++KPP P AAA PLLSGEPRSLSKK PPK SRKI SAQSSG  EKPK  TLFKS+SLADAKKLYSSFIATT+APLDLRFYNSLL
Subjt:  MGKLSPSFRSALSTTIVSKPPHPPAAA-PLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL

Query:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP
         SYASIA+LNDSISFLR+MSKVQPSFSPDRSTFH+LLSTSGNG+ SSLASV+QILNFMV+NGFNPDKVTTDIAVRSLCSAGLIDEAVELV+ELS+K SPP
Subjt:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP

Query:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK
        DSFTYNHLVKQLCKSRALSTVYGFIDEMRSS G+KPDLVTYTILIDNVCNGKNLREATRL+SVL EEGFKPDCFVYNTIMKGYCMLGRGSEA+GVYKKMK
Subjt:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK

Query:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL
        EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLE ME KGCSPNSCTYNTLLHGL+KSRLLDRGIEL
Subjt:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL

Query:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        YGLMKSG MKLETASYAT VRALCRS RIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKK REQG AI
Subjt:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

A0A6J1EGP9 pentatricopeptide repeat-containing protein At2g176706.5e-23991.28Show/hide
Query:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH
        MGKLSPSFRSA+ST IV+KPPHPPAA  LLSGE RSLSKK PPKHSR+  SAQSS H EKPKLPTLFKSASLADAKKLYSSFI TTKAPLD+RFYNSLL 
Subjt:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH

Query:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD
        SYASIASLNDSISFLRHMSKVQPSFSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT GFNPDK TTDIAVRSLCSAGLIDEAVELVRE SQKHSPPD
Subjt:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVY FI+EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLA+EGFKPDCFVYN IMKGYCMLGRG EA+GVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEME KGCSPNSCTYNTLLHGLSKSRLLDRGIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKS DMKLE ASYATFVRALCRSGRIAEAYEVFDYAVESKS+TDV AYSTLE TLK+LKKA E+G  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

A0A6J1KT16 pentatricopeptide repeat-containing protein At2g17670-like1.4e-24191.49Show/hide
Query:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH
        MGKLSPSFRSA+ST IV+KPP+PPAA  LLSGE RSLSKK PPKHSR+  SAQSSGH EKPKLPTLFKSASLA+AKKLYSSFI TTKAPLD+RFYNSLLH
Subjt:  MGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLH

Query:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD
        SY SIASLNDSISFLRHMSKVQP+FSP+RSTFH+LLSTSGNGTDSSLASVRQILNFMVT+GFNPDK T DIAVRSLCSAGLIDEAVELVRE SQKHSPPD
Subjt:  SYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPD

Query:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE
        S+TYNHLVKQLCKSR+LSTVYGFI EMRSSCGA PDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEA+GVYKKMKE
Subjt:  SFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKE

Query:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY
        EGLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPD VTYTSLMNGMCREGDALGALSLLEEME KGCSPNSC+YNTLLHGLSKSRLLD+GIELY
Subjt:  EGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELY

Query:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI
        GLMKSGDMKLE ASYATFVRALCRSGRIAEAYEVFDYAVES+S+TDVAAYSTLE+TLK+LKKAREQG  I
Subjt:  GLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI

SwissProt top hitse value%identityAlignment
P52403 Endochitinase 1 (Fragment)4.3e-12369.93Show/hide
Query:  TLLILAFAFVLGGSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCKTG-CQSQCSGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSKG-FYNYNAFVT
        TL +L    +L  SAEQCG QA GALC  GLCCS+FGWCG+T+DYC  G CQSQC G  G    +G +IS S ++QML + ND  C  KG FY+YNAF++
Subjt:  TLLILAFAFVLGGSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCKTG-CQSQCSGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSKG-FYNYNAFVT

Query:  AAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRER-NQNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTL
        AA SFPGFGTTGDI  R+RE+AAF  QTSHETTGGWPTAPDGPYAWGYCF+RE+ +   YCTP+ QWPC  G+KY+GRGPIQ++HNYNYGP G+AIG  L
Subjt:  AAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRER-NQNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTL

Query:  LNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRY
        LNNPD+VATD V+SFK+AIWFWMTPQ  KPSCHDVI G+WQPS VD AA RV G+GVITNIINGGLECGHG DSRV DRIGFY+RY
Subjt:  LNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRY

P52405 Endochitinase 3 (Fragment)1.8e-12169.34Show/hide
Query:  TLLILAFA-FVLGGSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCKTG-CQSQCSGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSK-GFYNYNAFV
        T+  L F+  +L  SAEQCG QA GALC  GLCCS+FGWCGNT+DYC  G CQSQC G  G    +G +IS S ++QML + ND  C  K  FY+YNAF+
Subjt:  TLLILAFA-FVLGGSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCKTG-CQSQCSGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSK-GFYNYNAFV

Query:  TAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRER-NQNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGST
        +AA SFPGFGTTGDI  R+RE+AAFL QTSHETTGGWP+APDGPYAWGYCF+RE+ +   YCTP+ QWPC  G+KY+GRGPIQ++HNYNYGP G+AIG  
Subjt:  TAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRER-NQNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGST

Query:  LLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRY
        LLNNPD+VATD V+SFK+AIWFWMTPQ  KPSCHDVI G+WQPS  D AA RV G+GVITNIINGGLECGHG DSRV DRIGFY+RY
Subjt:  LLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRY

Q05538 Basic 30 kDa endochitinase5.1e-12470.63Show/hide
Query:  TLLILAFAFVLGGSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCKTG-CQSQCSGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSK-GFYNYNAFVT
        TL +L    +L  SAEQCG QA GALC  GLCCS+FGWCGNT++YC  G CQSQC G  G    +G +IS S ++QML + ND  C  K  FY+YNAFVT
Subjt:  TLLILAFAFVLGGSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCKTG-CQSQCSGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSK-GFYNYNAFVT

Query:  AAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRER-NQNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTL
        AA SFPGFGTTGDI  R+RE+AAFL QTSHETTGGWPTAPDGPYAWGYCF+RE+ +   YCTP+ QWPC  G+KY+GRGPIQ++HNYNYGP G+AIG  L
Subjt:  AAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRER-NQNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTL

Query:  LNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRY
        LNNPD+VATDPV+SFK+AIWFWMTPQ  KPSCHDVI G+WQPS  D AA RV G+GVITNIINGGLECGHG DSRV DRIGFY+RY
Subjt:  LNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRY

Q09023 Endochitinase CH258.2e-12268.26Show/hide
Query:  MKAHTLLILAFAFVLGGS-AEQCGRQANGALCPIGLCCSQFGWCGNTDDYCK-TGCQSQCSGSSGSGTG-VGAIISESTYNQMLKYHNDGRCPSKGFYNY
        MK+  LL L F+F+L  S AEQCGRQA GALCP GLCCS+FGWCG+T+ YCK  GCQSQC G+    TG +  IIS S ++ MLK+ ND  CP++GFY Y
Subjt:  MKAHTLLILAFAFVLGGS-AEQCGRQANGALCPIGLCCSQFGWCGNTDDYCK-TGCQSQCSGSSGSGTG-VGAIISESTYNQMLKYHNDGRCPSKGFYNY

Query:  NAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-QNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKA
        +AF+ AAKSFPGFGTTGD  TR++E+AAF GQTSHETTGGW TAPDGPY+WGYCF +E+N  + YC+P+ +WPC +G+ YYGRGP+QL+ NYNYG  G+A
Subjt:  NAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-QNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKA

Query:  IGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRYYN
        IGS LLNNPD+V+ DPV++FK AIWFWMTPQ  KPSCH VI+G+WQPS  D AAGRV GYGVITNIINGGLECG G D+RVADRIGFY+RY N
Subjt:  IGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRYYN

Q84J71 Pentatricopeptide repeat-containing protein At2g176706.6e-15660.04Show/hide
Query:  MGKLSPSFRSALSTTIVSK-PPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL
        MGK+  SFRS  +  +V K  P PPA        PR    +         L  Q++  P +P L   FKS +L+DAK L++S  AT++ PLDL+F+NS+L
Subjt:  MGKLSPSFRSALSTTIVSK-PPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL

Query:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP
         SY SIA +ND++   +H+ K QP+F P RSTF ILLS +    DSS+++V ++LN MV NG  PD+VTTDIAVRSLC  G +DEA +L++EL++KHSPP
Subjt:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP

Query:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK
        D++TYN L+K LCK + L  VY F+DEMR     KPDLV++TILIDNVCN KNLREA  LVS L   GFKPDCF+YNTIMKG+C L +GSEAVGVYKKMK
Subjt:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK

Query:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL
        EEG+EPD +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMCR+G++LGALSLLEEME +GC+PN CTYNTLLHGL K+RL+D+G+EL
Subjt:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL

Query:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQG
        Y +MKS  +KLE+  YAT VR+L +SG++AEAYEVFDYAV+SKS++D +AYSTLE+TLK LKKA+EQG
Subjt:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQG

Arabidopsis top hitse value%identityAlignment
AT1G02360.1 Chitinase family protein1.0e-7454.11Show/hide
Query:  SGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-
        S +    T +  ++    YN++  + ++  CP+ GFY Y +FV A + FP FG+ G   T+R E+AAFL Q SHETTGGW TAPDGPYAWG CF  E + 
Subjt:  SGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-

Query:  QNVYC-TPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGY
        Q+ YC + + QWPC   + Y GRGPIQL+ NYNYGPAG+A+G   L NP+ V+ + V++F+TA+WFWMTPQ  KPSCHDV+IGK++P++ D AA R  G+
Subjt:  QNVYC-TPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGY

Query:  GVITNIINGGLECGHGPDSRVADRIGFYKRY
        G+ TNIINGGLECG   D RV DRIGF++RY
Subjt:  GVITNIINGGLECGHGPDSRVADRIGFYKRY

AT2G17670.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-15760.04Show/hide
Query:  MGKLSPSFRSALSTTIVSK-PPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL
        MGK+  SFRS  +  +V K  P PPA        PR    +         L  Q++  P +P L   FKS +L+DAK L++S  AT++ PLDL+F+NS+L
Subjt:  MGKLSPSFRSALSTTIVSK-PPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL

Query:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP
         SY SIA +ND++   +H+ K QP+F P RSTF ILLS +    DSS+++V ++LN MV NG  PD+VTTDIAVRSLC  G +DEA +L++EL++KHSPP
Subjt:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP

Query:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK
        D++TYN L+K LCK + L  VY F+DEMR     KPDLV++TILIDNVCN KNLREA  LVS L   GFKPDCF+YNTIMKG+C L +GSEAVGVYKKMK
Subjt:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK

Query:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL
        EEG+EPD +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMCR+G++LGALSLLEEME +GC+PN CTYNTLLHGL K+RL+D+G+EL
Subjt:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIEL

Query:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQG
        Y +MKS  +KLE+  YAT VR+L +SG++AEAYEVFDYAV+SKS++D +AYSTLE+TLK LKKA+EQG
Subjt:  YGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKAREQG

AT2G17670.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-10956.86Show/hide
Query:  MGKLSPSFRSALSTTIVSK-PPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL
        MGK+  SFRS  +  +V K  P PPA        PR    +         L  Q++  P +P L   FKS +L+DAK L++S  AT++ PLDL+F+NS+L
Subjt:  MGKLSPSFRSALSTTIVSK-PPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLL

Query:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP
         SY SIA +ND++   +H+ K QP+F P RSTF ILLS +    DSS+++V ++LN MV NG  PD+VTTDIAVRSLC  G +DEA +L++EL++KHSPP
Subjt:  HSYASIASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPP

Query:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK
        D++TYN L+K LCK + L  VY F+DEMR     KPDLV++TILIDNVCN KNLREA  LVS L   GFKPDCF+YNTIMKG+C L +GSEAVGVYKKMK
Subjt:  DSFTYNHLVKQLCKSRALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMK

Query:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREG
        EEG+EPD +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMCR+G
Subjt:  EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREG

AT3G12500.1 basic chitinase8.7e-11967.12Show/hide
Query:  MKAHTLLILAFAFVLG-GSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCK-TGCQSQCS--GSSGSGTG-VGAIISESTYNQMLKYHNDGRCPSKGFY
        MK +  L L F+ +L   SAEQCGRQA GALCP GLCCS+FGWCGNT+ YCK  GCQSQC+  G+    TG +  IIS S ++ MLK+ ND  CP++GFY
Subjt:  MKAHTLLILAFAFVLG-GSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCK-TGCQSQCS--GSSGSGTG-VGAIISESTYNQMLKYHNDGRCPSKGFY

Query:  NYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-QNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAG
         YNAF+TAAKSFPGFGTTGD  TR++E+AAF GQTSHETTGGW TAPDGPY+WGYCF +E+N  + YC P+  WPC +G++YYGRGP+QL+ NYNYG  G
Subjt:  NYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-QNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAG

Query:  KAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRYYN
        +AIG  LLNNPD+VA D V++FK AIWFWMT Q  KPSCH VI G+WQPS  D AAGR+ GYGVITNIINGGLECG G D RVADRIGFY+RY N
Subjt:  KAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRYYN

AT4G01700.1 Chitinase family protein3.1e-7656.82Show/hide
Query:  AIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-QNVYC-TPNQQ
        +++  + Y+Q+  + ++  CP+KGFY Y AFV A +SFP FG+ G+  TRRRE+AAFL Q SHETTGGW TAPDGPYAWG CF  E + Q+ YC   N+ 
Subjt:  AIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGFGTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERN-QNVYC-TPNQQ

Query:  WPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGL
        WPCV+G+ Y GRGPIQL+ NYNYG AG+A+G   L NP++VA + V++FKTA+WFWMT Q  KPSCH+V++ +++P+  D AA R  GYG++TNIINGGL
Subjt:  WPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAIWFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGL

Query:  ECGHGPDSRVADRIGFYKRY
        ECG   D RV DR+G+++RY
Subjt:  ECGHGPDSRVADRIGFYKRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCCCATACGCTCCTAATCTTAGCATTTGCCTTTGTTTTGGGAGGTTCCGCGGAGCAATGTGGGCGGCAGGCCAATGGCGCTCTGTGCCCCATTGGCCTCTGCTG
CAGCCAGTTCGGGTGGTGCGGCAACACCGACGACTACTGTAAAACTGGTTGCCAGAGCCAGTGTAGCGGCTCTTCCGGTTCCGGTACCGGGGTCGGAGCGATCATCTCCG
AATCCACTTATAATCAAATGCTCAAGTATCACAACGATGGGCGATGCCCTAGTAAAGGGTTCTATAACTATAATGCTTTCGTTACTGCTGCTAAATCCTTCCCTGGCTTC
GGTACCACTGGAGATATCAATACTCGTAGAAGGGAGTTGGCTGCTTTCCTTGGCCAAACTTCTCATGAAACTACTGGAGGATGGCCTACCGCACCAGATGGTCCATATGC
ATGGGGATACTGTTTCATACGGGAGAGAAATCAAAACGTATATTGTACACCTAATCAACAATGGCCCTGTGTTGCTGGCCAAAAATATTACGGTCGTGGACCAATCCAAC
TAACCCATAACTACAACTATGGGCCAGCGGGCAAAGCAATAGGATCGACTTTGTTGAACAACCCTGATGTGGTAGCCACTGATCCTGTTGTATCATTCAAGACAGCTATT
TGGTTTTGGATGACACCACAAGGAAATAAACCGTCATGTCACGATGTTATTATCGGCAAGTGGCAACCTTCGAGCGTCGATACTGCTGCAGGAAGAGTTTCTGGCTATGG
TGTCATCACCAATATCATTAATGGTGGACTCGAGTGTGGGCATGGTCCTGATAGCAGAGTTGCCGATAGAATTGGATTTTACAAACGATATTACAATGGTGGCCGTTTAT
CCAGTTGGGCCGAAGCCCGGCCCATTTACTTTCCCACCCAAAATCTAAAATCTGCAAAAACCCTAAAATCGGAAAAACCCTTTCGATTGATTCGAGAGGCGAGGCGAGTA
CAGGGTAATTGGAAGATGGGCAAATTATCGCCCTCATTTCGGTCAGCTCTCTCCACCACGATAGTTAGCAAACCACCTCATCCTCCGGCGGCGGCGCCGCTCTTGTCCGG
CGAGCCCCGCTCCTTGTCCAAGAAACCACCTCCTAAACATTCCCGGAAAATCCTATCAGCGCAGAGCTCCGGCCACCCGGAAAAGCCCAAACTCCCAACGCTGTTCAAAT
CGGCCAGTCTCGCAGATGCCAAGAAGCTCTACAGCTCCTTCATCGCCACCACAAAAGCCCCTCTCGACCTTCGATTCTATAACTCTCTTCTTCATTCTTACGCTTCAATC
GCCTCACTCAATGACTCCATCTCTTTCCTCCGCCACATGTCCAAAGTTCAACCTTCCTTCTCGCCCGATCGATCGACCTTCCATATCTTACTCTCTACCTCCGGGAATGG
TACTGATTCCTCTCTCGCCTCGGTTCGGCAAATCCTCAATTTCATGGTCACCAATGGCTTCAATCCTGACAAGGTGACTACTGATATTGCTGTGCGATCGCTTTGTTCGG
CAGGTCTGATTGATGAAGCTGTAGAATTAGTTAGAGAATTATCGCAAAAACACTCGCCTCCTGATTCTTTTACATATAATCATCTCGTTAAGCAACTTTGTAAGTCTAGA
GCTCTGTCTACGGTTTATGGTTTTATTGATGAAATGCGTAGTAGTTGTGGTGCGAAGCCCGATCTTGTTACTTATACAATCTTGATAGATAATGTGTGTAATGGCAAGAA
TCTACGTGAGGCGACGCGGTTGGTAAGTGTGCTGGCTGAGGAGGGTTTTAAGCCGGATTGCTTTGTTTATAACACAATTATGAAGGGTTATTGTATGCTTGGCAGGGGCA
GCGAGGCAGTTGGAGTCTATAAGAAAATGAAGGAGGAGGGATTGGAGCCTGATGTTGTAACATTTAACACGTTGATTTTTGGGTTATCAAAGTCGGGGCGAGTTAAGGAA
GCCAGAAAGTTTTTGGACATTATGGCAGAGATGGGTCATTTCCCTGATGCTGTTACTTACACTTCATTGATGAATGGAATGTGTCGTGAGGGTGATGCATTGGGAGCATT
GTCATTACTTGAGGAGATGGAGGAAAAGGGTTGCAGCCCCAATTCATGCACATATAATACGTTGCTCCATGGATTGTCAAAGTCTAGGCTTTTGGATAGAGGGATTGAAT
TGTATGGTTTGATGAAATCTGGTGATATGAAGCTTGAAACAGCTTCCTATGCTACTTTTGTGAGGGCGCTTTGCAGGAGCGGTAGGATTGCTGAGGCCTATGAAGTATTT
GATTATGCAGTTGAGAGTAAAAGTATGACTGATGTTGCTGCGTATTCAACATTAGAGAGTACATTGAAGTCTCTGAAGAAAGCGAGGGAGCAAGGCCAAGCTATATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCCCATACGCTCCTAATCTTAGCATTTGCCTTTGTTTTGGGAGGTTCCGCGGAGCAATGTGGGCGGCAGGCCAATGGCGCTCTGTGCCCCATTGGCCTCTGCTG
CAGCCAGTTCGGGTGGTGCGGCAACACCGACGACTACTGTAAAACTGGTTGCCAGAGCCAGTGTAGCGGCTCTTCCGGTTCCGGTACCGGGGTCGGAGCGATCATCTCCG
AATCCACTTATAATCAAATGCTCAAGTATCACAACGATGGGCGATGCCCTAGTAAAGGGTTCTATAACTATAATGCTTTCGTTACTGCTGCTAAATCCTTCCCTGGCTTC
GGTACCACTGGAGATATCAATACTCGTAGAAGGGAGTTGGCTGCTTTCCTTGGCCAAACTTCTCATGAAACTACTGGAGGATGGCCTACCGCACCAGATGGTCCATATGC
ATGGGGATACTGTTTCATACGGGAGAGAAATCAAAACGTATATTGTACACCTAATCAACAATGGCCCTGTGTTGCTGGCCAAAAATATTACGGTCGTGGACCAATCCAAC
TAACCCATAACTACAACTATGGGCCAGCGGGCAAAGCAATAGGATCGACTTTGTTGAACAACCCTGATGTGGTAGCCACTGATCCTGTTGTATCATTCAAGACAGCTATT
TGGTTTTGGATGACACCACAAGGAAATAAACCGTCATGTCACGATGTTATTATCGGCAAGTGGCAACCTTCGAGCGTCGATACTGCTGCAGGAAGAGTTTCTGGCTATGG
TGTCATCACCAATATCATTAATGGTGGACTCGAGTGTGGGCATGGTCCTGATAGCAGAGTTGCCGATAGAATTGGATTTTACAAACGATATTACAATGGTGGCCGTTTAT
CCAGTTGGGCCGAAGCCCGGCCCATTTACTTTCCCACCCAAAATCTAAAATCTGCAAAAACCCTAAAATCGGAAAAACCCTTTCGATTGATTCGAGAGGCGAGGCGAGTA
CAGGGTAATTGGAAGATGGGCAAATTATCGCCCTCATTTCGGTCAGCTCTCTCCACCACGATAGTTAGCAAACCACCTCATCCTCCGGCGGCGGCGCCGCTCTTGTCCGG
CGAGCCCCGCTCCTTGTCCAAGAAACCACCTCCTAAACATTCCCGGAAAATCCTATCAGCGCAGAGCTCCGGCCACCCGGAAAAGCCCAAACTCCCAACGCTGTTCAAAT
CGGCCAGTCTCGCAGATGCCAAGAAGCTCTACAGCTCCTTCATCGCCACCACAAAAGCCCCTCTCGACCTTCGATTCTATAACTCTCTTCTTCATTCTTACGCTTCAATC
GCCTCACTCAATGACTCCATCTCTTTCCTCCGCCACATGTCCAAAGTTCAACCTTCCTTCTCGCCCGATCGATCGACCTTCCATATCTTACTCTCTACCTCCGGGAATGG
TACTGATTCCTCTCTCGCCTCGGTTCGGCAAATCCTCAATTTCATGGTCACCAATGGCTTCAATCCTGACAAGGTGACTACTGATATTGCTGTGCGATCGCTTTGTTCGG
CAGGTCTGATTGATGAAGCTGTAGAATTAGTTAGAGAATTATCGCAAAAACACTCGCCTCCTGATTCTTTTACATATAATCATCTCGTTAAGCAACTTTGTAAGTCTAGA
GCTCTGTCTACGGTTTATGGTTTTATTGATGAAATGCGTAGTAGTTGTGGTGCGAAGCCCGATCTTGTTACTTATACAATCTTGATAGATAATGTGTGTAATGGCAAGAA
TCTACGTGAGGCGACGCGGTTGGTAAGTGTGCTGGCTGAGGAGGGTTTTAAGCCGGATTGCTTTGTTTATAACACAATTATGAAGGGTTATTGTATGCTTGGCAGGGGCA
GCGAGGCAGTTGGAGTCTATAAGAAAATGAAGGAGGAGGGATTGGAGCCTGATGTTGTAACATTTAACACGTTGATTTTTGGGTTATCAAAGTCGGGGCGAGTTAAGGAA
GCCAGAAAGTTTTTGGACATTATGGCAGAGATGGGTCATTTCCCTGATGCTGTTACTTACACTTCATTGATGAATGGAATGTGTCGTGAGGGTGATGCATTGGGAGCATT
GTCATTACTTGAGGAGATGGAGGAAAAGGGTTGCAGCCCCAATTCATGCACATATAATACGTTGCTCCATGGATTGTCAAAGTCTAGGCTTTTGGATAGAGGGATTGAAT
TGTATGGTTTGATGAAATCTGGTGATATGAAGCTTGAAACAGCTTCCTATGCTACTTTTGTGAGGGCGCTTTGCAGGAGCGGTAGGATTGCTGAGGCCTATGAAGTATTT
GATTATGCAGTTGAGAGTAAAAGTATGACTGATGTTGCTGCGTATTCAACATTAGAGAGTACATTGAAGTCTCTGAAGAAAGCGAGGGAGCAAGGCCAAGCTATATAACC
GGCCTTGATCAACTTCAACAAGCATTTGGCTAGCCAATGAAAAAAGTGGTTCAGGGAAAACATGGTGAAGCTTGAGCGACGGTGAATTTCCATGTCTCAAGCATTGGCAG
AAGAGCACGGATCCAAATAGTCATCGAGAGCTTCCTACTGCTTGATGCAATTTTGAAATTGGAATCATTCAACATTATACCTCACCACCATTGATTTATTATGCAAAGTT
AAAGTTCTGCTGATTGTACAGAATCTTGTTTGTTTTTAGTGTATTTCAGAAGCGATTGCGCCGAGAAGCTATACGATAAAGCCTCTATTATGTATTCGATGGATGGAGAT
GGGAAACTACACCATAGCTTGGAGATATTGTTTTCTATGAGAGAGAAATCATTATGGCAGACCCAACCTCGGATGGCCTTGCGTTGGCAGAAATATTATGATCTTTTATT
CTTCTAGGATTTTTTTCTTATAAAATTTAAAATCAAATTTATGAGCTTTTAACCCTAGTTCATAAAGTTGTCACTTTTTTTCTTGGCAACTACAACTATGACTTAGTTGA
AAGAGCACTGCTTCGTTTGATAACTATTTTATTTTTTAAAATCTTGCTAGCACAATTTCTTTGCAATATTTTCATCTTTTTTAATAAAAATTT
Protein sequenceShow/hide protein sequence
MKAHTLLILAFAFVLGGSAEQCGRQANGALCPIGLCCSQFGWCGNTDDYCKTGCQSQCSGSSGSGTGVGAIISESTYNQMLKYHNDGRCPSKGFYNYNAFVTAAKSFPGF
GTTGDINTRRRELAAFLGQTSHETTGGWPTAPDGPYAWGYCFIRERNQNVYCTPNQQWPCVAGQKYYGRGPIQLTHNYNYGPAGKAIGSTLLNNPDVVATDPVVSFKTAI
WFWMTPQGNKPSCHDVIIGKWQPSSVDTAAGRVSGYGVITNIINGGLECGHGPDSRVADRIGFYKRYYNGGRLSSWAEARPIYFPTQNLKSAKTLKSEKPFRLIREARRV
QGNWKMGKLSPSFRSALSTTIVSKPPHPPAAAPLLSGEPRSLSKKPPPKHSRKILSAQSSGHPEKPKLPTLFKSASLADAKKLYSSFIATTKAPLDLRFYNSLLHSYASI
ASLNDSISFLRHMSKVQPSFSPDRSTFHILLSTSGNGTDSSLASVRQILNFMVTNGFNPDKVTTDIAVRSLCSAGLIDEAVELVRELSQKHSPPDSFTYNHLVKQLCKSR
ALSTVYGFIDEMRSSCGAKPDLVTYTILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAVGVYKKMKEEGLEPDVVTFNTLIFGLSKSGRVKE
ARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALGALSLLEEMEEKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSGDMKLETASYATFVRALCRSGRIAEAYEVF
DYAVESKSMTDVAAYSTLESTLKSLKKAREQGQAI