; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005603 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005603
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptioncarboxypeptidase SOL1 isoform X1
Genome locationchr03:8523216..8532563
RNA-Seq ExpressionPI0005603
SyntenyPI0005603
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006518 - peptide metabolic process (biological process)
GO:0010008 - endosome membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004181 - metallocarboxypeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000834 - Peptidase M14, carboxypeptidase A
IPR015567 - Peptidase M14B, caboxypeptidase D


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140953.3 carboxypeptidase SOL1 isoform X1 [Cucumis sativus]1.5e-20595.12Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM
        MKFLLFF LLSLTSPALFH+ALARGASNISISP DFDSHAYG SARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCS+ISRIYSIGDSVQGFPLWVM
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM

Query:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
        EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
Subjt:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR

Query:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
        QPETKAIMKWMRE HFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMAS+YSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
Subjt:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF

Query:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        ELTLEITDNKWPPAN      EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDY V+++
Subjt:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

XP_008456738.1 PREDICTED: carboxypeptidase SOL1 isoform X1 [Cucumis melo]3.0e-20695.12Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM
        MKFLLFFLLLSL+SPALFH+ALARGASNISISPVDFDSHAYGGSAR LLEDNKS+GSIMQGYMTNKDLEEAIKAFGKKCS+ISRIYSIGDSVQGFPLWVM
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM

Query:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
        EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
Subjt:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR

Query:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
        QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMAS+YSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
Subjt:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF

Query:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        ELTLEITDNKWPPA+      EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDY V+++
Subjt:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

XP_022957364.1 carboxypeptidase SOL1 isoform X1 [Cucurbita moschata]7.5e-18987.6Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW
        MKFL  FLLLS+ SPALFH ALARG  N SISPVDF S AYGGSARFLLEDN S+ S  IMQGYMTNK+LE A+KAFGKKCSKIS+IYSIGDSVQGFPLW
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW

Query:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY
        V+EISDKPGQEEA+PAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLR RNNANNVDLNRDFPDQFF INDDEY
Subjt:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY

Query:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG
         RQPETKAIMKWMR+IHFTASA+LHGGALVANYPWDGTADKRKDYYACPDD+TFRFMASVYSRSHHNMS SQEF+GGITNGA+WYPIYGGMQDWNYIHGG
Subjt:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG

Query:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        CFELTLEITDNKWPPAN      EYNKLSML LVASL QTGIHGRIFSS+SG PLP TITLKGIDY V+++
Subjt:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

XP_022990333.1 carboxypeptidase SOL1 isoform X1 [Cucurbita maxima]1.7e-18887.33Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW
        MKFL  FLLLS+ SPALFH ALARG  N SISPVDF S AYGGSARFLLEDN S+ S  IMQGYMTNK+LE A+KAFGKKCSKIS+IYSIGDSVQGFPLW
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW

Query:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY
        V+EISDKPGQEE++PAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLR RNNANNVDLNRDFPDQFF INDDEY
Subjt:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY

Query:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG
         RQPETKAIMKWMR+IHFTASA+LHGGALVANYPWDGTADKRKDYYACPDD+TFRFMASVYSRSHHNMS SQEF+GGITNGA+WYPIYGGMQDWNYIHGG
Subjt:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG

Query:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        CFELTLEITDNKWPPAN      EYNKLSML LVASL QTGIHGRIFSS+SG PLP TITLKGIDY V+++
Subjt:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

XP_038885322.1 carboxypeptidase SOL1 isoform X1 [Benincasa hispida]2.5e-20092.68Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM
        MKFLL FLL SLTSPALFHVALARGA N  ISPVDFDS AYGGSARFLLEDNKSQGSIMQGYMTNK+LEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM

Query:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
        EISDKPGQEEA+PAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSL+RRNNAN+VDLNRDFPDQFFVINDDEY R
Subjt:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR

Query:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
        QPETKAIMKWMR++HFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSR+HHNMSFSQEFQGGITNGA+WYPIYGGMQDWNYIHGGCF
Subjt:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF

Query:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        ELTLEITDNKWPPAN      EYNKLSMLKLVASLVQTGIHGRIFSSD G PLPATITLKGIDY V++N
Subjt:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

TrEMBL top hitse value%identityAlignment
A0A0A0KCK0 Uncharacterized protein8.9e-20493.35Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALA-------RGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQ
        MKFLLFF LLSLTSPALFH+ALA       RGASNISISP DFDSHAYG SARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCS+ISRIYSIGDSVQ
Subjt:  MKFLLFFLLLSLTSPALFHVALA-------RGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQ

Query:  GFPLWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVI
        GFPLWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVI
Subjt:  GFPLWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVI

Query:  NDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWN
        NDDEYDRQPETKAIMKWMRE HFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMAS+YSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWN
Subjt:  NDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWN

Query:  YIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        YIHGGCFELTLEITDNKWPPAN      EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDY V+++
Subjt:  YIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

A0A1S3C3Y1 carboxypeptidase SOL1 isoform X11.5e-20695.12Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM
        MKFLLFFLLLSL+SPALFH+ALARGASNISISPVDFDSHAYGGSAR LLEDNKS+GSIMQGYMTNKDLEEAIKAFGKKCS+ISRIYSIGDSVQGFPLWVM
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM

Query:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
        EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
Subjt:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR

Query:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
        QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMAS+YSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
Subjt:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF

Query:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        ELTLEITDNKWPPA+      EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDY V+++
Subjt:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

A0A6J1BXF5 carboxypeptidase SOL1 isoform X34.0e-18887.26Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM
        MKFL  F LLS+TSPALFHVALARG  N SISPV F S  YG SARFLL+D KSQ SI QGYMTN++LEEA+KAFG++CSKISRIYSIG+SVQGFPLWVM
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVM

Query:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR
        EISDKPGQEEA+PAFKYIGNVHGDEPVGRELLLQFANWICDNY KDPLATLIVE+VHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFF INDDEY R
Subjt:  EISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDR

Query:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF
        QPETKAIMKW+R++HFTASASLHGGALVAN+PWDGTADKRK YYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGA+WYPIYGGMQDWNYI GGCF
Subjt:  QPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCF

Query:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        ELTLEITDNKWPPA+      EYNKLSML LVASLVQTGIHGRIFSSDSG PLPATITLKGIDY V+++
Subjt:  ELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

A0A6J1H1Q5 carboxypeptidase SOL1 isoform X13.6e-18987.6Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW
        MKFL  FLLLS+ SPALFH ALARG  N SISPVDF S AYGGSARFLLEDN S+ S  IMQGYMTNK+LE A+KAFGKKCSKIS+IYSIGDSVQGFPLW
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW

Query:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY
        V+EISDKPGQEEA+PAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLR RNNANNVDLNRDFPDQFF INDDEY
Subjt:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY

Query:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG
         RQPETKAIMKWMR+IHFTASA+LHGGALVANYPWDGTADKRKDYYACPDD+TFRFMASVYSRSHHNMS SQEF+GGITNGA+WYPIYGGMQDWNYIHGG
Subjt:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG

Query:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        CFELTLEITDNKWPPAN      EYNKLSML LVASL QTGIHGRIFSS+SG PLP TITLKGIDY V+++
Subjt:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

A0A6J1JSZ4 carboxypeptidase SOL1 isoform X18.1e-18987.33Show/hide
Query:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW
        MKFL  FLLLS+ SPALFH ALARG  N SISPVDF S AYGGSARFLLEDN S+ S  IMQGYMTNK+LE A+KAFGKKCSKIS+IYSIGDSVQGFPLW
Subjt:  MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLW

Query:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY
        V+EISDKPGQEE++PAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVE+VHLHILPSMNPDGFSLR RNNANNVDLNRDFPDQFF INDDEY
Subjt:  VMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEY

Query:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG
         RQPETKAIMKWMR+IHFTASA+LHGGALVANYPWDGTADKRKDYYACPDD+TFRFMASVYSRSHHNMS SQEF+GGITNGA+WYPIYGGMQDWNYIHGG
Subjt:  DRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGG

Query:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        CFELTLEITDNKWPPAN      EYNKLSML LVASL QTGIHGRIFSS+SG PLP TITLKGIDY V+++
Subjt:  CFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

SwissProt top hitse value%identityAlignment
O75976 Carboxypeptidase D2.4e-6542.5Show/hide
Query:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPG-QEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN
        D+E  ++ F  +   I+R+YS+G SV+   L+VMEISD PG  E  +P FKYIGN+HG+E VGRELLL    ++C N+  DP  T +V +  +H++PSMN
Subjt:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPG-QEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN

Query:  PDGF---------SLRRRNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA
        PDG+         S+  RNN+NN DLNR+FPDQF  I D     QPET A+M WM+   F  SA+LHGG+LV NYP+D        Y   PDD  F+ +A
Subjt:  PDGF---------SLRRRNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA

Query:  SVYSRSH---------HNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS
          YS+ +          NM  ++ F  GITNGA+WY + GGMQDWNY+   CFE+T+E+   K+P         E N+ S+++ +   V  G+ G +  +
Subjt:  SVYSRSH---------HNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS

Query:  DSGTP-LPATITLKGIDYYV
          G   L ATI++  I++ V
Subjt:  DSGTP-LPATITLKGIDYYV

O89001 Carboxypeptidase D7.1e-6541.88Show/hide
Query:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPG-QEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN
        D+E  ++ F  +   I+R+YS+G SV+   L+VMEISD PG  E  +P FKYIGN+HG+E VGRELLL    ++C N+  DP  T +V S  +H++PSMN
Subjt:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPG-QEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN

Query:  PDGF---------SLRRRNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA
        PDG+         S+  RNN+NN DLNR+FPDQF  I +     QPET A+M W++   F  SA+LHGG+LV NYP+D        Y   PDD  F+ +A
Subjt:  PDGF---------SLRRRNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA

Query:  SVYSRSH---------HNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS
          YS+ +          +M  ++ F  GITNGA+WY + GGMQDWNY+   CFE+T+E+   K+P  N      E N+ S+++ +   V  G+ G +  +
Subjt:  SVYSRSH---------HNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS

Query:  DSGTP-LPATITLKGIDYYV
          G   L AT+++  I++ V
Subjt:  DSGTP-LPATITLKGIDYYV

P83852 Carboxypeptidase D (Fragment)4.9e-6643.12Show/hide
Query:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPGQEEA-KPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN
        D+E  ++ +  +   I+R+YS+G SV+   L+VMEISD PG  EA +P FKYIGN+HG+E VGRELLL    ++C N+  DP  T +V+S  +HI+PSMN
Subjt:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPGQEEA-KPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN

Query:  PDGFSLRR---------RNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA
        PDG+   +         RNN+NN DLNR+FPDQFF + D     QPET A+M W++   F  SA+LHGG+LV NYP+D        Y   PDD  F+ +A
Subjt:  PDGFSLRR---------RNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA

Query:  SVYSRSHHNMSF---------SQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS
          YS+ +  M           ++ F  GITNGA WY + GGMQDWNY++  CFE+T+E+   K+P A       E N+ S+L+ +   V  GI G +  +
Subjt:  SVYSRSHHNMSF---------SQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS

Query:  DSGTP-LPATITLKGIDYYV
          G   L ATI++  I++ V
Subjt:  DSGTP-LPATITLKGIDYYV

Q90240 Carboxypeptidase D4.9e-6643.12Show/hide
Query:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPGQEEA-KPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN
        D+E  ++ +  +   I+R+YS+G SV+   L+VMEISD PG  EA +P FKYIGN+HG+E VGRELLL    ++C N+  DP  T +V+S  +HI+PSMN
Subjt:  DLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPGQEEA-KPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMN

Query:  PDGFSLRR---------RNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA
        PDG+   +         RNN+NN DLNR+FPDQFF + D     QPET A+M W++   F  SA+LHGG+LV NYP+D        Y   PDD  F+ +A
Subjt:  PDGFSLRR---------RNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMA

Query:  SVYSRSHHNMSF---------SQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS
          YS+ +  M           ++ F  GITNGA WY + GGMQDWNY++  CFE+T+E+   K+P A       E N+ S+L+ +   V  GI G +  +
Subjt:  SVYSRSHHNMSF---------SQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPAN------EYNKLSMLKLVASLVQTGIHGRIFSS

Query:  DSGTP-LPATITLKGIDYYV
          G   L ATI++  I++ V
Subjt:  DSGTP-LPATITLKGIDYYV

Q9M9H7 Carboxypeptidase SOL16.9e-14567.02Show/hide
Query:  MKFLLFFLLLSLTSPALFHVAL--ARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFP
        M  L FF  L +++   F +    ARG  +  I P D  ++++ G  R L    +   S  + +GYMTN DLE+A+K F K+CSKISR+YSIG SV GFP
Subjt:  MKFLLFFLLLSLTSPALFHVAL--ARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFP

Query:  LWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDD
        LWV+EISD+PG+ EA+PAFKYIGNVHGDEPVGRELLL+ ANWICDNY KDPLA +IVE+VHLHI+PS+NPDGFS+R+RNNANNVDLNRDFPDQFF  NDD
Subjt:  LWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDD

Query:  EYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIH
           RQPETKAIM W+R+I FTASA+LHGGALVAN+PWDGT DKRK YYACPDDETFRF+A +YS+SH NMS S+EF+ GITNGA+WYPIYGGMQDWNYI+
Subjt:  EYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIH

Query:  GGCFELTLEITDNKWPPANE------YNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        GGCFELTLEI+DNKWP A+E      YN+ SML LVASLV+TG+HGRIFS D G PLP  + +KGI+Y V+++
Subjt:  GGCFELTLEITDNKWPPANE------YNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

Arabidopsis top hitse value%identityAlignment
AT1G71696.1 carboxypeptidase D, putative2.5e-13475Show/hide
Query:  ISRIYSIGDSVQGFPLWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVD
        + R +SIG SV GFPLWV+EISD+PG+ EA+PAFKYIGNVHGDEPVGRELLL+ ANWICDNY KDPLA +IVE+VHLHI+PS+NPDGFS+R+RNNANNVD
Subjt:  ISRIYSIGDSVQGFPLWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVD

Query:  LNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAA
        LNRDFPDQFF  NDD   RQPETKAIM W+R+I FTASA+LHGGALVAN+PWDGT DKRK YYACPDDETFRF+A +YS+SH NMS S+EF+ GITNGA+
Subjt:  LNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAA

Query:  WYPIYGGMQDWNYIHGGCFELTLEITDNKWPPANE------YNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        WYPIYGGMQDWNYI+GGCFELTLEI+DNKWP A+E      YN+ SML LVASLV+TG+HGRIFS D G PLP  + +KGI+Y V+++
Subjt:  WYPIYGGMQDWNYIHGGCFELTLEITDNKWPPANE------YNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN

AT1G71696.2 carboxypeptidase D, putative4.9e-14667.02Show/hide
Query:  MKFLLFFLLLSLTSPALFHVAL--ARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFP
        M  L FF  L +++   F +    ARG  +  I P D  ++++ G  R L    +   S  + +GYMTN DLE+A+K F K+CSKISR+YSIG SV GFP
Subjt:  MKFLLFFLLLSLTSPALFHVAL--ARGASNISISPVDFDSHAYGGSARFLLEDNKSQGS--IMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFP

Query:  LWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDD
        LWV+EISD+PG+ EA+PAFKYIGNVHGDEPVGRELLL+ ANWICDNY KDPLA +IVE+VHLHI+PS+NPDGFS+R+RNNANNVDLNRDFPDQFF  NDD
Subjt:  LWVMEISDKPGQEEAKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDD

Query:  EYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIH
           RQPETKAIM W+R+I FTASA+LHGGALVAN+PWDGT DKRK YYACPDDETFRF+A +YS+SH NMS S+EF+ GITNGA+WYPIYGGMQDWNYI+
Subjt:  EYDRQPETKAIMKWMREIHFTASASLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIH

Query:  GGCFELTLEITDNKWPPANE------YNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN
        GGCFELTLEI+DNKWP A+E      YN+ SML LVASLV+TG+HGRIFS D G PLP  + +KGI+Y V+++
Subjt:  GGCFELTLEITDNKWPPANE------YNKLSMLKLVASLVQTGIHGRIFSSDSGTPLPATITLKGIDYYVRSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTCTCCTATTCTTCTTACTTCTCTCTCTTACTTCTCCAGCTTTATTTCACGTTGCACTGGCTCGAGGCGCCTCCAACATCTCCATTTCGCCAGTGGATTTCGA
TAGTCATGCATATGGCGGTTCTGCTAGATTTTTACTCGAGGACAATAAATCTCAAGGAAGCATCATGCAAGGTTATATGACCAATAAAGACCTTGAGGAGGCGATTAAAG
CTTTTGGTAAAAAATGTAGTAAAATTTCTAGGATATATAGTATTGGAGACAGCGTGCAAGGGTTTCCACTGTGGGTAATGGAAATCTCTGACAAGCCAGGCCAAGAAGAA
GCCAAACCTGCATTTAAGTACATAGGAAATGTACATGGAGATGAACCTGTTGGCAGGGAGCTTCTTTTGCAATTTGCTAACTGGATATGTGATAATTACCTGAAGGATCC
GTTGGCTACACTGATTGTGGAGAGTGTTCATCTTCATATACTTCCATCCATGAATCCTGATGGGTTTTCTCTTAGGAGACGCAATAATGCAAACAATGTTGATCTAAACA
GAGATTTCCCCGATCAGTTCTTTGTCATCAATGATGATGAATATGATCGACAACCTGAAACAAAAGCAATTATGAAGTGGATGAGAGAGATACATTTCACTGCCTCTGCC
AGTTTGCATGGGGGTGCACTTGTTGCAAATTACCCATGGGACGGCACTGCAGATAAAAGGAAAGATTACTATGCATGTCCTGATGATGAAACATTCCGATTCATGGCTAG
TGTCTACAGTCGCTCACATCATAACATGTCTTTTAGCCAAGAATTTCAAGGAGGAATTACTAATGGAGCTGCATGGTACCCTATATATGGTGGCATGCAGGACTGGAACT
ATATACATGGTGGCTGTTTTGAGTTGACTTTAGAGATTACAGACAACAAATGGCCTCCTGCTAATGAGTATAACAAGTTGAGCATGCTAAAGCTTGTTGCAAGCCTTGTT
CAGACAGGAATTCATGGAAGAATTTTTTCATCAGATAGTGGGACACCGCTACCTGCAACCATTACACTCAAGGGAATAGATTACTACGTACGTTCTAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTTCTCCTATTCTTCTTACTTCTCTCTCTTACTTCTCCAGCTTTATTTCACGTTGCACTGGCTCGAGGCGCCTCCAACATCTCCATTTCGCCAGTGGATTTCGA
TAGTCATGCATATGGCGGTTCTGCTAGATTTTTACTCGAGGACAATAAATCTCAAGGAAGCATCATGCAAGGTTATATGACCAATAAAGACCTTGAGGAGGCGATTAAAG
CTTTTGGTAAAAAATGTAGTAAAATTTCTAGGATATATAGTATTGGAGACAGCGTGCAAGGGTTTCCACTGTGGGTAATGGAAATCTCTGACAAGCCAGGCCAAGAAGAA
GCCAAACCTGCATTTAAGTACATAGGAAATGTACATGGAGATGAACCTGTTGGCAGGGAGCTTCTTTTGCAATTTGCTAACTGGATATGTGATAATTACCTGAAGGATCC
GTTGGCTACACTGATTGTGGAGAGTGTTCATCTTCATATACTTCCATCCATGAATCCTGATGGGTTTTCTCTTAGGAGACGCAATAATGCAAACAATGTTGATCTAAACA
GAGATTTCCCCGATCAGTTCTTTGTCATCAATGATGATGAATATGATCGACAACCTGAAACAAAAGCAATTATGAAGTGGATGAGAGAGATACATTTCACTGCCTCTGCC
AGTTTGCATGGGGGTGCACTTGTTGCAAATTACCCATGGGACGGCACTGCAGATAAAAGGAAAGATTACTATGCATGTCCTGATGATGAAACATTCCGATTCATGGCTAG
TGTCTACAGTCGCTCACATCATAACATGTCTTTTAGCCAAGAATTTCAAGGAGGAATTACTAATGGAGCTGCATGGTACCCTATATATGGTGGCATGCAGGACTGGAACT
ATATACATGGTGGCTGTTTTGAGTTGACTTTAGAGATTACAGACAACAAATGGCCTCCTGCTAATGAGTATAACAAGTTGAGCATGCTAAAGCTTGTTGCAAGCCTTGTT
CAGACAGGAATTCATGGAAGAATTTTTTCATCAGATAGTGGGACACCGCTACCTGCAACCATTACACTCAAGGGAATAGATTACTACGTACGTTCTAATCTTTAAGTTCC
CTCTCTTTGTTTTATTATGTTATTAATTATTTGGGGATCCCTTTTCATTCATTTTGTATCTTGTAGTTCAGCTTGTAATACCTGACACACTCTTTCCAAAAATATGCATT
GTATGTGTATTGCATTTAATCATGAATTACAAATGAAAAAAGAAAAAACTTGATTGCCTTTGAATCTCACATCACATGAATCGCATTTTGGAATCATCATAACTCAATTA
GATTTAGACCACATAAGTATTTTTCTCCGTACGAGAAACAATGAGAGAAAAGGATCACGTTGAGGAGGTGGAGGAAATTTCATTTCTTTTTTATTTGGGAAATTATGGAG
TTATTTCAAGTGAGAATAGTGAGAGTTTGGCTTTGCCTCTTTCAGAACAAAATCGGGTAGATTCTTTGGTAATCCTACACCTATACAGTAGAGGAATATAGAGGGGATTA
TAGAATGTGGTATTGATGAGAATGAGAGAGAAACCGAATGGGTTGACTCACAACCTAGTGGGAGGCCTCCTGATTTCCAAGGAGCTCTTCGTGACTCTTTGGGAAGAAGA
AAAGTTGTATACTGGTAGGAGGTGGATGGTAGGATTGAGTTTTGAATGATATTAAAAACATAAGGTGAGAGCAAAGAGTTGGTCGAGGAGATAGTTATAGTGGTAGAGAT
GGTATATAACAAGTGAGAAGATGATTGAATTTTGATGGTCATTGTTCAGGCAAAAAGGGACAGTCGAGAGCGGAGGATAGTGAAGGGTAAAGGAAAATATTTAGTACCAT
GGCAGGTCTTGTAGAAAACAATTAAAAATATCATGCTTATAATATGTTTATTCTTTCTACTGCTTGTTGGGAATGTCATATGGAAAAAACTACTCTATAAAACATAGTAA
AAATATATTAACATAGGTATTTATTGGATGTTATTATCTTATCATAACGAAACCGTGTGGAGAACTCAATCCATCTGAATTGGTTGATGATTTTGTGTGGAAACTTGATC
GATTGCCATTTTTATTCCTTTTCTTAATAAGAAATAATTTCATTGATGCATGAAATTACTGAAGAGCAGTGTTTTTAAATGCCCAAGGCGCACTAAGGCGCAATGGCCCT
CTGGAGCCTAGGCGCAAGGTGCACAAAAAGGCGCGGGCTTTCTATGTGAGGCACACTACAAATAAAAATATTACATTAAAAAGAGAAAGCACAGTGGAAGTGAAAATATG
GAAGAAAAGACTTCAAACATAAGAATTTTTGCATTTAGGCTTAGTAAACTTGATTCTTCTAGTAAATGATTAAGGAAAATTTTTGTGAGGCGCTAGACCTGCTCCTTGAG
CCTAGGCGCACCCCCGAGAGGGCTTTTTAAAACACTGCTAAAGAGGGGAGAGTTACCCTGAAAGAAATTATAGAAGAGAGCAATGAAAATAATGTGTTCTAAGAAAACAA
AATAAGAACCAATGAGGTTTAACCATAAGAGTCTCTCCCTCCCTTTAAAAGGATTACCCTCAAGAGATAAAAGGCAAGGGATGTCGATAGGTAGAACCATGAGGCACGAA
CCACGGAAATAAGAGATGATGTGGCTCCAAAATCTATTTGAATATATGCAGTGGATAAATACGTGGTAGCATAACTGCCATCTGATGAAGGTGACTCTTGCATTTTTATC
TCTTCAATTGTTCTTATTATACAACTTCGCCTCCGGAATAAGAACTTATTAGATCCTCTTTGACGGTTTTTTTTTTTTTTTGTGCTATTTGACATTGCAATTGTAATTTT
TAAATCTCGTTTTTTAAAATCGTAATCATACAATAGAAACAGGGAGACGGTGGGAGGGGAAAAACAATGGAGAGAATTCTCACTCCTTGGTGAACAATCTACCTGATAAA
TATTCTTAAATTCAATCGACAGGTTAAAGCTAGTCAAAAGTTTGCCAACTACCATCGGTTAGCCGCACCAAGACAGAAATATGAAGTAACTGCGTCAATGCCCGGGTACA
AGTCGAAGAATACAAGCATCTGGCTGGAAGAAGGAGCGATGTCTGTGGACTTTGTTCTTGATCCAGACACAACAGCCAAAGGAAAAGTAATACAAAACTGTGATTGCAAC
TGTGGAAATAGGCTGGACTTTGTGGGCTATATTTGGGGACACTACTTTGAGGCGTATATTTTCTTGGCGGTTGTTTTGGTATTTATTTGCTTTTTATTTCAGAGGAGAAT
GAAATCTAGACTTTCCAAGCAAAGATTGGTTGCATTACCTAAGAGGACTGTGGTCTAAAACCCCCTTTTTCTTCTCTTCCTATTTTATTTCCAGTTACTTTTCATTACTG
GAGAAATGATATCACATGTTTGTAGCGTAACAGTTTTTGAGGTACTTTCCCTAAGGGAAAAAAGAAAAAGAAGGAAGAAAGGATCTTTGGGTAATGTTGTCAAAATTTCT
GAGTCGGAATAAATTTTGATTCTGTACCATTCTAATTTTAGTCCTATTCTCCTTGCTCTCTGT
Protein sequenceShow/hide protein sequence
MKFLLFFLLLSLTSPALFHVALARGASNISISPVDFDSHAYGGSARFLLEDNKSQGSIMQGYMTNKDLEEAIKAFGKKCSKISRIYSIGDSVQGFPLWVMEISDKPGQEE
AKPAFKYIGNVHGDEPVGRELLLQFANWICDNYLKDPLATLIVESVHLHILPSMNPDGFSLRRRNNANNVDLNRDFPDQFFVINDDEYDRQPETKAIMKWMREIHFTASA
SLHGGALVANYPWDGTADKRKDYYACPDDETFRFMASVYSRSHHNMSFSQEFQGGITNGAAWYPIYGGMQDWNYIHGGCFELTLEITDNKWPPANEYNKLSMLKLVASLV
QTGIHGRIFSSDSGTPLPATITLKGIDYYVRSNL