; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029317 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029317
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionglyoxysomal processing protease, glyoxysomal
Genome locationtig00153293:1176283..1188342
RNA-Seq ExpressionSgr029317
SyntenySgr029317
Gene Ontology termsGO:0016485 - protein processing (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0016020 - membrane (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR009003 - Peptidase S1, PA clan
IPR039245 - Peroxisomal/glyoxysomal leader peptide-processing protease
IPR043504 - Peptidase S1, PA clan, chymotrypsin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604457.1 Glyoxysomal processing protease, glyoxysomal, partial [Cucurbita argyrosperma subsp. sororia]8.1e-28672.7Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL DT  A+HLGN+KDQFATLVLTVSSIFEPFMPLQHR+TIH         KG      KPELIPGVQIDIMVE  S ME+D +V     PHWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPT+A+AL+ V+DASLDS++QRWEVGWSLASY NGSPSF D+L  Q                                        DMPNI 
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        +SPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+NCYPP+S SKSLL+ADMRCLPGMEGCPVFDEHA LIGVLIRPL HYM+GAEIQLL+PW AIATA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK
        CS LLL AY  GERI NDNGC SAVGNEAM KE K EG               PS +                    V     GL+LTNAHLIEPWRFGK
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK

Query:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL
         NVSG RSIENA+L Q++TE    SMHNG FG KKSG+LTQNASKNANILLQ+Q++  KL+FANYGRRNLRVRLNHAEPW+WCDAKV+YICKGPWDVALL
Subjt:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL

Query:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
        QLEQIPEQLS IIMD SWPS+GS I+VIGHGLLGPKSGFSPSV SGVVANVVK KIP SYHQG SLEYFPAMLETTAAVHPG SGGAVVNSEGHMIGLVT
Subjt:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT

Query:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP
        SNARHGRGAIIPHLNFS+PCAALEPIH F +DM+DLSVLKVLDEPDEQLSSIWALM QRSPKPSP PDLPQL G +HETKGKGSRFAKFIAERREVFRK 
Subjt:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP

Query:  TLHNKVEGLPSEIIRSKV
        TLHN+ E LPS +IRSK+
Subjt:  TLHNKVEGLPSEIIRSKV

XP_022142190.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Momordica charantia]2.5e-29574.9Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPE L DTE A+HLGNHKDQFA+LVLT SSIFEPFMP QHRD I Q                KPELIPGVQIDIMVED S ME+DFEVRN G PHWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPTSATALQSV+DASLDSI+QRWEVGWSLASYTNG PSF DAL RQ                                        D+PNI+
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIAN YPP S +KSLLMADMRCLPGMEGCPVFDEHAR+IGVLIRPL HYM+GAEIQLLVPW AIATA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG--------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGKT
        CS+LL  AYY GE IGNDNGCT+ VGNEAMTKEQK EG              P+ V                    V     GLILTNAHLIEPWRFGKT
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG--------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGKT

Query:  NVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQ
        N S  RSIENAQL QTHTE    SMHNGVFG K SGSL QNAS+NANIL+QDQL+D K SFANYGRRNLRVRLNHA+ W+WCDAKVIYIC+GPWDVALLQ
Subjt:  NVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQ

Query:  LEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTS
        LEQIPEQLSPI MDCS PSSGS IYVIGHGLLGPKSGFSPSV SGVVANVVK KIPSS+HQG SLEYFPA+LETTAAVHPGGSGGAVVNSEGHM+GLVTS
Subjt:  LEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTS

Query:  NARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKPT
        NARHGRG+IIPHLNFS+PCAALEPI+RFSKDMEDLSVLKVLDEPDEQLSS+WALMPQRSPK    PDLPQL GE+HETKGKGSRFAKFIAERREVF+KPT
Subjt:  NARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKPT

Query:  LHNKVEGLPSEIIRSKV
        +HNK EGLPS+ +RSK+
Subjt:  LHNKVEGLPSEIIRSKV

XP_038881508.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Benincasa hispida]5.4e-29073.99Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL DT  A+HLGN+KDQFATLVLTVSSIFEPFM LQHRDTIH         KG      KPELIPGVQIDIMVE  S ME+D +V      HWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPTSA ALQSV+DASLDS++QRWEVGWSLASYTNGSP F D+   Q                                        DMPNI+
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        ISPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+NCYPPSS  KSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPL HYM+GAEIQLL+PW AI TA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK
        CS LLL AY  GERIGNDNGC S VGNEAM KEQK +G               P  V                    V     GLILTNAHLIEPWRFGK
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK

Query:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL
        TNVSG RSIENA+L Q HTE    SMH+GVFG KKSG +TQNASKNAN    DQL+D KLSFANYG RNLRVRLNHAEPW+WCDAKV+YICKGPWDVALL
Subjt:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL

Query:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
        QLEQ+PEQLSPIIMDCSWPSSGS I+VIGHGLLGPKSGFSPSV SGVVANVVK KIPSSYHQG SLEYFPAMLETTAAVHPGGSGGAVVNS+G MIGLVT
Subjt:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT

Query:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP
        SNARHGRGAIIPHLNFS+PCAALEPIHRFSKDMEDLSV+KVLDEPDEQLSSIWALM QRSPKPSP PDLPQLLGE+HETKGKGSRFAKFIAE+REV RKP
Subjt:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP

Query:  TLHNKVEG-LPSEIIRSKV
        TLHN+ E  LPS+IIRSK+
Subjt:  TLHNKVEG-LPSEIIRSKV

XP_038881509.1 glyoxysomal processing protease, glyoxysomal isoform X2 [Benincasa hispida]2.1e-28673.57Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL DT  A+HLGN+KDQFATLVLTVSSIFEPFM LQHRDTIH         KG      KPELIPGVQIDIMVE  S ME+D +V      HWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPTSA ALQSV+DASLDS++QRWEVGWSLASYTNGSP F D+   Q                                        DMPNI+
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        ISPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+NCYPPSS  KSLLMADMRCLP   GCPVFDE ARLIGVLIRPL HYM+GAEIQLL+PW AI TA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK
        CS LLL AY  GERIGNDNGC S VGNEAM KEQK +G               P  V                    V     GLILTNAHLIEPWRFGK
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK

Query:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL
        TNVSG RSIENA+L Q HTE    SMH+GVFG KKSG +TQNASKNAN    DQL+D KLSFANYG RNLRVRLNHAEPW+WCDAKV+YICKGPWDVALL
Subjt:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL

Query:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
        QLEQ+PEQLSPIIMDCSWPSSGS I+VIGHGLLGPKSGFSPSV SGVVANVVK KIPSSYHQG SLEYFPAMLETTAAVHPGGSGGAVVNS+G MIGLVT
Subjt:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT

Query:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP
        SNARHGRGAIIPHLNFS+PCAALEPIHRFSKDMEDLSV+KVLDEPDEQLSSIWALM QRSPKPSP PDLPQLLGE+HETKGKGSRFAKFIAE+REV RKP
Subjt:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP

Query:  TLHNKVEG-LPSEIIRSKV
        TLHN+ E  LPS+IIRSK+
Subjt:  TLHNKVEG-LPSEIIRSKV

XP_038881510.1 glyoxysomal processing protease, glyoxysomal isoform X3 [Benincasa hispida]2.2e-29175.25Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL DT  A+HLGN+KDQFATLVLTVSSIFEPFM LQHRDTIH         KG      KPELIPGVQIDIMVE  S ME+D +V      HWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------DMPNINISPSRQRGSFLL
        AHLLALYDIPTSA ALQSV+DASLDS++QRWEVGWSLASYTNGSP F D+   Q                            DMPNI+ISPSRQRGSFLL
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------DMPNINISPSRQRGSFLL

Query:  AVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATACSNLLLAAYYVG
        AVGSPFGVLSPVHF NSISVGSI+NCYPPSS  KSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPL HYM+GAEIQLL+PW AI TACS LLL AY  G
Subjt:  AVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATACSNLLLAAYYVG

Query:  ERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENA
        ERIGNDNGC S VGNEAM KEQK +G               P  V                    V     GLILTNAHLIEPWRFGKTNVSG RSIENA
Subjt:  ERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENA

Query:  QLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPI
        +L Q HTE    SMH+GVFG KKSG +TQNASKNAN    DQL+D KLSFANYG RNLRVRLNHAEPW+WCDAKV+YICKGPWDVALLQLEQ+PEQLSPI
Subjt:  QLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPI

Query:  IMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIP
        IMDCSWPSSGS I+VIGHGLLGPKSGFSPSV SGVVANVVK KIPSSYHQG SLEYFPAMLETTAAVHPGGSGGAVVNS+G MIGLVTSNARHGRGAIIP
Subjt:  IMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIP

Query:  HLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKPTLHNKVEG-LPS
        HLNFS+PCAALEPIHRFSKDMEDLSV+KVLDEPDEQLSSIWALM QRSPKPSP PDLPQLLGE+HETKGKGSRFAKFIAE+REV RKPTLHN+ E  LPS
Subjt:  HLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKPTLHNKVEG-LPS

Query:  EIIRSKV
        +IIRSK+
Subjt:  EIIRSKV

TrEMBL top hitse value%identityAlignment
A0A0A0KHN7 Uncharacterized protein1.4e-27870.93Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL DT  A+HLGN+KDQFATLVLTVSSIFEPFMPLQHRD IH         KG      KPELIPGVQIDIMVE    + +D +V     PHWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPTSATALQSV+DAS+DS++QRWEVGWSLASYTNGSPSF D+L  Q                                        DMPNI+
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        ISPSRQRGSFLLAVGSPFGVLSPVHF NS+SVGSI+NCYPPSSLSKSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPL HYM+GAEIQLL+PW AIATA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG-------------PSVVVFKKIL----------------------GLILTNAHLIEPWRFGK
        CS LLL    VGERI NDN C  AVGN A+ KEQK EG             P     +K +                      GLILTNAHLIEPWRFGK
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG-------------PSVVVFKKIL----------------------GLILTNAHLIEPWRFGK

Query:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL
        TNV G +SIENA+L Q+HTE    SM+N VFG ++ G++  NASKN NILL +QL+D KLSF NYGRRNL VRL+HAEPW+WCDAK++YICKG WDVALL
Subjt:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL

Query:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
        QLEQIPEQLSPI MDCS P+SGS I+VIGHGLLGPKSG SPSV SGVV+NVVK KIPSSYH+G SLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
Subjt:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT

Query:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP
        SNARHGRG IIPHLNFS+PCAALEPIHRFSKDMEDLSV+KVLDEP+EQLSSIWALM QRSPKPSP P LPQLLGE+HE+KGKGSRFAKFIAE+REV RKP
Subjt:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP

Query:  TLHNKVEG-LPSEIIRSKV
        TLHN+ E  LPS+I+RSK+
Subjt:  TLHNKVEG-LPSEIIRSKV

A0A1S3AZ98 glyoxysomal processing protease, glyoxysomal2.1e-27971.91Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL D+   +HLGN+KDQFATLVLTVSSIFEPFMPLQHRDTIH         KG      KPELIPGVQIDIMVE    + +D +V     PHWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPTSATALQSV+DASLDS++QRWEVGWSLASYTNGSPSF D+L  Q                                        DMPNIN
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        ISPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+NCYPPSSLSKSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPL HYM+GAEIQLL+PW AIATA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQK-SEGPSVV----------------------------------VFKKILGLILTNAHLIEPWRFGK
         S LLL     GERI NDNGC SAVGN A+ KEQK  EG S +                                  V     GLILTNAHLIEPWRFGK
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQK-SEGPSVV----------------------------------VFKKILGLILTNAHLIEPWRFGK

Query:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL
        TNVSG +SIEN++L Q+ TE    SM+NGVFG +KSG++  NASKN NILL +QL+D KLSFANYGRRNLRVRL+HAEPW+WCDAK++YICKGPWDVALL
Subjt:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL

Query:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
        QLE+IPEQLSPIIMDCS PSSGS I+VIGHGLLGPKSG SPSV SGVV+NVVK KIPSSYH+G SLEY PAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
Subjt:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT

Query:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP
        SNARHGRG IIPHLNFS+PCAALEPIHRFSKDMEDLSV+KVLDEP+EQLSSIWALM QRSPKPSP PDLP+LLGE+H +KGKGSRFAKFIAERREV RKP
Subjt:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP

Query:  TLHNKVEG-LPSEIIRSKV
        TLHN+ E  LPS+I RSK+
Subjt:  TLHNKVEG-LPSEIIRSKV

A0A6J1CK76 glyoxysomal processing protease, glyoxysomal isoform X11.2e-29574.9Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPE L DTE A+HLGNHKDQFA+LVLT SSIFEPFMP QHRD I Q                KPELIPGVQIDIMVED S ME+DFEVRN G PHWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPTSATALQSV+DASLDSI+QRWEVGWSLASYTNG PSF DAL RQ                                        D+PNI+
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIAN YPP S +KSLLMADMRCLPGMEGCPVFDEHAR+IGVLIRPL HYM+GAEIQLLVPW AIATA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG--------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGKT
        CS+LL  AYY GE IGNDNGCT+ VGNEAMTKEQK EG              P+ V                    V     GLILTNAHLIEPWRFGKT
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG--------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGKT

Query:  NVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQ
        N S  RSIENAQL QTHTE    SMHNGVFG K SGSL QNAS+NANIL+QDQL+D K SFANYGRRNLRVRLNHA+ W+WCDAKVIYIC+GPWDVALLQ
Subjt:  NVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQ

Query:  LEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTS
        LEQIPEQLSPI MDCS PSSGS IYVIGHGLLGPKSGFSPSV SGVVANVVK KIPSS+HQG SLEYFPA+LETTAAVHPGGSGGAVVNSEGHM+GLVTS
Subjt:  LEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTS

Query:  NARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKPT
        NARHGRG+IIPHLNFS+PCAALEPI+RFSKDMEDLSVLKVLDEPDEQLSS+WALMPQRSPK    PDLPQL GE+HETKGKGSRFAKFIAERREVF+KPT
Subjt:  NARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKPT

Query:  LHNKVEGLPSEIIRSKV
        +HNK EGLPS+ +RSK+
Subjt:  LHNKVEGLPSEIIRSKV

A0A6J1EJB5 glyoxysomal processing protease, glyoxysomal isoform X15.7e-28572.56Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL DT  A+HLGN+KDQFATLVLTVSSIFEPFMPLQHR+TIH         KG      KPELIPGVQIDIMVE  S ME+D +V     PHWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPT+A+AL+ V+DASLDS++QRWEVGWSLASY NGSPSF D+L  Q                                        DMPNI 
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        +SPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+NCYPP+S SKSLL+ADMRCLPGMEGCPVFDEHA LIGVLIRPL HYM+GAEIQLL+PW AIATA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK
        CS LLL AY  G+RI NDNGC SAVGNEAM KE K EG               PS +                    V     GL+LTNAHLIEPWRFGK
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK

Query:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL
         NVSG RSIENA+L Q++TE    SMHNG FG KKSG+LTQNASKNANILLQ+Q++  KL+FANYGRRNLRVRLNHAE W+WCDAKV+YICKGPWDVALL
Subjt:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL

Query:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
        QLEQIPEQLS IIMD SWPS+GS I+VIGHGLLGPKSGFSPSV SGVVANVVK KIP SYHQG SLEYFPAMLETTAAVHPG SGGAVVNSEGHMIGLVT
Subjt:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT

Query:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP
        SNARHGRGAIIPHLNFS+PCAALEPIH F +DM+DLSVLKVLDEPDEQLSSIWALM QRSPKPSP PDLPQL G +HETKGKGSRFAKFIAERREVFRK 
Subjt:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP

Query:  TLHNKVEGLPSEIIRSKV
        TLHNK E LPS +IRSK+
Subjt:  TLHNKVEGLPSEIIRSKV

A0A6J1INF4 glyoxysomal processing protease, glyoxysomal isoform X14.5e-28271.73Show/hide
Query:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA
        MILPETL DT  A+HLGN+KDQFATLVLTVSSIFEPFMPLQHR+TIH         KG      KPELIPGVQIDIMVE  S ME+D +V     PHWHA
Subjt:  MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHA

Query:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN
        AHLLALYDIPT+  AL+ V+DASLDS++QRWEVGWSLASY NGSPSF D+L  Q                                        D+PNI 
Subjt:  AHLLALYDIPTSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQ----------------------------------------DMPNIN

Query:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA
        +SPSRQRGSFLLAVGSPFGVLSP+HF NSISVGSI+NCYPP+S SKSLL+ADMRCLPGMEGCPVFDEHA L+GVLIRPL HYM+GAEIQLL+PW AIATA
Subjt:  ISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATA

Query:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK
        CS LLL AY  GERI NDNGC +AVGNEAM KE K EG               PS +                    V     GL+LTNAHLIEPWRFGK
Subjt:  CSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEG---------------PSVV--------------------VFKKILGLILTNAHLIEPWRFGK

Query:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL
         NVSG RSIENA+L Q++TE    SMHNGVFG KKSG+LTQNASKNANILLQ+Q++  KL+FANYGRRNLRVRLNHAEPW WCDAKV+YICKGPWDVALL
Subjt:  TNVSGGRSIENAQLWQTHTE----SMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALL

Query:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT
        QLEQIPEQLS IIMD SWPS+GS I+VIGHGLLGPKSGFSPSV SGVVANVVK KIP SYHQG SLEYFPAMLETTAAVHPG SGGAVVNSEGHMIGLVT
Subjt:  QLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVT

Query:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP
        SNARHGRGAIIPHLNFS+PCAALEPIHRF +D +DLSV+K LDEPDEQLSSIWALM QRSPKPSP PDLPQL G +HETKGKGSRFAKFIAERREVFRK 
Subjt:  SNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKP

Query:  TLHNKVEGLPSEIIRSKV
        TLH++ E LPS +IRSK+
Subjt:  TLHNKVEGLPSEIIRSKV

SwissProt top hitse value%identityAlignment
Q2FI55 Serine protease HtrA-like1.7e-0426.4Show/hide
Query:  GSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPCA
        G  I V+G+ L      F  +V+ G+++  +   +P  + +    +      +  A+V+PG SGGAVVN EG +IG+V +         + +++F++   
Subjt:  GSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPCA

Query:  ALEPIHRFSKDMEDLSVLKVLDEPD
           P++   K ++DL     +D PD
Subjt:  ALEPIHRFSKDMEDLSVLKVLDEPD

Q2T9J0 Peroxisomal leader peptide-processing protease4.8e-2327.38Show/hide
Query:  PNINISP--SRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPW
        P + +SP  +  +G+ LL  GSPFG   P  F N++S G ++N   P      LL+ D RCLPG EG  VF           RP     +GA + L+V  
Subjt:  PNINISP--SRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPW

Query:  EAIATACSNLLLAAYYVGERIGNDNGCTSA----VGNEAMTKEQKSEGPSVVVFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTESMHNG
           A  C       +  GE +G    C +A       +A+ +   S      +    +G+         PW        G    ++  LW      +  G
Subjt:  EAIATACSNLLLAAYYVGERIGNDNGCTSA----VGNEAMTKEQKSEGPSVVVFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTESMHNG

Query:  -VFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEP---WVWCDAKVIYICKG--PWDVALLQLEQIPEQLSPIIMDCSWPSSGS
         V+GS               +  +  +  + +S     R   RV +    P    +W   +V++  +   P+D+A++ LE+  + + PI +       G 
Subjt:  -VFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEP---WVWCDAKVIYICKG--PWDVALLQLEQIPEQLSPIIMDCSWPSSGS

Query:  MIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVV-NSEGHMIGLVTSNAR-HGRGAIIPHLNFSVPCA
         + V+G G+ G   G  PSV+SG+++ VV+            +   P ML+TT AVH G SGG +  N  G+++G++TSN R +  GA  PHLNFS+P  
Subjt:  MIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVV-NSEGHMIGLVTSNAR-HGRGAIIPHLNFSVPCA

Query:  ALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP
         L+P  +     +DL  L+ LD   E +  +W L    +  P
Subjt:  ALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP

Q8VZD4 Glyoxysomal processing protease, glyoxysomal3.7e-14845.39Show/hide
Query:  QFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKP-ELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHAAHLLALYDIPTSATALQSVI
        Q   LVLTV+S+ EPF+ L HR +              ++    P +LIPG  I+IMVE +   EK+       AP W  A LL+L D+P S+ ALQS+I
Subjt:  QFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKP-ELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHAAHLLALYDIPTSATALQSVI

Query:  DASLDSINQRWEVGWSLASYTNGS-PS-------------------------------FGDALHRQDMPNINISPSRQRGSFLLAVGSPFGVLSPVHFFN
        +AS  S +  W++GWSL S  NGS PS                                G  L     P++N + S  +G  L+A+GSPFG+LSPV+FFN
Subjt:  DASLDSINQRWEVGWSLASYTNGS-PS-------------------------------FGDALHRQDMPNINISPSRQRGSFLLAVGSPFGVLSPVHFFN

Query:  SISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATACSNLLLAAYYVGERIGNDNGCTSAVGNE
        S+S GSIAN YP  SL KSL++AD+RCLPGMEG PVF ++  LIG+LIRPLR   SG EIQL+VPW AI TACS+LLL    V        G  S  G+E
Subjt:  SISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATACSNLLLAAYYVGERIGNDNGCTSAVGNE

Query:  AMTKEQKSEGPSVVVFKKIL----------------------GLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTE--SMHNGVFGSKKSGSLTQNA
         ++ +  +  P+ V  +K +                      GLILTNAHL+EPWR+GK  V G    E  + +    E  S     F  +KS +L + A
Subjt:  AMTKEQKSEGPSVVVFKKIL----------------------GLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTE--SMHNGVFGSKKSGSLTQNA

Query:  SKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSV
         +N    + + +++ K +F   G R++RVRL H + W WC A V+YICK   D+ALLQLE +P +L PI  + S P  G+  +V+GHGL GP+ G SPS+
Subjt:  SKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSV

Query:  SSGVVANVVKTKIP-SSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVL
         SGVVA VV  K   ++      +  FPAMLETTAAVHPGGSGGAV+NS GHMIGLVTSNARHG G +IPHLNFS+PCA L PI +F++DM++ ++L+ L
Subjt:  SSGVVANVVKTKIP-SSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVL

Query:  DEPDEQLSSIWALMPQRSPKPSPS-PDLPQLLGENHETKGKGSRFAKFIAERREVFRKPTLHNKVEGLPSEI
        D+P E+LSSIWALMP  SPK   S P+LP+LL + +  + KGS+FAKFIAE +++F KPT  ++ + +PS++
Subjt:  DEPDEQLSSIWALMPQRSPKPSPS-PDLPQLLGENHETKGKGSRFAKFIAERREVFRKPTLHNKVEGLPSEI

Q9DBA6 Peroxisomal leader peptide-processing protease5.3e-2227.47Show/hide
Query:  GDALHRQDMPNINISP--SRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSG
        G A   +  P + ++P  +  +G+ LLA GSPFG   P  F N++S G ++N   P      LL+ D RCLPG EG  VF   AR  G L+         
Subjt:  GDALHRQDMPNINISP--SRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSG

Query:  AEIQLLVPWEA-------IATACSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEGPSVVVFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENAQ
        A +   + W+A       +  A + LL  A +   R+   +   S +             P V   +   GL L +  L  PW       +    +E   
Subjt:  AEIQLLVPWEA-------IATACSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEGPSVVVFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENAQ

Query:  LWQTHTESMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEP---WVWCDAKVIYICK--GPWDVALLQLEQIPEQLS--
        +W        +GV  + +     ++ +                      R   RV ++ A P    +W   +V++  +   P+D+A++ LE   E+L+  
Subjt:  LWQTHTESMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEP---WVWCDAKVIYICK--GPWDVALLQLEQIPEQLS--

Query:  PIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNS-EGHMIGLVTSNAR-HGRG
        P  +       G  + V+G G+ G   G  PSV+SG+++ VV+            ++  P ML+TT AVH G SGG + +S  G ++G+V SN R +  G
Subjt:  PIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNS-EGHMIGLVTSNAR-HGRG

Query:  AIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP
        A  PHLNFS+P   L+P  +      DL  L+ LD   E +  +W L    S  P
Subjt:  AIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP

Q9Z4H7 Serine protease Do-like HtrA3.5e-0520.21Show/hide
Query:  AYYVGERIGNDNG------CTSAVGNEAMTKEQKSEGPSVVVFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTESMHNGVFGSKKSGSLT
        +YY  +++ N  G        S+  ++   K  K+ G     +  + G +++  +L       + + S G            T+S++N +FG     S +
Subjt:  AYYVGERIGNDNG------CTSAVGNEAMTKEQKSEGPSVVVFKKILGLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTESMHNGVFGSKKSGSLT

Query:  QNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKG------------PWDVALLQLEQIPEQLSPIIMDCSWPSSGSMIYVI
        +N        L+   +   + +     +   V  NH       DA  + +  G              D+A+L ++      +    D     +G  +  +
Subjt:  QNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKG------------PWDVALLQLEQIPEQLSPIIMDCSWPSSGSMIYVI

Query:  GHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSN-ARHGRGAIIPHLNFSVP
        G  L    S ++ +V+ G+++   +T   SS +Q         +++T AA++PG SGGA+VNS G +IG+ +   A+   G  +  + F++P
Subjt:  GHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSN-ARHGRGAIIPHLNFSVP

Arabidopsis top hitse value%identityAlignment
AT1G28320.1 protease-related2.6e-14945.39Show/hide
Query:  QFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKP-ELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHAAHLLALYDIPTSATALQSVI
        Q   LVLTV+S+ EPF+ L HR +              ++    P +LIPG  I+IMVE +   EK+       AP W  A LL+L D+P S+ ALQS+I
Subjt:  QFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKP-ELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHAAHLLALYDIPTSATALQSVI

Query:  DASLDSINQRWEVGWSLASYTNGS-PS-------------------------------FGDALHRQDMPNINISPSRQRGSFLLAVGSPFGVLSPVHFFN
        +AS  S +  W++GWSL S  NGS PS                                G  L     P++N + S  +G  L+A+GSPFG+LSPV+FFN
Subjt:  DASLDSINQRWEVGWSLASYTNGS-PS-------------------------------FGDALHRQDMPNINISPSRQRGSFLLAVGSPFGVLSPVHFFN

Query:  SISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATACSNLLLAAYYVGERIGNDNGCTSAVGNE
        S+S GSIAN YP  SL KSL++AD+RCLPGMEG PVF ++  LIG+LIRPLR   SG EIQL+VPW AI TACS+LLL    V        G  S  G+E
Subjt:  SISVGSIANCYPPSSLSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATACSNLLLAAYYVGERIGNDNGCTSAVGNE

Query:  AMTKEQKSEGPSVVVFKKIL----------------------GLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTE--SMHNGVFGSKKSGSLTQNA
         ++ +  +  P+ V  +K +                      GLILTNAHL+EPWR+GK  V G    E  + +    E  S     F  +KS +L + A
Subjt:  AMTKEQKSEGPSVVVFKKIL----------------------GLILTNAHLIEPWRFGKTNVSGGRSIENAQLWQTHTE--SMHNGVFGSKKSGSLTQNA

Query:  SKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSV
         +N    + + +++ K +F   G R++RVRL H + W WC A V+YICK   D+ALLQLE +P +L PI  + S P  G+  +V+GHGL GP+ G SPS+
Subjt:  SKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPIIMDCSWPSSGSMIYVIGHGLLGPKSGFSPSV

Query:  SSGVVANVVKTKIP-SSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVL
         SGVVA VV  K   ++      +  FPAMLETTAAVHPGGSGGAV+NS GHMIGLVTSNARHG G +IPHLNFS+PCA L PI +F++DM++ ++L+ L
Subjt:  SSGVVANVVKTKIP-SSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPCAALEPIHRFSKDMEDLSVLKVL

Query:  DEPDEQLSSIWALMPQRSPKPSPS-PDLPQLLGENHETKGKGSRFAKFIAERREVFRKPTLHNKVEGLPSEI
        D+P E+LSSIWALMP  SPK   S P+LP+LL + +  + KGS+FAKFIAE +++F KPT  ++ + +PS++
Subjt:  DEPDEQLSSIWALMPQRSPKPSPS-PDLPQLLGENHETKGKGSRFAKFIAERREVFRKPTLHNKVEGLPSEI

AT3G27925.1 DegP protease 13.0e-0426.24Show/hide
Query:  GRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPIIMDCSWP-SSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGG
        G  +LRV L     +   DAKV+   +   DVA+L+++    +L PI +  S     G  ++ IG+       G   ++++GV++ +   +  SS   G 
Subjt:  GRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPIIMDCSWP-SSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGG

Query:  SLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPC----AALEPIHRFSKDMEDLSVLKVL-DEPDEQLSSIWALMPQR
         ++    +++T AA++PG SGG +++S G +IG+ T  A +        + FS+P       ++ + RF K    +  +K   D+  EQL     L+   
Subjt:  SLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPC----AALEPIHRFSKDMEDLSVLKVL-DEPDEQLSSIWALMPQR

Query:  SP
         P
Subjt:  SP

AT4G18370.1 DEGP protease 51.8e-0427.2Show/hide
Query:  DVALLQLEQIPEQLSPIIMDCSWP-SSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGH
        D+A+L++E    +L+P+++  S     G   + IG+       G+  +++ GVV+ + + +IPS    G S+      ++T A ++ G SGG +++S GH
Subjt:  DVALLQLEQIPEQLSPIIMDCSWP-SSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGH

Query:  MIGLVTSNARHGRGAIIPHLNFSVP
         IG+ T+        +   +NF++P
Subjt:  MIGLVTSNARHGRGAIIPHLNFSVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTACCTGAAACCCTTTGTGACACTGAGTTTGCTAGGCATCTTGGTAATCATAAGGATCAATTTGCAACGTTGGTTCTGACTGTTTCCTCCATCTTTGAGCCTTT
TATGCCACTGCAACACCGAGATACCATTCACCAGGTTAACTTCACTTACTATGCTGTAAAAGGGGGAAATGCATTTGACTGGAAACCTGAGCTAATTCCTGGTGTTCAGA
TTGACATTATGGTTGAGGATAAATCATGGATGGAGAAAGATTTTGAAGTTCGAAATGTAGGAGCTCCTCATTGGCATGCTGCACATTTGTTGGCTTTGTATGATATACCT
ACATCTGCCACTGCTCTTCAATCAGTCATTGATGCTTCTTTAGATTCAATAAATCAGAGATGGGAGGTCGGCTGGTCTTTGGCCTCATATACAAATGGTTCTCCATCCTT
TGGGGATGCTCTTCATAGACAGGATATGCCAAACATCAATATATCTCCCTCAAGGCAGAGAGGATCCTTTCTTCTTGCTGTTGGTTCTCCTTTTGGTGTACTATCACCGG
TGCATTTCTTTAACAGCATATCAGTCGGATCAATTGCCAACTGCTACCCTCCTAGCTCATTGAGCAAGTCATTGCTGATGGCTGACATGCGGTGTCTTCCTGGAATGGAA
GGCTGTCCTGTTTTTGATGAGCATGCACGTCTCATCGGTGTTCTGATTAGACCACTTAGGCATTATATGTCTGGTGCTGAGATTCAGCTGTTGGTTCCATGGGAAGCCAT
CGCAACTGCTTGCAGTAATTTGCTGCTAGCGGCTTATTATGTTGGAGAAAGGATTGGCAATGACAATGGGTGTACGAGTGCTGTGGGGAATGAGGCAATGACTAAGGAAC
AAAAATCTGAAGGACCTTCAGTAGTAGTATTCAAGAAAATTCTCGGCCTAATACTCACAAATGCCCACTTGATAGAGCCATGGAGATTTGGGAAAACAAACGTGAGTGGA
GGAAGATCAATTGAAAATGCCCAGCTGTGGCAGACCCATACTGAGTCAATGCATAATGGTGTTTTTGGCAGCAAAAAGAGCGGAAGTTTAACACAAAATGCCTCGAAGAA
TGCAAATATTCTTCTCCAGGACCAACTTAAAGATAAAAAATTGAGTTTTGCTAACTATGGCCGTAGAAACTTGCGTGTTCGCTTGAATCATGCAGAGCCTTGGGTTTGGT
GTGATGCTAAAGTAATATATATCTGTAAGGGACCTTGGGATGTTGCCCTGTTGCAGCTTGAGCAAATTCCGGAGCAGCTCTCACCTATTATTATGGATTGTTCGTGGCCG
TCCTCAGGATCGATGATATATGTTATTGGACATGGACTGTTGGGACCAAAATCTGGCTTCTCTCCATCTGTTTCCTCTGGCGTGGTAGCGAATGTGGTGAAAACAAAGAT
TCCCTCATCTTATCATCAGGGAGGTTCATTAGAATATTTTCCTGCAATGCTTGAAACAACAGCTGCAGTCCATCCTGGTGGTAGTGGGGGTGCTGTTGTCAATTCAGAAG
GCCATATGATTGGACTTGTTACAAGCAATGCGAGGCACGGGCGAGGAGCTATTATTCCACACTTGAACTTCAGCGTACCATGTGCAGCTTTGGAACCCATTCATAGGTTC
TCCAAAGACATGGAGGACCTCTCAGTCCTAAAAGTTCTGGATGAACCAGATGAGCAACTTTCTTCCATATGGGCATTGATGCCACAACGATCACCCAAGCCCTCTCCATC
GCCTGATCTGCCTCAATTGCTAGGTGAAAACCATGAAACAAAGGGGAAAGGTTCTCGATTTGCAAAGTTCATCGCCGAAAGACGTGAAGTGTTCCGCAAGCCAACTCTTC
ATAACAAGGTGGAGGGGCTTCCATCTGAGATAATCCGTAGCAAGGTTTCATGTTTACCTGCTACACTGCTGGCAAAAGCGCTGCTCCAATCCGGCCAAAATCACCACCGC
GGCCTTGGCATGGAACTCGCAGACCTTGTGCCGACGGTGGTACGGCCTGGCGTCGCTCAGATCAGCATTGCACCATCGACTTGGCAACGCGGCACCATTGACCTCCCTCC
TCCGCCGCCAGATCGACCACTGCTTAAAGCTCTCCTCATCCTCTCATCTTCTCCAAACCCTAGTTCTCCGCCGTTGTCGTCGTCTTCCTCCTCCTCCTCCTCCTCTCTTC
CTCCGCCGTGTCCGGCCAACCTTGAGCTCCGCTCGCTGCCATTCTTTTCCAGAGAAGCCGTTGTGGAGCAACTCGAAAGTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATATTACCTGAAACCCTTTGTGACACTGAGTTTGCTAGGCATCTTGGTAATCATAAGGATCAATTTGCAACGTTGGTTCTGACTGTTTCCTCCATCTTTGAGCCTTT
TATGCCACTGCAACACCGAGATACCATTCACCAGGTTAACTTCACTTACTATGCTGTAAAAGGGGGAAATGCATTTGACTGGAAACCTGAGCTAATTCCTGGTGTTCAGA
TTGACATTATGGTTGAGGATAAATCATGGATGGAGAAAGATTTTGAAGTTCGAAATGTAGGAGCTCCTCATTGGCATGCTGCACATTTGTTGGCTTTGTATGATATACCT
ACATCTGCCACTGCTCTTCAATCAGTCATTGATGCTTCTTTAGATTCAATAAATCAGAGATGGGAGGTCGGCTGGTCTTTGGCCTCATATACAAATGGTTCTCCATCCTT
TGGGGATGCTCTTCATAGACAGGATATGCCAAACATCAATATATCTCCCTCAAGGCAGAGAGGATCCTTTCTTCTTGCTGTTGGTTCTCCTTTTGGTGTACTATCACCGG
TGCATTTCTTTAACAGCATATCAGTCGGATCAATTGCCAACTGCTACCCTCCTAGCTCATTGAGCAAGTCATTGCTGATGGCTGACATGCGGTGTCTTCCTGGAATGGAA
GGCTGTCCTGTTTTTGATGAGCATGCACGTCTCATCGGTGTTCTGATTAGACCACTTAGGCATTATATGTCTGGTGCTGAGATTCAGCTGTTGGTTCCATGGGAAGCCAT
CGCAACTGCTTGCAGTAATTTGCTGCTAGCGGCTTATTATGTTGGAGAAAGGATTGGCAATGACAATGGGTGTACGAGTGCTGTGGGGAATGAGGCAATGACTAAGGAAC
AAAAATCTGAAGGACCTTCAGTAGTAGTATTCAAGAAAATTCTCGGCCTAATACTCACAAATGCCCACTTGATAGAGCCATGGAGATTTGGGAAAACAAACGTGAGTGGA
GGAAGATCAATTGAAAATGCCCAGCTGTGGCAGACCCATACTGAGTCAATGCATAATGGTGTTTTTGGCAGCAAAAAGAGCGGAAGTTTAACACAAAATGCCTCGAAGAA
TGCAAATATTCTTCTCCAGGACCAACTTAAAGATAAAAAATTGAGTTTTGCTAACTATGGCCGTAGAAACTTGCGTGTTCGCTTGAATCATGCAGAGCCTTGGGTTTGGT
GTGATGCTAAAGTAATATATATCTGTAAGGGACCTTGGGATGTTGCCCTGTTGCAGCTTGAGCAAATTCCGGAGCAGCTCTCACCTATTATTATGGATTGTTCGTGGCCG
TCCTCAGGATCGATGATATATGTTATTGGACATGGACTGTTGGGACCAAAATCTGGCTTCTCTCCATCTGTTTCCTCTGGCGTGGTAGCGAATGTGGTGAAAACAAAGAT
TCCCTCATCTTATCATCAGGGAGGTTCATTAGAATATTTTCCTGCAATGCTTGAAACAACAGCTGCAGTCCATCCTGGTGGTAGTGGGGGTGCTGTTGTCAATTCAGAAG
GCCATATGATTGGACTTGTTACAAGCAATGCGAGGCACGGGCGAGGAGCTATTATTCCACACTTGAACTTCAGCGTACCATGTGCAGCTTTGGAACCCATTCATAGGTTC
TCCAAAGACATGGAGGACCTCTCAGTCCTAAAAGTTCTGGATGAACCAGATGAGCAACTTTCTTCCATATGGGCATTGATGCCACAACGATCACCCAAGCCCTCTCCATC
GCCTGATCTGCCTCAATTGCTAGGTGAAAACCATGAAACAAAGGGGAAAGGTTCTCGATTTGCAAAGTTCATCGCCGAAAGACGTGAAGTGTTCCGCAAGCCAACTCTTC
ATAACAAGGTGGAGGGGCTTCCATCTGAGATAATCCGTAGCAAGGTTTCATGTTTACCTGCTACACTGCTGGCAAAAGCGCTGCTCCAATCCGGCCAAAATCACCACCGC
GGCCTTGGCATGGAACTCGCAGACCTTGTGCCGACGGTGGTACGGCCTGGCGTCGCTCAGATCAGCATTGCACCATCGACTTGGCAACGCGGCACCATTGACCTCCCTCC
TCCGCCGCCAGATCGACCACTGCTTAAAGCTCTCCTCATCCTCTCATCTTCTCCAAACCCTAGTTCTCCGCCGTTGTCGTCGTCTTCCTCCTCCTCCTCCTCCTCTCTTC
CTCCGCCGTGTCCGGCCAACCTTGAGCTCCGCTCGCTGCCATTCTTTTCCAGAGAAGCCGTTGTGGAGCAACTCGAAAGTGAATGA
Protein sequenceShow/hide protein sequence
MILPETLCDTEFARHLGNHKDQFATLVLTVSSIFEPFMPLQHRDTIHQVNFTYYAVKGGNAFDWKPELIPGVQIDIMVEDKSWMEKDFEVRNVGAPHWHAAHLLALYDIP
TSATALQSVIDASLDSINQRWEVGWSLASYTNGSPSFGDALHRQDMPNINISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANCYPPSSLSKSLLMADMRCLPGME
GCPVFDEHARLIGVLIRPLRHYMSGAEIQLLVPWEAIATACSNLLLAAYYVGERIGNDNGCTSAVGNEAMTKEQKSEGPSVVVFKKILGLILTNAHLIEPWRFGKTNVSG
GRSIENAQLWQTHTESMHNGVFGSKKSGSLTQNASKNANILLQDQLKDKKLSFANYGRRNLRVRLNHAEPWVWCDAKVIYICKGPWDVALLQLEQIPEQLSPIIMDCSWP
SSGSMIYVIGHGLLGPKSGFSPSVSSGVVANVVKTKIPSSYHQGGSLEYFPAMLETTAAVHPGGSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSVPCAALEPIHRF
SKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLLGENHETKGKGSRFAKFIAERREVFRKPTLHNKVEGLPSEIIRSKVSCLPATLLAKALLQSGQNHHR
GLGMELADLVPTVVRPGVAQISIAPSTWQRGTIDLPPPPPDRPLLKALLILSSSPNPSSPPLSSSSSSSSSSLPPPCPANLELRSLPFFSREAVVEQLESE