; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013834 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013834
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglyoxysomal processing protease, glyoxysomal
Genome locationchr1:53104996..53115858
RNA-Seq ExpressionLag0013834
SyntenyLag0013834
Gene Ontology termsGO:0016485 - protein processing (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR009003 - Peptidase S1, PA clan
IPR039245 - Peroxisomal/glyoxysomal leader peptide-processing protease
IPR043504 - Peptidase S1, PA clan, chymotrypsin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604457.1 Glyoxysomal processing protease, glyoxysomal, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0082.86Show/hide
Query:  PQPRLLHLASSQSQYLTAYLPVSWL------CGKLWIMREILPPWSESKAL---------SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTI
        PQ RLLHL S  S YLTA  PV              +M  +  P  +   +         SGRTTLSASGMILPE LYDT VAKHLGNYKDQFATLVLT+
Subjt:  PQPRLLHLASSQSQYLTAYLPVSWL------CGKLWIMREILPPWSESKAL---------SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTI

Query:  SSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLALYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSP
        SSIFEPFMPLQHR+TI KGKPELIPGVQIDIMVE NSLMERD +V   +TP WHAAHLLALYDIPT+A+AL+ VMDASLDSLHQRWEVGWSLASY NGSP
Subjt:  SSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLALYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSP

Query:  SFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWS
        SFRDSL+ QIE D+ TFAGSQR+LD EGS KNNDLTIR+AILGVPS SKDMPNI +SP RQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPP+S S
Subjt:  SFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWS

Query:  KSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLLLEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQE
        KSLL+ADMRCLPGMEGCPVFDEHA LIGVLIRPLVHYMTGAEIQLLIPWGAIATACS LLL AYNA ERI NDN CISAVGNEAM KE KFEG F SIQE
Subjt:  KSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLLLEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQE

Query:  NSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASK
        NS C  PFP K+EKAMASVCLVTIGEGIWASGVLLNSQGL+LTNAHLIEPWRFGK  VSGERSIENA+LLQ++TE S CS+ NG FG KKSGNLTQNASK
Subjt:  NSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASK

Query:  NASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCS
        NA+ILLQ+Q+E +KL+FANYG RNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLS IIMD SWPS+GSKIHVIGHGLLGPKSGFSPSVCS
Subjt:  NASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCS

Query:  GVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEP
        GVVANVVKAKIP S HQGDSLEYFPAMLETTAAVHPG SGGAVVNSEG MIGLVTSNARHGRGAIIPHLNFSIPCAALEPIH F +DM+DLSVLKVLDEP
Subjt:  GVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEP

Query:  DEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTLHNKEERLPSDTIRSKL
        DEQLSSIWALMSQRSPKPSPLPDLPQL G DHETK   GKGSRFAKFIAE+REVFRK TLHN+EE+LPS+ IRSKL
Subjt:  DEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTLHNKEERLPSDTIRSKL

KAG7034601.1 Glyoxysomal processing protease, glyoxysomal, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0081.69Show/hide
Query:  FRVLSARVCPDEGVLLIATTEGSYPQGPQPRLLHLASSQSQYLTAYLPVSWL------CGKLWIMREILPPWSESKAL---------SGRTTLSASGMIL
        FR L AR  P +   LIA+T+G YPQGPQ RLLHL S  S YLTA  PV              +M  +  P  +   +         SGRTTLSASGMIL
Subjt:  FRVLSARVCPDEGVLLIATTEGSYPQGPQPRLLHLASSQSQYLTAYLPVSWL------CGKLWIMREILPPWSESKAL---------SGRTTLSASGMIL

Query:  PEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLALYDIPTSATALQS
        PE LYDT VAKHLGNYKDQFATLVLT+SSIFEPFMPLQHR+TI KGKPELIPGVQIDIMVE NSLMERD +V   +TP WHAAHLLALYDIPT+A+AL+ 
Subjt:  PEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLALYDIPTSATALQS

Query:  VMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLRQRGSFLLAVGSPF
        VMDASLDSLHQRWEVGWSLASY NGSPSFRDSL+ QIE D+ TFAGSQR+LD EGS KNNDLTIR+AILGVPS SKDMPNI +SP RQRGSFLLAVGSPF
Subjt:  VMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLRQRGSFLLAVGSPF

Query:  GVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLLLEAYNAEERIGND
        GVLSPVHFLNSISVGSISNCYPP+S SKSLL+ADMRCLPGMEGCPVFDEHA LIGVLIRPLVHYMTGAEIQLLIPWGAIATACS LLL AY+A +RI ND
Subjt:  GVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLLLEAYNAEERIGND

Query:  NVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSGERSIENARLLQTH
        N CISAVGNEAM KE KFEG F SIQENS C  PFP K+EKAMASVCLVTIGEGIWASGVLLNSQGL+LTNAHLIEPWRFGK  VSGERSIENA+LLQ++
Subjt:  NVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSGERSIENARLLQTH

Query:  TEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLSPIIMDCSW
        TE S CS+ NG FG KKSGNLTQNASKNA+ILLQ+Q+E +KL+FANYG RNLRVRLNHAE WIWCDAKVLYICKGPWDVALLQLEQIPEQLS IIMD SW
Subjt:  TEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLSPIIMDCSW

Query:  PSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHGRGAIIPHLNFSI
        PS+GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIP S HQGDSLEYFPAMLETTAAVHPG SGGAVVNSEG MIGLVTSNARHGRGAIIPHLNFSI
Subjt:  PSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHGRGAIIPHLNFSI

Query:  PCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTLHNKEERLPSDTIR
        PCAALEPIH F +DM+DLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQL G DHETK   GKGSRFAKFIAE+REVFRK TLHNKEE+LPS+ IR
Subjt:  PCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTLHNKEERLPSDTIR

Query:  SKL
        SKL
Subjt:  SKL

XP_038881508.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Benincasa hispida]0.0e+0089.54Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYDT VAKHLGNYKDQFATLVLT+SSIFEPFM LQHRDTI KGKPELIPGVQIDIMVE NSLMERD +V   +T  WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPTSA ALQSVMDASLDSLHQRWEVGWSLASYTNGSP FRDS + QIE DKKTF G+Q +LDMEGS KNNDLTIRIAILGVPS SKDMPNISISP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSW KSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPLVHYMTGAEIQLLIPWGAI TACS LL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG
        L AYN  ERIGNDN C+S VGNEAM KEQKF+G FSSIQ+NSG   PFP +V+KAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKT VSG
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG

Query:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI
        ERSIENA+LLQ HTE SPCS+ +GVFGGKKSG++TQNASKNA+    DQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQ+
Subjt:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI

Query:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH
        PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSS HQGDSLEYFPAMLETTAAVHPG SGGAVVNS+GRMIGLVTSNARH
Subjt:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH

Query:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL
        GRGAIIPHLNFSIPCAALEPIH+FSKDMEDLSV+KVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETK   GKGSRFAKFIAEQREV RKPTL
Subjt:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL

Query:  HNKEER-LPSDTIRSKL
        HN+ ER LPSD IRSKL
Subjt:  HNKEER-LPSDTIRSKL

XP_038881509.1 glyoxysomal processing protease, glyoxysomal isoform X2 [Benincasa hispida]0.0e+0089.12Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYDT VAKHLGNYKDQFATLVLT+SSIFEPFM LQHRDTI KGKPELIPGVQIDIMVE NSLMERD +V   +T  WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPTSA ALQSVMDASLDSLHQRWEVGWSLASYTNGSP FRDS + QIE DKKTF G+Q +LDMEGS KNNDLTIRIAILGVPS SKDMPNISISP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSW KSLLMADMRCLP   GCPVFDE ARLIGVLIRPLVHYMTGAEIQLLIPWGAI TACS LL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG
        L AYN  ERIGNDN C+S VGNEAM KEQKF+G FSSIQ+NSG   PFP +V+KAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKT VSG
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG

Query:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI
        ERSIENA+LLQ HTE SPCS+ +GVFGGKKSG++TQNASKNA+    DQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQ+
Subjt:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI

Query:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH
        PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSS HQGDSLEYFPAMLETTAAVHPG SGGAVVNS+GRMIGLVTSNARH
Subjt:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH

Query:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL
        GRGAIIPHLNFSIPCAALEPIH+FSKDMEDLSV+KVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETK   GKGSRFAKFIAEQREV RKPTL
Subjt:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL

Query:  HNKEER-LPSDTIRSKL
        HN+ ER LPSD IRSKL
Subjt:  HNKEER-LPSDTIRSKL

XP_038881510.1 glyoxysomal processing protease, glyoxysomal isoform X3 [Benincasa hispida]0.0e+0088.28Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYDT VAKHLGNYKDQFATLVLT+SSIFEPFM LQHRDTI KGKPELIPGVQIDIMVE NSLMERD +V   +T  WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPTSA ALQSVMDASLDSLHQRWEVGWSLASYTNGSP FRDS +             Q +LDMEGS KNNDLTIRIAILGVPS SKDMPNISISP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSW KSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPLVHYMTGAEIQLLIPWGAI TACS LL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG
        L AYN  ERIGNDN C+S VGNEAM KEQKF+G FSSIQ+NSG   PFP +V+KAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKT VSG
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG

Query:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI
        ERSIENA+LLQ HTE SPCS+ +GVFGGKKSG++TQNASKNA+    DQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQ+
Subjt:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI

Query:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH
        PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSS HQGDSLEYFPAMLETTAAVHPG SGGAVVNS+GRMIGLVTSNARH
Subjt:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH

Query:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL
        GRGAIIPHLNFSIPCAALEPIH+FSKDMEDLSV+KVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETK   GKGSRFAKFIAEQREV RKPTL
Subjt:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL

Query:  HNKEER-LPSDTIRSKL
        HN+ ER LPSD IRSKL
Subjt:  HNKEER-LPSDTIRSKL

TrEMBL top hitse value%identityAlignment
A0A0A0KHN7 Uncharacterized protein0.0e+0085.5Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYDT  AKHLGNYKDQFATLVLT+SSIFEPFMPLQHRD I KGKPELIPGVQIDIMVE    + RD +V   +TP WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPTSATALQSVMDAS+DSLHQRWEVGWSLASYTNGSPSFRDSL+ QIE +K+T  GSQ+FLD+EGS+KNNDLTIRIAILGVPSLSKDMPNISISP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSPVHFLNS+SVGSISNCYPPSS SKSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACS LL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG
        L   N  ERI NDN CI AVGN A+ KEQK EG FSSIQE+SGC  PFPFK+EKA+ASVCLVT+GEGIWASGVLLNSQGLILTNAHLIEPWRFGKT V G
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG

Query:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI
        E+SIENA+LLQ+HTE SPCS++N VFGG++ GN+  NASKN +ILL +QLEDNKLSF NYG RNL VRL+HAEPWIWCDAK+LYICKG WDVALLQLEQI
Subjt:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI

Query:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH
        PEQLSPI MDCS P+SGSKIHVIGHGLLGPKSG SPSVCSGVV+NVVKAKIPSS H+GDSLEYFPAMLETTAAVHPG SGGAVVNSEG MIGLVTSNARH
Subjt:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH

Query:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL
        GRG IIPHLNFSIPCAALEPIH+FSKDMEDLSV+KVLDEP+EQLSSIWALMSQRSPKPSP P LPQLLGEDHE+K   GKGSRFAKFIAEQREV RKPTL
Subjt:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL

Query:  HNKEER-LPSDTIRSKL
        HN+ ER LPSD +RSKL
Subjt:  HNKEER-LPSDTIRSKL

A0A1S3AZ98 glyoxysomal processing protease, glyoxysomal0.0e+0085.91Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYD+   KHLGNYKDQFATLVLT+SSIFEPFMPLQHRDTI KGKPELIPGVQIDIMVE    + RD +V   +TP WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSL+ QIE +K+T  GSQRFLD+EGS KNNDLTIRIAILGV SLSKDMPNI+ISP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSS SKSLLMADMRCLPGMEGCPVFDE ARLIGVLIRPLVHYMTGAEIQLLIPWGAIATA S LL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG
        L   NA ERI NDN CISAVGN A+ KEQKFE  FSSIQE+S C  PFPFK+EKA+ASVCLVT+GEGIWASGVLLNSQGLILTNAHLIEPWRFGKT VSG
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG

Query:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI
        E+SIEN++LLQ+ TE SPCS++NGVF G+KSGN+  NASKN +ILL +QLEDNKLSFANYG RNLRVRL+HAEPWIWCDAK+LYICKGPWDVALLQLE+I
Subjt:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI

Query:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH
        PEQLSPIIMDCS PSSGSKIHVIGHGLLGPKSG SPSVCSGVV+NVVKAKIPSS H+GDSLEY PAMLETTAAVHPG SGGAVVNSEG MIGLVTSNARH
Subjt:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH

Query:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL
        GRG IIPHLNFSIPCAALEPIH+FSKDMEDLSV+KVLDEP+EQLSSIWALMSQRSPKPSPLPDLP+LLGEDH   G +GKGSRFAKFIAE+REV RKPTL
Subjt:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL

Query:  HNKEER-LPSDTIRSKL
        HN+ ER LPSD  RSKL
Subjt:  HNKEER-LPSDTIRSKL

A0A6J1CK76 glyoxysomal processing protease, glyoxysomal isoform X10.0e+0086.85Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYDTEVAKHLGN+KDQFA+LVLT SSIFEPFMP QHRD IR+GKPELIPGVQIDIMVEDNSLMERD EVRN  TP WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPTSATALQSVMDASLDS+HQRWEVGWSLASYTNG PSFRD+LQRQIE DK+TF GSQ+ LDMEGS K +DL +RIAILGVPSLSKD+PNISISP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSPVHF NSISVGSI+N YPP SW+KSLLMADMRCLPGMEGCPVFDEHAR+IGVLIRPL+HYMTGAEIQLL+PWGAIATACSDLL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSGE
          AY A E IGNDN C + VGNEAM KEQKFEGTFSSI ENS CCPFP KVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKT  S E
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSGE

Query:  RSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIP
        RSIENA+LLQTHTEDSPCS+ NGVFGGK SG+L QNAS+NA+IL+QDQL+DNK SFANYG RNLRVRLNHA+ WIWCDAKV+YIC+GPWDVALLQLEQIP
Subjt:  RSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIP

Query:  EQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHG
        EQLSPI MDCS PSSGSKI+VIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSS HQGDSLEYFPA+LETTAAVHPG SGGAVVNSEG M+GLVTSNARHG
Subjt:  EQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHG

Query:  RGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTLH
        RG+IIPHLNFSIPCAALEPI++FSKDMEDLSVLKVLDEPDEQLSS+WALM QRSPK    PDLPQL GEDHETK   GKGSRFAKFIAE+REVF+KPT+H
Subjt:  RGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTLH

Query:  NKEERLPSDTIRSKL
        NK E LPS T+RSKL
Subjt:  NKEERLPSDTIRSKL

A0A6J1EJB5 glyoxysomal processing protease, glyoxysomal isoform X10.0e+0087.15Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYDT VAKHLGNYKDQFATLVLT+SSIFEPFMPLQHR+TI KGKPELIPGVQIDIMVE NSLMERD +V   +TP WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPT+A+AL+ VMDASLDSLHQRWEVGWSLASY NGSPSFRDSL+ QIE D+ TFAGSQR+LD EGS KNNDLTIR+AILGVPS SKDMPNI +SP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPP+S SKSLL+ADMRCLPGMEGCPVFDEHA LIGVLIRPLVHYMTGAEIQLLIPWGAIATACS LL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG
        L AY+A +RI NDN CISAVGNEAM KE KFEG F SIQENS C  PFP K+EKAMASVCLVTIGEGIWASGVLLNSQGL+LTNAHLIEPWRFGK  VSG
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG

Query:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI
        ERSIENA+LLQ++TE S CS+ NG FG KKSGNLTQNASKNA+ILLQ+Q+E +KL+FANYG RNLRVRLNHAE WIWCDAKVLYICKGPWDVALLQLEQI
Subjt:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI

Query:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH
        PEQLS IIMD SWPS+GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIP S HQGDSLEYFPAMLETTAAVHPG SGGAVVNSEG MIGLVTSNARH
Subjt:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH

Query:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL
        GRGAIIPHLNFSIPCAALEPIH F +DM+DLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQL G DHETK   GKGSRFAKFIAE+REVFRK TL
Subjt:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL

Query:  HNKEERLPSDTIRSKL
        HNKEE+LPS+ IRSKL
Subjt:  HNKEERLPSDTIRSKL

A0A6J1INF4 glyoxysomal processing protease, glyoxysomal isoform X10.0e+0086.59Show/hide
Query:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA
        SGRTTLSASGMILPE LYDT VAKHLGNYKDQFATLVLT+SSIFEPFMPLQHR+TI KGKPELIPGVQIDIMVE NSLMERD +V   +TP WHAAHLLA
Subjt:  SGRTTLSASGMILPEALYDTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLA

Query:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR
        LYDIPT+  AL+ VMDASLDSLHQRWEVGWSLASY NGSPSFRDSL+ QIE D+ TFAGSQR+LD EGS KNNDLTIRIAILGVPS SKD+PNI +SP R
Subjt:  LYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLR

Query:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL
        QRGSFLLAVGSPFGVLSP+HFLNSISVGSISNCYPP+S SKSLL+ADMRCLPGMEGCPVFDEHA L+GVLIRPLVHYMTGAEIQLLIPWGAIATACS LL
Subjt:  QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLL

Query:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG
        L AYNA ERI NDN CI+AVGNEAM KE KFEG F SIQENS C  PFP K+EKAMASVCLVTIGEGIWASGVLLNSQGL+LTNAHLIEPWRFGK  VSG
Subjt:  LEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCC-PFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSG

Query:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI
        ERSIENA+LLQ++TE SPCS+ NGVFGGKKSGNLTQNASKNA+ILLQ+Q+E +KL+FANYG RNLRVRLNHAEPW WCDAKVLYICKGPWDVALLQLEQI
Subjt:  ERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQI

Query:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH
        PEQLS IIMD SWPS+GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIP S HQGDSLEYFPAMLETTAAVHPG SGGAVVNSEG MIGLVTSNARH
Subjt:  PEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARH

Query:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL
        GRGAIIPHLNFSIPCAALEPIH+F +D +DLSV+K LDEPDEQLSSIWALMSQRSPKPSPLPDLPQL G DHETK   GKGSRFAKFIAE+REVFRK TL
Subjt:  GRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVFRKPTL

Query:  HNKEERLPSDTIRSKL
        H++EE+LPS+ IRSKL
Subjt:  HNKEERLPSDTIRSKL

SwissProt top hitse value%identityAlignment
P39668 Uncharacterized serine protease YyxA6.3e-0530.43Show/hide
Query:  SGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGL----VTSNARHGRGAIIPHLNF
        SG  +  IG+ L      F+ SV  GV++   +A IP  S+     ++   +L+T AA++PG+SGGA++N +G++IG+    +  +A  G G  IP    
Subjt:  SGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGL----VTSNARHGRGAIIPHLNF

Query:  SIPCAALEPIHKFSK
         +    +E + ++ K
Subjt:  SIPCAALEPIHKFSK

Q2FI55 Serine protease HtrA-like1.8e-0427.2Show/hide
Query:  GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHGRGAIIPHLNFSIPCA
        G  I V+G+ L      F  +V  G+++  +   +P    + +  +      +  A+V+PG+SGGAVVN EG++IG+V +         + +++F+I   
Subjt:  GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTSNARHGRGAIIPHLNFSIPCA

Query:  ALEPIHKFSKDMEDLSVLKVLDEPD
           P+++  K ++DL     +D PD
Subjt:  ALEPIHKFSKDMEDLSVLKVLDEPD

Q2T9J0 Peroxisomal leader peptide-processing protease9.3e-2526.88Show/hide
Query:  PNISISPLR--QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPW
        P +++SPL    +G+ LL  GSPFG   P  FLN++S G +SN   P      LL+ D RCLPG EG  VF   AR  G L+  +V              
Subjt:  PNISISPLR--QRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPW

Query:  GAIATACSDLLLEAYNAEERIGNDNVCISA----VGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKA----MASVCLVTIGEGIWASGVLLNSQGLIL
           A  C       + A E +G   +C +A       +A+ +        +++       P+   +  +     A+  LV  G  +W SGV + +  L++
Subjt:  GAIATACSDLLLEAYNAEERIGNDNVCISA----VGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKA----MASVCLVTIGEGIWASGVLLNSQGLIL

Query:  TNAHLIEPWRFGKTKVSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKV
        T  H+                 E AR+L   T  +P S+                                                      IW   +V
Subjt:  TNAHLIEPWRFGKTKVSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKV

Query:  LYICKG--PWDVALLQLEQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSG
        ++  +   P+D+A++ LE+  + + PI +       G  + V+G G+ G   G  PSV SG+++ VV+            +   P ML+TT AVH GSSG
Subjt:  LYICKG--PWDVALLQLEQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSG

Query:  GAVV-NSEGRMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKP
        G +  N  G ++G++TSN R +  GA  PHLNFSIP   L+P  +     +DL  L+ LD   E +  +W L    +  P
Subjt:  GAVV-NSEGRMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKP

Q8VZD4 Glyoxysomal processing protease, glyoxysomal2.6e-16847.02Show/hide
Query:  SGRTTLSASGMILPEALY-DTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHR--DTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAH
        SG  TLSASG++LP  ++   EVA  +     Q   LVLT++S+ EPF+ L HR   +I +   +LIPG  I+IMVE     E+       E P W  A 
Subjt:  SGRTTLSASGMILPEALY-DTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHR--DTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAH

Query:  LLALYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGS-PSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISI
        LL+L D+P S+ ALQS+++AS  S    W++GWSL S  NGS PS        IE   K         +     K+     R+AILGVP      P+++ 
Subjt:  LLALYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGS-PSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISI

Query:  SPLRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATAC
        +    +G  L+A+GSPFG+LSPV+F NS+S GSI+N YP  S  KSL++AD+RCLPGMEG PVF ++  LIG+LIRPL    +G EIQL++PWGAI TAC
Subjt:  SPLRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATAC

Query:  SDLLLEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTK
        S LLLE  + E +       + +V ++A I                   P    +EKAM SVCL+T+ +G+WASG++LN  GLILTNAHL+EPWR+GK  
Subjt:  SDLLLEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTK

Query:  VSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQL
        V G    E  +      E+   S     F  +KS  L + A +N    + + + + K +F   GHR++RVRL H + W WC A V+YICK   D+ALLQL
Subjt:  VSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQL

Query:  EQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQ-GDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTS
        E +P +L PI  + S P  G+  HV+GHGL GP+ G SPS+CSGVVA VV AK   ++      +  FPAMLETTAAVHPG SGGAV+NS G MIGLVTS
Subjt:  EQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQ-GDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTS

Query:  NARHGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPK-PSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVF
        NARHG G +IPHLNFSIPCA L PI KF++DM++ ++L+ LD+P E+LSSIWALM   SPK    LP+LP+LL + +    K+ KGS+FAKFIAE +++F
Subjt:  NARHGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPK-PSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVF

Query:  RKPTLHNKEERLPSDTIRSKL
         KPT      +L  D I SKL
Subjt:  RKPTLHNKEERLPSDTIRSKL

Q9DBA6 Peroxisomal leader peptide-processing protease5.1e-2326.89Show/hide
Query:  PNISISPLRQ--RGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPW
        P ++++PL    +G+ LLA GSPFG   P  FLN++S G +SN   P      LL+ D RCLPG EG  VF   AR  G L+  +               
Subjt:  PNISISPLRQ--RGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPW

Query:  GAIATACSDLLLEAYNAEERIGNDNVCISA----VGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAH
           A  C       + A E +G   +C +A    V   A+ +      + S +         P         + L  +G   WA+  +L   G +  +  
Subjt:  GAIATACSDLLLEAYNAEERIGNDNVCISA----VGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAH

Query:  LIEPWRFGKTKVSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYIC
        ++ P                 RL+ T    +P                     + A +L+      N                      IW         
Subjt:  LIEPWRFGKTKVSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYIC

Query:  KGPWDVALLQLEQIPEQLS--PIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVV
          P+D+A++ LE   E+L+  P  +       G  + V+G G+ G   G  PSV SG+++ VV+            ++  P ML+TT AVH GSSGG + 
Subjt:  KGPWDVALLQLEQIPEQLS--PIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETTAAVHPGSSGGAVV

Query:  NS-EGRMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKP
        +S  G ++G+V SN R +  GA  PHLNFSIP   L+P  K      DL  L+ LD   E +  +W L    S  P
Subjt:  NS-EGRMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKP

Arabidopsis top hitse value%identityAlignment
AT1G28320.1 protease-related1.8e-16947.02Show/hide
Query:  SGRTTLSASGMILPEALY-DTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHR--DTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAH
        SG  TLSASG++LP  ++   EVA  +     Q   LVLT++S+ EPF+ L HR   +I +   +LIPG  I+IMVE     E+       E P W  A 
Subjt:  SGRTTLSASGMILPEALY-DTEVAKHLGNYKDQFATLVLTISSIFEPFMPLQHR--DTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAH

Query:  LLALYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGS-PSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISI
        LL+L D+P S+ ALQS+++AS  S    W++GWSL S  NGS PS        IE   K         +     K+     R+AILGVP      P+++ 
Subjt:  LLALYDIPTSATALQSVMDASLDSLHQRWEVGWSLASYTNGS-PSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISI

Query:  SPLRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATAC
        +    +G  L+A+GSPFG+LSPV+F NS+S GSI+N YP  S  KSL++AD+RCLPGMEG PVF ++  LIG+LIRPL    +G EIQL++PWGAI TAC
Subjt:  SPLRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWSKSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATAC

Query:  SDLLLEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTK
        S LLLE  + E +       + +V ++A I                   P    +EKAM SVCL+T+ +G+WASG++LN  GLILTNAHL+EPWR+GK  
Subjt:  SDLLLEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCCPFPFKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTK

Query:  VSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQL
        V G    E  +      E+   S     F  +KS  L + A +N    + + + + K +F   GHR++RVRL H + W WC A V+YICK   D+ALLQL
Subjt:  VSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYGHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQL

Query:  EQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQ-GDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTS
        E +P +L PI  + S P  G+  HV+GHGL GP+ G SPS+CSGVVA VV AK   ++      +  FPAMLETTAAVHPG SGGAV+NS G MIGLVTS
Subjt:  EQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQ-GDSLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVTS

Query:  NARHGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPK-PSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVF
        NARHG G +IPHLNFSIPCA L PI KF++DM++ ++L+ LD+P E+LSSIWALM   SPK    LP+LP+LL + +    K+ KGS+FAKFIAE +++F
Subjt:  NARHGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPK-PSPLPDLPQLLGEDHETKGKEGKGSRFAKFIAEQREVF

Query:  RKPTLHNKEERLPSDTIRSKL
         KPT      +L  D I SKL
Subjt:  RKPTLHNKEERLPSDTIRSKL

AT3G27925.1 DegP protease 18.4e-0530.07Show/hide
Query:  GHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLSPIIMDCSWP-SSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGD
        G  +LRV L     +   DAKV+   +   DVA+L+++    +L PI +  S     G K+  IG+       G   ++ +GV++ +   +  SS+  G 
Subjt:  GHRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLSPIIMDCSWP-SSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGD

Query:  SLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVT-----SNARHGRGAIIP
         ++    +++T AA++PG+SGG +++S G +IG+ T     S A  G G  IP
Subjt:  SLEYFPAMLETTAAVHPGSSGGAVVNSEGRMIGLVT-----SNARHGRGAIIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACACGCCTCGCGCTTTGCCTAAAATAACATTTTCCCGTCCGACGTCCTATTTTCGCGTACTATCGGCCCGCGTGTGCCCCGACGAAGGTGTACTGCTGATTGCTAC
TACCGAGGGCTCCTATCCTCAAGGACCTCAACCCCGTCTTCTTCATCTTGCTTCCTCACAGTCACAGTACCTCACAGCCTATCTTCCTGTGTCATGGCTATGCGGGAAAT
TGTGGATCATGCGAGAAATTTTGCCACCATGGTCAGAGTCCAAGGCCCTTTCTGGGAGGACAACTCTTTCAGCATCTGGAATGATATTACCTGAAGCCCTTTATGACACT
GAGGTCGCTAAGCATCTTGGTAATTATAAGGATCAATTTGCAACGTTGGTTCTGACTATTTCCTCCATTTTTGAGCCTTTTATGCCACTTCAACACAGAGATACCATTCG
TAAGGGAAAGCCTGAGTTAATTCCTGGTGTTCAGATTGACATTATGGTTGAGGATAACTCATTGATGGAGAGAGATATTGAAGTTCGTAATGTAGAAACTCCTCTTTGGC
ATGCTGCCCACTTGTTGGCTTTGTATGATATACCTACATCTGCCACTGCTCTTCAATCAGTCATGGATGCTTCTTTAGATTCATTACATCAGAGATGGGAGGTCGGCTGG
TCTTTGGCCTCATATACAAATGGTTCCCCATCCTTTCGGGATTCTCTTCAGAGACAGATTGAAAAGGACAAGAAAACCTTTGCTGGTAGCCAGAGATTTTTGGATATGGA
AGGATCTACCAAGAATAATGACTTAACTATAAGAATTGCCATTCTTGGTGTTCCCTCATTGTCAAAGGACATGCCAAACATCAGTATATCTCCCTTAAGGCAGAGAGGAT
CCTTTCTTCTTGCTGTTGGTTCTCCTTTTGGTGTTCTATCACCGGTGCACTTCCTTAACAGCATATCAGTCGGATCGATTTCCAATTGCTACCCTCCTAGCTCATGGAGC
AAGTCATTGCTCATGGCAGACATGCGGTGTCTTCCTGGAATGGAAGGTTGTCCTGTTTTTGATGAACATGCACGTCTCATCGGTGTTCTGATTAGGCCACTTGTGCATTA
TATGACTGGTGCTGAGATTCAGCTGTTGATTCCATGGGGAGCCATCGCAACTGCTTGCAGTGATTTGCTGCTAGAGGCTTATAATGCTGAAGAAAGGATTGGCAACGACA
ATGTGTGTATTAGTGCTGTGGGGAATGAGGCAATGATTAAGGAACAAAAATTTGAGGGAACCTTCAGCAGTATTCAAGAAAATTCTGGTTGTTGTCCTTTCCCATTTAAA
GTTGAGAAGGCAATGGCTTCTGTTTGTCTTGTTACAATTGGTGAAGGAATATGGGCATCTGGCGTTCTGCTCAATAGCCAAGGCCTAATACTCACAAATGCTCACTTGAT
AGAACCATGGAGATTTGGGAAAACAAAAGTTAGTGGAGAAAGATCAATTGAGAATGCCCGGCTGCTGCAGACCCACACTGAGGATTCTCCATGTTCAATTGATAACGGTG
TTTTTGGCGGCAAAAAGAGTGGAAATTTAACACAAAATGCCTCTAAGAATGCAAGTATTCTTCTCCAAGACCAACTTGAGGATAATAAGTTGAGTTTTGCTAACTATGGC
CATAGAAACTTGCGTGTTCGCTTGAATCATGCAGAGCCTTGGATTTGGTGTGATGCTAAAGTATTATACATCTGCAAAGGACCTTGGGATGTTGCCCTGTTGCAGCTTGA
GCAAATTCCGGAGCAGCTCTCACCTATTATTATGGATTGTTCGTGGCCATCCTCAGGATCAAAGATACATGTTATTGGACACGGACTGTTGGGACCAAAATCTGGCTTCT
CCCCATCTGTTTGCTCTGGTGTGGTGGCCAATGTGGTGAAAGCAAAGATTCCCTCATCTTCTCATCAAGGAGATTCATTAGAATATTTTCCTGCAATGCTCGAAACAACA
GCTGCAGTGCATCCTGGTAGTAGTGGGGGTGCTGTTGTCAATTCAGAAGGCCGTATGATTGGACTTGTTACAAGCAACGCGAGGCATGGGCGAGGAGCTATTATTCCACA
CTTGAACTTCAGCATACCATGTGCAGCATTGGAGCCCATTCATAAGTTCTCCAAAGACATGGAGGACCTCTCAGTCCTAAAAGTTCTGGACGAACCAGATGAGCAACTTT
CTTCTATATGGGCATTGATGTCACAGCGATCTCCCAAGCCCTCTCCTCTGCCCGATCTGCCTCAATTGCTAGGTGAAGACCATGAAACAAAGGGGAAAGAGGGGAAAGGT
TCTCGATTTGCAAAGTTCATCGCCGAACAACGTGAAGTATTCCGAAAGCCAACTCTTCATAATAAGGAGGAAAGGCTTCCATCTGATACAATCCGTAGCAAGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCACACGCCTCGCGCTTTGCCTAAAATAACATTTTCCCGTCCGACGTCCTATTTTCGCGTACTATCGGCCCGCGTGTGCCCCGACGAAGGTGTACTGCTGATTGCTAC
TACCGAGGGCTCCTATCCTCAAGGACCTCAACCCCGTCTTCTTCATCTTGCTTCCTCACAGTCACAGTACCTCACAGCCTATCTTCCTGTGTCATGGCTATGCGGGAAAT
TGTGGATCATGCGAGAAATTTTGCCACCATGGTCAGAGTCCAAGGCCCTTTCTGGGAGGACAACTCTTTCAGCATCTGGAATGATATTACCTGAAGCCCTTTATGACACT
GAGGTCGCTAAGCATCTTGGTAATTATAAGGATCAATTTGCAACGTTGGTTCTGACTATTTCCTCCATTTTTGAGCCTTTTATGCCACTTCAACACAGAGATACCATTCG
TAAGGGAAAGCCTGAGTTAATTCCTGGTGTTCAGATTGACATTATGGTTGAGGATAACTCATTGATGGAGAGAGATATTGAAGTTCGTAATGTAGAAACTCCTCTTTGGC
ATGCTGCCCACTTGTTGGCTTTGTATGATATACCTACATCTGCCACTGCTCTTCAATCAGTCATGGATGCTTCTTTAGATTCATTACATCAGAGATGGGAGGTCGGCTGG
TCTTTGGCCTCATATACAAATGGTTCCCCATCCTTTCGGGATTCTCTTCAGAGACAGATTGAAAAGGACAAGAAAACCTTTGCTGGTAGCCAGAGATTTTTGGATATGGA
AGGATCTACCAAGAATAATGACTTAACTATAAGAATTGCCATTCTTGGTGTTCCCTCATTGTCAAAGGACATGCCAAACATCAGTATATCTCCCTTAAGGCAGAGAGGAT
CCTTTCTTCTTGCTGTTGGTTCTCCTTTTGGTGTTCTATCACCGGTGCACTTCCTTAACAGCATATCAGTCGGATCGATTTCCAATTGCTACCCTCCTAGCTCATGGAGC
AAGTCATTGCTCATGGCAGACATGCGGTGTCTTCCTGGAATGGAAGGTTGTCCTGTTTTTGATGAACATGCACGTCTCATCGGTGTTCTGATTAGGCCACTTGTGCATTA
TATGACTGGTGCTGAGATTCAGCTGTTGATTCCATGGGGAGCCATCGCAACTGCTTGCAGTGATTTGCTGCTAGAGGCTTATAATGCTGAAGAAAGGATTGGCAACGACA
ATGTGTGTATTAGTGCTGTGGGGAATGAGGCAATGATTAAGGAACAAAAATTTGAGGGAACCTTCAGCAGTATTCAAGAAAATTCTGGTTGTTGTCCTTTCCCATTTAAA
GTTGAGAAGGCAATGGCTTCTGTTTGTCTTGTTACAATTGGTGAAGGAATATGGGCATCTGGCGTTCTGCTCAATAGCCAAGGCCTAATACTCACAAATGCTCACTTGAT
AGAACCATGGAGATTTGGGAAAACAAAAGTTAGTGGAGAAAGATCAATTGAGAATGCCCGGCTGCTGCAGACCCACACTGAGGATTCTCCATGTTCAATTGATAACGGTG
TTTTTGGCGGCAAAAAGAGTGGAAATTTAACACAAAATGCCTCTAAGAATGCAAGTATTCTTCTCCAAGACCAACTTGAGGATAATAAGTTGAGTTTTGCTAACTATGGC
CATAGAAACTTGCGTGTTCGCTTGAATCATGCAGAGCCTTGGATTTGGTGTGATGCTAAAGTATTATACATCTGCAAAGGACCTTGGGATGTTGCCCTGTTGCAGCTTGA
GCAAATTCCGGAGCAGCTCTCACCTATTATTATGGATTGTTCGTGGCCATCCTCAGGATCAAAGATACATGTTATTGGACACGGACTGTTGGGACCAAAATCTGGCTTCT
CCCCATCTGTTTGCTCTGGTGTGGTGGCCAATGTGGTGAAAGCAAAGATTCCCTCATCTTCTCATCAAGGAGATTCATTAGAATATTTTCCTGCAATGCTCGAAACAACA
GCTGCAGTGCATCCTGGTAGTAGTGGGGGTGCTGTTGTCAATTCAGAAGGCCGTATGATTGGACTTGTTACAAGCAACGCGAGGCATGGGCGAGGAGCTATTATTCCACA
CTTGAACTTCAGCATACCATGTGCAGCATTGGAGCCCATTCATAAGTTCTCCAAAGACATGGAGGACCTCTCAGTCCTAAAAGTTCTGGACGAACCAGATGAGCAACTTT
CTTCTATATGGGCATTGATGTCACAGCGATCTCCCAAGCCCTCTCCTCTGCCCGATCTGCCTCAATTGCTAGGTGAAGACCATGAAACAAAGGGGAAAGAGGGGAAAGGT
TCTCGATTTGCAAAGTTCATCGCCGAACAACGTGAAGTATTCCGAAAGCCAACTCTTCATAATAAGGAGGAAAGGCTTCCATCTGATACAATCCGTAGCAAGTTATGA
Protein sequenceShow/hide protein sequence
MHTPRALPKITFSRPTSYFRVLSARVCPDEGVLLIATTEGSYPQGPQPRLLHLASSQSQYLTAYLPVSWLCGKLWIMREILPPWSESKALSGRTTLSASGMILPEALYDT
EVAKHLGNYKDQFATLVLTISSIFEPFMPLQHRDTIRKGKPELIPGVQIDIMVEDNSLMERDIEVRNVETPLWHAAHLLALYDIPTSATALQSVMDASLDSLHQRWEVGW
SLASYTNGSPSFRDSLQRQIEKDKKTFAGSQRFLDMEGSTKNNDLTIRIAILGVPSLSKDMPNISISPLRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPSSWS
KSLLMADMRCLPGMEGCPVFDEHARLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSDLLLEAYNAEERIGNDNVCISAVGNEAMIKEQKFEGTFSSIQENSGCCPFPFK
VEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTKVSGERSIENARLLQTHTEDSPCSIDNGVFGGKKSGNLTQNASKNASILLQDQLEDNKLSFANYG
HRNLRVRLNHAEPWIWCDAKVLYICKGPWDVALLQLEQIPEQLSPIIMDCSWPSSGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSSHQGDSLEYFPAMLETT
AAVHPGSSGGAVVNSEGRMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHKFSKDMEDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLLGEDHETKGKEGKG
SRFAKFIAEQREVFRKPTLHNKEERLPSDTIRSKL