; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg15301 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg15301
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionglyoxysomal processing protease, glyoxysomal isoform X1
Genome locationCarg_Chr03:9173080..9180597
RNA-Seq ExpressionCarg15301
SyntenyCarg15301
Gene Ontology termsGO:0016485 - protein processing (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR009003 - Peptidase S1, PA clan
IPR039245 - Peroxisomal/glyoxysomal leader peptide-processing protease
IPR043504 - Peptidase S1, PA clan, chymotrypsin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604457.1 Glyoxysomal processing protease, glyoxysomal, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.48Show/hide
Query:  PQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSS
        PQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSS
Subjt:  PQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSS

Query:  IFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRD
        IFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRD
Subjt:  IFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRD

Query:  SLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLI
        SLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLI
Subjt:  SLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLI

Query:  ADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCS
        ADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAY+AG+RIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCS
Subjt:  ADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCS

Query:  RPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANIL
        RPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANIL
Subjt:  RPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANIL

Query:  LQNQIEGSKLNFANYGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVAN
        LQNQIEGSKLNFANYGRRNLRVRLNHAE WIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVAN
Subjt:  LQNQIEGSKLNFANYGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVAN

Query:  VVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLS
        VVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLS
Subjt:  VVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLS

Query:  SIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
        SIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHN+EEKLPSNVIRSKL
Subjt:  SIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

KAG7034601.1 Glyoxysomal processing protease, glyoxysomal, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  THIPPFAASFRALLARAPPKKVYCLIASTDGCYPQGPQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTL
        THIPPFAASFRALLARAPPKKVYCLIASTDGCYPQGPQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTL
Subjt:  THIPPFAASFRALLARAPPKKVYCLIASTDGCYPQGPQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTL

Query:  SASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAA
        SASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAA
Subjt:  SASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAA

Query:  SALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLA
        SALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLA
Subjt:  SALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLA

Query:  VGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKR
        VGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKR
Subjt:  VGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKR

Query:  IDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKL
        IDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKL
Subjt:  IDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKL

Query:  LQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIM
        LQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIM
Subjt:  LQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIM

Query:  DFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHL
        DFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHL
Subjt:  DFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHL

Query:  NFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVI
        NFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVI
Subjt:  NFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVI

Query:  RSKL
        RSKL
Subjt:  RSKL

XP_022925830.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
        MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE

Query:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
        LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
Subjt:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL

Query:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
        DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
Subjt:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC

Query:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
        LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
Subjt:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG

Query:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
        EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
Subjt:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL

Query:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
        RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
Subjt:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP

Query:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
        AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
Subjt:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP

Query:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
        QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
Subjt:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

XP_022978651.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Cucurbita maxima]0.0e+0097.19Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
        MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE

Query:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
        LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTA  ALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
Subjt:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL

Query:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
        DTEGSNKNNDLTIR+AILGVPSFSKD+PNIRLSPSRQRGSFLLAVGSPFGVLSP+HFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
Subjt:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC

Query:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
        L+GVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAY+AG+RIDNDNGCI+AVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
Subjt:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG

Query:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
        EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERS CSMHNG FG KKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
Subjt:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL

Query:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
        RVRLNHAE W WCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
Subjt:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP

Query:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
        AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIH FLRD KDLSV+K LDEPDEQLSSIWALMSQRSPKPSPLPDLP
Subjt:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP

Query:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
        QLPGGDHETKGKGSRFAKFIAERREVFRK TLH++EEKLPSNVIRSKL
Subjt:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

XP_023544053.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0097.99Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
        MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGM+LPETLYDT VAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE

Query:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
        LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
Subjt:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL

Query:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
        DTEGSNKNNDLTIR+AILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
Subjt:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC

Query:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
        LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAY+AG+RIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
Subjt:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG

Query:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
        EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANV GERSIENAKLLQSYTERS CSMHNG FGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
Subjt:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL

Query:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
        RVRLNHAE W WCDAKVLYICKGPWDVALLQLEQIPEQLSSI MDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSL YFP
Subjt:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP

Query:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
        AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIH FLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
Subjt:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP

Query:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
        QLPGGDHETKGKGSRFAKFIAERREVFRKSTL N+EEKLPSNVIRSKL
Subjt:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

TrEMBL top hitse value%identityAlignment
A0A0A0KHN7 Uncharacterized protein0.0e+0082.61Show/hide
Query:  KVYCLIASTDGCYPQG--PQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAK
        +V C   ST  C  +   P HR LHL  S Y TA  PVMA RE+VDHARNFA+MVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTR AK
Subjt:  KVYCLIASTDGCYPQG--PQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAK

Query:  HLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRW
        HLGNYKDQFATLVLTVSSIFEPFMPLQHR+ IHKGKPELIPGVQIDIMVEG   + RDSDVSKTPHWHAAHLLALYDIPT+A+AL+ VMDAS+DSLHQRW
Subjt:  HLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRW

Query:  EVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSIS
        EVGWSLASY NGSPSFRDSLRGQIEN++ T  GSQ++LD EGS+KNNDLTIR+AILGVPS SKDMPNI +SPSRQRGSFLLAVGSPFGVLSPVHFLNS+S
Subjt:  EVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSIS

Query:  VGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMN
        VGSISNCYPP+S SKSLL+ADMRCLPGMEGCPVFDE A LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLG  + G+RIDNDN CI AVGN A+N
Subjt:  VGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMN

Query:  KEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDF
        KE K EG F SIQE+S CSRPFP KIEKA+ASVCLVT+GEGIWASGVLLNSQGL+LTNAHLIEPWRFGK NV GE+SIENAKLLQS+TE S CSM+N  F
Subjt:  KEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDF

Query:  GRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHG
        G ++ GN+  NASKN NILL NQ+E +KL+F NYGRRNL VRL+HAE WIWCDAK+LYICKG WDVALLQLEQIPEQLS I MD S P++GSKIHVIGHG
Subjt:  GRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHG

Query:  LLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLR
        LLGPKSG SPSVCSGVV+NVVKAKIP SYH+GDSLEYFPAMLETTAAVHPG SGGAVVNSEGHMIGLVTSNARHGRG IIPHLNFSIPCAALEPIH F +
Subjt:  LLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLR

Query:  DMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEK-LPSNVIRSKL
        DM+DLSV+KVLDEP+EQLSSIWALMSQRSPKPSP P LPQL G DHE+KGKGSRFAKFIAE+REV RK TLHN+ E+ LPS+++RSKL
Subjt:  DMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEK-LPSNVIRSKL

A0A1S3AZ98 glyoxysomal processing protease, glyoxysomal0.0e+0086Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
        MA RE+VDHARNFA+MVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYD+R  KHLGNYKDQFATLVLTVSSIFEPFMPLQHR+TIHKGKPE
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE

Query:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
        LIPGVQIDIMVEG   + RDSDVSKTPHWHAAHLLALYDIPT+A+AL+ VMDASLDSLHQRWEVGWSLASY NGSPSFRDSLRGQIEN++ T  GSQR+L
Subjt:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL

Query:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVFDEHA
        D EGSNKNNDLTIR+AILGV S SKDMPNI +SPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPP+S SKSLL+ADMRCLPGMEGCPVFDE A
Subjt:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVFDEHA

Query:  CLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTI
         LIGVLIRPLVHYMTGAEIQLLIPWGAIATA SGLLLG  +AG+RIDNDNGCISAVGN A+NKE KFE  F SIQE+S+CSRPFP KIEKA+ASVCLVT+
Subjt:  CLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTI

Query:  GEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRN
        GEGIWASGVLLNSQGL+LTNAHLIEPWRFGK NVSGE+SIEN+KLLQS TE S CSM+NG FG +KSGN+  NASKN NILL NQ+E +KL+FANYGRRN
Subjt:  GEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRN

Query:  LRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYF
        LRVRL+HAE WIWCDAK+LYICKGPWDVALLQLE+IPEQLS IIMD S PS+GSKIHVIGHGLLGPKSG SPSVCSGVV+NVVKAKIP SYH+GDSLEY 
Subjt:  LRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYF

Query:  PAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDL
        PAMLETTAAVHPG SGGAVVNSEGHMIGLVTSNARHGRG IIPHLNFSIPCAALEPIH F +DM+DLSV+KVLDEP+EQLSSIWALMSQRSPKPSPLPDL
Subjt:  PAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDL

Query:  PQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEK-LPSNVIRSKL
        P+L G DH +KGKGSRFAKFIAERREV RK TLHN+ E+ LPS++ RSKL
Subjt:  PQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEK-LPSNVIRSKL

A0A6J1ED89 glyoxysomal processing protease, glyoxysomal isoform X20.0e+00100Show/hide
Query:  MVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNN
        MVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNN
Subjt:  MVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNN

Query:  DLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPL
        DLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPL
Subjt:  DLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPL

Query:  VHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVL
        VHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVL
Subjt:  VHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWASGVL

Query:  LNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEH
        LNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEH
Subjt:  LNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNHAEH

Query:  WIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAV
        WIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAV
Subjt:  WIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAV

Query:  HPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHET
        HPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHET
Subjt:  HPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHET

Query:  KGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
        KGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
Subjt:  KGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

A0A6J1EJB5 glyoxysomal processing protease, glyoxysomal isoform X10.0e+00100Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
        MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE

Query:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
        LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
Subjt:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL

Query:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
        DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
Subjt:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC

Query:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
        LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
Subjt:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG

Query:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
        EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
Subjt:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL

Query:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
        RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
Subjt:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP

Query:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
        AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
Subjt:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP

Query:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
        QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
Subjt:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

A0A6J1INF4 glyoxysomal processing protease, glyoxysomal isoform X10.0e+0097.19Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
        MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPE

Query:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
        LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTA  ALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL
Subjt:  LIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYL

Query:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
        DTEGSNKNNDLTIR+AILGVPSFSKD+PNIRLSPSRQRGSFLLAVGSPFGVLSP+HFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC
Subjt:  DTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEHAC

Query:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
        L+GVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAY+AG+RIDNDNGCI+AVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG
Subjt:  LIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIG

Query:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
        EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERS CSMHNG FG KKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL
Subjt:  EGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNL

Query:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
        RVRLNHAE W WCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP
Subjt:  RVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFP

Query:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP
        AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIH FLRD KDLSV+K LDEPDEQLSSIWALMSQRSPKPSPLPDLP
Subjt:  AMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLP

Query:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
        QLPGGDHETKGKGSRFAKFIAERREVFRK TLH++EEKLPSNVIRSKL
Subjt:  QLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

SwissProt top hitse value%identityAlignment
Q2FI55 Serine protease HtrA-like1.1e-0428Show/hide
Query:  GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCA
        G  I V+G+ L      F  +V  G+++  +   +P  + + +  +      +  A+V+PG SGGAVVN EG +IG+V +         + +++F+IP  
Subjt:  GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCA

Query:  ALEPIHWFLRDMKDLSVLKVLDEPD
         ++ I      +KDL     +D PD
Subjt:  ALEPIHWFLRDMKDLSVLKVLDEPD

Q2FZP2 Serine protease HtrA-like1.1e-0428Show/hide
Query:  GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCA
        G  I V+G+ L      F  +V  G+++  +   +P  + + +  +      +  A+V+PG SGGAVVN EG +IG+V +         + +++F+IP  
Subjt:  GSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCA

Query:  ALEPIHWFLRDMKDLSVLKVLDEPD
         ++ I      +KDL     +D PD
Subjt:  ALEPIHWFLRDMKDLSVLKVLDEPD

Q2T9J0 Peroxisomal leader peptide-processing protease2.3e-2325.87Show/hide
Query:  AILGVPSFSKDM-----PNIRLSP--SRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEH--ACLIGVLI
        A+LGV    +++     P + +SP  +  +G+ LL  GSPFG   P  FLN++S G +SN   P     LL+ D RCLPG EG  VF       L+ +++
Subjt:  AILGVPSFSKDM-----PNIRLSP--SRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNSSKSLLIADMRCLPGMEGCPVFDEH--ACLIGVLI

Query:  RPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWAS
         PL  +  G  +        +  A + L   A DA  R+ +    ++A+                  +       P         A+  LV  G  +W S
Subjt:  RPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVCLVTIGEGIWAS

Query:  GVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNH
        GV + +  LV+T  H + P    +             L++S T +S+                                                     
Subjt:  GVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANYGRRNLRVRLNH

Query:  AEHWIWCDAKVLYICKG--PWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLE
            IW   +V++  +   P+D+A++ LE+  + +  I +       G  + V+G G+ G   G  PSV SG+++ VV+            +   P ML+
Subjt:  AEHWIWCDAKVLYICKG--PWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLE

Query:  TTAAVHPGCSGGAVV-NSEGHMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKP
        TT AVH G SGG +  N  G+++G++TSN R +  GA  PHLNFSIP   L+P        +DL  L+ LD   E +  +W L    +  P
Subjt:  TTAAVHPGCSGGAVV-NSEGHMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKP

Q8VZD4 Glyoxysomal processing protease, glyoxysomal1.4e-18248.21Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLY-DTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHR--NTIHKG
        M   +VV  +RNFAV+V+V+GPDPKGLKM+KHAFHQYHSG  TLSASG++LP  ++    VA  +     Q   LVLTV+S+ EPF+ L HR  ++I + 
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLY-DTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHR--NTIHKG

Query:  KPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGS-PSFRDSLRGQIENDENTFAGS
          +LIPG  I+IMVEG    E+++     P W  A LL+L D+P +++AL+ +++AS  S    W++GWSL S  NGS PS          N E+     
Subjt:  KPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGS-PSFRDSLRGQIENDENTFAGS

Query:  QRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVF
         +  +   +N       RMAILGVP      P++  + S  +G  L+A+GSPFG+LSPV+F NS+S GSI+N YP  S  KSL+IAD+RCLPGMEG PVF
Subjt:  QRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVF

Query:  DEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVC
         ++  LIG+LIRPL    +G EIQL++PWGAI TACS LLL          +  G  S  G+E ++ +             S  S P    IEKAM SVC
Subjt:  DEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVC

Query:  LVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANY
        L+T+ +G+WASG++LN  GL+LTNAHL+EPWR+GK  V GE         + +      S     F  +KS  L + A +N    +   I   K NF   
Subjt:  LVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANY

Query:  GRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQ-GD
        G R++RVRL H + W WC A V+YICK   D+ALLQLE +P +L  I  +FS P  G+  HV+GHGL GP+ G SPS+CSGVVA VV AK   +      
Subjt:  GRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQ-GD

Query:  SLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPK-P
         +  FPAMLETTAAVHPG SGGAV+NS GHMIGLVTSNARHG G +IPHLNFSIPCA L PI  F  DM++ ++L+ LD+P E+LSSIWALM   SPK  
Subjt:  SLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPK-P

Query:  SPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
          LP+LP+L    +  + KGS+FAKFIAE +++F K T      KL  +VI SKL
Subjt:  SPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL

Q9DBA6 Peroxisomal leader peptide-processing protease4.6e-1635.06Show/hide
Query:  PWDVALLQLEQIPEQLSSI--IMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNS
        P+D+A++ LE   E+L+ +   +       G  + V+G G+ G   G  PSV SG+++ VV+            ++  P ML+TT AVH G SGG + +S
Subjt:  PWDVALLQLEQIPEQLSSI--IMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLETTAAVHPGCSGGAVVNS

Query:  -EGHMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKP
          G ++G+V SN R +  GA  PHLNFSIP   L+P         DL  L+ LD   E +  +W L    S  P
Subjt:  -EGHMIGLVTSNAR-HGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKP

Arabidopsis top hitse value%identityAlignment
AT1G28320.1 protease-related1.0e-18348.21Show/hide
Query:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLY-DTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHR--NTIHKG
        M   +VV  +RNFAV+V+V+GPDPKGLKM+KHAFHQYHSG  TLSASG++LP  ++    VA  +     Q   LVLTV+S+ EPF+ L HR  ++I + 
Subjt:  MATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPETLY-DTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHR--NTIHKG

Query:  KPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGS-PSFRDSLRGQIENDENTFAGS
          +LIPG  I+IMVEG    E+++     P W  A LL+L D+P +++AL+ +++AS  S    W++GWSL S  NGS PS          N E+     
Subjt:  KPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEVGWSLASYKNGS-PSFRDSLRGQIENDENTFAGS

Query:  QRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVF
         +  +   +N       RMAILGVP      P++  + S  +G  L+A+GSPFG+LSPV+F NS+S GSI+N YP  S  KSL+IAD+RCLPGMEG PVF
Subjt:  QRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNS-SKSLLIADMRCLPGMEGCPVF

Query:  DEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVC
         ++  LIG+LIRPL    +G EIQL++PWGAI TACS LLL          +  G  S  G+E ++ +             S  S P    IEKAM SVC
Subjt:  DEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFPSKIEKAMASVC

Query:  LVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANY
        L+T+ +G+WASG++LN  GL+LTNAHL+EPWR+GK  V GE         + +      S     F  +KS  L + A +N    +   I   K NF   
Subjt:  LVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFANY

Query:  GRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQ-GD
        G R++RVRL H + W WC A V+YICK   D+ALLQLE +P +L  I  +FS P  G+  HV+GHGL GP+ G SPS+CSGVVA VV AK   +      
Subjt:  GRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQ-GD

Query:  SLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPK-P
         +  FPAMLETTAAVHPG SGGAV+NS GHMIGLVTSNARHG G +IPHLNFSIPCA L PI  F  DM++ ++L+ LD+P E+LSSIWALM   SPK  
Subjt:  SLEYFPAMLETTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPK-P

Query:  SPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL
          LP+LP+L    +  + KGS+FAKFIAE +++F K T      KL  +VI SKL
Subjt:  SPLPDLPQLPGGDHETKGKGSRFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACTCACATTCCCCCGTTCGCGGCCTCGTTTCGCGCACTCCTCGCCCGCGCGCCCCCCAAGAAGGTCTACTGCCTGATTGCTTCTACCGACGGCTGCTATCCTCAAGGACC
TCAACACCGTCTTCTTCATCTCGATTCTTCACTGTACCTCACAGCCTGTCCTCCTGTCATGGCTACGCGGGAAGTTGTGGATCATGCCAGAAATTTTGCCGTCATGGTCA
GAGTCCAAGGCCCTGACCCGAAGGGCCTGAAGATGCAAAAACATGCATTCCATCAGTATCATTCTGGGAGGACAACTCTTTCAGCATCTGGAATGATATTACCTGAAACC
CTGTATGATACCAGGGTCGCTAAGCATCTTGGTAATTATAAGGATCAATTTGCAACGTTGGTTCTGACTGTTTCCTCCATTTTTGAGCCTTTTATGCCACTTCAACACAG
AAATACTATTCACAAGGGAAAGCCTGAGCTAATTCCTGGTGTTCAGATTGATATTATGGTTGAGGGTAACTCATTGATGGAGAGAGATTCTGATGTTAGTAAAACTCCAC
ATTGGCATGCTGCCCACTTGTTGGCTTTGTATGATATACCTACAGCTGCCTCTGCTCTTAAACCAGTCATGGATGCTTCTTTGGATTCATTACATCAGAGATGGGAGGTC
GGCTGGTCTTTGGCCTCATATAAAAATGGTTCTCCATCCTTCAGGGATTCTCTTCGGGGACAGATTGAAAATGACGAGAATACCTTTGCTGGGAGCCAGAGATATTTGGA
TACGGAAGGATCTAACAAGAATAATGACTTGACAATAAGAATGGCTATTCTTGGTGTTCCCTCATTCTCAAAGGACATGCCAAACATCCGTTTATCTCCCTCAAGGCAGC
GAGGATCCTTTCTTCTTGCTGTTGGTTCCCCTTTTGGTGTTCTATCACCGGTGCATTTCCTTAACAGCATATCGGTTGGATCAATTTCCAATTGCTACCCTCCTAACTCG
AGCAAGTCATTGCTGATTGCTGACATGCGGTGCCTTCCTGGAATGGAAGGCTGTCCGGTTTTTGATGAACATGCATGTCTCATCGGTGTTCTGATTAGACCACTTGTGCA
TTATATGACTGGGGCTGAGATTCAGCTGTTGATTCCATGGGGAGCCATCGCCACTGCTTGCAGTGGTCTACTTCTAGGGGCTTATGATGCTGGAAAAAGGATTGACAACG
ACAATGGGTGTATTAGTGCTGTGGGGAATGAGGCAATGAATAAGGAACACAAATTTGAGGGAGCCTTTTGCAGTATCCAAGAAAACTCTAGTTGTTCTCGTCCTTTCCCA
TCTAAAATTGAGAAGGCAATGGCTTCTGTTTGTCTTGTTACAATTGGTGAAGGAATATGGGCATCTGGCGTTTTGCTCAATAGCCAAGGCCTAGTACTCACCAATGCCCA
CTTGATAGAGCCATGGAGATTTGGGAAAGCAAATGTTAGTGGAGAAAGATCGATTGAAAATGCCAAGCTGCTGCAGTCCTACACTGAGCGTTCTCTGTGTTCAATGCACA
ATGGTGATTTTGGCCGCAAAAAGAGTGGAAATTTAACACAAAATGCCTCTAAGAATGCAAATATTCTTCTCCAGAACCAAATTGAGGGTAGTAAGTTGAATTTTGCTAAC
TATGGTCGTAGAAACTTGCGTGTTCGCTTGAATCATGCTGAGCATTGGATATGGTGTGATGCTAAAGTGCTGTACATCTGCAAGGGACCTTGGGATGTTGCATTGTTGCA
GCTTGAGCAAATTCCAGAGCAGCTCTCATCTATTATTATGGATTTTTCATGGCCGTCCGCAGGATCAAAGATACATGTTATCGGACATGGACTTTTGGGACCGAAATCAG
GCTTCTCCCCATCTGTTTGCTCTGGTGTGGTAGCCAATGTGGTGAAAGCAAAGATTCCCCCATCTTATCATCAAGGAGATTCATTAGAATATTTTCCGGCGATGCTTGAA
ACAACAGCTGCAGTGCATCCTGGTTGTAGTGGGGGTGCTGTTGTCAATTCAGAAGGGCATATGATTGGACTTGTTACAAGCAATGCGAGGCATGGGCGAGGAGCTATTAT
TCCACACTTGAACTTCAGCATACCTTGTGCAGCTTTGGAACCCATTCATTGGTTCCTTAGAGACATGAAGGACCTCTCAGTCCTAAAAGTCCTGGATGAACCAGATGAAC
AGCTTTCTTCTATATGGGCATTGATGTCACAGCGATCTCCCAAGCCCTCTCCTCTGCCTGATCTGCCTCAATTGCCAGGTGGAGACCATGAAACAAAGGGGAAAGGTTCT
CGATTTGCGAAGTTCATTGCCGAACGACGAGAAGTATTCCGAAAGTCAACTCTTCATAACAAGGAGGAAAAACTTCCATCTAATGTGATCCGTAGCAAGCTATGA
mRNA sequenceShow/hide mRNA sequence
ACTCACATTCCCCCGTTCGCGGCCTCGTTTCGCGCACTCCTCGCCCGCGCGCCCCCCAAGAAGGTCTACTGCCTGATTGCTTCTACCGACGGCTGCTATCCTCAAGGACC
TCAACACCGTCTTCTTCATCTCGATTCTTCACTGTACCTCACAGCCTGTCCTCCTGTCATGGCTACGCGGGAAGTTGTGGATCATGCCAGAAATTTTGCCGTCATGGTCA
GAGTCCAAGGCCCTGACCCGAAGGGCCTGAAGATGCAAAAACATGCATTCCATCAGTATCATTCTGGGAGGACAACTCTTTCAGCATCTGGAATGATATTACCTGAAACC
CTGTATGATACCAGGGTCGCTAAGCATCTTGGTAATTATAAGGATCAATTTGCAACGTTGGTTCTGACTGTTTCCTCCATTTTTGAGCCTTTTATGCCACTTCAACACAG
AAATACTATTCACAAGGGAAAGCCTGAGCTAATTCCTGGTGTTCAGATTGATATTATGGTTGAGGGTAACTCATTGATGGAGAGAGATTCTGATGTTAGTAAAACTCCAC
ATTGGCATGCTGCCCACTTGTTGGCTTTGTATGATATACCTACAGCTGCCTCTGCTCTTAAACCAGTCATGGATGCTTCTTTGGATTCATTACATCAGAGATGGGAGGTC
GGCTGGTCTTTGGCCTCATATAAAAATGGTTCTCCATCCTTCAGGGATTCTCTTCGGGGACAGATTGAAAATGACGAGAATACCTTTGCTGGGAGCCAGAGATATTTGGA
TACGGAAGGATCTAACAAGAATAATGACTTGACAATAAGAATGGCTATTCTTGGTGTTCCCTCATTCTCAAAGGACATGCCAAACATCCGTTTATCTCCCTCAAGGCAGC
GAGGATCCTTTCTTCTTGCTGTTGGTTCCCCTTTTGGTGTTCTATCACCGGTGCATTTCCTTAACAGCATATCGGTTGGATCAATTTCCAATTGCTACCCTCCTAACTCG
AGCAAGTCATTGCTGATTGCTGACATGCGGTGCCTTCCTGGAATGGAAGGCTGTCCGGTTTTTGATGAACATGCATGTCTCATCGGTGTTCTGATTAGACCACTTGTGCA
TTATATGACTGGGGCTGAGATTCAGCTGTTGATTCCATGGGGAGCCATCGCCACTGCTTGCAGTGGTCTACTTCTAGGGGCTTATGATGCTGGAAAAAGGATTGACAACG
ACAATGGGTGTATTAGTGCTGTGGGGAATGAGGCAATGAATAAGGAACACAAATTTGAGGGAGCCTTTTGCAGTATCCAAGAAAACTCTAGTTGTTCTCGTCCTTTCCCA
TCTAAAATTGAGAAGGCAATGGCTTCTGTTTGTCTTGTTACAATTGGTGAAGGAATATGGGCATCTGGCGTTTTGCTCAATAGCCAAGGCCTAGTACTCACCAATGCCCA
CTTGATAGAGCCATGGAGATTTGGGAAAGCAAATGTTAGTGGAGAAAGATCGATTGAAAATGCCAAGCTGCTGCAGTCCTACACTGAGCGTTCTCTGTGTTCAATGCACA
ATGGTGATTTTGGCCGCAAAAAGAGTGGAAATTTAACACAAAATGCCTCTAAGAATGCAAATATTCTTCTCCAGAACCAAATTGAGGGTAGTAAGTTGAATTTTGCTAAC
TATGGTCGTAGAAACTTGCGTGTTCGCTTGAATCATGCTGAGCATTGGATATGGTGTGATGCTAAAGTGCTGTACATCTGCAAGGGACCTTGGGATGTTGCATTGTTGCA
GCTTGAGCAAATTCCAGAGCAGCTCTCATCTATTATTATGGATTTTTCATGGCCGTCCGCAGGATCAAAGATACATGTTATCGGACATGGACTTTTGGGACCGAAATCAG
GCTTCTCCCCATCTGTTTGCTCTGGTGTGGTAGCCAATGTGGTGAAAGCAAAGATTCCCCCATCTTATCATCAAGGAGATTCATTAGAATATTTTCCGGCGATGCTTGAA
ACAACAGCTGCAGTGCATCCTGGTTGTAGTGGGGGTGCTGTTGTCAATTCAGAAGGGCATATGATTGGACTTGTTACAAGCAATGCGAGGCATGGGCGAGGAGCTATTAT
TCCACACTTGAACTTCAGCATACCTTGTGCAGCTTTGGAACCCATTCATTGGTTCCTTAGAGACATGAAGGACCTCTCAGTCCTAAAAGTCCTGGATGAACCAGATGAAC
AGCTTTCTTCTATATGGGCATTGATGTCACAGCGATCTCCCAAGCCCTCTCCTCTGCCTGATCTGCCTCAATTGCCAGGTGGAGACCATGAAACAAAGGGGAAAGGTTCT
CGATTTGCGAAGTTCATTGCCGAACGACGAGAAGTATTCCGAAAGTCAACTCTTCATAACAAGGAGGAAAAACTTCCATCTAATGTGATCCGTAGCAAGCTATGACCATT
CGAACTTTGGGCATTCTACAGGTTGAAACGTCTGATCAGATGAAAGAAAACTGATGGGAACTGCATCACAATGATTAAAAACAGCGTTGCTTTTGGTTTTGAGTGTGTTC
TTCTATATCTTTATATGATTGGCTCCATGGCTTCAGAGATGAAGACATTTAGCTACGATTCTAAATGTTCTGAATCGATTCTTTTATTTCACCAACAGGTATTTGAAATG
AAACCTCCCTTTACGATTAGTATTTTTCCTTTACGCATGGTCTTAAAAGTTGTATTGAACTTC
Protein sequenceShow/hide protein sequence
THIPPFAASFRALLARAPPKKVYCLIASTDGCYPQGPQHRLLHLDSSLYLTACPPVMATREVVDHARNFAVMVRVQGPDPKGLKMQKHAFHQYHSGRTTLSASGMILPET
LYDTRVAKHLGNYKDQFATLVLTVSSIFEPFMPLQHRNTIHKGKPELIPGVQIDIMVEGNSLMERDSDVSKTPHWHAAHLLALYDIPTAASALKPVMDASLDSLHQRWEV
GWSLASYKNGSPSFRDSLRGQIENDENTFAGSQRYLDTEGSNKNNDLTIRMAILGVPSFSKDMPNIRLSPSRQRGSFLLAVGSPFGVLSPVHFLNSISVGSISNCYPPNS
SKSLLIADMRCLPGMEGCPVFDEHACLIGVLIRPLVHYMTGAEIQLLIPWGAIATACSGLLLGAYDAGKRIDNDNGCISAVGNEAMNKEHKFEGAFCSIQENSSCSRPFP
SKIEKAMASVCLVTIGEGIWASGVLLNSQGLVLTNAHLIEPWRFGKANVSGERSIENAKLLQSYTERSLCSMHNGDFGRKKSGNLTQNASKNANILLQNQIEGSKLNFAN
YGRRNLRVRLNHAEHWIWCDAKVLYICKGPWDVALLQLEQIPEQLSSIIMDFSWPSAGSKIHVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPPSYHQGDSLEYFPAMLE
TTAAVHPGCSGGAVVNSEGHMIGLVTSNARHGRGAIIPHLNFSIPCAALEPIHWFLRDMKDLSVLKVLDEPDEQLSSIWALMSQRSPKPSPLPDLPQLPGGDHETKGKGS
RFAKFIAERREVFRKSTLHNKEEKLPSNVIRSKL