; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010875 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010875
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionglyoxysomal processing protease, glyoxysomal isoform X1
Genome locationscaffold35:2672268..2679674
RNA-Seq ExpressionMS010875
SyntenyMS010875
Gene Ontology termsGO:0016485 - protein processing (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR009003 - Peptidase S1, PA clan
IPR039245 - Peroxisomal/glyoxysomal leader peptide-processing protease
IPR043504 - Peptidase S1, PA clan, chymotrypsin-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034601.1 Glyoxysomal processing protease, glyoxysomal, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0079.44Show/hide
Query:  PVMATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQ
        PVMATRE+VD+ARNFA+MVRVQGPDPKGLKM KHAFHQYHSGRTTLSASGMILPE LYDT VAKHLGN+KDQFA+LVLT SSIFEPFMP QHR+ I   +
Subjt:  PVMATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQ

Query:  GKPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFV
        GKPELIPGVQIDIMVE NSLMERD +V    TPHWHAAHLLALYDIPT+A+AL+ VMDASLDS+HQRWEVGWSLASY NG PSFRD+L+ QIEND+ TF 
Subjt:  GKPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFV

Query:  GSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYM
        GSQ++LD EGSNK +DL +R+AILGVPS SKD+PNI +SPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+N YPP S +KSLL+ADMRCLP      
Subjt:  GSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYM

Query:  FYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQK
                              GMEGCPVFDEHA +IGVLIRPL+HYMTGAEIQLL+PWGAIATACS LL GAY AG+ I NDNGC + VGNEAM KE K
Subjt:  FYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQK

Query:  FEGTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKN
        FEG F SI ENS C  PFP+K+EKAMASVCLVTIGEGIWASGVLLNSQGL+LTNAHLIEPWRFGK N S ERSIENA+LLQ++TE S CSMHNG FG K 
Subjt:  FEGTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKN

Query:  SGSLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGP
        SG+L QNAS+NANIL+Q+Q++ +K +FANYGRRNLRVRLNHA+ WIWCDAKV+YIC+GPWDVALLQLEQIPEQLS I MD S PS+GSKI+VIGHGLLGP
Subjt:  SGSLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGP

Query:  KSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMED
        KSGFSPSVCSGVVANVVKAKIP S+HQGDSLEYFPA+LETTAAVHPG SGGAVVNSEGHM+GLVTSNARHGRG+IIPHLNFSIPCAALEPI+ F +DM+D
Subjt:  KSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMED

Query:  LSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL
        LSVLKVLDEPDEQLSSIWALM QRSPKPSP PDLPQL G DHETKGKGSRFAKFIAERREVF+K T+HNK E LPS  IRSKL
Subjt:  LSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL

XP_022142190.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Momordica charantia]0.0e+0095.37Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK
        MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIR  QGK
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK

Query:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
        PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
Subjt:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS

Query:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY
        QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLP        
Subjt:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY

Query:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEG
                            GMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEG
Subjt:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEG

Query:  TFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSL
        TFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSL
Subjt:  TFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSL

Query:  KQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFS
        KQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFS
Subjt:  KQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFS

Query:  PSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLK
        PSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLK
Subjt:  PSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLK

Query:  VLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL
        VLDEPDEQLSS+WALMPQRSPK    PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKT+RSKL
Subjt:  VLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL

XP_022142191.1 glyoxysomal processing protease, glyoxysomal isoform X2 [Momordica charantia]0.0e+0094.85Show/hide
Query:  MERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGSQKHLDMEGSNKTDDLRVR
        MERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGSQKHLDMEGSNKTDDLRVR
Subjt:  MERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGSQKHLDMEGSNKTDDLRVR

Query:  IAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFYYILFYSGGFFLTTITILN
        IAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLP                          
Subjt:  IAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFYYILFYSGGFFLTTITILN

Query:  MLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEGTFSSIHENSYCCPFPNKV
          GMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEGTFSSIHENSYCCPFPNKV
Subjt:  MLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEGTFSSIHENSYCCPFPNKV

Query:  EKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQD
        EKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQD
Subjt:  EKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQD

Query:  NKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPS
        NKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPS
Subjt:  NKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPS

Query:  SHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQ
        SHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSS+WALMPQ
Subjt:  SHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQ

Query:  RSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL
        RSPK    PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKT+RSKL
Subjt:  RSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL

XP_038881508.1 glyoxysomal processing protease, glyoxysomal isoform X1 [Benincasa hispida]0.0e+0081.33Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK
        M T EIVD+ARNFAIMVRVQGPDPKGLKM KHAFHQYHSGRTTLSASGMILPE LYDT VAKHLGN+KDQFA+LVLT SSIFEPFM  QHRD I   +GK
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK

Query:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
        PELIPGVQIDIMVE NSLMERD +V    T HWHAAHLLALYDIPTSA ALQSVMDASLDS+HQRWEVGWSLASYTNG P FRD+ + QIENDK+TFVG+
Subjt:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS

Query:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY
        Q +LDMEGSNK +DL +RIAILGVPS SKD+PNISISPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+N YPP SW KSLLMADMRCLP        
Subjt:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY

Query:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE
                            GMEGCPVFDE AR+IGVLIRPL+HYMTGAEIQLL+PWGAI TACS LL GAY  GE IGNDNGC + VGNEAM KEQKF+
Subjt:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE

Query:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG
        G FSSI +NS    PFP +V+KAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTN S ERSIENA+LLQ HTE SPCSMH+GVFGGK SG
Subjt:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG

Query:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS
         + QNAS+NAN    DQL+DNK SFANYG RNLRVRLNHA+ WIWCDAKV+YIC+GPWDVALLQLEQ+PEQLSPI MDCS PSSGSKI+VIGHGLLGPKS
Subjt:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS

Query:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS
        GFSPSVCSGVVANVVKAKIPSS+HQGDSLEYFPA+LETTAAVHPGGSGGAVVNS+G M+GLVTSNARHGRG+IIPHLNFSIPCAALEPI+RFSKDMEDLS
Subjt:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS

Query:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEG-LPSKTIRSKL
        V+KVLDEPDEQLSSIWALM QRSPKPSP PDLPQL GEDHETKGKGSRFAKFIAE+REV +KPT+HN+GE  LPS  IRSKL
Subjt:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEG-LPSKTIRSKL

XP_038881509.1 glyoxysomal processing protease, glyoxysomal isoform X2 [Benincasa hispida]0.0e+0080.95Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK
        M T EIVD+ARNFAIMVRVQGPDPKGLKM KHAFHQYHSGRTTLSASGMILPE LYDT VAKHLGN+KDQFA+LVLT SSIFEPFM  QHRD I   +GK
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK

Query:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
        PELIPGVQIDIMVE NSLMERD +V    T HWHAAHLLALYDIPTSA ALQSVMDASLDS+HQRWEVGWSLASYTNG P FRD+ + QIENDK+TFVG+
Subjt:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS

Query:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY
        Q +LDMEGSNK +DL +RIAILGVPS SKD+PNISISPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+N YPP SW KSLLMADMRCLP        
Subjt:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY

Query:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE
                               GCPVFDE AR+IGVLIRPL+HYMTGAEIQLL+PWGAI TACS LL GAY  GE IGNDNGC + VGNEAM KEQKF+
Subjt:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE

Query:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG
        G FSSI +NS    PFP +V+KAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTN S ERSIENA+LLQ HTE SPCSMH+GVFGGK SG
Subjt:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG

Query:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS
         + QNAS+NAN    DQL+DNK SFANYG RNLRVRLNHA+ WIWCDAKV+YIC+GPWDVALLQLEQ+PEQLSPI MDCS PSSGSKI+VIGHGLLGPKS
Subjt:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS

Query:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS
        GFSPSVCSGVVANVVKAKIPSS+HQGDSLEYFPA+LETTAAVHPGGSGGAVVNS+G M+GLVTSNARHGRG+IIPHLNFSIPCAALEPI+RFSKDMEDLS
Subjt:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS

Query:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEG-LPSKTIRSKL
        V+KVLDEPDEQLSSIWALM QRSPKPSP PDLPQL GEDHETKGKGSRFAKFIAE+REV +KPT+HN+GE  LPS  IRSKL
Subjt:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEG-LPSKTIRSKL

TrEMBL top hitse value%identityAlignment
A0A0A0KHN7 Uncharacterized protein0.0e+0078.53Show/hide
Query:  AYLPVMATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIR
        A LPVMA REIVD+ARNFAIMVRVQGPDPKGLKM KHAFHQYHSGRTTLSASGMILPE LYDT  AKHLGN+KDQFA+LVLT SSIFEPFMP QHRD I 
Subjt:  AYLPVMATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIR

Query:  QLQGKPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKR
          +GKPELIPGVQIDIMVE    + RD +V    TPHWHAAHLLALYDIPTSATALQSVMDAS+DS+HQRWEVGWSLASYTNG PSFRD+L+ QIEN+KR
Subjt:  QLQGKPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKR

Query:  TFVGSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMY
        T VGSQK LD+EGS+K +DL +RIAILGVPSLSKD+PNISISPSRQRGSFLLAVGSPFGVLSPVHF NS+SVGSI+N YPP S +KSLLMADMRCLP   
Subjt:  TFVGSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMY

Query:  IYMFYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTK
                                 GMEGCPVFDE AR+IGVLIRPL+HYMTGAEIQLL+PWGAIATACS LL G    GE I NDN C   VGN A+ K
Subjt:  IYMFYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTK

Query:  EQKFEGTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFG
        EQK EG FSSI E+S C  PFP K+EKA+ASVCLVT+GEGIWASGVLLNSQGLILTNAHLIEPWRFGKTN   E+SIENA+LLQ+HTE SPCSM+N VFG
Subjt:  EQKFEGTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFG

Query:  GKNSGSLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGL
        G+  G+++ NAS+N NIL+ +QL+DNK SF NYGRRNL VRL+HA+ WIWCDAK++YIC+G WDVALLQLEQIPEQLSPI MDCSCP+SGSKI+VIGHGL
Subjt:  GKNSGSLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGL

Query:  LGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKD
        LGPKSG SPSVCSGVV+NVVKAKIPSS+H+GDSLEYFPA+LETTAAVHPGGSGGAVVNSEGHM+GLVTSNARHGRG IIPHLNFSIPCAALEPI+RFSKD
Subjt:  LGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKD

Query:  MEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEG-LPSKTIRSKL
        MEDLSV+KVLDEP+EQLSSIWALM QRSPKPSP P LPQL GEDHE+KGKGSRFAKFIAE+REV +KPT+HN+GE  LPS  +RSKL
Subjt:  MEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEG-LPSKTIRSKL

A0A6J1CK76 glyoxysomal processing protease, glyoxysomal isoform X10.0e+0095.37Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK
        MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIR  QGK
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK

Query:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
        PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
Subjt:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS

Query:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY
        QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLP        
Subjt:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY

Query:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEG
                            GMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEG
Subjt:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEG

Query:  TFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSL
        TFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSL
Subjt:  TFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSL

Query:  KQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFS
        KQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFS
Subjt:  KQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFS

Query:  PSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLK
        PSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLK
Subjt:  PSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLK

Query:  VLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL
        VLDEPDEQLSS+WALMPQRSPK    PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKT+RSKL
Subjt:  VLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL

A0A6J1CLH2 glyoxysomal processing protease, glyoxysomal isoform X20.0e+0094.85Show/hide
Query:  MERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGSQKHLDMEGSNKTDDLRVR
        MERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGSQKHLDMEGSNKTDDLRVR
Subjt:  MERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGSQKHLDMEGSNKTDDLRVR

Query:  IAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFYYILFYSGGFFLTTITILN
        IAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLP                          
Subjt:  IAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFYYILFYSGGFFLTTITILN

Query:  MLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEGTFSSIHENSYCCPFPNKV
          GMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEGTFSSIHENSYCCPFPNKV
Subjt:  MLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEGTFSSIHENSYCCPFPNKV

Query:  EKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQD
        EKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQD
Subjt:  EKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQD

Query:  NKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPS
        NKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPS
Subjt:  NKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPS

Query:  SHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQ
        SHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSS+WALMPQ
Subjt:  SHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQ

Query:  RSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL
        RSPK    PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKT+RSKL
Subjt:  RSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL

A0A6J1EJB5 glyoxysomal processing protease, glyoxysomal isoform X10.0e+0079.39Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK
        MATRE+VD+ARNFA+MVRVQGPDPKGLKM KHAFHQYHSGRTTLSASGMILPE LYDT VAKHLGN+KDQFA+LVLT SSIFEPFMP QHR+ I   +GK
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK

Query:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
        PELIPGVQIDIMVE NSLMERD +V    TPHWHAAHLLALYDIPT+A+AL+ VMDASLDS+HQRWEVGWSLASY NG PSFRD+L+ QIEND+ TF GS
Subjt:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS

Query:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY
        Q++LD EGSNK +DL +R+AILGVPS SKD+PNI +SPSRQRGSFLLAVGSPFGVLSPVHF NSISVGSI+N YPP S +KSLL+ADMRCLP        
Subjt:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY

Query:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE
                            GMEGCPVFDEHA +IGVLIRPL+HYMTGAEIQLL+PWGAIATACS LL GAY AG+ I NDNGC + VGNEAM KE KFE
Subjt:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE

Query:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG
        G F SI ENS C  PFP+K+EKAMASVCLVTIGEGIWASGVLLNSQGL+LTNAHLIEPWRFGK N S ERSIENA+LLQ++TE S CSMHNG FG K SG
Subjt:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG

Query:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS
        +L QNAS+NANIL+Q+Q++ +K +FANYGRRNLRVRLNHA+ WIWCDAKV+YIC+GPWDVALLQLEQIPEQLS I MD S PS+GSKI+VIGHGLLGPKS
Subjt:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS

Query:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS
        GFSPSVCSGVVANVVKAKIP S+HQGDSLEYFPA+LETTAAVHPG SGGAVVNSEGHM+GLVTSNARHGRG+IIPHLNFSIPCAALEPI+ F +DM+DLS
Subjt:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS

Query:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL
        VLKVLDEPDEQLSSIWALM QRSPKPSP PDLPQL G DHETKGKGSRFAKFIAERREVF+K T+HNK E LPS  IRSKL
Subjt:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL

A0A6J1INF4 glyoxysomal processing protease, glyoxysomal isoform X10.0e+0079.13Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK
        MATRE+VD+ARNFA+MVRVQGPDPKGLKM KHAFHQYHSGRTTLSASGMILPE LYDT VAKHLGN+KDQFA+LVLT SSIFEPFMP QHR+ I   +GK
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGK

Query:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS
        PELIPGVQIDIMVE NSLMERD +V    TPHWHAAHLLALYDIPT+  AL+ VMDASLDS+HQRWEVGWSLASY NG PSFRD+L+ QIEND+ TF GS
Subjt:  PELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGS

Query:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY
        Q++LD EGSNK +DL +RIAILGVPS SKD+PNI +SPSRQRGSFLLAVGSPFGVLSP+HF NSISVGSI+N YPP S +KSLL+ADMRCLP        
Subjt:  QKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFY

Query:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE
                            GMEGCPVFDEHA ++GVLIRPL+HYMTGAEIQLL+PWGAIATACS LL GAY AGE I NDNGC N VGNEAM KE KFE
Subjt:  YILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTN-VGNEAMTKEQKFE

Query:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG
        G F SI ENS C  PFP+K+EKAMASVCLVTIGEGIWASGVLLNSQGL+LTNAHLIEPWRFGK N S ERSIENA+LLQ++TE SPCSMHNGVFGGK SG
Subjt:  GTFSSIHENSYCC-PFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG

Query:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS
        +L QNAS+NANIL+Q+Q++ +K +FANYGRRNLRVRLNHA+ W WCDAKV+YIC+GPWDVALLQLEQIPEQLS I MD S PS+GSKI+VIGHGLLGPKS
Subjt:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS

Query:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS
        GFSPSVCSGVVANVVKAKIP S+HQGDSLEYFPA+LETTAAVHPG SGGAVVNSEGHM+GLVTSNARHGRG+IIPHLNFSIPCAALEPI+RF +D +DLS
Subjt:  GFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLS

Query:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL
        V+K LDEPDEQLSSIWALM QRSPKPSP PDLPQL G DHETKGKGSRFAKFIAERREVF+K T+H++ E LPS  IRSKL
Subjt:  VLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSKTIRSKL

SwissProt top hitse value%identityAlignment
O22609 Protease Do-like 1, chloroplastic5.5e-0628.71Show/hide
Query:  GRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCP-SSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGD
        G  +LRV L  ADQ  + DAKV+   Q   DVA+L+++    +L PI +  S     G K++ IG+       G   ++ +GV++ +   +  SS   G 
Subjt:  GRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCP-SSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGD

Query:  SLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPC----AALEPIYRFSKDMEDLSVLKVL-DEPDEQLSSIWALMPQR
         ++    +++T AA++PG SGG +++S G ++G+ T  A +        + FSIP       ++ + RF K    +  +K   D+  EQL     L+   
Subjt:  SLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPC----AALEPIYRFSKDMEDLSVLKVL-DEPDEQLSSIWALMPQR

Query:  SP
         P
Subjt:  SP

P39668 Uncharacterized serine protease YyxA2.3e-0430.19Show/hide
Query:  SGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPC
        SG  +  IG+ L      F+ SV  GV++   +A IP   +     ++   +L+T AA++PG SGGA++N +G ++G+   N+     S +  +  SIP 
Subjt:  SGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPC

Query:  AALEPI
          + P+
Subjt:  AALEPI

Q2T9J0 Peroxisomal leader peptide-processing protease6.3e-1833.16Show/hide
Query:  IWCDAKVIYICQG--PWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAA
        IW   +V++  Q   P+D+A++ LE+  + + PI +       G  + V+G G+ G   G  PSV SG+++ VV+            +   P +L+TT A
Subjt:  IWCDAKVIYICQG--PWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAA

Query:  VHPGGSGGAVV-NSEGHMVGLVTSNAR-HGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP
        VH G SGG +  N  G+++G++TSN R +  G+  PHLNFSIP   L+P  +     +DL  L+ LD   E +  +W L    +  P
Subjt:  VHPGGSGGAVV-NSEGHMVGLVTSNAR-HGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP

Q8VZD4 Glyoxysomal processing protease, glyoxysomal1.6e-17845.43Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILP-EILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQG
        M   ++V ++RNFA++V+V+GPDPKGLKM KHAFHQYHSG  TLSASG++LP +I    EVA  +     Q  +LVLT +S+ EPF+   HR      Q 
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILP-EILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQG

Query:  KPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNG-PPSFRDALQRQIENDKRTFV
          +LIPG  I+IMVE     E++        P W  A LL+L D+P S+ ALQS+++AS  S    W++GWSL S  NG  PS        IE+  +  +
Subjt:  KPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNG-PPSFRDALQRQIENDKRTFV

Query:  GSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYM
              +   +N       R+AILGVP      P+++ + S  +G  L+A+GSPFG+LSPV+FFNS+S GSIANSYP  S  KSL++AD+RCLP      
Subjt:  GSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYM

Query:  FYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKF
                              GMEG PVF ++  +IG+LIRPL    +G EIQL+VPWGAI TACS LL       E    +   +  G+E ++     
Subjt:  FYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKF

Query:  EGTFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG
                ++    P    +EKAM SVCL+T+ +G+WASG++LN  GLILTNAHL+EPWR+GK     E       +L      S  S     F  + S 
Subjt:  EGTFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG

Query:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS
        +L + A  N    + + +++ K +F   G R++RVRL H D W WC A V+YIC+   D+ALLQLE +P +L PI  + S P  G+  +V+GHGL GP+ 
Subjt:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS

Query:  GFSPSVCSGVVANVVKAKIP-SSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDL
        G SPS+CSGVVA VV AK   ++      +  FPA+LETTAAVHPGGSGGAV+NS GHM+GLVTSNARHG G++IPHLNFSIPCA L PI++F++DM++ 
Subjt:  GFSPSVCSGVVANVVKAKIP-SSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDL

Query:  SVLKVLDEPDEQLSSIWALMPQRSPKPSPS-PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSK
        ++L+ LD+P E+LSSIWALMP  SPK   S P+LP+L  + +  + KGS+FAKFIAE +++F KPT  ++ + +PSK
Subjt:  SVLKVLDEPDEQLSSIWALMPQRSPKPSPS-PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSK

Q9DBA6 Peroxisomal leader peptide-processing protease1.2e-1633.86Show/hide
Query:  IWCDAKVIYICQ--GPWDVALLQLEQIPEQLS--PIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETT
        IW   +V++  Q   P+D+A++ LE   E+L+  P  +       G  + V+G G+ G   G  PSV SG+++ VV+            ++  P +L+TT
Subjt:  IWCDAKVIYICQ--GPWDVALLQLEQIPEQLS--PIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETT

Query:  AAVHPGGSGGAVVNS-EGHMVGLVTSNAR-HGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP
         AVH G SGG + +S  G ++G+V SN R +  G+  PHLNFSIP   L+P  +      DL  L+ LD   E +  +W L    S  P
Subjt:  AAVHPGGSGGAVVNS-EGHMVGLVTSNAR-HGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKP

Arabidopsis top hitse value%identityAlignment
AT1G28320.1 protease-related1.1e-17945.43Show/hide
Query:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILP-EILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQG
        M   ++V ++RNFA++V+V+GPDPKGLKM KHAFHQYHSG  TLSASG++LP +I    EVA  +     Q  +LVLT +S+ EPF+   HR      Q 
Subjt:  MATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILP-EILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQG

Query:  KPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNG-PPSFRDALQRQIENDKRTFV
          +LIPG  I+IMVE     E++        P W  A LL+L D+P S+ ALQS+++AS  S    W++GWSL S  NG  PS        IE+  +  +
Subjt:  KPELIPGVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNG-PPSFRDALQRQIENDKRTFV

Query:  GSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYM
              +   +N       R+AILGVP      P+++ + S  +G  L+A+GSPFG+LSPV+FFNS+S GSIANSYP  S  KSL++AD+RCLP      
Subjt:  GSQKHLDMEGSNKTDDLRVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYM

Query:  FYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKF
                              GMEG PVF ++  +IG+LIRPL    +G EIQL+VPWGAI TACS LL       E    +   +  G+E ++     
Subjt:  FYYILFYSGGFFLTTITILNMLGMEGCPVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKF

Query:  EGTFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG
                ++    P    +EKAM SVCL+T+ +G+WASG++LN  GLILTNAHL+EPWR+GK     E       +L      S  S     F  + S 
Subjt:  EGTFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIWASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSG

Query:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS
        +L + A  N    + + +++ K +F   G R++RVRL H D W WC A V+YIC+   D+ALLQLE +P +L PI  + S P  G+  +V+GHGL GP+ 
Subjt:  SLKQNASENANILIQDQLQDNK-SFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKS

Query:  GFSPSVCSGVVANVVKAKIP-SSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDL
        G SPS+CSGVVA VV AK   ++      +  FPA+LETTAAVHPGGSGGAV+NS GHM+GLVTSNARHG G++IPHLNFSIPCA L PI++F++DM++ 
Subjt:  GFSPSVCSGVVANVVKAKIP-SSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDL

Query:  SVLKVLDEPDEQLSSIWALMPQRSPKPSPS-PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSK
        ++L+ LD+P E+LSSIWALMP  SPK   S P+LP+L  + +  + KGS+FAKFIAE +++F KPT  ++ + +PSK
Subjt:  SVLKVLDEPDEQLSSIWALMPQRSPKPSPS-PDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNKGEGLPSK

AT3G27925.1 DegP protease 13.9e-0728.71Show/hide
Query:  GRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCP-SSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGD
        G  +LRV L  ADQ  + DAKV+   Q   DVA+L+++    +L PI +  S     G K++ IG+       G   ++ +GV++ +   +  SS   G 
Subjt:  GRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCP-SSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGD

Query:  SLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPC----AALEPIYRFSKDMEDLSVLKVL-DEPDEQLSSIWALMPQR
         ++    +++T AA++PG SGG +++S G ++G+ T  A +        + FSIP       ++ + RF K    +  +K   D+  EQL     L+   
Subjt:  SLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIPC----AALEPIYRFSKDMEDLSVLKVL-DEPDEQLSSIWALMPQR

Query:  SP
         P
Subjt:  SP

AT4G18370.1 DEGP protease 51.1e-0422.67Show/hide
Query:  TFSSIHENSY--CCPFPNKVEKAMASVCLVTIGEGIWASGVLL--NSQGLILTNAHLIEPWR----------------FGKTNSSAERSIENAQLLQTHT
        T S I+++ +   C   N V+       ++  G  +  +  LL  N Q L + +A  +E ++                F KT+ S    IE  +L +T +
Subjt:  TFSSIHENSY--CCPFPNKVEKAMASVCLVTIGEGIWASGVLL--NSQGLILTNAHLIEPWR----------------FGKTNSSAERSIENAQLLQTHT

Query:  EDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCP-
         D      NG   G  SG +        +I+    +    +   +G +  +V L  A    +     I       D+A+L++E    +L+P+ +  S   
Subjt:  EDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDAKVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCP-

Query:  SSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIP
          G   + IG+       G+  ++  GVV+ + + +IPS +  G S+      ++T A ++ G SGG +++S GH +G+ T+        +   +NF+IP
Subjt:  SSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHMVGLVTSNARHGRGSIIPHLNFSIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCCTATCTTCCTGTCATGGCTACGCGGGAAATCGTGGATTATGCCAGAAATTTTGCCATCATGGTCAGAGTCCAAGGTCCTGACCCGAAGGGCCTGAAGATGCACAAGCA
TGCATTCCATCAGTATCACTCTGGGAGGACAACTCTTTCAGCATCTGGAATGATATTACCTGAAATCCTTTATGACACTGAGGTCGCTAAGCATCTTGGTAATCATAAGG
ATCAATTTGCATCGTTGGTTCTGACTAATTCCTCAATTTTTGAGCCTTTTATGCCACAGCAACACAGAGATATCATTCGTCAGCTGCAGGGAAAGCCTGAGTTAATTCCT
GGTGTTCAGATTGACATTATGGTTGAGGATAACTCATTGATGGAGAGAGATTTTGAAGTACGCAATGGAGGAACTCCTCATTGGCATGCTGCGCACTTGTTGGCTTTGTA
TGATATACCTACATCTGCCACTGCTCTTCAGTCAGTCATGGATGCATCTTTAGATTCAATACATCAAAGATGGGAGGTCGGCTGGTCGTTGGCCTCATATACAAATGGTC
CTCCATCCTTCAGGGATGCTCTTCAGAGACAGATTGAAAATGACAAGAGAACTTTTGTTGGTAGCCAGAAGCATTTGGATATGGAAGGATCCAACAAGACTGATGACTTA
AGAGTAAGAATTGCCATTCTTGGTGTTCCGTCATTATCAAAGGATGTGCCAAACATCAGTATATCTCCCTCAAGGCAGAGAGGATCCTTTCTTCTTGCGGTTGGTTCTCC
TTTTGGTGTTCTATCACCGGTGCATTTCTTTAACAGCATATCAGTCGGGTCAATTGCCAATTCCTACCCTCCTCGCTCATGGAACAAGTCATTGCTGATGGCTGACATGC
GGTGTCTTCCTGGTATGTACATCTATATGTTTTATTATATTCTGTTTTATTCTGGTGGCTTCTTTCTTACCACCATTACGATACTTAATATGTTAGGAATGGAAGGCTGT
CCTGTTTTTGATGAACATGCACGTATCATCGGTGTTCTGATTAGGCCACTTATGCATTATATGACTGGTGCTGAGATTCAGTTGTTAGTTCCATGGGGAGCCATCGCAAC
TGCTTGCAGTGATTTGCTGTTTGGGGCTTATTATGCTGGAGAAGGGATTGGAAATGACAATGGGTGTACTAACGTGGGGAATGAGGCAATGACTAAGGAACAAAAATTTG
AGGGAACCTTCAGTAGTATTCATGAAAATTCTTATTGTTGTCCTTTCCCAAATAAAGTTGAGAAGGCAATGGCTTCTGTTTGTCTTGTTACAATTGGTGAAGGAATATGG
GCCTCTGGTGTTCTGCTCAATAGCCAAGGCCTAATACTCACAAATGCTCACTTGATAGAGCCGTGGAGATTTGGGAAAACAAACTCAAGTGCAGAAAGATCAATTGAAAA
TGCCCAGCTGCTGCAGACCCATACTGAGGATTCTCCATGTTCAATGCATAATGGTGTTTTTGGCGGGAAAAATAGTGGAAGTTTGAAACAAAATGCCTCTGAGAATGCAA
ATATTCTGATTCAGGACCAACTTCAGGATAATAAGAGTTTTGCTAACTATGGCCGTAGAAACTTGCGTGTTCGCTTGAACCATGCAGATCAGTGGATTTGGTGCGATGCT
AAAGTGATATATATCTGTCAGGGACCTTGGGATGTTGCCCTGTTACAGCTTGAGCAAATTCCAGAGCAGCTCTCACCTATTAAAATGGATTGTTCGTGTCCATCCTCAGG
GTCAAAGATATATGTTATCGGACATGGACTATTGGGACCAAAATCTGGCTTCTCTCCATCTGTTTGCTCTGGTGTGGTGGCGAATGTGGTGAAAGCAAAGATTCCCTCAT
CTCATCATCAAGGAGATTCACTAGAATATTTTCCTGCAATTCTTGAAACAACAGCTGCAGTCCATCCTGGTGGTAGTGGGGGTGCTGTTGTCAATTCAGAAGGCCATATG
GTTGGACTTGTTACAAGCAATGCGAGGCATGGGCGAGGATCTATTATTCCACACTTGAATTTCAGCATACCATGTGCAGCCTTGGAGCCCATTTATAGGTTCTCCAAAGA
CATGGAGGACCTCTCAGTCTTAAAAGTTCTTGATGAACCAGATGAGCAGCTTTCTTCTATATGGGCACTGATGCCACAACGATCTCCCAAGCCCTCTCCTTCGCCCGATC
TGCCACAACTGTTCGGTGAAGACCATGAAACAAAGGGGAAAGGTTCTCGATTTGCAAAGTTCATCGCCGAAAGGCGAGAAGTGTTCCAAAAGCCAACTGTTCATAACAAG
GGGGAGGGCCTTCCATCTAAGACAATCCGTAGCAAGTTA
mRNA sequenceShow/hide mRNA sequence
GCCTATCTTCCTGTCATGGCTACGCGGGAAATCGTGGATTATGCCAGAAATTTTGCCATCATGGTCAGAGTCCAAGGTCCTGACCCGAAGGGCCTGAAGATGCACAAGCA
TGCATTCCATCAGTATCACTCTGGGAGGACAACTCTTTCAGCATCTGGAATGATATTACCTGAAATCCTTTATGACACTGAGGTCGCTAAGCATCTTGGTAATCATAAGG
ATCAATTTGCATCGTTGGTTCTGACTAATTCCTCAATTTTTGAGCCTTTTATGCCACAGCAACACAGAGATATCATTCGTCAGCTGCAGGGAAAGCCTGAGTTAATTCCT
GGTGTTCAGATTGACATTATGGTTGAGGATAACTCATTGATGGAGAGAGATTTTGAAGTACGCAATGGAGGAACTCCTCATTGGCATGCTGCGCACTTGTTGGCTTTGTA
TGATATACCTACATCTGCCACTGCTCTTCAGTCAGTCATGGATGCATCTTTAGATTCAATACATCAAAGATGGGAGGTCGGCTGGTCGTTGGCCTCATATACAAATGGTC
CTCCATCCTTCAGGGATGCTCTTCAGAGACAGATTGAAAATGACAAGAGAACTTTTGTTGGTAGCCAGAAGCATTTGGATATGGAAGGATCCAACAAGACTGATGACTTA
AGAGTAAGAATTGCCATTCTTGGTGTTCCGTCATTATCAAAGGATGTGCCAAACATCAGTATATCTCCCTCAAGGCAGAGAGGATCCTTTCTTCTTGCGGTTGGTTCTCC
TTTTGGTGTTCTATCACCGGTGCATTTCTTTAACAGCATATCAGTCGGGTCAATTGCCAATTCCTACCCTCCTCGCTCATGGAACAAGTCATTGCTGATGGCTGACATGC
GGTGTCTTCCTGGTATGTACATCTATATGTTTTATTATATTCTGTTTTATTCTGGTGGCTTCTTTCTTACCACCATTACGATACTTAATATGTTAGGAATGGAAGGCTGT
CCTGTTTTTGATGAACATGCACGTATCATCGGTGTTCTGATTAGGCCACTTATGCATTATATGACTGGTGCTGAGATTCAGTTGTTAGTTCCATGGGGAGCCATCGCAAC
TGCTTGCAGTGATTTGCTGTTTGGGGCTTATTATGCTGGAGAAGGGATTGGAAATGACAATGGGTGTACTAACGTGGGGAATGAGGCAATGACTAAGGAACAAAAATTTG
AGGGAACCTTCAGTAGTATTCATGAAAATTCTTATTGTTGTCCTTTCCCAAATAAAGTTGAGAAGGCAATGGCTTCTGTTTGTCTTGTTACAATTGGTGAAGGAATATGG
GCCTCTGGTGTTCTGCTCAATAGCCAAGGCCTAATACTCACAAATGCTCACTTGATAGAGCCGTGGAGATTTGGGAAAACAAACTCAAGTGCAGAAAGATCAATTGAAAA
TGCCCAGCTGCTGCAGACCCATACTGAGGATTCTCCATGTTCAATGCATAATGGTGTTTTTGGCGGGAAAAATAGTGGAAGTTTGAAACAAAATGCCTCTGAGAATGCAA
ATATTCTGATTCAGGACCAACTTCAGGATAATAAGAGTTTTGCTAACTATGGCCGTAGAAACTTGCGTGTTCGCTTGAACCATGCAGATCAGTGGATTTGGTGCGATGCT
AAAGTGATATATATCTGTCAGGGACCTTGGGATGTTGCCCTGTTACAGCTTGAGCAAATTCCAGAGCAGCTCTCACCTATTAAAATGGATTGTTCGTGTCCATCCTCAGG
GTCAAAGATATATGTTATCGGACATGGACTATTGGGACCAAAATCTGGCTTCTCTCCATCTGTTTGCTCTGGTGTGGTGGCGAATGTGGTGAAAGCAAAGATTCCCTCAT
CTCATCATCAAGGAGATTCACTAGAATATTTTCCTGCAATTCTTGAAACAACAGCTGCAGTCCATCCTGGTGGTAGTGGGGGTGCTGTTGTCAATTCAGAAGGCCATATG
GTTGGACTTGTTACAAGCAATGCGAGGCATGGGCGAGGATCTATTATTCCACACTTGAATTTCAGCATACCATGTGCAGCCTTGGAGCCCATTTATAGGTTCTCCAAAGA
CATGGAGGACCTCTCAGTCTTAAAAGTTCTTGATGAACCAGATGAGCAGCTTTCTTCTATATGGGCACTGATGCCACAACGATCTCCCAAGCCCTCTCCTTCGCCCGATC
TGCCACAACTGTTCGGTGAAGACCATGAAACAAAGGGGAAAGGTTCTCGATTTGCAAAGTTCATCGCCGAAAGGCGAGAAGTGTTCCAAAAGCCAACTGTTCATAACAAG
GGGGAGGGCCTTCCATCTAAGACAATCCGTAGCAAGTTA
Protein sequenceShow/hide protein sequence
AYLPVMATREIVDYARNFAIMVRVQGPDPKGLKMHKHAFHQYHSGRTTLSASGMILPEILYDTEVAKHLGNHKDQFASLVLTNSSIFEPFMPQQHRDIIRQLQGKPELIP
GVQIDIMVEDNSLMERDFEVRNGGTPHWHAAHLLALYDIPTSATALQSVMDASLDSIHQRWEVGWSLASYTNGPPSFRDALQRQIENDKRTFVGSQKHLDMEGSNKTDDL
RVRIAILGVPSLSKDVPNISISPSRQRGSFLLAVGSPFGVLSPVHFFNSISVGSIANSYPPRSWNKSLLMADMRCLPGMYIYMFYYILFYSGGFFLTTITILNMLGMEGC
PVFDEHARIIGVLIRPLMHYMTGAEIQLLVPWGAIATACSDLLFGAYYAGEGIGNDNGCTNVGNEAMTKEQKFEGTFSSIHENSYCCPFPNKVEKAMASVCLVTIGEGIW
ASGVLLNSQGLILTNAHLIEPWRFGKTNSSAERSIENAQLLQTHTEDSPCSMHNGVFGGKNSGSLKQNASENANILIQDQLQDNKSFANYGRRNLRVRLNHADQWIWCDA
KVIYICQGPWDVALLQLEQIPEQLSPIKMDCSCPSSGSKIYVIGHGLLGPKSGFSPSVCSGVVANVVKAKIPSSHHQGDSLEYFPAILETTAAVHPGGSGGAVVNSEGHM
VGLVTSNARHGRGSIIPHLNFSIPCAALEPIYRFSKDMEDLSVLKVLDEPDEQLSSIWALMPQRSPKPSPSPDLPQLFGEDHETKGKGSRFAKFIAERREVFQKPTVHNK
GEGLPSKTIRSKL