; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021691 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021691
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionATP-dependent Clp protease proteolytic subunit
Genome locationtig00153809:438145..456154
RNA-Seq ExpressionSgr021691
SyntenySgr021691
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0009526 - plastid envelope (cellular component)
GO:0009570 - chloroplast stroma (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR001907 - ATP-dependent Clp protease proteolytic subunit
IPR023562 - Clp protease proteolytic subunit /Translocation-enhancing protein TepA
IPR029045 - ClpP/crotonase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF1878300.1 hypothetical protein Lal_00046967 [Lupinus albus]7.7e-29063.43Show/hide
Query:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFP-FSRSKTPCR-RYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT
        MA+S L  LS P +        ++KSSFL  T   FP F +S T    R F SPSAK+S DHIPKQFR +NLKDG+M+NYKNAPQYLYGL+PSQMDMFMT
Subjt:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFP-FSRSKTPCR-RYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT

Query:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNPIR+Q+E VTEE+ISS+ NYL+HGGMWS S   N   S+YSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL

Query:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFA
        +ANTEYY+ELLAKGIGK KEEIAKDVQRPKY QAQEAI+YG+ADKIIDSRD+ F+KRNYDEM+AQSRA RR AGG+PQ APSGFRA  +     R+    
Subjt:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFA

Query:  IVSIRVELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVV
        +    V         R S Q   VNA      +V+C            + TK  C   H+  V G+      +                           
Subjt:  IVSIRVELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVV

Query:  WGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSK
                I  +RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT CLV+HIK FPP TAPAGY CP+CST IWPPK++KDS SRLHSK
Subjt:  WGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSK

Query:  LKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSS
        LKEAI+QTG+EKNLFGNHPV L  TESHGPPPAFASDPL+    + H    S+           +GFS  TG+  SK ++ DIVE+D P S GNF++ SS
Subjt:  LKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSS

Query:  PSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFRHKFLRALLPFW
        P G  PGATTRK     +RQNSEISYYADDED NRKKY RR                                            GPFRHKFLRALLPFW
Subjt:  PSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFRHKFLRALLPFW

Query:  STALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ
        S+ALPTLPVTAPPRKDA N  + SEGR RHQR SRMDPRKILL+IAIMAC+ATMGILYYRLVQRG GEEF  DEQ
Subjt:  STALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ

KAF1880709.1 hypothetical protein Lal_00011768 [Lupinus albus]1.1e-26750.75Show/hide
Query:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFPFSRSKT--PCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT
        MA+S L  LS P S    +   +SKSSF  GT   FP     T    RR F SP AK+S DHIP QFR ENL+DGLM+NYKN P+YLYGL+PSQMDMF+T
Subjt:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFPFSRSKT--PCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT

Query:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNPIR+QSE VTEE+ISS+ NYL+HGGMWS S   N   S+YSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL

Query:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGF----------------
        +ANTEYY+ELLAKGIGK KEEIAKDVQRPKY QAQEAI+YG+ADKIIDSRD+ F+KRNYDEML+QSRA RR AGG+PQ APSGF                
Subjt:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGF----------------

Query:  ---------------------------RASEI-------------------YIPC---------RRIVSFAIVSI-------------------------
                                   R +E+                   ++ C          R V  A+V +                         
Subjt:  ---------------------------RASEI-------------------YIPC---------RRIVSFAIVSI-------------------------

Query:  ---------------RVEL--------------------------------------------------HF-------------------TRSSPRR--S
                        +EL                                                  H+                    R  P +  +
Subjt:  ---------------RVEL--------------------------------------------------HF-------------------TRSSPRR--S

Query:  VQWWS-VNAARPRSYI--------VLCTRFPYVENAFVSQNTKYACHGG-------------------------------HEFSVKGKMNSMSSN-----
        V W S + AAR    I         L    P     +V  +  YA  G                                HEF    K +          
Subjt:  VQWWS-VNAARPRSYI--------VLCTRFPYVENAFVSQNTKYACHGG-------------------------------HEFSVKGKMNSMSSN-----

Query:  ---GRDLRPV--------------------------ERKFIHFRL-----------------------------------------GRIAPMIVSV----
            R LR                            ER  I F L                                          R+   I S     
Subjt:  ---GRDLRPV--------------------------ERKFIHFRL-----------------------------------------GRIAPMIVSV----

Query:  -VWGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLH
         V    ++L    +RTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHVIHT+CLV+HIK FPP TAPAGY CP+CST IWPPK++KDSGSRLH
Subjt:  -VWGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLH

Query:  SKLKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKN
        SKLKE I+QTG+EKNLFGNHPV LS TES GPPPAFASDPL+    + H    SL           +G+S  TG+ +SK +  DIVEID   S GNFV+ 
Subjt:  SKLKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKN

Query:  SSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKIL
        SSP G  PGATTRKG    +RQNSEISYYADDED NRKKY RRGPFRHKFLRALLPFWS+ALPTLPV+APP+KDA N  + SEGR RHQR SRMDPRKIL
Subjt:  SSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKIL

Query:  LIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ
        L+IAIMAC+ATMGILYYRL QRG GEE   DEQ
Subjt:  LIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ

KAG6588711.1 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0075.21Show/hide
Query:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR
        MATSS  H+SA PSLAV SSKSSFL GT LPFP SR +T  RRYFLSPSAK SMDHIPKQFR ENLKDGLMENY+N PQ LYGLTPSQ+DMFMTEDNP+R
Subjt:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR

Query:  RQSELVTEENISSSHNYLNHGGMWSLSGKNEKS--SRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
        RQSELVTEE+ISS+H+YL +GGMWSLSG + K   S+YSMS SMYRGGGRG GR ++APPDLPSLLLDARI YLGMPIVPAVTELLVAQFMWLDYDNPSK
Subjt:  RQSELVTEENISSSHNYLNHGGMWSLSGKNEKS--SRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE
        PIYLYINSPGTQNEKME VG ETEAYAIADMMAYCK DVYT+NCGMA+GQAAMLLSLGTKGYRAVQPNSSAKLYLPKV+RSSG+VIDMWIKA+ELDANT+
Subjt:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE

Query:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFAIVSIR
        YYIELLAKG GKP EEIAKD+QRPKYL  QEAIDYGL DKII SRDSAFEKRNYD+MLAQSRAMR+GAGG+PQAAPSG                      
Subjt:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFAIVSIR

Query:  VELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVVWGKMS
                          ++ + P + +V+C            + TK  C   H+  V G+      +                                
Subjt:  VELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVVWGKMS

Query:  ALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAI
           I  +RTYSEWVLNGDYDWPPNCCLCHATLEEG GPQTTRLGCLHVIHTDCLV+HIK FPPSTAPAGYDCPACS SIWPPKNIKDSGSRLH+KLKEAI
Subjt:  ALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAI

Query:  LQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSSPSGVT
        LQTGLEK+LFGNHPVGLSATESHGPPPAFASDPLVSSSGD HN KSSLNSIA V SN  EGFSATTGAGSSKNNI+DIVEI+ PG EGNFVK SSPS   
Subjt:  LQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSSPSGVT

Query:  PGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMA
        P ATTRKGA NYDRQ+SEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKD+L  NDVSEGRVRHQRPSRMDPRKILL+IAIMA
Subjt:  PGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMA

Query:  CLATMGILYYRLVQRGIGEEFVDDEQQ
        CLATMGILYYRL QRGIGEE V+DEQQ
Subjt:  CLATMGILYYRLVQRGIGEEFVDDEQQ

OIV91138.1 hypothetical protein TanjilG_30360 [Lupinus angustifolius]7.7e-29866.55Show/hide
Query:  MATSSLAHLSAP--PSLAVDSS--KSSFLCGTKLPFP-FSRSKTP-CRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT
        MA+S L  LS P  PS   D+S  KSSF+ GT   FP F +S T   RR F SPSAK+S DHIPKQFR +NLKDG+M+NYKNAPQYLYGL+PSQMDMFMT
Subjt:  MATSSLAHLSAP--PSLAVDSS--KSSFLCGTKLPFP-FSRSKTP-CRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT

Query:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNPIR+Q+E VTEE+ISS+ NYL+HGGMWS S   N   S+YSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL

Query:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFA
        +ANTEYYIELLAKGIGK KEEIAKDVQRPKY QAQEAI+YG+ DKIIDSRD+ F+KRNYDEM+AQSRA RR AGG+PQ APSGFR S             
Subjt:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFA

Query:  IVSIRVELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVV
                    S     + W  +       + ++  + P+    F  + TK  C   H+  V G+      +                           
Subjt:  IVSIRVELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVV

Query:  WGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSK
                I  +RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT CLV+HIK FPP TAPAGY CP+CST IWPPK++KDS SRLHSK
Subjt:  WGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSK

Query:  LKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSS
        LKEAI+QTG+EKNLFGNHPV LS TES GPPPAFASDPL+    + H    S+           +GFS  TG+  SK ++ DIVE+D P S GNF++ SS
Subjt:  LKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSS

Query:  PSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLI
        P G  PGATTRK     +RQNSEISYYADDED NRKKY RRGPFRHKFLRALLPFWS+ALPTLPVTAPPRKDA N  + SEGR RHQR SRMDPRKILL+
Subjt:  PSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLI

Query:  IAIMACLATMGILYYRLVQRGIGEEFVDDEQ
        IAIMACLATMGILYYRL QRG GEEF  DEQ
Subjt:  IAIMACLATMGILYYRLVQRGIGEEFVDDEQ

RDY10958.1 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic, partial [Mucuna pruriens]1.9e-27264.73Show/hide
Query:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR
        M++S    LSAP         SSFL GTKL FPFS    P  R F S SAK S+DHIPKQFR ENL+DGLMEN+KNAPQYLYGLTPSQMDMFMTEDNPIR
Subjt:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR

Query:  RQSELVTEENISSSHNYLNHGGMWSLS--GKNEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
        +Q+E VTEE+ISS+ NY++HGGMWSLS  GKN+ +S+YSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNP+K
Subjt:  RQSELVTEENISSSHNYLNHGGMWSLS--GKNEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE
        PIYLYINS GTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKEL+ANTE
Subjt:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE

Query:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFAIVSIR
        YYIELLAKGIGK KEEIAK+VQRPKY QAQEAIDYG+ADK IDSRD  FEKRNYDEMLAQSRA RR AGG+PQ            +  + I   A   I 
Subjt:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFAIVSIR

Query:  VELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVVWGKMS
         E              W                         S  TK  C   H+  V G+      +                                
Subjt:  VELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVVWGKMS

Query:  ALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAI
           I  VRTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHV+HT+CLV+HIK F P TAPAGY CP+CSTSIWPPK++KDSGSRLHSKLKEAI
Subjt:  ALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAI

Query:  LQ------------TGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEG
        +Q            +G+EKN+FGNHPV LS TES  PPPAFASDPL+S   + H    S+           +GFS  TG+   K ++ DIVEID   S G
Subjt:  LQ------------TGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEG

Query:  NFVKNSSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMD
        NF+K+SSP  V PGATTRKG+ + +RQNSEISYYADDEDGNRKKY +RGPF HKFLRALLPFWS ALPTLPVTAP RKDA N  D SEGR RHQR S MD
Subjt:  NFVKNSSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMD

Query:  PRKILLIIAIM
        PRKILL+IAI+
Subjt:  PRKILLIIAIM

TrEMBL top hitse value%identityAlignment
A0A371I7F6 ATP-dependent Clp protease proteolytic subunit (Fragment)9.2e-27364.73Show/hide
Query:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR
        M++S    LSAP         SSFL GTKL FPFS    P  R F S SAK S+DHIPKQFR ENL+DGLMEN+KNAPQYLYGLTPSQMDMFMTEDNPIR
Subjt:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR

Query:  RQSELVTEENISSSHNYLNHGGMWSLS--GKNEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
        +Q+E VTEE+ISS+ NY++HGGMWSLS  GKN+ +S+YSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNP+K
Subjt:  RQSELVTEENISSSHNYLNHGGMWSLS--GKNEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE
        PIYLYINS GTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKEL+ANTE
Subjt:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE

Query:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFAIVSIR
        YYIELLAKGIGK KEEIAK+VQRPKY QAQEAIDYG+ADK IDSRD  FEKRNYDEMLAQSRA RR AGG+PQ            +  + I   A   I 
Subjt:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFAIVSIR

Query:  VELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVVWGKMS
         E              W                         S  TK  C   H+  V G+      +                                
Subjt:  VELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVVWGKMS

Query:  ALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAI
           I  VRTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHV+HT+CLV+HIK F P TAPAGY CP+CSTSIWPPK++KDSGSRLHSKLKEAI
Subjt:  ALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAI

Query:  LQ------------TGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEG
        +Q            +G+EKN+FGNHPV LS TES  PPPAFASDPL+S   + H    S+           +GFS  TG+   K ++ DIVEID   S G
Subjt:  LQ------------TGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEG

Query:  NFVKNSSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMD
        NF+K+SSP  V PGATTRKG+ + +RQNSEISYYADDEDGNRKKY +RGPF HKFLRALLPFWS ALPTLPVTAP RKDA N  D SEGR RHQR S MD
Subjt:  NFVKNSSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMD

Query:  PRKILLIIAIM
        PRKILL+IAI+
Subjt:  PRKILLIIAIM

A0A5A7SMD0 ATP-dependent Clp protease proteolytic subunit1.6e-18787.34Show/hide
Query:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR
        MATSSL HLSAPPSLA+DSSKSSFLCGT+LP P SR +T CRRY LSPSA+ SMDHIPKQFRGENLKDGL+ENYKNAP+YLYGLTPSQMDMFMTEDNP+R
Subjt:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIR

Query:  RQSELVTEENISSSHNYLNHGGMWSLSGKNEKS-SRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKP
        RQSELVTE+NISSS+NYLNHGGMWSL+G + K  ++YSMSVSMYRGGGR +GRPR APPDLPSLLLDARI YLGMPIVPAVTELLVAQFMWLDYDNPSKP
Subjt:  RQSELVTEENISSSHNYLNHGGMWSLSGKNEKS-SRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKP

Query:  IYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEY
        IYLYINSPGTQNEKMETVGSETEAYA+ DMM+YCKSDVYTVN GMA+GQAAMLLSLGTKGYRA+QPNSS KLYLPKVNRSSG+VIDMWIKAKELDANTEY
Subjt:  IYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEY

Query:  YIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR
        Y+ELLAKGIGKPKEEI KD+QR KY QAQEAIDYG+ADKII S+DSAFEKRNYDEMLAQS+A RRGAGG+PQAAP+GFR
Subjt:  YIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR

A0A6A5N4U1 ATP-dependent Clp protease proteolytic subunit3.7e-29063.43Show/hide
Query:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFP-FSRSKTPCR-RYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT
        MA+S L  LS P +        ++KSSFL  T   FP F +S T    R F SPSAK+S DHIPKQFR +NLKDG+M+NYKNAPQYLYGL+PSQMDMFMT
Subjt:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFP-FSRSKTPCR-RYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT

Query:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNPIR+Q+E VTEE+ISS+ NYL+HGGMWS S   N   S+YSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL

Query:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFA
        +ANTEYY+ELLAKGIGK KEEIAKDVQRPKY QAQEAI+YG+ADKIIDSRD+ F+KRNYDEM+AQSRA RR AGG+PQ APSGFRA  +     R+    
Subjt:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFA

Query:  IVSIRVELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVV
        +    V         R S Q   VNA      +V+C            + TK  C   H+  V G+      +                           
Subjt:  IVSIRVELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFVSQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVV

Query:  WGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSK
                I  +RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT CLV+HIK FPP TAPAGY CP+CST IWPPK++KDS SRLHSK
Subjt:  WGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSK

Query:  LKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSS
        LKEAI+QTG+EKNLFGNHPV L  TESHGPPPAFASDPL+    + H    S+           +GFS  TG+  SK ++ DIVE+D P S GNF++ SS
Subjt:  LKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSS

Query:  PSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFRHKFLRALLPFW
        P G  PGATTRK     +RQNSEISYYADDED NRKKY RR                                            GPFRHKFLRALLPFW
Subjt:  PSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFRHKFLRALLPFW

Query:  STALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ
        S+ALPTLPVTAPPRKDA N  + SEGR RHQR SRMDPRKILL+IAIMAC+ATMGILYYRLVQRG GEEF  DEQ
Subjt:  STALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ

A0A6A5N6H9 RING-type domain-containing protein5.2e-26850.75Show/hide
Query:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFPFSRSKT--PCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT
        MA+S L  LS P S    +   +SKSSF  GT   FP     T    RR F SP AK+S DHIP QFR ENL+DGLM+NYKN P+YLYGL+PSQMDMF+T
Subjt:  MATSSLAHLSAPPS----LAVDSSKSSFLCGTKLPFPFSRSKT--PCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMT

Query:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNPIR+QSE VTEE+ISS+ NYL+HGGMWS S   N   S+YSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPIRRQSELVTEENISSSHNYLNHGGMWSLSGK-NEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKEL

Query:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGF----------------
        +ANTEYY+ELLAKGIGK KEEIAKDVQRPKY QAQEAI+YG+ADKIIDSRD+ F+KRNYDEML+QSRA RR AGG+PQ APSGF                
Subjt:  DANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGF----------------

Query:  ---------------------------RASEI-------------------YIPC---------RRIVSFAIVSI-------------------------
                                   R +E+                   ++ C          R V  A+V +                         
Subjt:  ---------------------------RASEI-------------------YIPC---------RRIVSFAIVSI-------------------------

Query:  ---------------RVEL--------------------------------------------------HF-------------------TRSSPRR--S
                        +EL                                                  H+                    R  P +  +
Subjt:  ---------------RVEL--------------------------------------------------HF-------------------TRSSPRR--S

Query:  VQWWS-VNAARPRSYI--------VLCTRFPYVENAFVSQNTKYACHGG-------------------------------HEFSVKGKMNSMSSN-----
        V W S + AAR    I         L    P     +V  +  YA  G                                HEF    K +          
Subjt:  VQWWS-VNAARPRSYI--------VLCTRFPYVENAFVSQNTKYACHGG-------------------------------HEFSVKGKMNSMSSN-----

Query:  ---GRDLRPV--------------------------ERKFIHFRL-----------------------------------------GRIAPMIVSV----
            R LR                            ER  I F L                                          R+   I S     
Subjt:  ---GRDLRPV--------------------------ERKFIHFRL-----------------------------------------GRIAPMIVSV----

Query:  -VWGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLH
         V    ++L    +RTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHVIHT+CLV+HIK FPP TAPAGY CP+CST IWPPK++KDSGSRLH
Subjt:  -VWGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLH

Query:  SKLKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKN
        SKLKE I+QTG+EKNLFGNHPV LS TES GPPPAFASDPL+    + H    SL           +G+S  TG+ +SK +  DIVEID   S GNFV+ 
Subjt:  SKLKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKN

Query:  SSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKIL
        SSP G  PGATTRKG    +RQNSEISYYADDED NRKKY RRGPFRHKFLRALLPFWS+ALPTLPV+APP+KDA N  + SEGR RHQR SRMDPRKIL
Subjt:  SSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKIL

Query:  LIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ
        L+IAIMAC+ATMGILYYRL QRG GEE   DEQ
Subjt:  LIIAIMACLATMGILYYRLVQRGIGEEFVDDEQ

A0A6J1D6K6 ATP-dependent Clp protease proteolytic subunit3.4e-19892.89Show/hide
Query:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKN-SMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPI
        MATSSL++LSAPPSLA+DSSKS F+CGT+L FP SRS+T CRRYFLSPSAKN SMDHIPK+FRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPI
Subjt:  MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKN-SMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPI

Query:  RRQSELVTEENISSSHNYLNHGGMWSLSGKNEKS-SRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
        RRQSELVTEENISSSHNYLNHGGMWSLSG N+K  S+YSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
Subjt:  RRQSELVTEENISSSHNYLNHGGMWSLSGKNEKS-SRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE
        PIYLYINS GTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSS KLYLPKVNRSSG+VIDMWIKAKELDANTE
Subjt:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTE

Query:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR
        YYIELLAKGIGKPKEEI KDVQRPKY QAQEA+DYGLADKIIDS+DSAFEKRNYDEMLAQSRAMR+G GG+PQAAPSGFR
Subjt:  YYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR

SwissProt top hitse value%identityAlignment
P74466 Putative ATP-dependent Clp protease proteolytic subunit-like2.6e-3841.4Show/hide
Query:  MSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIA
        M ++  +    G    +T PPDL SLLL  RI YLGMP+  +          VT+L++AQ ++L +D+P KPIY YINS GT     + VG ETEA+AI 
Subjt:  MSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIA

Query:  DMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQA
        D + Y K  V+T+  G A G AAM+LS GTKGYRA  P+++  L   +   + G   D+ I+AKE+ +N +  +E+L+   G+ +E++AKD+ R  YL  
Subjt:  DMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQA

Query:  QEAIDYGLADKIIDS
         +A +YGL D++++S
Subjt:  QEAIDYGLADKIIDS

Q8L770 ATP-dependent Clp protease proteolytic subunit-related protein 3, chloroplastic5.4e-4449.21Show/hide
Query:  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAM
        RPRT PPDLPS+LLD RI Y+GMP+VPAVTEL+VA+ M+L + +P +PIY+YINS GT  +  ETVG E+E +AI D +   K++V+TV  G A GQA +
Subjt:  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAM

Query:  LLSLGTKGYRAVQPNSSAKLYLPKVNRSSG--SVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKII
        LLS GTKG R + P++ A +  P+V  SSG     D+ I+AKE+  N +  +ELL+K  G   E +A  ++RP Y+ A +A ++G+ D+I+
Subjt:  LLSLGTKGYRAVQPNSSAKLYLPKVNRSSG--SVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKII

Q8LB10 ATP-dependent Clp protease proteolytic subunit-related protein 4, chloroplastic7.5e-3842.4Show/hide
Query:  SLSGKNEKSSRYSMS-VSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEA
        SL+  N  SS+     V+M     +GS   +  PPDL S L   RI YLGM +VP+VTEL++A+F++L Y++  KPIYLYINS GT  +  E +G +TEA
Subjt:  SLSGKNEKSSRYSMS-VSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEA

Query:  YAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPK
        +AI D+M Y K  ++T+  G A+G+AA+LL+ G KG R+  P+S+  +  P + R  G   D+ I  KE+       ++L +K IGK  E+I  D++RPK
Subjt:  YAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPK

Query:  YLQAQEAIDYGLADKII
        Y    EA++YG+ DK++
Subjt:  YLQAQEAIDYGLADKII

Q9L4P4 Putative ATP-dependent Clp protease proteolytic subunit-like7.3e-4145Show/hide
Query:  RTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG
        RT PPDLPSLLL  RI YLGMP+  +          VTEL++AQ ++L++DNP KPIY YINS GT     + +G ETEA+AI D M Y K  V+T+  G
Subjt:  RTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG

Query:  MAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSR
         A G AAM+LS GT G RA  P+++  L  P+   + G   D+ I+AKE+ AN    +E+ A+  G+  + +A+D  R  Y+   +A++YGL D+++DSR
Subjt:  MAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKIIDSR

Q9XJ35 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic3.3e-15076.44Show/hide
Query:  KSSFLCGTKL---PFPFSRSKTPCRRYFLSPSAKN-SMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIRRQSELVTEENISSSHN
        KS F+ G+KL     P S      RR     SAK+ S DHIPKQFRG+NLKDG+M+N+KN PQY YGL  +QMDMFMTED+P+RRQ+E VTEE+ISS +N
Subjt:  KSSFLCGTKL---PFPFSRSKTPCRRYFLSPSAKN-SMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIRRQSELVTEENISSSHN

Query:  YLNHGGMWSLSGKNEKSS-RYSMSVSMYRGGGRGSG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEK
        YLN+GG+WS+SG N   + RYSMSV MYRGGG G G  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNP+KPIYLYINSPGTQNEK
Subjt:  YLNHGGMWSLSGKNEKSS-RYSMSVSMYRGGGRGSG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEK

Query:  METVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKE
        METVGSETEAYAIAD ++YCKSDVYT+NCGMA+GQAAMLLSLG KGYRAVQP+SS KLYLPKVNRSSG+ IDMWIKAKELDANTEYYIELLAKG GK KE
Subjt:  METVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKE

Query:  EIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR
        +I +D++RPKYLQAQ AIDYG+ADKI DS+DS+FEKR+YD  LAQ RAMR G GGSP AAP+G R
Subjt:  EIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR

Arabidopsis top hitse value%identityAlignment
AT1G09130.1 ATP-dependent caseinolytic (Clp) protease/crotonase family protein3.8e-4549.21Show/hide
Query:  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAM
        RPRT PPDLPS+LLD RI Y+GMP+VPAVTEL+VA+ M+L + +P +PIY+YINS GT  +  ETVG E+E +AI D +   K++V+TV  G A GQA +
Subjt:  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAM

Query:  LLSLGTKGYRAVQPNSSAKLYLPKVNRSSG--SVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKII
        LLS GTKG R + P++ A +  P+V  SSG     D+ I+AKE+  N +  +ELL+K  G   E +A  ++RP Y+ A +A ++G+ D+I+
Subjt:  LLSLGTKGYRAVQPNSSAKLYLPKVNRSSG--SVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKII

AT1G09130.2 ATP-dependent caseinolytic (Clp) protease/crotonase family protein3.8e-4549.21Show/hide
Query:  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAM
        RPRT PPDLPS+LLD RI Y+GMP+VPAVTEL+VA+ M+L + +P +PIY+YINS GT  +  ETVG E+E +AI D +   K++V+TV  G A GQA +
Subjt:  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAM

Query:  LLSLGTKGYRAVQPNSSAKLYLPKVNRSSG--SVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKII
        LLS GTKG R + P++ A +  P+V  SSG     D+ I+AKE+  N +  +ELL+K  G   E +A  ++RP Y+ A +A ++G+ D+I+
Subjt:  LLSLGTKGYRAVQPNSSAKLYLPKVNRSSG--SVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEAIDYGLADKII

AT1G49970.1 CLP protease proteolytic subunit 12.3e-15176.44Show/hide
Query:  KSSFLCGTKL---PFPFSRSKTPCRRYFLSPSAKN-SMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIRRQSELVTEENISSSHN
        KS F+ G+KL     P S      RR     SAK+ S DHIPKQFRG+NLKDG+M+N+KN PQY YGL  +QMDMFMTED+P+RRQ+E VTEE+ISS +N
Subjt:  KSSFLCGTKL---PFPFSRSKTPCRRYFLSPSAKN-SMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIRRQSELVTEENISSSHN

Query:  YLNHGGMWSLSGKNEKSS-RYSMSVSMYRGGGRGSG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEK
        YLN+GG+WS+SG N   + RYSMSV MYRGGG G G  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNP+KPIYLYINSPGTQNEK
Subjt:  YLNHGGMWSLSGKNEKSS-RYSMSVSMYRGGGRGSG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEK

Query:  METVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKE
        METVGSETEAYAIAD ++YCKSDVYT+NCGMA+GQAAMLLSLG KGYRAVQP+SS KLYLPKVNRSSG+ IDMWIKAKELDANTEYYIELLAKG GK KE
Subjt:  METVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKE

Query:  EIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR
        +I +D++RPKYLQAQ AIDYG+ADKI DS+DS+FEKR+YD  LAQ RAMR G GGSP AAP+G R
Subjt:  EIAKDVQRPKYLQAQEAIDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFR

AT2G14835.1 RING/U-box superfamily protein6.2e-11262.93Show/hide
Query:  VRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAILQTGLE
        VRTYSEWV++G+YD  P CC C AT +EG G Q TRLGCLH IHT CLV+ IK FPP TAPAGY CPACST IWPP  +KD+GSRLH+ L+E I QTGLE
Subjt:  VRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAILQTGLE

Query:  KNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSSPSGVTPGATTR
        KNL GNHPV  S TES  PPPAFASD L++ S  +H ++          +N+ +G+S       SK+ +++IVEID P S G+++K+SSP      A  R
Subjt:  KNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSSPSGVTPGATTR

Query:  KGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMG
        KG    DRQNSE  YYADDEDGNRKKY RRGP RHKFLRALLPFWS+ALPTLPVTAPPRKDA   +D SEGRVRHQR S+MD RKIL+ IA++AC+ATMG
Subjt:  KGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMG

Query:  ILYYRLVQRGIGEEFVDDEQQ
        ILYYRL  + IG+E  D+EQ+
Subjt:  ILYYRLVQRGIGEEFVDDEQQ

AT2G14835.2 RING/U-box superfamily protein6.2e-11262.93Show/hide
Query:  VRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAILQTGLE
        VRTYSEWV++G+YD  P CC C AT +EG G Q TRLGCLH IHT CLV+ IK FPP TAPAGY CPACST IWPP  +KD+GSRLH+ L+E I QTGLE
Subjt:  VRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTDCLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAILQTGLE

Query:  KNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSSPSGVTPGATTR
        KNL GNHPV  S TES  PPPAFASD L++ S  +H ++          +N+ +G+S       SK+ +++IVEID P S G+++K+SSP      A  R
Subjt:  KNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGFSATTGAGSSKNNIADIVEIDTPGSEGNFVKNSSPSGVTPGATTR

Query:  KGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMG
        KG    DRQNSE  YYADDEDGNRKKY RRGP RHKFLRALLPFWS+ALPTLPVTAPPRKDA   +D SEGRVRHQR S+MD RKIL+ IA++AC+ATMG
Subjt:  KGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMG

Query:  ILYYRLVQRGIGEEFVDDEQQ
        ILYYRL  + IG+E  D+EQ+
Subjt:  ILYYRLVQRGIGEEFVDDEQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACTTCTTCACTCGCCCATCTCTCTGCTCCGCCGTCGTTGGCCGTCGACAGTTCAAAGTCTTCTTTCCTCTGTGGCACCAAACTGCCTTTTCCATTCTCTCGTTC
AAAAACTCCATGTCGTAGATACTTTCTGTCTCCCTCAGCCAAAAATTCGATGGACCATATTCCTAAGCAGTTCAGAGGGGAAAATCTCAAAGATGGCTTGATGGAGAACT
ACAAGAATGCTCCTCAATATCTTTATGGCCTGACGCCTTCACAAATGGACATGTTCATGACAGAAGATAATCCCATTCGGCGACAGTCAGAATTAGTTACCGAGGAAAAC
ATCTCATCTTCCCACAACTACTTGAATCATGGAGGAATGTGGAGTCTATCAGGCAAAAATGAGAAGAGTTCAAGATATAGTATGAGTGTTAGTATGTATCGTGGGGGAGG
AAGAGGATCTGGAAGACCTCGAACAGCTCCTCCTGATTTGCCTTCTTTGCTCCTAGATGCTCGTATATGCTATTTGGGCATGCCAATTGTACCAGCAGTGACTGAGCTTC
TTGTTGCTCAGTTTATGTGGCTAGATTATGACAACCCATCAAAGCCTATATATCTCTACATAAACTCTCCTGGAACACAGAATGAGAAGATGGAGACTGTTGGATCTGAA
ACTGAAGCATATGCTATAGCAGATATGATGGCTTACTGCAAATCGGATGTCTATACAGTTAACTGTGGGATGGCATACGGTCAAGCAGCAATGCTTTTGTCACTTGGAAC
CAAGGGTTACCGTGCTGTCCAGCCTAACTCCTCCGCTAAACTATATCTGCCAAAAGTTAATAGATCAAGTGGTTCAGTCATAGATATGTGGATTAAGGCCAAGGAACTCG
ATGCCAACACCGAGTATTATATCGAGCTATTAGCTAAAGGAATCGGTAAACCCAAGGAAGAGATCGCTAAAGATGTCCAACGACCCAAATACCTTCAAGCACAAGAAGCT
ATTGACTATGGCCTTGCAGACAAGATAATCGACTCACGGGATTCTGCATTTGAAAAGAGGAATTATGATGAGATGCTTGCTCAGTCGAGAGCTATGAGGAGAGGAGCAGG
AGGCAGTCCACAAGCGGCCCCATCCGGGTTCAGAGCTTCGGAGATCTATATTCCATGTCGTCGAATTGTTAGTTTCGCCATTGTATCCATTCGAGTCGAGCTTCATTTCA
CCAGATCGTCTCCGCGCCGGTCAGTGCAATGGTGGTCTGTAAATGCCGCAAGGCCACGAAGTTATATTGTTTTGTGCACAAGGTTCCCGTATGTGGAGAATGCATTTGTT
TCCCAGAACACCAAATATGCGTGCCATGGCGGACACGAGTTCTCTGTCAAAGGAAAGATGAATTCCATGAGCTCCAATGGTCGGGATTTGAGGCCAGTTGAAAGGAAGTT
TATCCACTTCAGGTTGGGTCGGATAGCTCCGATGATAGTATCAGTAGTTTGGGGAAAAATGTCAGCTTTGTGGATTGATAAAGTTCGTACCTATTCAGAATGGGTATTGA
ATGGAGATTATGATTGGCCTCCAAATTGCTGCTTGTGTCATGCTACACTTGAGGAAGGAATTGGTCCTCAAACTACTCGATTGGGATGCTTGCATGTTATACATACGGAT
TGCTTGGTCGCACATATCAAGGGCTTTCCTCCATCCACTGCCCCTGCTGGATATGACTGTCCTGCCTGTTCCACTTCGATATGGCCTCCTAAAAATATTAAAGATTCAGG
ATCTCGCCTGCACTCAAAGCTAAAGGAAGCTATCTTGCAGACTGGTCTGGAAAAGAATTTGTTTGGCAATCATCCAGTAGGATTATCAGCAACAGAATCCCATGGTCCTC
CCCCTGCCTTTGCTTCAGATCCCTTGGTTTCCTCTTCAGGAGACACACATAACAAAAAGAGCTCATTAAACTCAATAGCAGAGGTTGAGTCGAATATGGCTGAAGGGTTT
TCAGCGACAACTGGAGCTGGATCTTCCAAAAATAACATTGCAGATATTGTAGAGATTGATACACCTGGTTCAGAAGGGAATTTTGTGAAAAACTCCAGTCCTTCAGGCGT
AACTCCCGGTGCTACGACAAGAAAAGGTGCATTCAACTACGACAGGCAAAATTCTGAAATTTCATATTATGCTGATGATGAAGATGGTAATCGCAAGAAGTATGTTCGCA
GAGGTCCTTTTAGGCACAAGTTTCTTAGGGCACTACTTCCTTTTTGGTCAACTGCATTGCCAACTCTACCCGTGACTGCACCTCCACGTAAAGATGCATTAAATGGCAAT
GATGTCAGTGAAGGTCGCGTTCGGCACCAAAGACCCTCCAGAATGGATCCGAGAAAAATTCTTCTTATCATAGCAATCATGGCATGTCTGGCAACCATGGGTATTTTGTA
CTACAGACTTGTACAACGTGGTATCGGAGAAGAATTTGTGGATGACGAGCAGCAACAAGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACTTCTTCACTCGCCCATCTCTCTGCTCCGCCGTCGTTGGCCGTCGACAGTTCAAAGTCTTCTTTCCTCTGTGGCACCAAACTGCCTTTTCCATTCTCTCGTTC
AAAAACTCCATGTCGTAGATACTTTCTGTCTCCCTCAGCCAAAAATTCGATGGACCATATTCCTAAGCAGTTCAGAGGGGAAAATCTCAAAGATGGCTTGATGGAGAACT
ACAAGAATGCTCCTCAATATCTTTATGGCCTGACGCCTTCACAAATGGACATGTTCATGACAGAAGATAATCCCATTCGGCGACAGTCAGAATTAGTTACCGAGGAAAAC
ATCTCATCTTCCCACAACTACTTGAATCATGGAGGAATGTGGAGTCTATCAGGCAAAAATGAGAAGAGTTCAAGATATAGTATGAGTGTTAGTATGTATCGTGGGGGAGG
AAGAGGATCTGGAAGACCTCGAACAGCTCCTCCTGATTTGCCTTCTTTGCTCCTAGATGCTCGTATATGCTATTTGGGCATGCCAATTGTACCAGCAGTGACTGAGCTTC
TTGTTGCTCAGTTTATGTGGCTAGATTATGACAACCCATCAAAGCCTATATATCTCTACATAAACTCTCCTGGAACACAGAATGAGAAGATGGAGACTGTTGGATCTGAA
ACTGAAGCATATGCTATAGCAGATATGATGGCTTACTGCAAATCGGATGTCTATACAGTTAACTGTGGGATGGCATACGGTCAAGCAGCAATGCTTTTGTCACTTGGAAC
CAAGGGTTACCGTGCTGTCCAGCCTAACTCCTCCGCTAAACTATATCTGCCAAAAGTTAATAGATCAAGTGGTTCAGTCATAGATATGTGGATTAAGGCCAAGGAACTCG
ATGCCAACACCGAGTATTATATCGAGCTATTAGCTAAAGGAATCGGTAAACCCAAGGAAGAGATCGCTAAAGATGTCCAACGACCCAAATACCTTCAAGCACAAGAAGCT
ATTGACTATGGCCTTGCAGACAAGATAATCGACTCACGGGATTCTGCATTTGAAAAGAGGAATTATGATGAGATGCTTGCTCAGTCGAGAGCTATGAGGAGAGGAGCAGG
AGGCAGTCCACAAGCGGCCCCATCCGGGTTCAGAGCTTCGGAGATCTATATTCCATGTCGTCGAATTGTTAGTTTCGCCATTGTATCCATTCGAGTCGAGCTTCATTTCA
CCAGATCGTCTCCGCGCCGGTCAGTGCAATGGTGGTCTGTAAATGCCGCAAGGCCACGAAGTTATATTGTTTTGTGCACAAGGTTCCCGTATGTGGAGAATGCATTTGTT
TCCCAGAACACCAAATATGCGTGCCATGGCGGACACGAGTTCTCTGTCAAAGGAAAGATGAATTCCATGAGCTCCAATGGTCGGGATTTGAGGCCAGTTGAAAGGAAGTT
TATCCACTTCAGGTTGGGTCGGATAGCTCCGATGATAGTATCAGTAGTTTGGGGAAAAATGTCAGCTTTGTGGATTGATAAAGTTCGTACCTATTCAGAATGGGTATTGA
ATGGAGATTATGATTGGCCTCCAAATTGCTGCTTGTGTCATGCTACACTTGAGGAAGGAATTGGTCCTCAAACTACTCGATTGGGATGCTTGCATGTTATACATACGGAT
TGCTTGGTCGCACATATCAAGGGCTTTCCTCCATCCACTGCCCCTGCTGGATATGACTGTCCTGCCTGTTCCACTTCGATATGGCCTCCTAAAAATATTAAAGATTCAGG
ATCTCGCCTGCACTCAAAGCTAAAGGAAGCTATCTTGCAGACTGGTCTGGAAAAGAATTTGTTTGGCAATCATCCAGTAGGATTATCAGCAACAGAATCCCATGGTCCTC
CCCCTGCCTTTGCTTCAGATCCCTTGGTTTCCTCTTCAGGAGACACACATAACAAAAAGAGCTCATTAAACTCAATAGCAGAGGTTGAGTCGAATATGGCTGAAGGGTTT
TCAGCGACAACTGGAGCTGGATCTTCCAAAAATAACATTGCAGATATTGTAGAGATTGATACACCTGGTTCAGAAGGGAATTTTGTGAAAAACTCCAGTCCTTCAGGCGT
AACTCCCGGTGCTACGACAAGAAAAGGTGCATTCAACTACGACAGGCAAAATTCTGAAATTTCATATTATGCTGATGATGAAGATGGTAATCGCAAGAAGTATGTTCGCA
GAGGTCCTTTTAGGCACAAGTTTCTTAGGGCACTACTTCCTTTTTGGTCAACTGCATTGCCAACTCTACCCGTGACTGCACCTCCACGTAAAGATGCATTAAATGGCAAT
GATGTCAGTGAAGGTCGCGTTCGGCACCAAAGACCCTCCAGAATGGATCCGAGAAAAATTCTTCTTATCATAGCAATCATGGCATGTCTGGCAACCATGGGTATTTTGTA
CTACAGACTTGTACAACGTGGTATCGGAGAAGAATTTGTGGATGACGAGCAGCAACAAGCGTGA
Protein sequenceShow/hide protein sequence
MATSSLAHLSAPPSLAVDSSKSSFLCGTKLPFPFSRSKTPCRRYFLSPSAKNSMDHIPKQFRGENLKDGLMENYKNAPQYLYGLTPSQMDMFMTEDNPIRRQSELVTEEN
ISSSHNYLNHGGMWSLSGKNEKSSRYSMSVSMYRGGGRGSGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSE
TEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSAKLYLPKVNRSSGSVIDMWIKAKELDANTEYYIELLAKGIGKPKEEIAKDVQRPKYLQAQEA
IDYGLADKIIDSRDSAFEKRNYDEMLAQSRAMRRGAGGSPQAAPSGFRASEIYIPCRRIVSFAIVSIRVELHFTRSSPRRSVQWWSVNAARPRSYIVLCTRFPYVENAFV
SQNTKYACHGGHEFSVKGKMNSMSSNGRDLRPVERKFIHFRLGRIAPMIVSVVWGKMSALWIDKVRTYSEWVLNGDYDWPPNCCLCHATLEEGIGPQTTRLGCLHVIHTD
CLVAHIKGFPPSTAPAGYDCPACSTSIWPPKNIKDSGSRLHSKLKEAILQTGLEKNLFGNHPVGLSATESHGPPPAFASDPLVSSSGDTHNKKSSLNSIAEVESNMAEGF
SATTGAGSSKNNIADIVEIDTPGSEGNFVKNSSPSGVTPGATTRKGAFNYDRQNSEISYYADDEDGNRKKYVRRGPFRHKFLRALLPFWSTALPTLPVTAPPRKDALNGN
DVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRGIGEEFVDDEQQQA