; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G032900 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G032900
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionATP-dependent Clp protease proteolytic subunit
Genome locationCiama_Chr02:8166642..8182269
RNA-Seq ExpressionCaUC02G032900
SyntenyCaUC02G032900
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0009526 - plastid envelope (cellular component)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR001907 - ATP-dependent Clp protease proteolytic subunit
IPR023562 - Clp protease proteolytic subunit /Translocation-enhancing protein TepA
IPR029045 - ClpP/crotonase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF1878300.1 hypothetical protein Lal_00046967 [Lupinus albus]8.8e-30865.62Show/hide
Query:  MTTSSLSHLSAPPS--LAIDSS--KSSFLFGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFM
        M +S L  LS P +     D+S  KSSFL  T   FPS   S ++ S R +  SPSAK S DHIPKQFR +NLKDG+M+NYKN P+YLYGL+PSQMDMFM
Subjt:  MTTSSLSHLSAPPS--LAIDSS--KSSFLFGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFM

Query:  TEDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLD
        TEDNP+R+Q+E VTEE+ISS+ NYL+HGGMWS S M   GP+KYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVP+V ELLVAQFMWLD
Subjt:  TEDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLD

Query:  YDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKE
        YDNP KPIYLYINSSGTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKE
Subjt:  YDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKE

Query:  LDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEF
        L+ANTEYY+ELLAKG GK KEEIAKDVQRP+YFQAQEAI+YG+ADKIIDS+D  F+KRNYDEM+AQSRA RR  GGNPQ APSG R   A ++    +  
Subjt:  LDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEF

Query:  LVSPLYRYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIA-CRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVV
           P+ R N    +S +++                               +NA ++  CR           +ATKLYCFVHKVPVCGECICFP+HQICV+
Subjt:  LVSPLYRYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIA-CRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVV

Query:  RTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEK
        RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT CLVSHIKSFPP TAPAGY CP+CS  IWPPK+ KDS SRLH+KLKEAI+QTG+EK
Subjt:  RTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEK

Query:  NLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGA
        NLFGNHPV+L  TES GPPPAFASDPL+    + H N  S+           +GFS  TG+  SK ++ DIVE++ P S GNF++ SSP GP ATTRK  
Subjt:  NLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGA

Query:  LNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPR
        +  +RQNSEISYYADDED NRKKY RR                                            GPF+HKFLRALLPFWS+ALPTLPVTAPPR
Subjt:  LNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPR

Query:  KDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQ
        KDA NA + SEGR RHQR SRMDPRKILL+IAIMAC+ATMGILYYRLVQR  GEEF  DEQ
Subjt:  KDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQ

KAF1880709.1 hypothetical protein Lal_00011768 [Lupinus albus]2.3e-27149.43Show/hide
Query:  MTTSSLSHLSAP--PSLAID--SSKSSFLFGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMT
        M +S L  LS P  P   ID  +SKSSF  GT   FPS    T+   RR   SP AK S DHIP QFR ENL+DGLM+NYKN PKYLYGL+PSQMDMF+T
Subjt:  MTTSSLSHLSAP--PSLAID--SSKSSFLFGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMT

Query:  EDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY
        EDNP+R+QSE VTEE+ISS+ NYL+HGGMWS S M   GP+KYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVP+V ELLVAQFMWLDY
Subjt:  EDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL
        DNP KPIYLYINSSGTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKEL
Subjt:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL

Query:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR-----DLTAFKIYIH
        +ANTEYY+ELLAKG GK KEEIAKDVQRP+YFQAQEAI+YGIADKIIDS+D  F+KRNYDEML+QSRA RR  GGNPQ APSG       DL  +   + 
Subjt:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR-----DLTAFKIYIH

Query:  VV------EFLVSPLYRYNSNFISSD--------RLLAGYCNGGLQMPQG---------------------LVDIWDR----------------------
                +  +  L+ +N   + S           L   C+    +PQG                     LVD++ +                      
Subjt:  VV------EFLVSPLYRYNSNFISSD--------RLLAGYCNGGLQMPQG---------------------LVDIWDR----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------IHVWEFGIR--------------------SDLNAW--------------------------------LIAC--------------
                       I++ E  ++                    + +  W                                 + C              
Subjt:  ---------------IHVWEFGIR--------------------SDLNAW--------------------------------LIAC--------------

Query:  -------------------------RENCGIISSFSTRATKLYC-----------FVHKVPVCGEC---------------ICFPEHQI--CV-------
                                  +  G++S  S R    +             +  + VCG+C               I    +++  C+       
Subjt:  -------------------------RENCGIISSFSTRATKLYC-----------FVHKVPVCGEC---------------ICFPEHQI--CV-------

Query:  -------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLH
                     +RTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHVIHT+CLVSHIK FPP TAPAGY CP+CS  IWPPK+ KDSGSRLH
Subjt:  -------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLH

Query:  AKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKS
        +KLKE I+QTG+EKNLFGNHPV+LS TESRGPPPAFASDPL+    + H N  SL           +G+S  TG+ +SK +  DIVEI+   S GNFV+ 
Subjt:  AKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKS

Query:  SSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLI
        SSP GP ATTRKG +  +RQNSEISYYADDED NRKKY RRGPF+HKFLRALLPFWS+ALPTLPV+APP+KDA NA + SEGR RHQR SRMDPRKILL+
Subjt:  SSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLI

Query:  IAIMACLATMGILYYRLVQRDIGEEFVDDEQ
        IAIMAC+ATMGILYYRL QR  GEE   DEQ
Subjt:  IAIMACLATMGILYYRLVQRDIGEEFVDDEQ

KAG6588711.1 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.95Show/hide
Query:  MTTSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPVR
        M TSS  H+SA PSLA+ SSKSSFLFGT LPFPSSR RTS RRY LSPSAK+SMDHIPKQFR ENLKDGLMENY+N P+ LYGLTPSQ+DMFMTEDNPVR
Subjt:  MTTSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPVR

Query:  RQSELVTEENISSSHNYLNHGGMWSLSGMDG-KGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSK
        RQSELVTEE+ISS+H+YL +GGMWSLSGMDG KGP+KYSMS SMYRGGGRG GR ++APPDLPSLLLDARI YLGMPIVP+VTELLVAQFMWLDYDNPSK
Subjt:  RQSELVTEENISSSHNYLNHGGMWSLSGMDG-KGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTE
        PIYLYINS GTQNEKME VG ETEAYAIADMMAYCK DVYT+NCGMAFGQAAMLLSLGTKGYR VQPNSS KLYLPKV+RSSGA IDMWIKA+ELDANT+
Subjt:  PIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTE

Query:  YYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFLVSPLY
        YY+ELLAKGTGKP EEIAKD+QRP+Y   QEAIDYG+ DKII S+D+AFEKRNYD+MLAQSRAMR+G GGNPQAAPSGL                     
Subjt:  YYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFLVSPLY

Query:  RYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIA-CRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRTYSEW
                                                + + + A ++  CR           +ATKLYCFVHKVPVCGECICFPEHQICV+RTYSEW
Subjt:  RYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIA-CRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRTYSEW

Query:  VLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNH
        VLNGDYDWPPNCCLCHATLEEG GPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACS+SIWPPKN KDSGSRLHAKLKEAILQTGLEK+LFGNH
Subjt:  VLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNH

Query:  PVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGALNYDRQ
        PV LS TES GPPPAFASDPLVSSSGD HNNKSSLNSIAN  SN GEGFSATTG GSSKNNI+DIVEIE+PG EGNFVK SSPS PVATTRKGA+NYDRQ
Subjt:  PVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGALNYDRQ

Query:  NSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQR
        +SEISYYADDEDGNRKKYVRRGPF+HKFLRALLPFWSTALPTLPVTAPPRKD+  ANDVSEGRVRHQRPSRMDPRKILL+IAIMACLATMGILYYRL QR
Subjt:  NSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQR

Query:  DIGEEFVDDEQQLQAAQ
         IGEE V+DEQQL+AAQ
Subjt:  DIGEEFVDDEQQLQAAQ

OIV91138.1 hypothetical protein TanjilG_30360 [Lupinus angustifolius]0.0e+0069.45Show/hide
Query:  MTTSSLSHLSAP--PSLAIDSS--KSSFLFGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMT
        M +S L  LS P  PS   D+S  KSSF++GT   FPS    T+   RR   SPSAK S DHIPKQFR +NLKDG+M+NYKN P+YLYGL+PSQMDMFMT
Subjt:  MTTSSLSHLSAP--PSLAIDSS--KSSFLFGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMT

Query:  EDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY
        EDNP+R+Q+E VTEE+ISS+ NYL+HGGMWS S M   GP+KYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVP+V ELLVAQFMWLDY
Subjt:  EDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL
        DNP KPIYLYINSSGTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKEL
Subjt:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL

Query:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFL
        +ANTEYY+ELLAKG GK KEEIAKDVQRP+YFQAQEAI+YGI DKIIDS+D  F+KRNYDEM+AQSRA RR  GGNPQ APSG R     + +I++    
Subjt:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFL

Query:  VSPLYRYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIACRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRT
                                            D IH+  F +  D   +  A          F   ATKLYCFVHKVPVCGECICFP+HQICV+RT
Subjt:  VSPLYRYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIACRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRT

Query:  YSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNL
        YSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT CLVSHIKSFPP TAPAGY CP+CS  IWPPK+ KDS SRLH+KLKEAI+QTG+EKNL
Subjt:  YSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNL

Query:  FGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGALN
        FGNHPV+LS TESRGPPPAFASDPL+    + H N  S+           +GFS  TG+  SK ++ DIVE++ P S GNF++ SSP GP ATTRK  + 
Subjt:  FGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGALN

Query:  YDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYR
         +RQNSEISYYADDED NRKKY RRGPF+HKFLRALLPFWS+ALPTLPVTAPPRKDA N+ + SEGR RHQR SRMDPRKILL+IAIMACLATMGILYYR
Subjt:  YDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYR

Query:  LVQRDIGEEFVDDEQ
        L QR  GEEF  DEQ
Subjt:  LVQRDIGEEFVDDEQ

RDY10958.1 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic, partial [Mucuna pruriens]1.2e-29167.42Show/hide
Query:  TSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPVRRQ
        +SSLS   + PS+      SSFL GT+L FP S       R   S SAK S+DHIPKQFR ENL+DGLMEN+KN P+YLYGLTPSQMDMFMTEDNP+R+Q
Subjt:  TSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPVRRQ

Query:  SELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIY
        +E VTEE+ISS+ NY++HGGMWSLS M     +KYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVP+VTELLVAQFMWLDYDNP+KPIY
Subjt:  SELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIY

Query:  LYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYL
        LYINSSGTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKEL+ANTEYY+
Subjt:  LYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYL

Query:  ELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFLVSPLYRYN
        ELLAKG GK KEEIAK+VQRP+YFQAQEAIDYGIADK IDS+D  FEKRNYDEMLAQSRA RR  GGNPQ     ++ L       H + F         
Subjt:  ELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFLVSPLYRYN

Query:  SNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIACRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNG
                                 + W               +W                 ATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWV++G
Subjt:  SNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIACRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNG

Query:  DYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQ------------TGL
        +YDWPP CC C A LEEG G QTTRLGCLHV+HT+CLVSHIKSF P TAPAGY CP+CS SIWPPK+ KDSGSRLH+KLKEAI+Q            +G+
Subjt:  DYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQ------------TGL

Query:  EKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRK
        EKN+FGNHPV+LS TESR PPPAFASDPL+S   + H N  S+           +GFS  TG+   K ++ DIVEI+   S GNF+KSSSP  P ATTRK
Subjt:  EKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRK

Query:  GALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIM
        G+++ +RQNSEISYYADDEDGNRKKY +RGPF HKFLRALLPFWS ALPTLPVTAP RKDA NA D SEGR RHQR S MDPRKILL+IAI+
Subjt:  GALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIM

TrEMBL top hitse value%identityAlignment
A0A371I7F6 ATP-dependent Clp protease proteolytic subunit (Fragment)5.6e-29267.42Show/hide
Query:  TSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPVRRQ
        +SSLS   + PS+      SSFL GT+L FP S       R   S SAK S+DHIPKQFR ENL+DGLMEN+KN P+YLYGLTPSQMDMFMTEDNP+R+Q
Subjt:  TSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPVRRQ

Query:  SELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIY
        +E VTEE+ISS+ NY++HGGMWSLS M     +KYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVP+VTELLVAQFMWLDYDNP+KPIY
Subjt:  SELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIY

Query:  LYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYL
        LYINSSGTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKEL+ANTEYY+
Subjt:  LYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYL

Query:  ELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFLVSPLYRYN
        ELLAKG GK KEEIAK+VQRP+YFQAQEAIDYGIADK IDS+D  FEKRNYDEMLAQSRA RR  GGNPQ     ++ L       H + F         
Subjt:  ELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFLVSPLYRYN

Query:  SNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIACRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNG
                                 + W               +W                 ATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWV++G
Subjt:  SNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIACRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNG

Query:  DYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQ------------TGL
        +YDWPP CC C A LEEG G QTTRLGCLHV+HT+CLVSHIKSF P TAPAGY CP+CS SIWPPK+ KDSGSRLH+KLKEAI+Q            +G+
Subjt:  DYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQ------------TGL

Query:  EKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRK
        EKN+FGNHPV+LS TESR PPPAFASDPL+S   + H N  S+           +GFS  TG+   K ++ DIVEI+   S GNF+KSSSP  P ATTRK
Subjt:  EKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRK

Query:  GALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIM
        G+++ +RQNSEISYYADDEDGNRKKY +RGPF HKFLRALLPFWS ALPTLPVTAP RKDA NA D SEGR RHQR S MDPRKILL+IAI+
Subjt:  GALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIM

A0A5A7SMD0 ATP-dependent Clp protease proteolytic subunit3.0e-19787.68Show/hide
Query:  SDFDRRRTRPLKSIGKIRPSFSPLTFSMTTSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMEN
        SD DRRRTR  K I +  PS      SM TSSL HLSAPPSLAIDSSKSSFL GTQLP PSSR RTSCRRYVLSPSA+QSMDHIPKQFRGENLKDGL+EN
Subjt:  SDFDRRRTRPLKSIGKIRPSFSPLTFSMTTSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMEN

Query:  YKNTPKYLYGLTPSQMDMFMTEDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYL
        YKN PKYLYGLTPSQMDMFMTEDNPVRRQSELVTE+NISSS+NYLNHGGMWSL+GMDGKGPAKYSMSVSMYRGGGR AGRPR APPDLPSLLLDARI YL
Subjt:  YKNTPKYLYGLTPSQMDMFMTEDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYL

Query:  GMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLY
        GMPIVP+VTELLVAQFMWLDYDNPSKPIYLYINS GTQNEKMETVGSETEAYA+ DMM+YCKSDVYTVN GMAFGQAAMLLSLGTKGYR +QPNSSTKLY
Subjt:  GMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLY

Query:  LPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQA
        LPKVNRSSG  IDMWIKAKELDANTEYYLELLAKG GKPKEEI KD+QR +YFQAQEAIDYGIADKII SQD+AFEKRNYDEMLAQS+A RRG GGNPQA
Subjt:  LPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQA

Query:  APSGLR
        AP+G R
Subjt:  APSGLR

A0A6A5N4U1 ATP-dependent Clp protease proteolytic subunit4.3e-30865.62Show/hide
Query:  MTTSSLSHLSAPPS--LAIDSS--KSSFLFGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFM
        M +S L  LS P +     D+S  KSSFL  T   FPS   S ++ S R +  SPSAK S DHIPKQFR +NLKDG+M+NYKN P+YLYGL+PSQMDMFM
Subjt:  MTTSSLSHLSAPPS--LAIDSS--KSSFLFGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFM

Query:  TEDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLD
        TEDNP+R+Q+E VTEE+ISS+ NYL+HGGMWS S M   GP+KYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVP+V ELLVAQFMWLD
Subjt:  TEDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLD

Query:  YDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKE
        YDNP KPIYLYINSSGTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKE
Subjt:  YDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKE

Query:  LDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEF
        L+ANTEYY+ELLAKG GK KEEIAKDVQRP+YFQAQEAI+YG+ADKIIDS+D  F+KRNYDEM+AQSRA RR  GGNPQ APSG R   A ++    +  
Subjt:  LDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEF

Query:  LVSPLYRYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIA-CRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVV
           P+ R N    +S +++                               +NA ++  CR           +ATKLYCFVHKVPVCGECICFP+HQICV+
Subjt:  LVSPLYRYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIA-CRENCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVV

Query:  RTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEK
        RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT CLVSHIKSFPP TAPAGY CP+CS  IWPPK+ KDS SRLH+KLKEAI+QTG+EK
Subjt:  RTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEK

Query:  NLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGA
        NLFGNHPV+L  TES GPPPAFASDPL+    + H N  S+           +GFS  TG+  SK ++ DIVE++ P S GNF++ SSP GP ATTRK  
Subjt:  NLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKSSSPSGPVATTRKGA

Query:  LNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPR
        +  +RQNSEISYYADDED NRKKY RR                                            GPF+HKFLRALLPFWS+ALPTLPVTAPPR
Subjt:  LNYDRQNSEISYYADDEDGNRKKYVRR--------------------------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPR

Query:  KDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQ
        KDA NA + SEGR RHQR SRMDPRKILL+IAIMAC+ATMGILYYRLVQR  GEEF  DEQ
Subjt:  KDAPNANDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQ

A0A6A5N6H9 RING-type domain-containing protein1.1e-27149.43Show/hide
Query:  MTTSSLSHLSAP--PSLAID--SSKSSFLFGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMT
        M +S L  LS P  P   ID  +SKSSF  GT   FPS    T+   RR   SP AK S DHIP QFR ENL+DGLM+NYKN PKYLYGL+PSQMDMF+T
Subjt:  MTTSSLSHLSAP--PSLAID--SSKSSFLFGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMT

Query:  EDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY
        EDNP+R+QSE VTEE+ISS+ NYL+HGGMWS S M   GP+KYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVP+V ELLVAQFMWLDY
Subjt:  EDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL
        DNP KPIYLYINSSGTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKEL
Subjt:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL

Query:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR-----DLTAFKIYIH
        +ANTEYY+ELLAKG GK KEEIAKDVQRP+YFQAQEAI+YGIADKIIDS+D  F+KRNYDEML+QSRA RR  GGNPQ APSG       DL  +   + 
Subjt:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR-----DLTAFKIYIH

Query:  VV------EFLVSPLYRYNSNFISSD--------RLLAGYCNGGLQMPQG---------------------LVDIWDR----------------------
                +  +  L+ +N   + S           L   C+    +PQG                     LVD++ +                      
Subjt:  VV------EFLVSPLYRYNSNFISSD--------RLLAGYCNGGLQMPQG---------------------LVDIWDR----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------IHVWEFGIR--------------------SDLNAW--------------------------------LIAC--------------
                       I++ E  ++                    + +  W                                 + C              
Subjt:  ---------------IHVWEFGIR--------------------SDLNAW--------------------------------LIAC--------------

Query:  -------------------------RENCGIISSFSTRATKLYC-----------FVHKVPVCGEC---------------ICFPEHQI--CV-------
                                  +  G++S  S R    +             +  + VCG+C               I    +++  C+       
Subjt:  -------------------------RENCGIISSFSTRATKLYC-----------FVHKVPVCGEC---------------ICFPEHQI--CV-------

Query:  -------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLH
                     +RTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHVIHT+CLVSHIK FPP TAPAGY CP+CS  IWPPK+ KDSGSRLH
Subjt:  -------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLH

Query:  AKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKS
        +KLKE I+QTG+EKNLFGNHPV+LS TESRGPPPAFASDPL+    + H N  SL           +G+S  TG+ +SK +  DIVEI+   S GNFV+ 
Subjt:  AKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADIVEIEMPGSEGNFVKS

Query:  SSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLI
        SSP GP ATTRKG +  +RQNSEISYYADDED NRKKY RRGPF+HKFLRALLPFWS+ALPTLPV+APP+KDA NA + SEGR RHQR SRMDPRKILL+
Subjt:  SSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPSRMDPRKILLI

Query:  IAIMACLATMGILYYRLVQRDIGEEFVDDEQ
        IAIMAC+ATMGILYYRL QR  GEE   DEQ
Subjt:  IAIMACLATMGILYYRLVQRDIGEEFVDDEQ

A0A6J1D6K6 ATP-dependent Clp protease proteolytic subunit1.0e-19791.58Show/hide
Query:  MTTSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAK-QSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPV
        M TSSLS+LSAPPSLAIDSSKS F+ GTQL FPSSRSRT+CRRY LSPSAK  SMDHIPK+FRGENLKDGLMENYKN P+YLYGLTPSQMDMFMTEDNP+
Subjt:  MTTSSLSHLSAPPSLAIDSSKSSFLFGTQLPFPSSRSRTSCRRYVLSPSAK-QSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPV

Query:  RRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSK
        RRQSELVTEENISSSHNYLNHGGMWSLSGM+ KGP+KYSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVP+VTELLVAQFMWLDYDNPSK
Subjt:  RRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTE
        PIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMA+GQAAMLLSLGTKGYR VQPNSSTKLYLPKVNRSSGA IDMWIKAKELDANTE
Subjt:  PIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTE

Query:  YYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR
        YY+ELLAKG GKPKEEI KDVQRP+YFQAQEA+DYG+ADKIIDSQD+AFEKRNYDEMLAQSRAMR+G GGNPQAAPSG R
Subjt:  YYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR

SwissProt top hitse value%identityAlignment
P74466 Putative ATP-dependent Clp protease proteolytic subunit-like2.5e-3944.22Show/hide
Query:  RTAPPDLPSLLLDARICYLGMPIVPS----------VTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG
        +T PPDL SLLL  RI YLGMP+  S          VT+L++AQ ++L +D+P KPIY YINS+GT     + VG ETEA+AI D + Y K  V+T+  G
Subjt:  RTAPPDLPSLLLDARICYLGMPIVPS----------VTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG

Query:  MAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDS
         A G AAM+LS GTKGYR   P+++  L   +   + G A D+ I+AKE+ +N +  LE+L+  TG+ +E++AKD+ R  Y    +A +YG+ D++++S
Subjt:  MAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDS

Q8L770 ATP-dependent Clp protease proteolytic subunit-related protein 3, chloroplastic2.6e-4446.19Show/hide
Query:  PAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAY
        P   S    + R       RPRT PPDLPS+LLD RI Y+GMP+VP+VTEL+VA+ M+L + +P +PIY+YINS+GT  +  ETVG E+E +AI D +  
Subjt:  PAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAY

Query:  CKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSG--AAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEA
         K++V+TV  G A GQA +LLS GTKG R + P++   +  P+V  SSG   A D+ I+AKE+  N +  +ELL+K TG   E +A  ++RP Y  A +A
Subjt:  CKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSG--AAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEA

Query:  IDYGIADKII
         ++G+ D+I+
Subjt:  IDYGIADKII

Q8LB10 ATP-dependent Clp protease proteolytic subunit-related protein 4, chloroplastic7.3e-3946.2Show/hide
Query:  PPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLG
        PPDL S L   RI YLGM +VPSVTEL++A+F++L Y++  KPIYLYINS+GT  +  E +G +TEA+AI D+M Y K  ++T+  G A+G+AA+LL+ G
Subjt:  PPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLG

Query:  TKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKII
         KG R   P+S+  +  P + R  G A D+ I  KE+       ++L +K  GK  E+I  D++RP+YF   EA++YGI DK++
Subjt:  TKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKII

Q9L4P4 Putative ATP-dependent Clp protease proteolytic subunit-like1.6e-4145.5Show/hide
Query:  RTAPPDLPSLLLDARICYLGMPIVPS----------VTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG
        RT PPDLPSLLL  RI YLGMP+  S          VTEL++AQ ++L++DNP KPIY YINS+GT     + +G ETEA+AI D M Y K  V+T+  G
Subjt:  RTAPPDLPSLLLDARICYLGMPIVPS----------VTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG

Query:  MAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQ
         A G AAM+LS GT G R   P+++  L  P+   + G A D+ I+AKE+ AN    LE+ A+ TG+  + +A+D  R  Y    +A++YG+ D+++DS+
Subjt:  MAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQ

Q9XJ35 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic5.4e-15171.95Show/hide
Query:  SMTTSSLSHLSAPPSLAIDSS---KSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTED
        S  TS L+H +      +  S     S LF + +P  +   RT  R +  + +   S DHIPKQFRG+NLKDG+M+N+KN P+Y YGL  +QMDMFMTED
Subjt:  SMTTSSLSHLSAPPSLAIDSS---KSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTED

Query:  NPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY
        +PVRRQ+E VTEE+ISS +NYLN+GG+WS+SGM+     +YSMSV MYRGGG G G  RPRTAPPDLPSLLLDARICYLGMPIVP+VTELLVAQFMWLDY
Subjt:  NPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL
        DNP+KPIYLYINS GTQNEKMETVGSETEAYAIAD ++YCKSDVYT+NCGMAFGQAAMLLSLG KGYR VQP+SSTKLYLPKVNRSSGAAIDMWIKAKEL
Subjt:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL

Query:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR
        DANTEYY+ELLAKGTGK KE+I +D++RP+Y QAQ AIDYGIADKI DSQD++FEKR+YD  LAQ RAMR  PGG   AAP+GLR
Subjt:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR

Arabidopsis top hitse value%identityAlignment
AT1G09130.1 ATP-dependent caseinolytic (Clp) protease/crotonase family protein1.8e-4546.19Show/hide
Query:  PAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAY
        P   S    + R       RPRT PPDLPS+LLD RI Y+GMP+VP+VTEL+VA+ M+L + +P +PIY+YINS+GT  +  ETVG E+E +AI D +  
Subjt:  PAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAY

Query:  CKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSG--AAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEA
         K++V+TV  G A GQA +LLS GTKG R + P++   +  P+V  SSG   A D+ I+AKE+  N +  +ELL+K TG   E +A  ++RP Y  A +A
Subjt:  CKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSG--AAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEA

Query:  IDYGIADKII
         ++G+ D+I+
Subjt:  IDYGIADKII

AT1G09130.2 ATP-dependent caseinolytic (Clp) protease/crotonase family protein1.8e-4546.19Show/hide
Query:  PAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAY
        P   S    + R       RPRT PPDLPS+LLD RI Y+GMP+VP+VTEL+VA+ M+L + +P +PIY+YINS+GT  +  ETVG E+E +AI D +  
Subjt:  PAKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAY

Query:  CKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSG--AAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEA
         K++V+TV  G A GQA +LLS GTKG R + P++   +  P+V  SSG   A D+ I+AKE+  N +  +ELL+K TG   E +A  ++RP Y  A +A
Subjt:  CKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSG--AAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEA

Query:  IDYGIADKII
         ++G+ D+I+
Subjt:  IDYGIADKII

AT1G49970.1 CLP protease proteolytic subunit 13.8e-15271.95Show/hide
Query:  SMTTSSLSHLSAPPSLAIDSS---KSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTED
        S  TS L+H +      +  S     S LF + +P  +   RT  R +  + +   S DHIPKQFRG+NLKDG+M+N+KN P+Y YGL  +QMDMFMTED
Subjt:  SMTTSSLSHLSAPPSLAIDSS---KSSFLFGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTED

Query:  NPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY
        +PVRRQ+E VTEE+ISS +NYLN+GG+WS+SGM+     +YSMSV MYRGGG G G  RPRTAPPDLPSLLLDARICYLGMPIVP+VTELLVAQFMWLDY
Subjt:  NPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL
        DNP+KPIYLYINS GTQNEKMETVGSETEAYAIAD ++YCKSDVYT+NCGMAFGQAAMLLSLG KGYR VQP+SSTKLYLPKVNRSSGAAIDMWIKAKEL
Subjt:  DNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKEL

Query:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR
        DANTEYY+ELLAKGTGK KE+I +D++RP+Y QAQ AIDYGIADKI DSQD++FEKR+YD  LAQ RAMR  PGG   AAP+GLR
Subjt:  DANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAFEKRNYDEMLAQSRAMRRGPGGNPQAAPSGLR

AT2G14835.1 RING/U-box superfamily protein2.9e-13166.28Show/hide
Query:  RATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVS
        +AT+LYCFVHK PVCGECICFPEHQ CVVRTYSEWV++G+YD  P CC C AT +EG G Q TRLGCLH IHT CLVS IKSFPP TAPAGY CPACS  
Subjt:  RATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVS

Query:  IWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADI
        IWPP   KD+GSRLHA L+E I QTGLEKNL GNHPV+ S TESR PPPAFASD L++ S  +H  +          +NL +G+S       SK+ +++I
Subjt:  IWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADI

Query:  VEIEMPGSEGNFVKSSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVR
        VEI++P S G+++KSSSP    A  RKG    DRQNSE  YYADDEDGNRKKY RRGP +HKFLRALLPFWS+ALPTLPVTAPPRKDA  A+D SEGRVR
Subjt:  VEIEMPGSEGNFVKSSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVR

Query:  HQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ
        HQR S+MD RKIL+ IA++AC+ATMGILYYRL  + IG+E  D+EQ+
Subjt:  HQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ

AT2G14835.2 RING/U-box superfamily protein2.9e-13166.28Show/hide
Query:  RATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVS
        +AT+LYCFVHK PVCGECICFPEHQ CVVRTYSEWV++G+YD  P CC C AT +EG G Q TRLGCLH IHT CLVS IKSFPP TAPAGY CPACS  
Subjt:  RATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVS

Query:  IWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADI
        IWPP   KD+GSRLHA L+E I QTGLEKNL GNHPV+ S TESR PPPAFASD L++ S  +H  +          +NL +G+S       SK+ +++I
Subjt:  IWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADI

Query:  VEIEMPGSEGNFVKSSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVR
        VEI++P S G+++KSSSP    A  RKG    DRQNSE  YYADDEDGNRKKY RRGP +HKFLRALLPFWS+ALPTLPVTAPPRKDA  A+D SEGRVR
Subjt:  VEIEMPGSEGNFVKSSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVR

Query:  HQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ
        HQR S+MD RKIL+ IA++AC+ATMGILYYRL  + IG+E  D+EQ+
Subjt:  HQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATTACAGGCCGGGCAGTCGGGACATGTGAGAGAAAGGGCGACAGAAACATGTGGTGGCATGATCTGCACGTCAAAGACTCGAGCGATGGTTCAGATCTG
ACGTGCGGCACGAACGGAGCGGTGAACGGTAGTGATTTCGACCGGAGAAGAACTCGACCTCTAAAATCTATCGGGAAAATCCGGCCAAGCTTTTCTCCTTTAACT
TTCTCCATGACGACCTCTTCACTCTCCCATCTCTCTGCTCCGCCGTCGTTAGCCATCGACAGCTCAAAGTCTTCCTTCCTCTTCGGTACACAGCTGCCCTTCCCA
TCCTCTCGCTCAAGAACTTCATGTCGCAGATATGTCCTATCTCCCTCCGCCAAACAATCTATGGACCACATTCCTAAGCAGTTTAGAGGAGAAAATCTCAAAGAT
GGATTGATGGAGAACTACAAGAATACTCCTAAGTATCTTTATGGCCTTACACCTTCACAAATGGACATGTTCATGACAGAAGATAATCCCGTCCGGCGGCAGTCA
GAATTAGTTACTGAGGAAAACATATCATCTTCCCACAACTACTTGAATCATGGAGGAATGTGGAGTCTATCTGGCATGGATGGGAAGGGTCCAGCAAAATATAGT
ATGAGTGTTAGCATGTATCGTGGGGGAGGAAGAGGAGCCGGGCGACCTCGAACAGCTCCTCCTGATTTGCCCTCTTTGCTCTTAGATGCTCGTATATGCTATTTG
GGCATGCCCATTGTACCATCAGTGACTGAGCTTCTTGTTGCTCAGTTTATGTGGCTAGATTATGATAACCCGTCAAAGCCTATATATCTCTACATAAACTCTTCT
GGGACACAGAATGAGAAGATGGAGACGGTTGGGTCTGAAACTGAAGCATACGCTATAGCAGATATGATGGCTTACTGCAAATCGGATGTCTATACCGTTAACTGT
GGAATGGCATTCGGTCAAGCGGCAATGCTTTTGTCACTTGGAACCAAGGGTTACCGTGGTGTCCAACCTAACTCTTCCACTAAACTATATCTGCCTAAAGTTAAT
AGATCAAGTGGTGCGGCCATAGATATGTGGATTAAGGCCAAAGAACTCGATGCCAACACCGAGTACTATCTCGAGCTATTAGCTAAAGGAACTGGCAAACCCAAG
GAAGAGATTGCCAAAGATGTCCAACGACCCAGATACTTCCAAGCACAAGAAGCTATTGACTACGGCATTGCAGACAAAATAATCGACTCTCAAGATACCGCATTT
GAAAAAAGGAATTACGACGAGATGCTTGCGCAGTCGAGAGCTATGAGGAGAGGACCAGGAGGCAATCCACAAGCGGCACCGTCTGGGCTCAGAGATTTGACAGCT
TTCAAGATCTACATCCATGTCGTTGAATTTTTAGTTTCCCCATTGTATCGATACAACTCCAACTTCATTTCATCGGACCGTCTCCTCGCCGGTTACTGCAATGGT
GGTCTGCAAATGCCGCAAGGTTTGGTTGATATTTGGGATCGGATCCACGTTTGGGAGTTCGGAATTAGAAGCGATCTGAATGCTTGGTTAATTGCTTGTAGGGAG
AATTGCGGAATTATAAGCTCATTTTCTACGAGGGCCACGAAGTTGTATTGTTTTGTTCATAAGGTTCCCGTTTGTGGAGAATGCATTTGTTTCCCAGAACACCAA
ATATGCGTGGTTCGTACCTATTCAGAGTGGGTATTGAATGGAGATTATGATTGGCCTCCAAATTGCTGCCTGTGTCATGCTACACTCGAGGAAGGAGTTGGTCCT
CAAACTACTCGATTGGGATGTTTGCATGTCATACATACGGATTGCTTGGTCTCACATATCAAGAGCTTTCCTCCATCCACTGCCCCTGCTGGATATGACTGTCCT
GCCTGTTCTGTTTCGATTTGGCCTCCGAAGAACTTTAAAGATTCAGGATCTCGTCTACATGCAAAGCTAAAGGAAGCCATCTTGCAGACTGGTCTGGAAAAGAAT
TTGTTTGGAAATCATCCAGTTGCATTATCACCTACAGAATCCCGTGGTCCTCCTCCTGCCTTTGCCTCAGATCCCTTGGTTTCCTCTTCAGGAGACACACATAAC
AATAAGAGTTCATTAAATTCAATAGCAAACACTGAGTCAAATCTGGGTGAAGGGTTCTCCGCCACAACTGGAAACGGGTCTTCCAAAAATAACATTGCAGATATT
GTAGAGATTGAAATGCCTGGTTCAGAAGGAAATTTTGTGAAAAGCTCAAGTCCTTCAGGCCCTGTTGCTACCACAAGAAAAGGTGCACTCAATTATGACAGGCAA
AATTCTGAAATTTCATATTATGCTGATGATGAAGACGGTAATCGTAAGAAGTATGTTCGAAGAGGTCCTTTTAAGCACAAGTTTCTTAGAGCACTACTTCCTTTC
TGGTCAACTGCATTGCCAACGCTACCCGTGACTGCACCTCCTCGTAAAGATGCACCGAATGCCAATGATGTCAGTGAAGGTCGCGTTCGGCACCAAAGACCCTCC
AGAATGGATCCGAGAAAAATTCTTCTTATCATAGCAATCATGGCATGTCTGGCAACAATGGGTATTTTGTACTACAGACTTGTACAAAGAGATATTGGAGAAGAA
TTTGTGGATGACGAGCAGCAATTGCAAGCAGCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATAATTACAGGCCGGGCAGTCGGGACATGTGAGAGAAAGGGCGACAGAAACATGTGGTGGCATGATCTGCACGTCAAAGACTCGAGCGATGGTTCAGATCTG
ACGTGCGGCACGAACGGAGCGGTGAACGGTAGTGATTTCGACCGGAGAAGAACTCGACCTCTAAAATCTATCGGGAAAATCCGGCCAAGCTTTTCTCCTTTAACT
TTCTCCATGACGACCTCTTCACTCTCCCATCTCTCTGCTCCGCCGTCGTTAGCCATCGACAGCTCAAAGTCTTCCTTCCTCTTCGGTACACAGCTGCCCTTCCCA
TCCTCTCGCTCAAGAACTTCATGTCGCAGATATGTCCTATCTCCCTCCGCCAAACAATCTATGGACCACATTCCTAAGCAGTTTAGAGGAGAAAATCTCAAAGAT
GGATTGATGGAGAACTACAAGAATACTCCTAAGTATCTTTATGGCCTTACACCTTCACAAATGGACATGTTCATGACAGAAGATAATCCCGTCCGGCGGCAGTCA
GAATTAGTTACTGAGGAAAACATATCATCTTCCCACAACTACTTGAATCATGGAGGAATGTGGAGTCTATCTGGCATGGATGGGAAGGGTCCAGCAAAATATAGT
ATGAGTGTTAGCATGTATCGTGGGGGAGGAAGAGGAGCCGGGCGACCTCGAACAGCTCCTCCTGATTTGCCCTCTTTGCTCTTAGATGCTCGTATATGCTATTTG
GGCATGCCCATTGTACCATCAGTGACTGAGCTTCTTGTTGCTCAGTTTATGTGGCTAGATTATGATAACCCGTCAAAGCCTATATATCTCTACATAAACTCTTCT
GGGACACAGAATGAGAAGATGGAGACGGTTGGGTCTGAAACTGAAGCATACGCTATAGCAGATATGATGGCTTACTGCAAATCGGATGTCTATACCGTTAACTGT
GGAATGGCATTCGGTCAAGCGGCAATGCTTTTGTCACTTGGAACCAAGGGTTACCGTGGTGTCCAACCTAACTCTTCCACTAAACTATATCTGCCTAAAGTTAAT
AGATCAAGTGGTGCGGCCATAGATATGTGGATTAAGGCCAAAGAACTCGATGCCAACACCGAGTACTATCTCGAGCTATTAGCTAAAGGAACTGGCAAACCCAAG
GAAGAGATTGCCAAAGATGTCCAACGACCCAGATACTTCCAAGCACAAGAAGCTATTGACTACGGCATTGCAGACAAAATAATCGACTCTCAAGATACCGCATTT
GAAAAAAGGAATTACGACGAGATGCTTGCGCAGTCGAGAGCTATGAGGAGAGGACCAGGAGGCAATCCACAAGCGGCACCGTCTGGGCTCAGAGATTTGACAGCT
TTCAAGATCTACATCCATGTCGTTGAATTTTTAGTTTCCCCATTGTATCGATACAACTCCAACTTCATTTCATCGGACCGTCTCCTCGCCGGTTACTGCAATGGT
GGTCTGCAAATGCCGCAAGGTTTGGTTGATATTTGGGATCGGATCCACGTTTGGGAGTTCGGAATTAGAAGCGATCTGAATGCTTGGTTAATTGCTTGTAGGGAG
AATTGCGGAATTATAAGCTCATTTTCTACGAGGGCCACGAAGTTGTATTGTTTTGTTCATAAGGTTCCCGTTTGTGGAGAATGCATTTGTTTCCCAGAACACCAA
ATATGCGTGGTTCGTACCTATTCAGAGTGGGTATTGAATGGAGATTATGATTGGCCTCCAAATTGCTGCCTGTGTCATGCTACACTCGAGGAAGGAGTTGGTCCT
CAAACTACTCGATTGGGATGTTTGCATGTCATACATACGGATTGCTTGGTCTCACATATCAAGAGCTTTCCTCCATCCACTGCCCCTGCTGGATATGACTGTCCT
GCCTGTTCTGTTTCGATTTGGCCTCCGAAGAACTTTAAAGATTCAGGATCTCGTCTACATGCAAAGCTAAAGGAAGCCATCTTGCAGACTGGTCTGGAAAAGAAT
TTGTTTGGAAATCATCCAGTTGCATTATCACCTACAGAATCCCGTGGTCCTCCTCCTGCCTTTGCCTCAGATCCCTTGGTTTCCTCTTCAGGAGACACACATAAC
AATAAGAGTTCATTAAATTCAATAGCAAACACTGAGTCAAATCTGGGTGAAGGGTTCTCCGCCACAACTGGAAACGGGTCTTCCAAAAATAACATTGCAGATATT
GTAGAGATTGAAATGCCTGGTTCAGAAGGAAATTTTGTGAAAAGCTCAAGTCCTTCAGGCCCTGTTGCTACCACAAGAAAAGGTGCACTCAATTATGACAGGCAA
AATTCTGAAATTTCATATTATGCTGATGATGAAGACGGTAATCGTAAGAAGTATGTTCGAAGAGGTCCTTTTAAGCACAAGTTTCTTAGAGCACTACTTCCTTTC
TGGTCAACTGCATTGCCAACGCTACCCGTGACTGCACCTCCTCGTAAAGATGCACCGAATGCCAATGATGTCAGTGAAGGTCGCGTTCGGCACCAAAGACCCTCC
AGAATGGATCCGAGAAAAATTCTTCTTATCATAGCAATCATGGCATGTCTGGCAACAATGGGTATTTTGTACTACAGACTTGTACAAAGAGATATTGGAGAAGAA
TTTGTGGATGACGAGCAGCAATTGCAAGCAGCACAATGAAAAAAAA
Protein sequenceShow/hide protein sequence
MIITGRAVGTCERKGDRNMWWHDLHVKDSSDGSDLTCGTNGAVNGSDFDRRRTRPLKSIGKIRPSFSPLTFSMTTSSLSHLSAPPSLAIDSSKSSFLFGTQLPFP
SSRSRTSCRRYVLSPSAKQSMDHIPKQFRGENLKDGLMENYKNTPKYLYGLTPSQMDMFMTEDNPVRRQSELVTEENISSSHNYLNHGGMWSLSGMDGKGPAKYS
MSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPSVTELLVAQFMWLDYDNPSKPIYLYINSSGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNC
GMAFGQAAMLLSLGTKGYRGVQPNSSTKLYLPKVNRSSGAAIDMWIKAKELDANTEYYLELLAKGTGKPKEEIAKDVQRPRYFQAQEAIDYGIADKIIDSQDTAF
EKRNYDEMLAQSRAMRRGPGGNPQAAPSGLRDLTAFKIYIHVVEFLVSPLYRYNSNFISSDRLLAGYCNGGLQMPQGLVDIWDRIHVWEFGIRSDLNAWLIACRE
NCGIISSFSTRATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCP
ACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVALSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANTESNLGEGFSATTGNGSSKNNIADI
VEIEMPGSEGNFVKSSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDAPNANDVSEGRVRHQRPS
RMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQLQAAQ