; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018694 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018694
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionATP-dependent Clp protease proteolytic subunit
Genome locationChr04:6934603..6947147
RNA-Seq ExpressionHG10018694
SyntenyHG10018694
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0009526 - plastid envelope (cellular component)
GO:0009570 - chloroplast stroma (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR001907 - ATP-dependent Clp protease proteolytic subunit
IPR023562 - Clp protease proteolytic subunit /Translocation-enhancing protein TepA
IPR029045 - ClpP/crotonase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF1878300.1 hypothetical protein Lal_00046967 [Lupinus albus]0.0e+0070.1Show/hide
Query:  MATSSLSHLSAPPS--LAIDSS--KSSFLCGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFM
        MA+S L  LS P +     D+S  KSSFL  T   FPS   S ++ S R +  SPSAK S DHIPK+FR +NLKDG+M+NYKNAP+YLYGL+PSQMDMFM
Subjt:  MATSSLSHLSAPPS--LAIDSS--KSSFLCGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFM

Query:  TEDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLD
        TEDNP+R+Q+E VTE++ISS+ NYL+HGGMWS S M   GPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLD
Subjt:  TEDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLD

Query:  YDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKE
        YDNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKE
Subjt:  YDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKE

Query:  LDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFH----RSVSS
        L+ANTEYY+ELLAKGIGK KEEI KDVQRPKYFQAQEAI+YG+ADKIIDSRD+ F+KRNYDEM+AQSR      GGNPQ APSGF          R    
Subjt:  LDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFH----RSVSS

Query:  P---------------VTAMVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTD
        P               V AMVVCKCRKATKLYCFVHKVPVCGECICFP+HQICV+RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT 
Subjt:  P---------------VTAMVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTD

Query:  CLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIA
        CLVSHIKSFPP TAPAGY CP+CS  IWPPK+ KDS SRLH+KLKEAI+QTG+EKNLFGNHPV L  TES GPPPAFASDPL+    + H N        
Subjt:  CLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIA

Query:  NIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRR------------------
           SD  +GFS  TG+  SK  + DIVE++ P S GNF++GSSP GP ATTRK  +  +RQNSEISYYADDED NRKKY RR                  
Subjt:  NIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRR------------------

Query:  --------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYY
                                  GPF+HKFLRALLPFWS+ALPTLPVTAPPRKDA+N  + SEGR RHQR SRMDPRKILL+IAIMAC+ATMGILYY
Subjt:  --------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYY

Query:  RLVQRDIGEEFVDDEQ
        RLVQR  GEEF  DEQ
Subjt:  RLVQRDIGEEFVDDEQ

KAF1880709.1 hypothetical protein Lal_00011768 [Lupinus albus]4.0e-26148.76Show/hide
Query:  MATSSLSHLSAP--PSLAID--SSKSSFLCGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMT
        MA+S L  LS P  P   ID  +SKSSF  GT   FPS    T+   RR   SP AK S DHIP +FR ENL+DGLM+NYKN PKYLYGL+PSQMDMF+T
Subjt:  MATSSLSHLSAP--PSLAID--SSKSSFLCGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMT

Query:  EDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNP+R+QSE VTE++ISS+ NYL+HGGMWS S M   GPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL

Query:  DANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTS------------
        +ANTEYY+ELLAKGIGK KEEI KDVQRPKYFQAQEAI+YGIADKIIDSRD+ F+KRNYDEML+QSR      GGNPQ APSGFS  +            
Subjt:  DANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTS------------

Query:  ---------------------------------------------------------------FHRSVSSPVTAM-------------------------
                                                                        +R V + +  M                         
Subjt:  ---------------------------------------------------------------FHRSVSSPVTAM-------------------------

Query:  ----------------------------------------------------------------------------VVCK--------------------
                                                                                    V+C+                    
Subjt:  ----------------------------------------------------------------------------VVCK--------------------

Query:  -------------------------------------------------------------------------------------CRKATKLYC------
                                                                                             C K T  YC      
Subjt:  -------------------------------------------------------------------------------------CRKATKLYC------

Query:  --------------------------------------------------------FVHKVPVCGEC---------------ICFPEHQI--CV------
                                                                 +  + VCG+C               I    +++  C+      
Subjt:  --------------------------------------------------------FVHKVPVCGEC---------------ICFPEHQI--CV------

Query:  --------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRL
                      +RTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHVIHT+CLVSHIK FPP TAPAGY CP+CS  IWPPK+ KDSGSRL
Subjt:  --------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRL

Query:  HAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVK
        H+KLKE I+QTG+EKNLFGNHPV LS TESRGPPPAFASDPL+    + H N  SL           +G+S  TG+ +SK    DIVEI+   S GNFV+
Subjt:  HAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVK

Query:  GSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILL
        GSSP GP ATTRKG +  +RQNSEISYYADDED NRKKY RRGPF+HKFLRALLPFWS+ALPTLPV+APP+KDA+N  + SEGR RHQR SRMDPRKILL
Subjt:  GSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILL

Query:  IIAIMACLATMGILYYRLVQRDIGEEFVDDEQ
        +IAIMAC+ATMGILYYRL QR  GEE   DEQ
Subjt:  IIAIMACLATMGILYYRLVQRDIGEEFVDDEQ

KAG6588711.1 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.85Show/hide
Query:  MATSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPVR
        MATSS  H+SA PSLA+ SSKSSFL GT LPFPSSR RTS RRY LSPSAK+SMDHIPK+FR ENLKDGLMENY+N P+ LYGLTPSQ+DMFMTEDNPVR
Subjt:  MATSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPVR

Query:  RQSELVTEQNISSSHNYLNHGGMWSLSGMNG-KGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
        RQSELVTE++ISS+H+YL +GGMWSLSGM+G KGPSKYSMS SMYRGGGRG GR ++APPDLPSLLLDARI YLGMPIVPAVTELLVAQFMWLDYDNPSK
Subjt:  RQSELVTEQNISSSHNYLNHGGMWSLSGMNG-KGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTE
        PIYLYINSPGTQNEKME VG ETEAYAIADMMAYCK DVYT+NCGMA+GQAAMLLSLGTKGYRAVQPNSS KLYLPKV+RSSGAVIDMWIKA+ELDANT+
Subjt:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTE

Query:  YYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSR----GPGGNPQAAPSGFSPTSFHRSVSSPVTAMVVCKC
        YY+ELLAKG GKP EEI KD+QRPKY   QEAIDYG+ DKII SRDSAFEKRNYD+MLAQSR    G GGNPQAAPSG        SVS+PVTAMVVCKC
Subjt:  YYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSR----GPGGNPQAAPSGFSPTSFHRSVSSPVTAMVVCKC

Query:  RKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSV
        RKATKLYCFVHKVPVCGECICFPEHQICV+RTYSEWVLNGDYDWPPNCCLCHATLEEG GPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACS+
Subjt:  RKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSV

Query:  SIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSSKNDIAD
        SIWPPKN KDSGSRLHAKLKEAILQTGLEK+LFGNHPVGLS TES GPPPAFASDPLVSSSGD HNNKSSLNSIAN+ S+ GEGFSATTG GSSKN+I+D
Subjt:  SIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSSKNDIAD

Query:  IVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRV
        IVEIE+PG EGNFVKGSSPS PVATTRKGA+NYDRQ+SEISYYADDEDGNRKKYVRRGPF+HKFLRALLPFWSTALPTLPVTAPPRKD+   NDVSEGRV
Subjt:  IVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRV

Query:  RHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQLQAAQ
        RHQRPSRMDPRKILL+IAIMACLATMGILYYRL QR IGEE V+DEQQL+AAQ
Subjt:  RHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQLQAAQ

OIV91138.1 hypothetical protein TanjilG_30360 [Lupinus angustifolius]0.0e+0073.99Show/hide
Query:  MATSSLSHLSAP--PSLAIDSS--KSSFLCGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMT
        MA+S L  LS P  PS   D+S  KSSF+ GT   FPS    T+   RR   SPSAK S DHIPK+FR +NLKDG+M+NYKNAP+YLYGL+PSQMDMFMT
Subjt:  MATSSLSHLSAP--PSLAIDSS--KSSFLCGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMT

Query:  EDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNP+R+Q+E VTE++ISS+ NYL+HGGMWS S M   GPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL

Query:  DANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGF-----SPTSFHRSVSS
        +ANTEYY+ELLAKGIGK KEEI KDVQRPKYFQAQEAI+YGI DKIIDSRD+ F+KRNYDEM+AQSR      GGNPQ APSGF     S +  +     
Subjt:  DANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGF-----SPTSFHRSVSS

Query:  PVTAMVVCKCR--------KATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIK
         + +M+  K          +ATKLYCFVHKVPVCGECICFP+HQICV+RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT CLVSHIK
Subjt:  PVTAMVVCKCR--------KATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIK

Query:  SFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLG
        SFPP TAPAGY CP+CS  IWPPK+ KDS SRLH+KLKEAI+QTG+EKNLFGNHPV LS TESRGPPPAFASDPL+    + H N           SD  
Subjt:  SFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLG

Query:  EGFSATTGNGSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVT
        +GFS  TG+  SK  + DIVE++ P S GNF++GSSP GP ATTRK  +  +RQNSEISYYADDED NRKKY RRGPF+HKFLRALLPFWS+ALPTLPVT
Subjt:  EGFSATTGNGSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVT

Query:  APPRKDASNTNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQ
        APPRKDA+N+ + SEGR RHQR SRMDPRKILL+IAIMACLATMGILYYRL QR  GEEF  DEQ
Subjt:  APPRKDASNTNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQ

RDY10958.1 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic, partial [Mucuna pruriens]3.8e-29672.98Show/hide
Query:  TSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPVRRQ
        +SSLS   + PS+      SSFL GT+L FP S       R   S SAK S+DHIPK+FR ENL+DGLMEN+KNAP+YLYGLTPSQMDMFMTEDNP+R+Q
Subjt:  TSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPVRRQ

Query:  SELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIY
        +E VTE++ISS+ NY++HGGMWSLS M     SKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNP+KPIY
Subjt:  SELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIY

Query:  LYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYL
        LYINS GTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL+ANTEYY+
Subjt:  LYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYL

Query:  ELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFHRSVSSPVTAMVVCKCRKA
        ELLAKGIGK KEEI K+VQRPKYFQAQEAIDYGIADK IDSRD  FEKRNYDEMLAQSR      GGNPQ       P + H         +       A
Subjt:  ELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFHRSVSSPVTAMVVCKCRKA

Query:  TKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIW
        TKLYCFVHKVPVCGECICFPEHQICVVRTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHV+HT+CLVSHIKSF P TAPAGY CP+CS SIW
Subjt:  TKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIW

Query:  PPKNFKDSGSRLHAKLKEAILQ------------TGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGN
        PPK+ KDSGSRLH+KLKEAI+Q            +G+EKN+FGNHPV LS TESR PPPAFASDPL+S   + H N  S+           +GFS  TG+
Subjt:  PPKNFKDSGSRLHAKLKEAILQ------------TGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGN

Query:  GSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASN
           K  + DIVEI+   S GNF+K SSP  P ATTRKG+++ +RQNSEISYYADDEDGNRKKY +RGPF HKFLRALLPFWS ALPTLPVTAP RKDASN
Subjt:  GSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASN

Query:  TNDVSEGRVRHQRPSRMDPRKILLIIAIM
          D SEGR RHQR S MDPRKILL+IAI+
Subjt:  TNDVSEGRVRHQRPSRMDPRKILLIIAIM

TrEMBL top hitse value%identityAlignment
A0A1S3C0P4 zinc finger protein-like 1 homolog8.5e-20195.26Show/hide
Query:  MVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYD
        MVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNG+YDWPPNCCLCHATL EGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYD
Subjt:  MVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYD

Query:  CPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSS
        CPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGN+PVG SPTES  PPPAFASDPLVS+S DTHNNKSSLNS+ANIES+LGEGFSATTG GSS
Subjt:  CPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSS

Query:  KNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTND
        KN+IADIVEIEMPGSEGNFVKGSSPSGPVATTRKGA+NYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDA N ND
Subjt:  KNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTND

Query:  VSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQLQAAQ
        V+EGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRD+GEEFVDDEQQLQAAQ
Subjt:  VSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQLQAAQ

A0A371I7F6 ATP-dependent Clp protease proteolytic subunit (Fragment)1.8e-29672.98Show/hide
Query:  TSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPVRRQ
        +SSLS   + PS+      SSFL GT+L FP S       R   S SAK S+DHIPK+FR ENL+DGLMEN+KNAP+YLYGLTPSQMDMFMTEDNP+R+Q
Subjt:  TSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPVRRQ

Query:  SELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIY
        +E VTE++ISS+ NY++HGGMWSLS M     SKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNP+KPIY
Subjt:  SELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIY

Query:  LYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYL
        LYINS GTQNEK ETVGSETEAY+IADMM+Y K+DVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL+ANTEYY+
Subjt:  LYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYL

Query:  ELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFHRSVSSPVTAMVVCKCRKA
        ELLAKGIGK KEEI K+VQRPKYFQAQEAIDYGIADK IDSRD  FEKRNYDEMLAQSR      GGNPQ       P + H         +       A
Subjt:  ELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFHRSVSSPVTAMVVCKCRKA

Query:  TKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIW
        TKLYCFVHKVPVCGECICFPEHQICVVRTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHV+HT+CLVSHIKSF P TAPAGY CP+CS SIW
Subjt:  TKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIW

Query:  PPKNFKDSGSRLHAKLKEAILQ------------TGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGN
        PPK+ KDSGSRLH+KLKEAI+Q            +G+EKN+FGNHPV LS TESR PPPAFASDPL+S   + H N  S+           +GFS  TG+
Subjt:  PPKNFKDSGSRLHAKLKEAILQ------------TGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGN

Query:  GSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASN
           K  + DIVEI+   S GNF+K SSP  P ATTRKG+++ +RQNSEISYYADDEDGNRKKY +RGPF HKFLRALLPFWS ALPTLPVTAP RKDASN
Subjt:  GSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASN

Query:  TNDVSEGRVRHQRPSRMDPRKILLIIAIM
          D SEGR RHQR S MDPRKILL+IAI+
Subjt:  TNDVSEGRVRHQRPSRMDPRKILLIIAIM

A0A6A5N4U1 ATP-dependent Clp protease proteolytic subunit0.0e+0070.1Show/hide
Query:  MATSSLSHLSAPPS--LAIDSS--KSSFLCGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFM
        MA+S L  LS P +     D+S  KSSFL  T   FPS   S ++ S R +  SPSAK S DHIPK+FR +NLKDG+M+NYKNAP+YLYGL+PSQMDMFM
Subjt:  MATSSLSHLSAPPS--LAIDSS--KSSFLCGTQLPFPS---SRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFM

Query:  TEDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLD
        TEDNP+R+Q+E VTE++ISS+ NYL+HGGMWS S M   GPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLD
Subjt:  TEDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLD

Query:  YDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKE
        YDNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKE
Subjt:  YDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKE

Query:  LDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFH----RSVSS
        L+ANTEYY+ELLAKGIGK KEEI KDVQRPKYFQAQEAI+YG+ADKIIDSRD+ F+KRNYDEM+AQSR      GGNPQ APSGF          R    
Subjt:  LDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTSFH----RSVSS

Query:  P---------------VTAMVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTD
        P               V AMVVCKCRKATKLYCFVHKVPVCGECICFP+HQICV+RTYSEWV++G+YDWPP CC C + LEEG G QTTRLGCLHVIHT 
Subjt:  P---------------VTAMVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTD

Query:  CLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIA
        CLVSHIKSFPP TAPAGY CP+CS  IWPPK+ KDS SRLH+KLKEAI+QTG+EKNLFGNHPV L  TES GPPPAFASDPL+    + H N        
Subjt:  CLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIA

Query:  NIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRR------------------
           SD  +GFS  TG+  SK  + DIVE++ P S GNF++GSSP GP ATTRK  +  +RQNSEISYYADDED NRKKY RR                  
Subjt:  NIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRR------------------

Query:  --------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYY
                                  GPF+HKFLRALLPFWS+ALPTLPVTAPPRKDA+N  + SEGR RHQR SRMDPRKILL+IAIMAC+ATMGILYY
Subjt:  --------------------------GPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYY

Query:  RLVQRDIGEEFVDDEQ
        RLVQR  GEEF  DEQ
Subjt:  RLVQRDIGEEFVDDEQ

A0A6A5N6H9 RING-type domain-containing protein1.9e-26148.76Show/hide
Query:  MATSSLSHLSAP--PSLAID--SSKSSFLCGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMT
        MA+S L  LS P  P   ID  +SKSSF  GT   FPS    T+   RR   SP AK S DHIP +FR ENL+DGLM+NYKN PKYLYGL+PSQMDMF+T
Subjt:  MATSSLSHLSAP--PSLAID--SSKSSFLCGTQLPFPSSRSRTS--CRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMT

Query:  EDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY
        EDNP+R+QSE VTE++ISS+ NYL+HGGMWS S M   GPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAV ELLVAQFMWLDY
Subjt:  EDNPVRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDY

Query:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL
        DNP KPIYLYINS GTQNEK ETVGSETEAY+IADMM+Y KSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL
Subjt:  DNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKEL

Query:  DANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTS------------
        +ANTEYY+ELLAKGIGK KEEI KDVQRPKYFQAQEAI+YGIADKIIDSRD+ F+KRNYDEML+QSR      GGNPQ APSGFS  +            
Subjt:  DANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSRG----PGGNPQAAPSGFSPTS------------

Query:  ---------------------------------------------------------------FHRSVSSPVTAM-------------------------
                                                                        +R V + +  M                         
Subjt:  ---------------------------------------------------------------FHRSVSSPVTAM-------------------------

Query:  ----------------------------------------------------------------------------VVCK--------------------
                                                                                    V+C+                    
Subjt:  ----------------------------------------------------------------------------VVCK--------------------

Query:  -------------------------------------------------------------------------------------CRKATKLYC------
                                                                                             C K T  YC      
Subjt:  -------------------------------------------------------------------------------------CRKATKLYC------

Query:  --------------------------------------------------------FVHKVPVCGEC---------------ICFPEHQI--CV------
                                                                 +  + VCG+C               I    +++  C+      
Subjt:  --------------------------------------------------------FVHKVPVCGEC---------------ICFPEHQI--CV------

Query:  --------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRL
                      +RTYSEWV++G+YDWPP CC C A LEEG G QTTRLGCLHVIHT+CLVSHIK FPP TAPAGY CP+CS  IWPPK+ KDSGSRL
Subjt:  --------------VRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRL

Query:  HAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVK
        H+KLKE I+QTG+EKNLFGNHPV LS TESRGPPPAFASDPL+    + H N  SL           +G+S  TG+ +SK    DIVEI+   S GNFV+
Subjt:  HAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVK

Query:  GSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILL
        GSSP GP ATTRKG +  +RQNSEISYYADDED NRKKY RRGPF+HKFLRALLPFWS+ALPTLPV+APP+KDA+N  + SEGR RHQR SRMDPRKILL
Subjt:  GSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILL

Query:  IIAIMACLATMGILYYRLVQRDIGEEFVDDEQ
        +IAIMAC+ATMGILYYRL QR  GEE   DEQ
Subjt:  IIAIMACLATMGILYYRLVQRDIGEEFVDDEQ

A0A6J1D6K6 ATP-dependent Clp protease proteolytic subunit6.1e-19993.67Show/hide
Query:  MATSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAK-QSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPV
        MATSSLS+LSAPPSLAIDSSKS F+CGTQL FPSSRSRT+CRRY LSPSAK  SMDHIPK+FRGENLKDGLMENYKNAP+YLYGLTPSQMDMFMTEDNP+
Subjt:  MATSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAK-QSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPV

Query:  RRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
        RRQSELVTE+NISSSHNYLNHGGMWSLSGMN KGPSKYSMSVSMYRGGGRG+GRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK
Subjt:  RRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSK

Query:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTE
        PIYLYINS GTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTE
Subjt:  PIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTE

Query:  YYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSR----GPGGNPQAAPSGF
        YY+ELLAKGIGKPKEEITKDVQRPKYFQAQEA+DYG+ADKIIDS+DSAFEKRNYDEMLAQSR    G GGNPQAAPSGF
Subjt:  YYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQSR----GPGGNPQAAPSGF

SwissProt top hitse value%identityAlignment
P74466 Putative ATP-dependent Clp protease proteolytic subunit-like1.5e-3742.71Show/hide
Query:  RTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG
        +T PPDL SLLL  RI YLGMP+  +          VT+L++AQ ++L +D+P KPIY YINS GT     + VG ETEA+AI D + Y K  V+T+  G
Subjt:  RTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG

Query:  MAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDS
         A G AAM+LS GTKGYRA  P+++  L   +   + G   D+ I+AKE+ +N +  LE+L+   G+ +E++ KD+ R  Y    +A +YG+ D++++S
Subjt:  MAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDS

Q8L770 ATP-dependent Clp protease proteolytic subunit-related protein 3, chloroplastic4.1e-4345.24Show/hide
Query:  PSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAY
        P   S    + R       RPRT PPDLPS+LLD RI Y+GMP+VPAVTEL+VA+ M+L + +P +PIY+YINS GT  +  ETVG E+E +AI D +  
Subjt:  PSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAY

Query:  CKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAV--IDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEA
         K++V+TV  G A GQA +LLS GTKG R + P++   +  P+V  SSG +   D+ I+AKE+  N +  +ELL+K  G   E +   ++RP Y  A +A
Subjt:  CKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAV--IDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEA

Query:  IDYGIADKII
         ++G+ D+I+
Subjt:  IDYGIADKII

Q8LB10 ATP-dependent Clp protease proteolytic subunit-related protein 4, chloroplastic1.6e-3943.9Show/hide
Query:  PPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLG
        PPDL S L   RI YLGM +VP+VTEL++A+F++L Y++  KPIYLYINS GT  +  E +G +TEA+AI D+M Y K  ++T+  G A+G+AA+LL+ G
Subjt:  PPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLG

Query:  TKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEML
         KG R+  P+S+  +  P + R  G   D+ I  KE+       ++L +K IGK  E+I  D++RPKYF   EA++YGI DK++           Y+E  
Subjt:  TKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEML

Query:  AQSRG
        +Q RG
Subjt:  AQSRG

Q9L4P4 Putative ATP-dependent Clp protease proteolytic subunit-like3.3e-4044.5Show/hide
Query:  RTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG
        RT PPDLPSLLL  RI YLGMP+  +          VTEL++AQ ++L++DNP KPIY YINS GT     + +G ETEA+AI D M Y K  V+T+  G
Subjt:  RTAPPDLPSLLLDARICYLGMPIVPA----------VTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCG

Query:  MAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSR
         A G AAM+LS GT G RA  P+++  L  P+   + G   D+ I+AKE+ AN    LE+ A+  G+  + + +D  R  Y    +A++YG+ D+++DSR
Subjt:  MAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSR

Q9XJ35 ATP-dependent Clp protease proteolytic subunit-related protein 1, chloroplastic1.2e-14872.49Show/hide
Query:  TSSLSHLSAPPSLAIDSSKSSFLCGTQL---PFPSSRSRTSCRRYVLSPSAK-QSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNP
        TS L+H +      +   KS F+ G++L     P S      RR     SAK  S DHIPK+FRG+NLKDG+M+N+KN P+Y YGL  +QMDMFMTED+P
Subjt:  TSSLSHLSAPPSLAIDSSKSSFLCGTQL---PFPSSRSRTSCRRYVLSPSAK-QSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNP

Query:  VRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDN
        VRRQ+E VTE++ISS +NYLN+GG+WS+SGMN     +YSMSV MYRGGG G G  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDN
Subjt:  VRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDN

Query:  PSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDA
        P+KPIYLYINSPGTQNEKMETVGSETEAYAIAD ++YCKSDVYT+NCGMA+GQAAMLLSLG KGYRAVQP+SSTKLYLPKVNRSSGA IDMWIKAKELDA
Subjt:  PSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDA

Query:  NTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQ-SRGPGGNPQAAPSG
        NTEYY+ELLAKG GK KE+I +D++RPKY QAQ AIDYGIADKI DS+DS+FEKR+YD  LAQ +  PGG   AAP+G
Subjt:  NTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQ-SRGPGGNPQAAPSG

Arabidopsis top hitse value%identityAlignment
AT1G09130.1 ATP-dependent caseinolytic (Clp) protease/crotonase family protein2.9e-4445.24Show/hide
Query:  PSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAY
        P   S    + R       RPRT PPDLPS+LLD RI Y+GMP+VPAVTEL+VA+ M+L + +P +PIY+YINS GT  +  ETVG E+E +AI D +  
Subjt:  PSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAY

Query:  CKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAV--IDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEA
         K++V+TV  G A GQA +LLS GTKG R + P++   +  P+V  SSG +   D+ I+AKE+  N +  +ELL+K  G   E +   ++RP Y  A +A
Subjt:  CKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAV--IDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEA

Query:  IDYGIADKII
         ++G+ D+I+
Subjt:  IDYGIADKII

AT1G09130.2 ATP-dependent caseinolytic (Clp) protease/crotonase family protein2.9e-4445.24Show/hide
Query:  PSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAY
        P   S    + R       RPRT PPDLPS+LLD RI Y+GMP+VPAVTEL+VA+ M+L + +P +PIY+YINS GT  +  ETVG E+E +AI D +  
Subjt:  PSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAY

Query:  CKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAV--IDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEA
         K++V+TV  G A GQA +LLS GTKG R + P++   +  P+V  SSG +   D+ I+AKE+  N +  +ELL+K  G   E +   ++RP Y  A +A
Subjt:  CKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAV--IDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEA

Query:  IDYGIADKII
         ++G+ D+I+
Subjt:  IDYGIADKII

AT1G49970.1 CLP protease proteolytic subunit 18.8e-15072.49Show/hide
Query:  TSSLSHLSAPPSLAIDSSKSSFLCGTQL---PFPSSRSRTSCRRYVLSPSAK-QSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNP
        TS L+H +      +   KS F+ G++L     P S      RR     SAK  S DHIPK+FRG+NLKDG+M+N+KN P+Y YGL  +QMDMFMTED+P
Subjt:  TSSLSHLSAPPSLAIDSSKSSFLCGTQL---PFPSSRSRTSCRRYVLSPSAK-QSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNP

Query:  VRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDN
        VRRQ+E VTE++ISS +NYLN+GG+WS+SGMN     +YSMSV MYRGGG G G  RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDN
Subjt:  VRRQSELVTEQNISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAG--RPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDN

Query:  PSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDA
        P+KPIYLYINSPGTQNEKMETVGSETEAYAIAD ++YCKSDVYT+NCGMA+GQAAMLLSLG KGYRAVQP+SSTKLYLPKVNRSSGA IDMWIKAKELDA
Subjt:  PSKPIYLYINSPGTQNEKMETVGSETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDA

Query:  NTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQ-SRGPGGNPQAAPSG
        NTEYY+ELLAKG GK KE+I +D++RPKY QAQ AIDYGIADKI DS+DS+FEKR+YD  LAQ +  PGG   AAP+G
Subjt:  NTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQEAIDYGIADKIIDSRDSAFEKRNYDEMLAQ-SRGPGGNPQAAPSG

AT2G14835.1 RING/U-box superfamily protein4.7e-13566.1Show/hide
Query:  MVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYD
        MVVCKC+KAT+LYCFVHK PVCGECICFPEHQ CVVRTYSEWV++G+YD  P CC C AT +EG G Q TRLGCLH IHT CLVS IKSFPP TAPAGY 
Subjt:  MVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYD

Query:  CPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSS
        CPACS  IWPP   KD+GSRLHA L+E I QTGLEKNL GNHPV  S TESR PPPAFASD L++ S  +H  +          ++L +G+S       S
Subjt:  CPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSS

Query:  KNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTND
        K+ +++IVEI++P S G+++K SSP    A  RKG    DRQNSE  YYADDEDGNRKKY RRGP +HKFLRALLPFWS+ALPTLPVTAPPRKDA+  +D
Subjt:  KNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTND

Query:  VSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ
         SEGRVRHQR S+MD RKIL+ IA++AC+ATMGILYYRL  + IG+E  D+EQ+
Subjt:  VSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ

AT2G14835.2 RING/U-box superfamily protein4.7e-13566.1Show/hide
Query:  MVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYD
        MVVCKC+KAT+LYCFVHK PVCGECICFPEHQ CVVRTYSEWV++G+YD  P CC C AT +EG G Q TRLGCLH IHT CLVS IKSFPP TAPAGY 
Subjt:  MVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWPPNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYD

Query:  CPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSS
        CPACS  IWPP   KD+GSRLHA L+E I QTGLEKNL GNHPV  S TESR PPPAFASD L++ S  +H  +          ++L +G+S       S
Subjt:  CPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASDPLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSS

Query:  KNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTND
        K+ +++IVEI++P S G+++K SSP    A  RKG    DRQNSE  YYADDEDGNRKKY RRGP +HKFLRALLPFWS+ALPTLPVTAPPRKDA+  +D
Subjt:  KNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFLRALLPFWSTALPTLPVTAPPRKDASNTND

Query:  VSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ
         SEGRVRHQR S+MD RKIL+ IA++AC+ATMGILYYRL  + IG+E  D+EQ+
Subjt:  VSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACTTCTTCACTCTCCCATCTCTCTGCTCCACCGTCGTTGGCCATCGACAGCTCAAAGTCTTCCTTCCTCTGCGGTACGCAGCTGCCCTTCCCTTCCTCTCGCTC
AAGAACTTCATGTCGCAGATACGTCCTATCTCCCTCCGCCAAACAATCGATGGATCACATTCCTAAGGAGTTTAGAGGAGAAAATCTCAAAGATGGATTGATGGAGAACT
ACAAGAATGCTCCTAAGTATCTTTATGGCCTTACACCTTCACAAATGGACATGTTCATGACAGAAGATAATCCCGTCCGGCGGCAGTCAGAATTAGTTACAGAGCAAAAC
ATATCATCTTCCCACAACTACTTGAATCATGGAGGAATGTGGAGTCTATCTGGCATGAATGGGAAGGGTCCATCAAAATATAGTATGAGTGTTAGCATGTATCGTGGGGG
AGGAAGAGGAGCTGGACGACCTCGAACAGCTCCTCCTGATTTGCCCTCTTTGCTCTTAGATGCTCGTATATGCTATTTGGGCATGCCTATTGTACCAGCAGTGACTGAGC
TTCTTGTTGCTCAGTTTATGTGGCTAGATTATGATAACCCGTCAAAGCCTATATATCTCTACATAAACTCTCCTGGGACACAGAATGAGAAGATGGAGACTGTTGGATCT
GAAACTGAAGCATATGCTATAGCAGATATGATGGCTTACTGCAAATCGGATGTCTATACCGTCAACTGTGGAATGGCATATGGTCAAGCGGCAATGCTTTTGTCACTTGG
AACCAAGGGCTACCGTGCTGTCCAACCTAACTCTTCCACTAAACTATATCTACCTAAAGTTAATAGATCAAGTGGTGCGGTCATAGATATGTGGATTAAGGCTAAAGAAC
TCGATGCCAACACGGAGTACTATCTCGAGCTATTAGCTAAAGGAATTGGTAAACCCAAGGAAGAGATTACTAAAGATGTCCAACGACCCAAATACTTCCAAGCACAAGAA
GCTATTGACTACGGCATTGCAGACAAAATAATCGACTCTCGAGATTCCGCATTTGAAAAAAGGAATTACGACGAGATGCTTGCGCAGTCGAGAGGACCAGGAGGCAATCC
ACAAGCAGCACCATCCGGGTTCAGTCCAACTTCATTTCATCGGAGCGTCTCCTCGCCGGTTACTGCAATGGTGGTCTGCAAATGCCGCAAGGCCACGAAATTGTATTGTT
TTGTTCATAAGGTTCCCGTATGTGGAGAATGCATTTGTTTCCCAGAACACCAAATATGCGTGGTTCGTACCTATTCAGAGTGGGTATTGAATGGAGATTATGATTGGCCT
CCAAATTGCTGCCTGTGTCATGCTACACTTGAGGAAGGAGTTGGTCCTCAAACTACTCGATTGGGATGCTTGCATGTCATACATACGGATTGCTTGGTCTCACATATCAA
GAGCTTTCCTCCATCCACTGCCCCTGCTGGATATGACTGTCCTGCCTGTTCAGTTTCGATATGGCCTCCGAAGAACTTTAAAGATTCAGGATCTCGTCTACATGCAAAGC
TAAAGGAAGCTATCTTGCAGACTGGTCTGGAAAAGAATTTGTTTGGAAATCATCCAGTTGGATTATCACCAACAGAATCCCGTGGTCCTCCCCCTGCCTTTGCCTCAGAT
CCCTTGGTTTCCTCTTCAGGAGACACACATAACAATAAGAGTTCATTAAATTCAATAGCAAACATTGAGTCAGATCTGGGTGAAGGGTTCTCCGCCACAACTGGAAATGG
GTCTTCCAAAAATGACATTGCAGATATTGTAGAGATTGAAATGCCTGGTTCAGAAGGAAATTTTGTGAAAGGCTCAAGTCCTTCAGGCCCTGTTGCTACCACAAGAAAAG
GTGCACTCAATTATGACAGGCAAAATTCTGAAATTTCATATTATGCTGATGATGAAGACGGTAATCGCAAGAAGTATGTTCGAAGAGGTCCTTTTAAGCACAAGTTTCTT
AGAGCCCTACTTCCTTTCTGGTCAACTGCATTGCCAACTCTACCCGTTACTGCACCGCCTCGTAAAGATGCATCAAATACCAATGATGTCAGTGAAGGTCGTGTTCGGCA
CCAAAGACCCTCCAGAATGGATCCAAGAAAAATTCTTCTTATCATAGCAATCATGGCATGCCTGGCAACGATGGGTATTTTATACTACAGACTTGTACAAAGAGATATCG
GAGAAGAATTTGTGGATGACGAGCAGCAACTGCAAGCAGCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACTTCTTCACTCTCCCATCTCTCTGCTCCACCGTCGTTGGCCATCGACAGCTCAAAGTCTTCCTTCCTCTGCGGTACGCAGCTGCCCTTCCCTTCCTCTCGCTC
AAGAACTTCATGTCGCAGATACGTCCTATCTCCCTCCGCCAAACAATCGATGGATCACATTCCTAAGGAGTTTAGAGGAGAAAATCTCAAAGATGGATTGATGGAGAACT
ACAAGAATGCTCCTAAGTATCTTTATGGCCTTACACCTTCACAAATGGACATGTTCATGACAGAAGATAATCCCGTCCGGCGGCAGTCAGAATTAGTTACAGAGCAAAAC
ATATCATCTTCCCACAACTACTTGAATCATGGAGGAATGTGGAGTCTATCTGGCATGAATGGGAAGGGTCCATCAAAATATAGTATGAGTGTTAGCATGTATCGTGGGGG
AGGAAGAGGAGCTGGACGACCTCGAACAGCTCCTCCTGATTTGCCCTCTTTGCTCTTAGATGCTCGTATATGCTATTTGGGCATGCCTATTGTACCAGCAGTGACTGAGC
TTCTTGTTGCTCAGTTTATGTGGCTAGATTATGATAACCCGTCAAAGCCTATATATCTCTACATAAACTCTCCTGGGACACAGAATGAGAAGATGGAGACTGTTGGATCT
GAAACTGAAGCATATGCTATAGCAGATATGATGGCTTACTGCAAATCGGATGTCTATACCGTCAACTGTGGAATGGCATATGGTCAAGCGGCAATGCTTTTGTCACTTGG
AACCAAGGGCTACCGTGCTGTCCAACCTAACTCTTCCACTAAACTATATCTACCTAAAGTTAATAGATCAAGTGGTGCGGTCATAGATATGTGGATTAAGGCTAAAGAAC
TCGATGCCAACACGGAGTACTATCTCGAGCTATTAGCTAAAGGAATTGGTAAACCCAAGGAAGAGATTACTAAAGATGTCCAACGACCCAAATACTTCCAAGCACAAGAA
GCTATTGACTACGGCATTGCAGACAAAATAATCGACTCTCGAGATTCCGCATTTGAAAAAAGGAATTACGACGAGATGCTTGCGCAGTCGAGAGGACCAGGAGGCAATCC
ACAAGCAGCACCATCCGGGTTCAGTCCAACTTCATTTCATCGGAGCGTCTCCTCGCCGGTTACTGCAATGGTGGTCTGCAAATGCCGCAAGGCCACGAAATTGTATTGTT
TTGTTCATAAGGTTCCCGTATGTGGAGAATGCATTTGTTTCCCAGAACACCAAATATGCGTGGTTCGTACCTATTCAGAGTGGGTATTGAATGGAGATTATGATTGGCCT
CCAAATTGCTGCCTGTGTCATGCTACACTTGAGGAAGGAGTTGGTCCTCAAACTACTCGATTGGGATGCTTGCATGTCATACATACGGATTGCTTGGTCTCACATATCAA
GAGCTTTCCTCCATCCACTGCCCCTGCTGGATATGACTGTCCTGCCTGTTCAGTTTCGATATGGCCTCCGAAGAACTTTAAAGATTCAGGATCTCGTCTACATGCAAAGC
TAAAGGAAGCTATCTTGCAGACTGGTCTGGAAAAGAATTTGTTTGGAAATCATCCAGTTGGATTATCACCAACAGAATCCCGTGGTCCTCCCCCTGCCTTTGCCTCAGAT
CCCTTGGTTTCCTCTTCAGGAGACACACATAACAATAAGAGTTCATTAAATTCAATAGCAAACATTGAGTCAGATCTGGGTGAAGGGTTCTCCGCCACAACTGGAAATGG
GTCTTCCAAAAATGACATTGCAGATATTGTAGAGATTGAAATGCCTGGTTCAGAAGGAAATTTTGTGAAAGGCTCAAGTCCTTCAGGCCCTGTTGCTACCACAAGAAAAG
GTGCACTCAATTATGACAGGCAAAATTCTGAAATTTCATATTATGCTGATGATGAAGACGGTAATCGCAAGAAGTATGTTCGAAGAGGTCCTTTTAAGCACAAGTTTCTT
AGAGCCCTACTTCCTTTCTGGTCAACTGCATTGCCAACTCTACCCGTTACTGCACCGCCTCGTAAAGATGCATCAAATACCAATGATGTCAGTGAAGGTCGTGTTCGGCA
CCAAAGACCCTCCAGAATGGATCCAAGAAAAATTCTTCTTATCATAGCAATCATGGCATGCCTGGCAACGATGGGTATTTTATACTACAGACTTGTACAAAGAGATATCG
GAGAAGAATTTGTGGATGACGAGCAGCAACTGCAAGCAGCACAATGA
Protein sequenceShow/hide protein sequence
MATSSLSHLSAPPSLAIDSSKSSFLCGTQLPFPSSRSRTSCRRYVLSPSAKQSMDHIPKEFRGENLKDGLMENYKNAPKYLYGLTPSQMDMFMTEDNPVRRQSELVTEQN
ISSSHNYLNHGGMWSLSGMNGKGPSKYSMSVSMYRGGGRGAGRPRTAPPDLPSLLLDARICYLGMPIVPAVTELLVAQFMWLDYDNPSKPIYLYINSPGTQNEKMETVGS
ETEAYAIADMMAYCKSDVYTVNCGMAYGQAAMLLSLGTKGYRAVQPNSSTKLYLPKVNRSSGAVIDMWIKAKELDANTEYYLELLAKGIGKPKEEITKDVQRPKYFQAQE
AIDYGIADKIIDSRDSAFEKRNYDEMLAQSRGPGGNPQAAPSGFSPTSFHRSVSSPVTAMVVCKCRKATKLYCFVHKVPVCGECICFPEHQICVVRTYSEWVLNGDYDWP
PNCCLCHATLEEGVGPQTTRLGCLHVIHTDCLVSHIKSFPPSTAPAGYDCPACSVSIWPPKNFKDSGSRLHAKLKEAILQTGLEKNLFGNHPVGLSPTESRGPPPAFASD
PLVSSSGDTHNNKSSLNSIANIESDLGEGFSATTGNGSSKNDIADIVEIEMPGSEGNFVKGSSPSGPVATTRKGALNYDRQNSEISYYADDEDGNRKKYVRRGPFKHKFL
RALLPFWSTALPTLPVTAPPRKDASNTNDVSEGRVRHQRPSRMDPRKILLIIAIMACLATMGILYYRLVQRDIGEEFVDDEQQLQAAQ