; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026904 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026904
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionChaperone protein ClpB
Genome locationtig00153047:2010300..2022147
RNA-Seq ExpressionSgr026904
SyntenySgr026904
Gene Ontology termsGO:0009570 - chloroplast stroma (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
InterPro domainsIPR001270 - ClpA/B family
IPR003593 - AAA+ ATPase domain
IPR003959 - ATPase, AAA-type, core
IPR019489 - Clp ATPase, C-terminal
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR028299 - ClpA/B, conserved site 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057986.1 chaperone protein ClpB1 [Cucumis melo var. makuwa]1.7e-11265.89Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   ++A   +   A  A P QP GSFLFLGPSGVGKTELAK LA ELF+DE  MVRIDMSE+ EKHSVSRLIG+PPGYVGY EGGQLTEPV+ RPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL
        VL DEVEKA+V VLN+LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHLL   SGKYC M          VK HFKPEF+N LDEIL+ +PLS DQ 
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
        RRI KSMMKDV  RL+E+ G+A+ VT++ L++VL +SFD VYG+R IRRWLEKKVVT+LSKMLI+EEI E+ TVY+DA  +GKDL Y VEKN  LIN  +
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS
          +YEILIQ+P+ +K  D     D GGN E++  T    TSDS
Subjt:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS

KAE8652431.1 hypothetical protein Csa_013437 [Cucumis sativus]1.5e-11366.07Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   ++A   +   A  ALP QP GSFLFLGPSGVGKTELAK LA ELF+DE  MVRIDMSE+ EKHSVSRLIG+PPGYVGY EGGQLTEPV+RRPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL
        VL DEVEKA+V VLN+LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHL S    KYCPM          VK HFKPEF+N LDEIL+ +PLS  Q 
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
        RR+TKSMMKDV  RL+E+ G+A+ VT++ L++VL +SFD VYG+R IRRWLEKKVVT +SKML++EEI E+ TVY+DA  DGKDL Y VEKN  LI+  +
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKG--DGAGSVDGGGNGEDHETT
        D +YEILIQ+P+ +K   D +   +GG   ED ETT
Subjt:  DWKYEILIQMPSAQKG--DGAGSVDGGGNGEDHETT

XP_008453261.1 PREDICTED: chaperone protein ClpB1 [Cucumis melo]1.7e-11265.89Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   ++A   +   A  A P QP GSFLFLGPSGVGKTELAK LA ELF+DE  MVRIDMSE+ EKHSVSRLIG+PPGYVGY EGGQLTEPV+ RPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL
        VL DEVEKA+V VLN+LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHLL   SGKYC M          VK HFKPEF+N LDEIL+ +PLS DQ 
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
        RRI KSMMKDV  RL+E+ G+A+ VT++ L++VL +SFD VYG+R IRRWLEKKVVT+LSKMLI+EEI E+ TVY+DA  +GKDL Y VEKN  LIN  +
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS
          +YEILIQ+P+ +K  D     D GGN E++  T    TSDS
Subjt:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS

XP_031737029.1 chaperone protein ClpB1 [Cucumis sativus]6.4e-11265.49Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPG---YVGYDEGGQLTEPVRRRP
        ++   ++A   +   A  ALP QP GSFLFLGPSGVGKTELAK LA ELF+DE  MVRIDMSE+ EKHSVSRLIG+PPG   YVGY EGGQLTEPV+RRP
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPG---YVGYDEGGQLTEPVRRRP

Query:  YCFVLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSL
        YC VL DEVEKA+V VLN+LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHL S    KYCPM          VK HFKPEF+N LDEIL+ +PLS 
Subjt:  YCFVLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSL

Query:  DQLRRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLIN
         Q RR+TKSMMKDV  RL+E+ G+A+ VT++ L++VL +SFD VYG+R IRRWLEKKVVT +SKML++EEI E+ TVY+DA  DGKDL Y VEKN  LI+
Subjt:  DQLRRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLIN

Query:  ATADWKYEILIQMPSAQKG--DGAGSVDGGGNGEDHETT
          +D +YEILIQ+P+ +K   D +   +GG   ED ETT
Subjt:  ATADWKYEILIQMPSAQKG--DGAGSVDGGGNGEDHETT

XP_038880335.1 chaperone protein ClpB1 [Benincasa hispida]5.1e-11770.81Show/hide
Query:  LPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQ
        LP QP GSFLFLGPSGVGKTELAK LA ELF+DE HMVRIDMSE+ EKHSVSRLIGAPPGYVGY EGGQLTEPVR+RPYC VL DEVEKA+V VLN+LLQ
Subjt:  LPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQ

Query:  VLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAERS
        VLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHLLS Q  KYC M          VK HFKPEFLN LDEIL+ QPLS DQ RRITKSM+KDV  RL    
Subjt:  VLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAERS

Query:  GVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATADWKYEILIQMPSAQKGDGA
         +AL VTEA L++VL +SFD VYG+R IRRWLEKK+VTELSKMLI+EEIDE  TVYIDA   GKDL Y VEKN  L N  +D KYE+LIQ+PS +K    
Subjt:  GVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATADWKYEILIQMPSAQKGDGA

Query:  GSVDGGGNGEDHETTPLCNTSD
         S D   +G+D +     + SD
Subjt:  GSVDGGGNGEDHETTPLCNTSD

TrEMBL top hitse value%identityAlignment
A0A0A0LPI1 Uncharacterized protein7.4e-11466.07Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   ++A   +   A  ALP QP GSFLFLGPSGVGKTELAK LA ELF+DE  MVRIDMSE+ EKHSVSRLIG+PPGYVGY EGGQLTEPV+RRPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL
        VL DEVEKA+V VLN+LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHL S    KYCPM          VK HFKPEF+N LDEIL+ +PLS  Q 
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
        RR+TKSMMKDV  RL+E+ G+A+ VT++ L++VL +SFD VYG+R IRRWLEKKVVT +SKML++EEI E+ TVY+DA  DGKDL Y VEKN  LI+  +
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKG--DGAGSVDGGGNGEDHETT
        D +YEILIQ+P+ +K   D +   +GG   ED ETT
Subjt:  DWKYEILIQMPSAQKG--DGAGSVDGGGNGEDHETT

A0A1S3BWY7 chaperone protein ClpB18.1e-11365.89Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   ++A   +   A  A P QP GSFLFLGPSGVGKTELAK LA ELF+DE  MVRIDMSE+ EKHSVSRLIG+PPGYVGY EGGQLTEPV+ RPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL
        VL DEVEKA+V VLN+LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHLL   SGKYC M          VK HFKPEF+N LDEIL+ +PLS DQ 
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
        RRI KSMMKDV  RL+E+ G+A+ VT++ L++VL +SFD VYG+R IRRWLEKKVVT+LSKMLI+EEI E+ TVY+DA  +GKDL Y VEKN  LIN  +
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS
          +YEILIQ+P+ +K  D     D GGN E++  T    TSDS
Subjt:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS

A0A2C9VQM0 Clp R domain-containing protein4.0e-11267.7Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        D+   A+A   L   A    P+QPTGSFLFLGP+GVGKTELAKALAE+LF DE  +VRIDMSE+ E+HSV+RLIGAPPGYVG++EGGQLTE VRRRPY  
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPM---------HVKAHFKPEFLNCLDEILVLQPLSLDQL
        VLFDEVEKA++SV N LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGA HLLS   GK C M          V+ HF+PE LN LDEI+V  PLS DQL
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPM---------HVKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
         ++ +  MKDV  RLAER G+AL VT+A L+Y+L ES+D VYG+R IRRWLEKKVVTELS+ML+REEIDE+STVYIDAG  G DLVYTVEKNG L+NA  
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKGDGAGSV
          K E+LIQ+PS  + D A +V
Subjt:  DWKYEILIQMPSAQKGDGAGSV

A0A371E943 Chaperone protein ClpB1 (Fragment)5.3e-11268.01Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        D+   A+A   L   A    P+QPTGSFLFLGP+GVGKTELAKALAE+LF DE  +VRIDMSE+ E+HSVSRLIGAPPGYVG++EGGQLTE VRRRPY  
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPM---------HVKAHFKPEFLNCLDEILVLQPLSLDQL
        VLFDEVEKA+ SV N LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGA HLLS  SGK C M          V+ HF+PE LN LDEI+V  PLS DQL
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPM---------HVKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
        R++ +  MKDV  RLAER G+A+ VT+A L+Y+LGES+D VYG+R IRRWLEKKVVTELS+MLIREEIDE+STVYIDAG  G +LVY VEKNG ++N T 
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKGDGAGSV
          K +ILIQ+P   K D A +V
Subjt:  DWKYEILIQMPSAQKGDGAGSV

A0A5A7UUZ9 Chaperone protein ClpB18.1e-11365.89Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   ++A   +   A  A P QP GSFLFLGPSGVGKTELAK LA ELF+DE  MVRIDMSE+ EKHSVSRLIG+PPGYVGY EGGQLTEPV+ RPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL
        VL DEVEKA+V VLN+LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGAGHLL   SGKYC M          VK HFKPEF+N LDEIL+ +PLS DQ 
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH---------VKAHFKPEFLNCLDEILVLQPLSLDQL

Query:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA
        RRI KSMMKDV  RL+E+ G+A+ VT++ L++VL +SFD VYG+R IRRWLEKKVVT+LSKMLI+EEI E+ TVY+DA  +GKDL Y VEKN  LIN  +
Subjt:  RRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATA

Query:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS
          +YEILIQ+P+ +K  D     D GGN E++  T    TSDS
Subjt:  DWKYEILIQMPSAQKG-DGAGSVDGGGNGEDHETTPLCNTSDS

SwissProt top hitse value%identityAlignment
P31543 Heat shock protein 1002.5e-7450.34Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        D+    +A   +   A  + P  PT SFLFLGP+GVGKTEL KA+A ELF DE HMVRIDMSE+ E+HSVSRLIGAPPGY+G+DEGGQLTEPVRRRP+  
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQ--SGKYCPMH------VKAHFKPEFLNCLDEILVLQPLSLDQLR
        VLFDEVEKA+ +V NVLLQVLDDGRLTD +G TVDF NT+I+MTSNLG+ HLL+ +  +  Y  +       V+++F+PE +N LD+I+V + L  + LR
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQ--SGKYCPMH------VKAHFKPEFLNCLDEILVLQPLSLDQLR

Query:  RITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEK
         +  +++  V  RL + SG ++++ +   +++L    D   G+R +RRW+EK +VTE+ +MLI +E+  +ST+ +     G  L + V++
Subjt:  RITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEK

P42730 Chaperone protein ClpB17.1e-10664.78Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   A++   L   A    P+QPTGSFLFLGP+GVGKTELAKALAE+LF DE  +VRIDMSE+ E+HSVSRLIGAPPGYVG++EGGQLTE VRRRPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKY-------CPM-HVKAHFKPEFLNCLDEILVLQPLSLDQLR
        +LFDEVEKA+V+V N LLQVLDDGRLTDGQG TVDFRN+VIIMTSNLGA HLL+  +GK        C M  V+ HF+PE LN LDEI+V  PLS DQLR
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKY-------CPM-HVKAHFKPEFLNCLDEILVLQPLSLDQLR

Query:  RITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATAD
        ++ +  MKDV  RLAER GVAL VT+A L+Y+L ES+D VYG+R IRRW+EKKVVTELSKM++REEIDE+STVYIDAG    DLVY VE  G L++A+  
Subjt:  RITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATAD

Query:  WKYEILIQMPSAQKGDGA
         K ++LI + +  K   A
Subjt:  WKYEILIQMPSAQKGDGA

Q6F2Y7 Chaperone protein ClpB11.3e-10262.11Show/hide
Query:  KTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFV
        +   A+A   L   A    P+QPTGSFLFLGP+GVGKTELAKALAE+LF DE  +VRIDMSE+ E+HSV+RLIGAPPGYVG++EGGQLTE VRRRPY  +
Subjt:  KTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFV

Query:  LFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGK--------YCPMHVKAHFKPEFLNCLDEILVLQPLSLDQLRR
        LFDEVEKA+V+V N LLQVLDDGRLTDGQG TVDFRNTVIIMTSNLGA HLL+   GK             V+ HF+PE LN LDEI++  PLS +QLR+
Subjt:  LFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGK--------YCPMHVKAHFKPEFLNCLDEILVLQPLSLDQLRR

Query:  ITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATADW
        + +  MKDV  RLAER GVAL VT+A L+ +L  S+D VYG+R IRRW+EK+VVT+LSKMLI+EEIDE+ TVYIDA     +L Y V+  G L+NA    
Subjt:  ITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATADW

Query:  KYEILIQMP--SAQKGDGAGSV
        K +ILIQ+P  +A   D A +V
Subjt:  KYEILIQMP--SAQKGDGAGSV

Q74FF1 Chaperone protein ClpB1.6e-7354.34Show/hide
Query:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV
        P +P GSFLFLGP+GVGKTE AKALAE LF+D+  +VRIDMSE+ EKH+V+RLIGAPPGYVGY+EGGQLTE VRRRPY  VLFDE+EKA+  V NVLLQV
Subjt:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV

Query:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH------VKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAERSGVAL
        LDDGRLTDGQG TVDFRNTVIIMTSNLG+  +    S  Y  M       +K  FKPEFLN +DEI++   L L+Q+++I    ++ ++ RLA+R  + L
Subjt:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMH------VKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAERSGVAL

Query:  VVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYT
         +++    Y+  E +D  YG+R ++R +++K+   L+  L+  +  E  TV +D    G++LV T
Subjt:  VVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYT

Q81TT4 Chaperone protein ClpB2.7e-7350.74Show/hide
Query:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV
        P +P GSF+FLGP+GVGKTELAK LA+ LF  E  M+RIDMSE+ EKH+VSRLIGAPPGYVGY+EGGQLTE VRR+PY  +L DE+EKA+  V N+LLQ+
Subjt:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV

Query:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA---------QSGKYCPMHVKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAERSG
        LDDGR+TD QG TVDF+NTVIIMTSN+G+ HLL           +S +     ++ HF+PEFLN +DEI++ +PL+ ++++ I   ++K+++ RLA+R  
Subjt:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA---------QSGKYCPMHVKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAERSG

Query:  VALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVE
        + + +TEA   +V+   FD +YG+R ++R+++++V T+L++ LI   I ++S V +D   +  +LV  V+
Subjt:  VALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVE

Arabidopsis top hitse value%identityAlignment
AT1G74310.1 heat shock protein 1015.1e-10764.78Show/hide
Query:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF
        ++   A++   L   A    P+QPTGSFLFLGP+GVGKTELAKALAE+LF DE  +VRIDMSE+ E+HSVSRLIGAPPGYVG++EGGQLTE VRRRPYC 
Subjt:  DKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCF

Query:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKY-------CPM-HVKAHFKPEFLNCLDEILVLQPLSLDQLR
        +LFDEVEKA+V+V N LLQVLDDGRLTDGQG TVDFRN+VIIMTSNLGA HLL+  +GK        C M  V+ HF+PE LN LDEI+V  PLS DQLR
Subjt:  VLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKY-------CPM-HVKAHFKPEFLNCLDEILVLQPLSLDQLR

Query:  RITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATAD
        ++ +  MKDV  RLAER GVAL VT+A L+Y+L ES+D VYG+R IRRW+EKKVVTELSKM++REEIDE+STVYIDAG    DLVY VE  G L++A+  
Subjt:  RITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATAD

Query:  WKYEILIQMPSAQKGDGA
         K ++LI + +  K   A
Subjt:  WKYEILIQMPSAQKGDGA

AT2G25140.1 casein lytic proteinase B41.3e-6545.88Show/hide
Query:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV
        P +P  SF+F+GP+GVGKTELAKALA  LF+ E  +VR+DMSE+ EKHSVSRL+GAPPGYVGY+EGGQLTE VRRRPY  VLFDE+EKA+  V N+LLQ+
Subjt:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV

Query:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA----QSGKYCPMHV---------KAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLA
        LDDGR+TD QG TV F+N V+IMTSN+G+ H+L      +  K     +         + +F+PEF+N +DE +V QPL  +++ +I +  M+ V+  L 
Subjt:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA----QSGKYCPMHV---------KAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLA

Query:  ERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDA---GGDGKDLVYTVEKN
        E+  + L  T+  ++ +    FD  YG+R ++R +++ V  E++  +++ +  E+ TV +D      D K ++  +E N
Subjt:  ERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDA---GGDGKDLVYTVEKN

AT3G48870.1 Clp ATPase2.2e-6244.37Show/hide
Query:  ATPALSKAAKNAL-----PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVL
        A  A+S+A + A      P +P  SF+F GP+GVGK+ELAKALA   F  E  M+R+DMSEF E+H+VS+LIG+PPGYVGY EGGQLTE VRRRPY  VL
Subjt:  ATPALSKAAKNAL-----PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVL

Query:  FDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA------------QSGKY------CPMHVKAHFKPEFLNCLDEILVLQ
        FDE+EKA+  V N++LQ+L+DGRLTD +G TVDF+NT++IMTSN+G+  +               +   Y          +K +F+PEFLN LDE++V +
Subjt:  FDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA------------QSGKY------CPMHVKAHFKPEFLNCLDEILVLQ

Query:  PLSLDQLRRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLV
         L+  +++ I   M+K+V  RL E   + L VTE     V+ E FD  YG+R +RR + + +   +++ ++  +I E  +V +D   +G  +V
Subjt:  PLSLDQLRRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLV

AT3G48870.2 Clp ATPase2.2e-6244.37Show/hide
Query:  ATPALSKAAKNAL-----PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVL
        A  A+S+A + A      P +P  SF+F GP+GVGK+ELAKALA   F  E  M+R+DMSEF E+H+VS+LIG+PPGYVGY EGGQLTE VRRRPY  VL
Subjt:  ATPALSKAAKNAL-----PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVL

Query:  FDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA------------QSGKY------CPMHVKAHFKPEFLNCLDEILVLQ
        FDE+EKA+  V N++LQ+L+DGRLTD +G TVDF+NT++IMTSN+G+  +               +   Y          +K +F+PEFLN LDE++V +
Subjt:  FDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSA------------QSGKY------CPMHVKAHFKPEFLNCLDEILVLQ

Query:  PLSLDQLRRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLV
         L+  +++ I   M+K+V  RL E   + L VTE     V+ E FD  YG+R +RR + + +   +++ ++  +I E  +V +D   +G  +V
Subjt:  PLSLDQLRRITKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLV

AT5G15450.1 casein lytic proteinase B32.1e-6549.03Show/hide
Query:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV
        P +P  SF+F+GP+GVGKTELAKALA  +F+ E  +VRIDMSE+ EKH+VSRLIGAPPGYVGY+EGGQLTE VRRRPY  +LFDE+EKA+  V NV LQ+
Subjt:  PRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYDEGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQV

Query:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHL----------LSAQSGKYCPMH-VKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAER
        LDDGR+TD QG TV F NTVIIMTSN+G+  +          LS ++ K   M+  ++ F+PEF+N +DE +V +PL  +Q+ RI +  +  V+ R+A+R
Subjt:  LDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHL----------LSAQSGKYCPMH-VKAHFKPEFLNCLDEILVLQPLSLDQLRRITKSMMKDVECRLAER

Query:  SGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYID
          + + +T+A ++ +    +D  YG+R ++R +++ +  EL+K ++R +  E+  + ID
Subjt:  SGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYID


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAAGGTAAGATCGACATAGTTTTCGTATCAGTGCTTAAAGAGAAATACCCTGAACATGACACCGGCGATGCCGAAAACTACGGCGATGAAACCGGAATAATCGA
CCGGCAGATCTTGCGAATTCACCGCCGGGGCAACGTAAGGCTTCGCCGCCGACGGCTGCCGTGGATCGTTAGCCTGTGTCGACATCTCTTGACTCCCTCAAGTCTCGACG
ACTCCGTCTCTCTCCACTCGCTCGAGATGCTATGGTACTCCTCTCTTGTCAACCACATCTCTCTCTTTCGCGTTTCAGAAGTCTGCAGAAAAACCGAATCCTCACGGTCT
TATCCCGCCCCTCCCTTTTCTTCCTTCCGCCACCCCTCTGCTTCTGATCGCCACGACGCGACGGGACGAAAACCCTTAACACGATGGAGACTCAGTATTTCTCTACACCT
CCTTCTTCTAAAGGAACTACTTGGAACTCTGGAAGCAGAACACCAAGCTCTTCGCCTCTACTTCCGTCGGTACATGCCTTCTATGGAACTGGGTTCTCTCAGCAACATGC
TTGATATACGACAAAAAGCTTGTTTGAAATTGTCTAAGCAGCAGGAACTTCATCAAAGAAAACTCTTATCTTCATACAAGGACATGGTTGATGTTGTAGTTCAGATGGTC
AACAATAGCAGATTGATGAGATGTTATTTCAAGGGATCAAGTAGTAGCACACTCATACAATTTTCTACTAGCTCCGAGGATAATCAAGATAATCACAATGATGCTGGAGA
TGGTGGCGGCATCTCAGTATTTACATTTTTGACAGTCCCTTGCTTTGAGAAATTGGCAGAGGAACTTGTTCAAATGTTTGGGTTGGAGCTCAATCTGAAGCGGCTACTTT
TGATTGAACTACTTTCTATTAGTTCTGAAGTTCCATCACGAAATAGCTTGGTCTGGTCAGAGGAGTTTTACCCGGGAGAATTCGACGACTTGCGTTTGTGCAACTTGTAC
TCCGAGGAATCTGGTGAGCTGCTCCATCCAACCTTGAAGGGTCATGAATCCAATACACCTATTGTTAGTCGTAGCAATCAGCCAAACCATGAAGTTTTACAGGTCTATCT
TGTTGCATGGCTTGCAGAAGTGAACATACACACAAAAAGGAGTGCTATGTCAAAGTTATCAGGAACAAGCTTTGTGATTCTACGCTGTGGAACGATCGGCATTGGGCCCC
CTCCGAGAGCAGAAGACTGCGCCGTCCATCTGTTCTCGGCGAAAGCCCTAGATGATCATCGACGCGTACGCGAGAAAACAGGTTTTCGTTTTCGAAAAATTGGGCAGCCA
GCCAAGTTAAATTTCCTGAGTGGCCCAAAGCCACAAACTGCCCAGTGTGTCCGTGACCTTGTCAAAGAAATTAATCCCTTCTCCCCTCTCTCTCTTGGATATTGGGGGGC
TTGTAGGAATTTGTCTGTGTCTTCTGGGTTGGTTGTGAACTTGATGATAGTGCTGATTCTTATAAATTCTCAACTACAGCCCAGCTTGAAGTTTAATCCACAAAGATCTT
CATTCCGAGATAGAGATCAGCTTCTACAAGAAGCCGAAAGAGGACACATATTCAACGGTTACTGCGAAGGAACATTGTTGTCGTTGAAACCGACCTCAAAAAGGACGTCG
GAAGTAGCAAATGCGACACGAGTTATAAGCCTCTCAAGTTCAGGGCCGGCAATCGCGGCGGCTGGCCACCCCGTCTCCGACAAGACCACCGGAGCCCTCGCCACCCCAGC
TCTCTCCAAAGCCGCCAAAAACGCATTGCCGCGGCAGCCGACCGGATCTTTCCTTTTCTTGGGTCCGTCTGGGGTTGGGAAGACGGAGCTTGCCAAAGCTCTGGCTGAAG
AACTTTTTCACGATGAGTATCATATGGTGCGAATCGACATGTCGGAGTTCACGGAGAAACACTCCGTTTCGCGGCTCATCGGAGCTCCGCCGGGGTATGTTGGGTACGAT
GAAGGAGGGCAACTCACAGAGCCTGTAAGGCGAAGGCCGTACTGTTTCGTCCTCTTTGATGAAGTGGAAAAAGCAAACGTCTCCGTTTTGAATGTTCTGCTTCAAGTTTT
AGACGACGGCCGCCTTACCGACGGCCAAGGTTGCACCGTCGACTTCAGAAACACCGTGATTATCATGACTTCGAACCTCGGAGCCGGCCATCTTCTTTCCGCCCAATCGG
GAAAGTACTGCCCGATGCATGTGAAAGCACACTTCAAGCCGGAGTTTCTCAATTGCCTCGACGAGATTCTGGTGCTCCAGCCACTTTCTCTAGATCAACTGAGGAGGATC
ACAAAATCGATGATGAAAGACGTTGAGTGCCGCCTTGCCGAGAGATCAGGCGTTGCGTTGGTGGTGACGGAGGCGCCTCTAAACTACGTTCTCGGAGAGAGCTTTGATCA
GGTTTACGGCAGTAGATCGATCAGGCGGTGGCTGGAGAAGAAAGTGGTGACGGAGCTTTCAAAGATGCTGATAAGGGAAGAGATCGACGAGGACTCCACCGTATACATCG
ACGCCGGTGGCGACGGAAAAGATTTGGTGTATACGGTGGAGAAAAACGGAAGGCTGATAAATGCAACCGCTGATTGGAAATATGAGATATTGATTCAAATGCCATCTGCG
CAGAAAGGTGACGGCGCCGGCAGTGTGGACGGAGGAGGAAATGGAGAGGACCACGAAACGACACCGCTTTGTAATACAAGTGATAGTGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAAGGTAAGATCGACATAGTTTTCGTATCAGTGCTTAAAGAGAAATACCCTGAACATGACACCGGCGATGCCGAAAACTACGGCGATGAAACCGGAATAATCGA
CCGGCAGATCTTGCGAATTCACCGCCGGGGCAACGTAAGGCTTCGCCGCCGACGGCTGCCGTGGATCGTTAGCCTGTGTCGACATCTCTTGACTCCCTCAAGTCTCGACG
ACTCCGTCTCTCTCCACTCGCTCGAGATGCTATGGTACTCCTCTCTTGTCAACCACATCTCTCTCTTTCGCGTTTCAGAAGTCTGCAGAAAAACCGAATCCTCACGGTCT
TATCCCGCCCCTCCCTTTTCTTCCTTCCGCCACCCCTCTGCTTCTGATCGCCACGACGCGACGGGACGAAAACCCTTAACACGATGGAGACTCAGTATTTCTCTACACCT
CCTTCTTCTAAAGGAACTACTTGGAACTCTGGAAGCAGAACACCAAGCTCTTCGCCTCTACTTCCGTCGGTACATGCCTTCTATGGAACTGGGTTCTCTCAGCAACATGC
TTGATATACGACAAAAAGCTTGTTTGAAATTGTCTAAGCAGCAGGAACTTCATCAAAGAAAACTCTTATCTTCATACAAGGACATGGTTGATGTTGTAGTTCAGATGGTC
AACAATAGCAGATTGATGAGATGTTATTTCAAGGGATCAAGTAGTAGCACACTCATACAATTTTCTACTAGCTCCGAGGATAATCAAGATAATCACAATGATGCTGGAGA
TGGTGGCGGCATCTCAGTATTTACATTTTTGACAGTCCCTTGCTTTGAGAAATTGGCAGAGGAACTTGTTCAAATGTTTGGGTTGGAGCTCAATCTGAAGCGGCTACTTT
TGATTGAACTACTTTCTATTAGTTCTGAAGTTCCATCACGAAATAGCTTGGTCTGGTCAGAGGAGTTTTACCCGGGAGAATTCGACGACTTGCGTTTGTGCAACTTGTAC
TCCGAGGAATCTGGTGAGCTGCTCCATCCAACCTTGAAGGGTCATGAATCCAATACACCTATTGTTAGTCGTAGCAATCAGCCAAACCATGAAGTTTTACAGGTCTATCT
TGTTGCATGGCTTGCAGAAGTGAACATACACACAAAAAGGAGTGCTATGTCAAAGTTATCAGGAACAAGCTTTGTGATTCTACGCTGTGGAACGATCGGCATTGGGCCCC
CTCCGAGAGCAGAAGACTGCGCCGTCCATCTGTTCTCGGCGAAAGCCCTAGATGATCATCGACGCGTACGCGAGAAAACAGGTTTTCGTTTTCGAAAAATTGGGCAGCCA
GCCAAGTTAAATTTCCTGAGTGGCCCAAAGCCACAAACTGCCCAGTGTGTCCGTGACCTTGTCAAAGAAATTAATCCCTTCTCCCCTCTCTCTCTTGGATATTGGGGGGC
TTGTAGGAATTTGTCTGTGTCTTCTGGGTTGGTTGTGAACTTGATGATAGTGCTGATTCTTATAAATTCTCAACTACAGCCCAGCTTGAAGTTTAATCCACAAAGATCTT
CATTCCGAGATAGAGATCAGCTTCTACAAGAAGCCGAAAGAGGACACATATTCAACGGTTACTGCGAAGGAACATTGTTGTCGTTGAAACCGACCTCAAAAAGGACGTCG
GAAGTAGCAAATGCGACACGAGTTATAAGCCTCTCAAGTTCAGGGCCGGCAATCGCGGCGGCTGGCCACCCCGTCTCCGACAAGACCACCGGAGCCCTCGCCACCCCAGC
TCTCTCCAAAGCCGCCAAAAACGCATTGCCGCGGCAGCCGACCGGATCTTTCCTTTTCTTGGGTCCGTCTGGGGTTGGGAAGACGGAGCTTGCCAAAGCTCTGGCTGAAG
AACTTTTTCACGATGAGTATCATATGGTGCGAATCGACATGTCGGAGTTCACGGAGAAACACTCCGTTTCGCGGCTCATCGGAGCTCCGCCGGGGTATGTTGGGTACGAT
GAAGGAGGGCAACTCACAGAGCCTGTAAGGCGAAGGCCGTACTGTTTCGTCCTCTTTGATGAAGTGGAAAAAGCAAACGTCTCCGTTTTGAATGTTCTGCTTCAAGTTTT
AGACGACGGCCGCCTTACCGACGGCCAAGGTTGCACCGTCGACTTCAGAAACACCGTGATTATCATGACTTCGAACCTCGGAGCCGGCCATCTTCTTTCCGCCCAATCGG
GAAAGTACTGCCCGATGCATGTGAAAGCACACTTCAAGCCGGAGTTTCTCAATTGCCTCGACGAGATTCTGGTGCTCCAGCCACTTTCTCTAGATCAACTGAGGAGGATC
ACAAAATCGATGATGAAAGACGTTGAGTGCCGCCTTGCCGAGAGATCAGGCGTTGCGTTGGTGGTGACGGAGGCGCCTCTAAACTACGTTCTCGGAGAGAGCTTTGATCA
GGTTTACGGCAGTAGATCGATCAGGCGGTGGCTGGAGAAGAAAGTGGTGACGGAGCTTTCAAAGATGCTGATAAGGGAAGAGATCGACGAGGACTCCACCGTATACATCG
ACGCCGGTGGCGACGGAAAAGATTTGGTGTATACGGTGGAGAAAAACGGAAGGCTGATAAATGCAACCGCTGATTGGAAATATGAGATATTGATTCAAATGCCATCTGCG
CAGAAAGGTGACGGCGCCGGCAGTGTGGACGGAGGAGGAAATGGAGAGGACCACGAAACGACACCGCTTTGTAATACAAGTGATAGTGGGTGA
Protein sequenceShow/hide protein sequence
MSKGKIDIVFVSVLKEKYPEHDTGDAENYGDETGIIDRQILRIHRRGNVRLRRRRLPWIVSLCRHLLTPSSLDDSVSLHSLEMLWYSSLVNHISLFRVSEVCRKTESSRS
YPAPPFSSFRHPSASDRHDATGRKPLTRWRLSISLHLLLLKELLGTLEAEHQALRLYFRRYMPSMELGSLSNMLDIRQKACLKLSKQQELHQRKLLSSYKDMVDVVVQMV
NNSRLMRCYFKGSSSSTLIQFSTSSEDNQDNHNDAGDGGGISVFTFLTVPCFEKLAEELVQMFGLELNLKRLLLIELLSISSEVPSRNSLVWSEEFYPGEFDDLRLCNLY
SEESGELLHPTLKGHESNTPIVSRSNQPNHEVLQVYLVAWLAEVNIHTKRSAMSKLSGTSFVILRCGTIGIGPPPRAEDCAVHLFSAKALDDHRRVREKTGFRFRKIGQP
AKLNFLSGPKPQTAQCVRDLVKEINPFSPLSLGYWGACRNLSVSSGLVVNLMIVLILINSQLQPSLKFNPQRSSFRDRDQLLQEAERGHIFNGYCEGTLLSLKPTSKRTS
EVANATRVISLSSSGPAIAAAGHPVSDKTTGALATPALSKAAKNALPRQPTGSFLFLGPSGVGKTELAKALAEELFHDEYHMVRIDMSEFTEKHSVSRLIGAPPGYVGYD
EGGQLTEPVRRRPYCFVLFDEVEKANVSVLNVLLQVLDDGRLTDGQGCTVDFRNTVIIMTSNLGAGHLLSAQSGKYCPMHVKAHFKPEFLNCLDEILVLQPLSLDQLRRI
TKSMMKDVECRLAERSGVALVVTEAPLNYVLGESFDQVYGSRSIRRWLEKKVVTELSKMLIREEIDEDSTVYIDAGGDGKDLVYTVEKNGRLINATADWKYEILIQMPSA
QKGDGAGSVDGGGNGEDHETTPLCNTSDSG