; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013636 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013636
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionChaperone protein
Genome locationChr02:3245357..3260627
RNA-Seq ExpressionHG10013636
SyntenyHG10013636
Gene Ontology termsGO:0034605 - cellular response to heat (biological process)
GO:0042026 - protein refolding (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR003593 - AAA+ ATPase domain
IPR003959 - ATPase, AAA-type, core
IPR004176 - Clp, repeat (R) domain
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR018368 - ClpA/B, conserved site 1
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR036628 - Clp, N-terminal domain superfamily
IPR041546 - ClpA/ClpB, AAA lid domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145507.1 U-box domain-containing protein 7 [Cucumis sativus]6.2e-24691.04Show/hide
Query:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD
        MAKCQRN+VGSV FDR S S+ AGSHFRLC  FS ASFRRK+FDAVSCGGSSRY YHHDGNVGGGDGTVS+AIRSLSEIVKEREA RPKRSNVKSEKLFD
Subjt:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD

Query:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
        LLKLESSPES+PETKKKEEVLEEFKR VKKLQDEDLVERRAAAS VRLLAKED EAR TL MLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
Subjt:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA

Query:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL
        AIAKAGT+HKMLKLIESE+ PNP VSEAIVANFLGLSALDTNKL+IGSSGAIPFLVKNLYDPH++SSSQVKQDALRALYNLSIFPSN+PFILETKL+PFL
Subjt:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL

Query:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
        LNALGDMEVSERALSVLSNV+ST +GRKAIST+PNSFPIL DVLNWADSPGCQEK SYILMVMAHKSYSDRQAMIEAG+SSALLELTLLGSTLAQKRASR
Subjt:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR

Query:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
        +LESLRVDKGKQISDH GGNSSAP+  SL+SFTN ILGSAE LEG DDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSD+FKSLTSSSTSKSL
Subjt:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL

Query:  PF
        PF
Subjt:  PF

XP_008452863.1 PREDICTED: chaperone protein ClpB4, mitochondrial [Cucumis melo]3.3e-23989.68Show/hide
Query:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQTD
        MATRRVSKLTR ALAAIDAPK PHSR LLSR    S SSSSSLGN I   SVAK +GSRPV+G+SMASA+YLATIFTRNFHST PSRYSATASSQINQTD
Subjt:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQTD

Query:  FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDF
        FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVL ATVDFISQQPKVTGETSGPIIGTHL L+LDNARKHKKEMGDDF
Subjt:  FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDF

Query:  LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII
        LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYG+DLTE ARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII
Subjt:  LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII

Query:  GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGR
        GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDA NLLKPMLGR
Subjt:  GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGR

Query:  GELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA
        GELRCIGATTL EYRKYIEKDPALERRFQQVFCG+PSVEDTISILRGLRERYELHHGVKISD +  S  V          +   A D V  +AA
Subjt:  GELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA

XP_008452864.1 PREDICTED: U-box domain-containing protein 15-like [Cucumis melo]8.6e-24892.23Show/hide
Query:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD
        MAKCQRNDVGSV FDR STS+ AGSHFRLC  FS ASFRRK+FDAVSCGGSSRY YHHDGNVGGGDGTVS+AIRSLSEIVKEREA RPKRSN KSEKLFD
Subjt:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD

Query:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
        LLKLESSPES+PETKKKEEVLEEFK  VKKLQDEDL ERRAAAS VRLLAKEDAEAR TL MLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
Subjt:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA

Query:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL
        AIAKAGTVHKMLKLIESES PNP VSEAIVANFLGLSALDTNKL+IGSSGAIPFLVKNLYDPH++SSSQVKQDALRALYNLSI PSN+PFILETKLIPFL
Subjt:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL

Query:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
        LNALGDME+SERALS+LSNVVSTPEGRKAIST+PNSFPIL DVLNWADSPGCQEK SYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
Subjt:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR

Query:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
        +LESLRVDKGKQISDH GGNSSAPI  SL+SFTN ILGSAE LEG DDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
Subjt:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL

Query:  PF
        PF
Subjt:  PF

XP_038897265.1 U-box domain-containing protein 15-like [Benincasa hispida]4.7e-25494.82Show/hide
Query:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD
        MAKCQRNDVGSV FDRVSTS AAGSHFRLCTSFS ASFRRK+FDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKR+NVKSEKLFD
Subjt:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD

Query:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
        LLKLESSPES+PETKKKEEVLE+FK  VKKLQDEDLVERRAAASRVRLLAKEDAEAR TLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
Subjt:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA

Query:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL
        AIAKAGTVHKMLKLIESES PNPSVSEAIVANFLGLSALDTNKLVIGSSGA+PFLVKNLYDPH++SSSQVKQDALRALYNLSIFPSN+P ILETKLIPFL
Subjt:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL

Query:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
        LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSF IL DVLNWADSPGCQEKASYILMVMAHKSYSDRQ MIEAGISSALLELTLLGSTLAQKRASR
Subjt:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR

Query:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
        ILESLR+DKGKQISDHFGGNSSAPI  SLSSFTN ILGSAEGLEG DDLVSEEKKAVKQLVR SLQNNM+RIVKRANLPQDFVPSDHFKSLTSSSTSKSL
Subjt:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL

Query:  PF
        PF
Subjt:  PF

XP_038898368.1 chaperone protein ClpB4, mitochondrial isoform X1 [Benincasa hispida]5.2e-24591.5Show/hide
Query:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQTD
        MATRRVSKLTR ALAAIDA K  HSRS+ S SPALSRSSSSSL NSIG  SVAK +GSRPVNGASMASAKYLATIFTRNFHST PSRYSATASSQINQTD
Subjt:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQTD

Query:  FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDF
        FTEMAWEGIVGAVDTARANKQQVVESEHLMK LLEQKDGLARRIFSKAGLDNSSVL ATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDF
Subjt:  FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDF

Query:  LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII
        LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTE ARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII
Subjt:  LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII

Query:  GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGR
        GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAK+RGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGR
Subjt:  GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGR

Query:  GELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA
        GELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD +  S  V          +   A D V  +AA
Subjt:  GELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA

TrEMBL top hitse value%identityAlignment
A0A0A0L4B6 Uncharacterized protein3.0e-24691.04Show/hide
Query:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD
        MAKCQRN+VGSV FDR S S+ AGSHFRLC  FS ASFRRK+FDAVSCGGSSRY YHHDGNVGGGDGTVS+AIRSLSEIVKEREA RPKRSNVKSEKLFD
Subjt:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD

Query:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
        LLKLESSPES+PETKKKEEVLEEFKR VKKLQDEDLVERRAAAS VRLLAKED EAR TL MLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
Subjt:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA

Query:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL
        AIAKAGT+HKMLKLIESE+ PNP VSEAIVANFLGLSALDTNKL+IGSSGAIPFLVKNLYDPH++SSSQVKQDALRALYNLSIFPSN+PFILETKL+PFL
Subjt:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL

Query:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
        LNALGDMEVSERALSVLSNV+ST +GRKAIST+PNSFPIL DVLNWADSPGCQEK SYILMVMAHKSYSDRQAMIEAG+SSALLELTLLGSTLAQKRASR
Subjt:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR

Query:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
        +LESLRVDKGKQISDH GGNSSAP+  SL+SFTN ILGSAE LEG DDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSD+FKSLTSSSTSKSL
Subjt:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL

Query:  PF
        PF
Subjt:  PF

A0A0A0L5L9 Clp R domain-containing protein2.7e-23990.1Show/hide
Query:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATA-SSQINQT
        MATRRVSKLTR ALAAIDAPK PHSR LLSR    SRSSSSSL N I   SVAK +GSR V+G+SMASAKYLATIFTRNFHST PSRYSATA SSQINQT
Subjt:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATA-SSQINQT

Query:  DFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDD
        DFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVL ATVDFI+QQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDD
Subjt:  DFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDD

Query:  FLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVI
        FLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYG+DLTE ARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVI
Subjt:  FLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVI

Query:  IGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLG
        IGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLG
Subjt:  IGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLG

Query:  RGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA
        RGELRCIGATTLKEYRKYIEKDPALERRFQQVFCG+PSVEDTISILRGLRERYELHHGVKISD +  S  V          +   A D V  +AA
Subjt:  RGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA

A0A1S3BUA9 chaperone protein ClpB4, mitochondrial1.6e-23989.68Show/hide
Query:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQTD
        MATRRVSKLTR ALAAIDAPK PHSR LLSR    S SSSSSLGN I   SVAK +GSRPV+G+SMASA+YLATIFTRNFHST PSRYSATASSQINQTD
Subjt:  MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQTD

Query:  FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDF
        FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVL ATVDFISQQPKVTGETSGPIIGTHL L+LDNARKHKKEMGDDF
Subjt:  FTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDF

Query:  LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII
        LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYG+DLTE ARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII
Subjt:  LSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVII

Query:  GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGR
        GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDA NLLKPMLGR
Subjt:  GEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGR

Query:  GELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA
        GELRCIGATTL EYRKYIEKDPALERRFQQVFCG+PSVEDTISILRGLRERYELHHGVKISD +  S  V          +   A D V  +AA
Subjt:  GELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAA

A0A1S3BUW8 U-box domain-containing protein 15-like4.2e-24892.23Show/hide
Query:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD
        MAKCQRNDVGSV FDR STS+ AGSHFRLC  FS ASFRRK+FDAVSCGGSSRY YHHDGNVGGGDGTVS+AIRSLSEIVKEREA RPKRSN KSEKLFD
Subjt:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD

Query:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
        LLKLESSPES+PETKKKEEVLEEFK  VKKLQDEDL ERRAAAS VRLLAKEDAEAR TL MLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
Subjt:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA

Query:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL
        AIAKAGTVHKMLKLIESES PNP VSEAIVANFLGLSALDTNKL+IGSSGAIPFLVKNLYDPH++SSSQVKQDALRALYNLSI PSN+PFILETKLIPFL
Subjt:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL

Query:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
        LNALGDME+SERALS+LSNVVSTPEGRKAIST+PNSFPIL DVLNWADSPGCQEK SYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
Subjt:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR

Query:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
        +LESLRVDKGKQISDH GGNSSAPI  SL+SFTN ILGSAE LEG DDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
Subjt:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL

Query:  PF
        PF
Subjt:  PF

A0A5D3D8U0 U-box domain-containing protein 15-like4.2e-24892.23Show/hide
Query:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD
        MAKCQRNDVGSV FDR STS+ AGSHFRLC  FS ASFRRK+FDAVSCGGSSRY YHHDGNVGGGDGTVS+AIRSLSEIVKEREA RPKRSN KSEKLFD
Subjt:  MAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEKLFD

Query:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
        LLKLESSPES+PETKKKEEVLEEFK  VKKLQDEDL ERRAAAS VRLLAKEDAEAR TL MLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA
Subjt:  LLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYALLNLGIGNDLNKA

Query:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL
        AIAKAGTVHKMLKLIESES PNP VSEAIVANFLGLSALDTNKL+IGSSGAIPFLVKNLYDPH++SSSQVKQDALRALYNLSI PSN+PFILETKLIPFL
Subjt:  AIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIPFL

Query:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
        LNALGDME+SERALS+LSNVVSTPEGRKAIST+PNSFPIL DVLNWADSPGCQEK SYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR
Subjt:  LNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASR

Query:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
        +LESLRVDKGKQISDH GGNSSAPI  SL+SFTN ILGSAE LEG DDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL
Subjt:  ILESLRVDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSL

Query:  PF
        PF
Subjt:  PF

SwissProt top hitse value%identityAlignment
Q0E3C8 Chaperone protein ClpB3, mitochondrial5.0e-18281.56Show/hide
Query:  RNFHSTRPSRYSATASSQINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPI
        R FH T+ +RYS ++SSQI   +FTEMAWEG+VGAVD AR +KQQVVE+EHLMKALLEQKDGLARRIFSKAG+DN+SVL AT +FIS+QPKV G+TSGPI
Subjt:  RNFHSTRPSRYSATASSQINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPI

Query:  IGTHLGLILDNARKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIG
        IG+    ILDNARKHKKE  D+F+SVEH + AF  DKRFGQQLF++L++ E +LK+A+ AVRG+QRVTDQNPEGKY+AL+KYG D+TE ARRGKLDPVIG
Subjt:  IGTHLGLILDNARKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIG

Query:  RDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHT
        RDDE+RRCIQIL RRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPL NRKLISLDMG+L+AGAK++G FEERLKAVLKE+TASNGQIILFIDEIHT
Subjt:  RDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHT

Query:  VVGAGATGGAMDAGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD
        +VGAGA GGAMDAGNLLKPMLGRGELRCIGATTL EYRKYIEKD ALERRFQQV+CG+P+VEDTISILRGLRERYELHHGVKISD
Subjt:  VVGAGATGGAMDAGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD

Q75GT3 Chaperone protein ClpB2, chloroplastic5.4e-16074.26Show/hide
Query:  ATASSQINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNA
        A ++ +I Q +FTEMAW+ IV + + A+ +K Q+VE+EHLMK+LLEQ++GLARRIFSKAG+DN+ +LDAT  FI +QPKV GE  G ++G  L  ++  A
Subjt:  ATASSQINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNA

Query:  RKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQIL
        R  KKE GD F+SVEH VL F  DKRFG+QLFK+ Q++ + LK A++++RG Q V DQ+PEGKYEALDKYG DLT  AR+GKLDPVIGRDDEIRRCIQIL
Subjt:  RKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQIL

Query:  SRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMD
        SRRTKNNPV+IGEPGVGKTAIAEGLAQRIV+GDVP+ L NR+LI+LDMG+L+AGAKYRG+FE+RLKAVLKEVT S+GQ ILFIDEIHTVVGAGAT GAMD
Subjt:  SRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMD

Query:  AGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD
        AGNLLKPMLGRGELRCIGATTL EYRKYIEKDPALERRFQQV+  QPSVEDTISILRGLRERYELHHGV+ISD
Subjt:  AGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD

Q8DJ40 Chaperone protein ClpB 12.4e-14471.31Show/hide
Query:  NQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEM
        N   FTE AW  I    D A+  + Q +ESEHLMK+LLEQ +GLA +IF KAG     + D T +FIS+QPK++   SG  +G  L  +LD A + +K+ 
Subjt:  NQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEM

Query:  GDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNN
        GD+F+S+EH VLAF  D RFG++LF+++ LSEK L++A+Q +RG+Q+VTDQNPEGKY AL+KYG DLT  AR+GKLDPVIGRDDEIRR IQILSRRTKNN
Subjt:  GDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNN

Query:  PVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKP
        PV+IGEPGVGKTAIAEGLAQRIV  DVP+ L +R+LI+LDMG+L+AGAKYRG+FEERLKAVLKEVT SNGQIILFIDEIHTVVGAGAT GAMDAGNLLKP
Subjt:  PVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKP

Query:  MLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD
        ML RGELRCIGATTL EYRKYIEKD ALERRFQQV+  QPSVEDTISILRGL+ERYE+HHGVKISD
Subjt:  MLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD

Q8VYJ7 Chaperone protein ClpB4, mitochondrial1.1e-16869.18Show/hide
Query:  MATRRVSKLTRFALAAIDAPKFPHSR-SLLSRSPALSRSSS-SSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQ
        MA RR+SK    A+ A    ++  SR S L RS +LS S   +S+G    SF + K      +N +S+  A    T   + F  + P R+  T ++Q+NQ
Subjt:  MATRRVSKLTRFALAAIDAPKFPHSR-SLLSRSPALSRSSS-SSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQ

Query:  TDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGD
         +FTEMAWEG++ A D AR +KQQ+VESEHLMKALLEQKDG+AR+IF+KAG+DNSSVL AT  FIS+QP V+ + SG  +G+ L +IL+NA++HKK+M D
Subjt:  TDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGD

Query:  DFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPV
         ++SVEHF+LA++SD RFGQ+ F++++L  + LKDA++ VRG+QRVTD+NPE KY+AL+KYG DLTE ARRGKLDPVIGRDDEIRRCIQIL RRTKNNPV
Subjt:  DFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPV

Query:  IIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPML
        IIGEPGVGKTAIAEGLAQRIVRGDVPEPL+NRKLISLDMGSL+AGAK+RGDFEERLKAV+KEV+ASNGQ ILFIDEIHTVVGAGA  GAMDA NLLKPML
Subjt:  IIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPML

Query:  GRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD
        GRGELRCIGATTL EYRKYIEKDPALERRFQQV C QPSVEDTISILRGLRERYELHHGV ISD
Subjt:  GRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD

Q9LF37 Chaperone protein ClpB3, chloroplastic2.8e-16169.69Show/hide
Query:  IGSFSVAKSYGSRPVNGASMAS--AKYLATIFTRNFHSTRPSRYSATASS-QINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARR
        I SFS  +   + P   +S  S   K  A +  R  H     R  A++S+ ++ Q +FTEMAW+ IV + D A+ NKQQ+VE+EHLMKALLEQK+GLARR
Subjt:  IGSFSVAKSYGSRPVNGASMAS--AKYLATIFTRNFHSTRPSRYSATASS-QINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARR

Query:  IFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQR
        IFSK G+DN+ VL+AT  FI +QPKV G+ +G ++G  L  +   AR+ KK++ D ++SVEH VLAF  DKRFG+QLFK+ Q+SE+ LK A++++RG Q 
Subjt:  IFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQR

Query:  VTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAG
        V DQ+PEGKYEAL+KYG DLT  AR GKLDPVIGRDDEIRRCIQILSRRTKNNPV+IGEPGVGKTAI+EGLAQRIV+GDVP+ L+NRKLISLDMG+L+AG
Subjt:  VTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAG

Query:  AKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTIS
        AKYRG+FE+RLKAVLKEVT S GQIILFIDEIHTVVGAGAT GAMDAGNLLKPMLGRGELRCIGATTL EYRKYIEKDPALERRFQQV+  QP+VEDTIS
Subjt:  AKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTIS

Query:  ILRGLRERYELHHGVKISD
        ILRGLRERYELHHGV+ISD
Subjt:  ILRGLRERYELHHGVKISD

Arabidopsis top hitse value%identityAlignment
AT2G25130.1 ARM repeat superfamily protein1.8e-13159.49Show/hide
Query:  MAKCQRNDVGSVAFDRV---STSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEK
        MAKC RN+V  +   R+   S+S+ +G+      +FS +S RR +FDA+SCGGSSRYR                 +R   +    +  +  + S  K EK
Subjt:  MAKCQRNDVGSVAFDRV---STSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSEIVKEREAVRPKRSNVKSEK

Query:  LFDLLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVE------RRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLE--DDESKIASLYALL
        L DLL L +  ES  ETKKKEE LE  KR VK LQ E   E      + AAAS VRLLAK+D EAR TLAMLGAIPPLV M+D E   +++ IASLYALL
Subjt:  LFDLLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVE------RRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLE--DDESKIASLYALL

Query:  NLGIGNDLNKAAIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVP
        NLGIGND+NKAAI KAG VHKMLKL+ES   PN +++EAIVANFLGLSALD+NK +IGSSGAI FLVK L +  + SSSQ ++DALRALYNLSI+  NV 
Subjt:  NLGIGNDLNKAAIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVP

Query:  FILETKLIPFLLNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLL
        FILET LIPFLLN LGDMEVSER L++L+NVVS PEGRKAI     +FPIL DVLNW DS  CQEKA YILM+MAHK Y DR AMIEAGI S+LLELTL+
Subjt:  FILETKLIPFLLNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLL

Query:  GSTLAQKRASRILESLR-VDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFV-PSDH
        GS LAQKRASR+LE LR VDKGKQ+S    G SS              LG      G D  +++E+KAVKQLV+QSLQ+NM+RIVKRANLP DFV  S H
Subjt:  GSTLAQKRASRILESLR-VDKGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFV-PSDH

Query:  F-KSLT
        F KSLT
Subjt:  F-KSLT

AT2G25140.1 casein lytic proteinase B47.7e-17069.18Show/hide
Query:  MATRRVSKLTRFALAAIDAPKFPHSR-SLLSRSPALSRSSS-SSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQ
        MA RR+SK    A+ A    ++  SR S L RS +LS S   +S+G    SF + K      +N +S+  A    T   + F  + P R+  T ++Q+NQ
Subjt:  MATRRVSKLTRFALAAIDAPKFPHSR-SLLSRSPALSRSSS-SSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQ

Query:  TDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGD
         +FTEMAWEG++ A D AR +KQQ+VESEHLMKALLEQKDG+AR+IF+KAG+DNSSVL AT  FIS+QP V+ + SG  +G+ L +IL+NA++HKK+M D
Subjt:  TDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGD

Query:  DFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPV
         ++SVEHF+LA++SD RFGQ+ F++++L  + LKDA++ VRG+QRVTD+NPE KY+AL+KYG DLTE ARRGKLDPVIGRDDEIRRCIQIL RRTKNNPV
Subjt:  DFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPV

Query:  IIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPML
        IIGEPGVGKTAIAEGLAQRIVRGDVPEPL+NRKLISLDMGSL+AGAK+RGDFEERLKAV+KEV+ASNGQ ILFIDEIHTVVGAGA  GAMDA NLLKPML
Subjt:  IIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPML

Query:  GRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD
        GRGELRCIGATTL EYRKYIEKDPALERRFQQV C QPSVEDTISILRGLRERYELHHGV ISD
Subjt:  GRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTISILRGLRERYELHHGVKISD

AT4G31890.1 ARM repeat superfamily protein1.0e-15060.91Show/hide
Query:  MAKCQRNDVGSVAFDR---VSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYH-----HDGN---VGGGDGTVSSAIRSLSEIVKEREAVRPK
        MAKC RN++GS+  DR    S+S+ +G HFRL ++FS ++FRRK+ DAVSCGGSSRYR+       +G+   V       SS     + I      V  +
Subjt:  MAKCQRNDVGSVAFDR---VSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYH-----HDGN---VGGGDGTVSSAIRSLSEIVKEREAVRPK

Query:  RSNVKSEKLFDLLKLESSPESEPETKKKEEVLEEFKRSVKKLQDE-----------DLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGML-DLED
         ++ KSEKL DLL L +  E++ ET KKEE LE  KR V++LQ             D  ++  AAS VRLLAKED+EAR TLAMLGAIPPLV M+ D   
Subjt:  RSNVKSEKLFDLLKLESSPESEPETKKKEEVLEEFKRSVKKLQDE-----------DLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGML-DLED

Query:  DESKIASLYALLNLGIGNDLNKAAIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRA
         +++IASLYALLNLGIGND NKAAI KAG VHKMLKLIES + P+  ++EA+VANFLGLSALD+NK +IGSSGAI FLVK L +  + SSSQ ++DALRA
Subjt:  DESKIASLYALLNLGIGNDLNKAAIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRA

Query:  LYNLSIFPSNVPFILETKLIPFLLNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEA
        LYNLSI+  NV FILET LI +LLN LGDMEVSER L++LSN+V+ PEGRKAI    ++FP+L DVLNW DSPGCQEKA+YILM+MAHK Y DRQ MIEA
Subjt:  LYNLSIFPSNVPFILETKLIPFLLNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEA

Query:  GISSALLELTLLGSTLAQKRASRILESLRVDKGKQISDHFG--GNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKR
        GI SALLELTLLGS LAQKRASRILE LRVDKGKQ+ D  G  G  SAPI  +  +  +         E  D ++SEE+KAVKQLV+QSLQ+NM+RIVKR
Subjt:  GISSALLELTLLGSTLAQKRASRILESLRVDKGKQISDHFG--GNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKR

Query:  ANLPQDFVPSDHFKSLTSSSTSKSLPF
        ANLPQDFVPS+HFKSL+ SSTSKSLPF
Subjt:  ANLPQDFVPSDHFKSLTSSSTSKSLPF

AT4G31890.2 ARM repeat superfamily protein1.0e-15060.91Show/hide
Query:  MAKCQRNDVGSVAFDR---VSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYH-----HDGN---VGGGDGTVSSAIRSLSEIVKEREAVRPK
        MAKC RN++GS+  DR    S+S+ +G HFRL ++FS ++FRRK+ DAVSCGGSSRYR+       +G+   V       SS     + I      V  +
Subjt:  MAKCQRNDVGSVAFDR---VSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYH-----HDGN---VGGGDGTVSSAIRSLSEIVKEREAVRPK

Query:  RSNVKSEKLFDLLKLESSPESEPETKKKEEVLEEFKRSVKKLQDE-----------DLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGML-DLED
         ++ KSEKL DLL L +  E++ ET KKEE LE  KR V++LQ             D  ++  AAS VRLLAKED+EAR TLAMLGAIPPLV M+ D   
Subjt:  RSNVKSEKLFDLLKLESSPESEPETKKKEEVLEEFKRSVKKLQDE-----------DLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGML-DLED

Query:  DESKIASLYALLNLGIGNDLNKAAIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRA
         +++IASLYALLNLGIGND NKAAI KAG VHKMLKLIES + P+  ++EA+VANFLGLSALD+NK +IGSSGAI FLVK L +  + SSSQ ++DALRA
Subjt:  DESKIASLYALLNLGIGNDLNKAAIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRA

Query:  LYNLSIFPSNVPFILETKLIPFLLNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEA
        LYNLSI+  NV FILET LI +LLN LGDMEVSER L++LSN+V+ PEGRKAI    ++FP+L DVLNW DSPGCQEKA+YILM+MAHK Y DRQ MIEA
Subjt:  LYNLSIFPSNVPFILETKLIPFLLNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEA

Query:  GISSALLELTLLGSTLAQKRASRILESLRVDKGKQISDHFG--GNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKR
        GI SALLELTLLGS LAQKRASRILE LRVDKGKQ+ D  G  G  SAPI  +  +  +         E  D ++SEE+KAVKQLV+QSLQ+NM+RIVKR
Subjt:  GISSALLELTLLGSTLAQKRASRILESLRVDKGKQISDHFG--GNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKR

Query:  ANLPQDFVPSDHFKSLTSSSTSKSLPF
        ANLPQDFVPS+HFKSL+ SSTSKSLPF
Subjt:  ANLPQDFVPSDHFKSLTSSSTSKSLPF

AT5G15450.1 casein lytic proteinase B32.0e-16269.69Show/hide
Query:  IGSFSVAKSYGSRPVNGASMAS--AKYLATIFTRNFHSTRPSRYSATASS-QINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARR
        I SFS  +   + P   +S  S   K  A +  R  H     R  A++S+ ++ Q +FTEMAW+ IV + D A+ NKQQ+VE+EHLMKALLEQK+GLARR
Subjt:  IGSFSVAKSYGSRPVNGASMAS--AKYLATIFTRNFHSTRPSRYSATASS-QINQTDFTEMAWEGIVGAVDTARANKQQVVESEHLMKALLEQKDGLARR

Query:  IFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQR
        IFSK G+DN+ VL+AT  FI +QPKV G+ +G ++G  L  +   AR+ KK++ D ++SVEH VLAF  DKRFG+QLFK+ Q+SE+ LK A++++RG Q 
Subjt:  IFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDFLSVEHFVLAFHSDKRFGQQLFKNLQLSEKDLKDAVQAVRGNQR

Query:  VTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAG
        V DQ+PEGKYEAL+KYG DLT  AR GKLDPVIGRDDEIRRCIQILSRRTKNNPV+IGEPGVGKTAI+EGLAQRIV+GDVP+ L+NRKLISLDMG+L+AG
Subjt:  VTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNRKLISLDMGSLVAG

Query:  AKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTIS
        AKYRG+FE+RLKAVLKEVT S GQIILFIDEIHTVVGAGAT GAMDAGNLLKPMLGRGELRCIGATTL EYRKYIEKDPALERRFQQV+  QP+VEDTIS
Subjt:  AKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVEDTIS

Query:  ILRGLRERYELHHGVKISD
        ILRGLRERYELHHGV+ISD
Subjt:  ILRGLRERYELHHGVKISD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACCAGAAGAGTTTCGAAGCTCACAAGGTTTGCTTTAGCCGCCATTGACGCTCCAAAATTTCCTCATTCTCGTTCCCTTCTCTCTCGTTCACCTGCACTTTCGCG
TTCTTCATCTTCTTCTCTCGGTAATTCCATCGGTTCCTTTTCTGTTGCCAAGTCTTATGGTTCCAGACCCGTCAATGGCGCTTCCATGGCGTCGGCCAAGTATTTGGCTA
CGATTTTCACTCGGAACTTCCACTCTACGCGTCCTTCTCGCTACTCTGCTACGGCTTCTTCTCAGATAAATCAGACAGACTTCACTGAGATGGCATGGGAAGGCATAGTT
GGTGCAGTTGATACTGCACGGGCGAATAAACAACAAGTTGTTGAGAGTGAACATTTAATGAAAGCACTTCTTGAACAGAAGGATGGCTTAGCAAGGAGAATATTTTCTAA
GGCCGGACTTGACAATTCATCAGTTTTGGATGCTACAGTTGATTTTATATCTCAACAACCAAAGGTAACAGGCGAAACTAGTGGTCCAATAATAGGCACGCATCTAGGTT
TGATTTTGGACAATGCTCGAAAACATAAAAAAGAAATGGGAGACGATTTTCTATCTGTGGAACATTTTGTGTTAGCCTTCCATTCAGATAAGAGATTTGGGCAGCAACTA
TTTAAGAACTTGCAACTTAGTGAAAAGGATTTGAAGGATGCTGTTCAGGCTGTTCGTGGAAATCAGAGGGTGACTGATCAAAATCCTGAAGGAAAATATGAAGCTCTTGA
CAAGTACGGGACTGACTTAACTGAATTTGCTAGACGCGGTAAGCTTGATCCAGTTATTGGAAGAGATGATGAAATACGGCGATGCATCCAAATTCTATCAAGGAGAACTA
AAAACAATCCCGTAATCATTGGTGAGCCAGGTGTTGGGAAAACTGCAATCGCTGAAGGACTAGCTCAACGAATTGTGCGCGGGGATGTTCCAGAACCTTTGTTGAATAGA
AAGTTAATATCTCTGGACATGGGTTCACTGGTTGCTGGTGCAAAATACCGTGGAGATTTTGAGGAAAGATTGAAGGCTGTGCTAAAGGAAGTCACTGCTTCAAATGGGCA
AATTATCTTGTTCATAGATGAAATTCATACAGTTGTTGGTGCAGGGGCTACTGGTGGTGCGATGGATGCTGGCAATCTCTTGAAACCAATGCTTGGTCGAGGTGAACTAC
GGTGTATTGGTGCAACTACATTAAAGGAGTATAGAAAATACATTGAGAAAGATCCTGCACTCGAACGTAGATTTCAGCAAGTGTTTTGTGGCCAACCATCTGTTGAAGAT
ACAATCTCTATTCTTCGTGGGTTACGAGAGCGATATGAACTACATCATGGTGTAAAGATTTCCGATATTTCCAGAAGCAGTTTTCCAGTTTCCGATATGGCCAAGTGTCA
AAGAAACGACGTTGGATCTGTAGCTTTTGACCGAGTCTCCACTTCCGCTGCCGCCGGAAGCCATTTCCGTCTCTGCACTTCCTTCTCCGCCGCTTCATTCCGTAGAAAGG
TTTTTGACGCTGTAAGTTGTGGCGGAAGTTCTCGCTATCGTTATCACCACGACGGCAATGTCGGTGGCGGCGATGGTACTGTTTCCTCGGCCATTAGGTCGTTGTCCGAG
ATTGTGAAGGAAAGGGAGGCAGTGAGGCCGAAACGGTCCAATGTGAAGTCGGAGAAGCTGTTCGATCTTCTTAAGTTGGAGTCGTCGCCGGAATCGGAGCCGGAGACGAA
GAAGAAGGAGGAGGTGCTAGAAGAGTTCAAAAGGTCGGTGAAGAAGTTGCAGGATGAGGATTTGGTGGAGAGGAGAGCGGCTGCAAGTCGGGTTAGGTTGCTTGCAAAAG
AGGATGCAGAAGCGAGGCGAACGCTTGCAATGCTCGGAGCCATTCCGCCGCTAGTTGGAATGCTTGATTTGGAAGATGATGAATCTAAGATCGCCTCACTTTATGCATTG
CTCAATCTTGGAATTGGAAACGATTTGAACAAGGCGGCCATTGCTAAAGCGGGTACTGTTCACAAAATGCTCAAGCTGATCGAATCTGAAAGTTACCCAAATCCATCTGT
TTCAGAAGCTATAGTTGCGAATTTCCTCGGGTTGAGCGCATTAGATACGAACAAACTGGTAATTGGGTCATCAGGTGCAATTCCATTCTTGGTAAAGAACTTATACGACC
CACATAAAAAAAGTAGTTCACAAGTCAAGCAAGACGCTCTACGTGCGCTTTATAATCTCTCTATTTTCCCATCCAATGTTCCATTTATCTTAGAAACCAAGTTGATCCCA
TTTCTTCTAAACGCATTGGGGGACATGGAAGTAAGTGAAAGAGCCCTCTCTGTTCTAAGCAATGTGGTATCAACCCCAGAAGGTCGAAAGGCCATAAGCACTTTCCCAAA
TTCATTTCCAATACTGACAGATGTCTTGAATTGGGCTGATTCACCAGGCTGCCAAGAGAAAGCATCCTACATTTTAATGGTAATGGCGCATAAATCTTACAGTGATAGAC
AAGCAATGATTGAAGCTGGGATTTCATCAGCTTTGTTGGAACTAACTCTTTTGGGCAGTACATTGGCTCAGAAGAGGGCCTCGAGGATTTTGGAGTCTTTGAGGGTTGAT
AAAGGGAAACAGATCTCTGATCATTTTGGAGGAAATTCTTCTGCTCCAATTTCTAGTTCTTTATCTTCTTTTACCAACCAAATTCTAGGTTCTGCCGAAGGTTTGGAAGG
AGTAGATGATTTGGTGAGTGAAGAGAAGAAAGCCGTGAAGCAATTGGTGCGCCAAAGTTTGCAAAACAATATGAGGAGAATTGTGAAGAGAGCCAATTTGCCTCAGGATT
TTGTGCCTTCTGATCATTTCAAGTCGCTCACATCAAGTTCCACTTCAAAAAGCTTGCCATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACCAGAAGAGTTTCGAAGCTCACAAGGTTTGCTTTAGCCGCCATTGACGCTCCAAAATTTCCTCATTCTCGTTCCCTTCTCTCTCGTTCACCTGCACTTTCGCG
TTCTTCATCTTCTTCTCTCGGTAATTCCATCGGTTCCTTTTCTGTTGCCAAGTCTTATGGTTCCAGACCCGTCAATGGCGCTTCCATGGCGTCGGCCAAGTATTTGGCTA
CGATTTTCACTCGGAACTTCCACTCTACGCGTCCTTCTCGCTACTCTGCTACGGCTTCTTCTCAGATAAATCAGACAGACTTCACTGAGATGGCATGGGAAGGCATAGTT
GGTGCAGTTGATACTGCACGGGCGAATAAACAACAAGTTGTTGAGAGTGAACATTTAATGAAAGCACTTCTTGAACAGAAGGATGGCTTAGCAAGGAGAATATTTTCTAA
GGCCGGACTTGACAATTCATCAGTTTTGGATGCTACAGTTGATTTTATATCTCAACAACCAAAGGTAACAGGCGAAACTAGTGGTCCAATAATAGGCACGCATCTAGGTT
TGATTTTGGACAATGCTCGAAAACATAAAAAAGAAATGGGAGACGATTTTCTATCTGTGGAACATTTTGTGTTAGCCTTCCATTCAGATAAGAGATTTGGGCAGCAACTA
TTTAAGAACTTGCAACTTAGTGAAAAGGATTTGAAGGATGCTGTTCAGGCTGTTCGTGGAAATCAGAGGGTGACTGATCAAAATCCTGAAGGAAAATATGAAGCTCTTGA
CAAGTACGGGACTGACTTAACTGAATTTGCTAGACGCGGTAAGCTTGATCCAGTTATTGGAAGAGATGATGAAATACGGCGATGCATCCAAATTCTATCAAGGAGAACTA
AAAACAATCCCGTAATCATTGGTGAGCCAGGTGTTGGGAAAACTGCAATCGCTGAAGGACTAGCTCAACGAATTGTGCGCGGGGATGTTCCAGAACCTTTGTTGAATAGA
AAGTTAATATCTCTGGACATGGGTTCACTGGTTGCTGGTGCAAAATACCGTGGAGATTTTGAGGAAAGATTGAAGGCTGTGCTAAAGGAAGTCACTGCTTCAAATGGGCA
AATTATCTTGTTCATAGATGAAATTCATACAGTTGTTGGTGCAGGGGCTACTGGTGGTGCGATGGATGCTGGCAATCTCTTGAAACCAATGCTTGGTCGAGGTGAACTAC
GGTGTATTGGTGCAACTACATTAAAGGAGTATAGAAAATACATTGAGAAAGATCCTGCACTCGAACGTAGATTTCAGCAAGTGTTTTGTGGCCAACCATCTGTTGAAGAT
ACAATCTCTATTCTTCGTGGGTTACGAGAGCGATATGAACTACATCATGGTGTAAAGATTTCCGATATTTCCAGAAGCAGTTTTCCAGTTTCCGATATGGCCAAGTGTCA
AAGAAACGACGTTGGATCTGTAGCTTTTGACCGAGTCTCCACTTCCGCTGCCGCCGGAAGCCATTTCCGTCTCTGCACTTCCTTCTCCGCCGCTTCATTCCGTAGAAAGG
TTTTTGACGCTGTAAGTTGTGGCGGAAGTTCTCGCTATCGTTATCACCACGACGGCAATGTCGGTGGCGGCGATGGTACTGTTTCCTCGGCCATTAGGTCGTTGTCCGAG
ATTGTGAAGGAAAGGGAGGCAGTGAGGCCGAAACGGTCCAATGTGAAGTCGGAGAAGCTGTTCGATCTTCTTAAGTTGGAGTCGTCGCCGGAATCGGAGCCGGAGACGAA
GAAGAAGGAGGAGGTGCTAGAAGAGTTCAAAAGGTCGGTGAAGAAGTTGCAGGATGAGGATTTGGTGGAGAGGAGAGCGGCTGCAAGTCGGGTTAGGTTGCTTGCAAAAG
AGGATGCAGAAGCGAGGCGAACGCTTGCAATGCTCGGAGCCATTCCGCCGCTAGTTGGAATGCTTGATTTGGAAGATGATGAATCTAAGATCGCCTCACTTTATGCATTG
CTCAATCTTGGAATTGGAAACGATTTGAACAAGGCGGCCATTGCTAAAGCGGGTACTGTTCACAAAATGCTCAAGCTGATCGAATCTGAAAGTTACCCAAATCCATCTGT
TTCAGAAGCTATAGTTGCGAATTTCCTCGGGTTGAGCGCATTAGATACGAACAAACTGGTAATTGGGTCATCAGGTGCAATTCCATTCTTGGTAAAGAACTTATACGACC
CACATAAAAAAAGTAGTTCACAAGTCAAGCAAGACGCTCTACGTGCGCTTTATAATCTCTCTATTTTCCCATCCAATGTTCCATTTATCTTAGAAACCAAGTTGATCCCA
TTTCTTCTAAACGCATTGGGGGACATGGAAGTAAGTGAAAGAGCCCTCTCTGTTCTAAGCAATGTGGTATCAACCCCAGAAGGTCGAAAGGCCATAAGCACTTTCCCAAA
TTCATTTCCAATACTGACAGATGTCTTGAATTGGGCTGATTCACCAGGCTGCCAAGAGAAAGCATCCTACATTTTAATGGTAATGGCGCATAAATCTTACAGTGATAGAC
AAGCAATGATTGAAGCTGGGATTTCATCAGCTTTGTTGGAACTAACTCTTTTGGGCAGTACATTGGCTCAGAAGAGGGCCTCGAGGATTTTGGAGTCTTTGAGGGTTGAT
AAAGGGAAACAGATCTCTGATCATTTTGGAGGAAATTCTTCTGCTCCAATTTCTAGTTCTTTATCTTCTTTTACCAACCAAATTCTAGGTTCTGCCGAAGGTTTGGAAGG
AGTAGATGATTTGGTGAGTGAAGAGAAGAAAGCCGTGAAGCAATTGGTGCGCCAAAGTTTGCAAAACAATATGAGGAGAATTGTGAAGAGAGCCAATTTGCCTCAGGATT
TTGTGCCTTCTGATCATTTCAAGTCGCTCACATCAAGTTCCACTTCAAAAAGCTTGCCATTTTGA
Protein sequenceShow/hide protein sequence
MATRRVSKLTRFALAAIDAPKFPHSRSLLSRSPALSRSSSSSLGNSIGSFSVAKSYGSRPVNGASMASAKYLATIFTRNFHSTRPSRYSATASSQINQTDFTEMAWEGIV
GAVDTARANKQQVVESEHLMKALLEQKDGLARRIFSKAGLDNSSVLDATVDFISQQPKVTGETSGPIIGTHLGLILDNARKHKKEMGDDFLSVEHFVLAFHSDKRFGQQL
FKNLQLSEKDLKDAVQAVRGNQRVTDQNPEGKYEALDKYGTDLTEFARRGKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEPLLNR
KLISLDMGSLVAGAKYRGDFEERLKAVLKEVTASNGQIILFIDEIHTVVGAGATGGAMDAGNLLKPMLGRGELRCIGATTLKEYRKYIEKDPALERRFQQVFCGQPSVED
TISILRGLRERYELHHGVKISDISRSSFPVSDMAKCQRNDVGSVAFDRVSTSAAAGSHFRLCTSFSAASFRRKVFDAVSCGGSSRYRYHHDGNVGGGDGTVSSAIRSLSE
IVKEREAVRPKRSNVKSEKLFDLLKLESSPESEPETKKKEEVLEEFKRSVKKLQDEDLVERRAAASRVRLLAKEDAEARRTLAMLGAIPPLVGMLDLEDDESKIASLYAL
LNLGIGNDLNKAAIAKAGTVHKMLKLIESESYPNPSVSEAIVANFLGLSALDTNKLVIGSSGAIPFLVKNLYDPHKKSSSQVKQDALRALYNLSIFPSNVPFILETKLIP
FLLNALGDMEVSERALSVLSNVVSTPEGRKAISTFPNSFPILTDVLNWADSPGCQEKASYILMVMAHKSYSDRQAMIEAGISSALLELTLLGSTLAQKRASRILESLRVD
KGKQISDHFGGNSSAPISSSLSSFTNQILGSAEGLEGVDDLVSEEKKAVKQLVRQSLQNNMRRIVKRANLPQDFVPSDHFKSLTSSSTSKSLPF