; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004080 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004080
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCS domain-containing protein
Genome locationChr08:13473128..13475225
RNA-Seq ExpressionHG10004080
SyntenyHG10004080
Gene Ontology termsGO:0006457 - protein folding (biological process)
GO:0051131 - chaperone-mediated protein complex assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0051087 - chaperone binding (molecular function)
GO:0051879 - Hsp90 protein binding (molecular function)
InterPro domainsIPR007052 - CS domain
IPR008978 - HSP20-like chaperone


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578857.1 Co-chaperone protein p23-1, partial [Cucurbita argyrosperma subsp. sororia]1.9e-7578.01Show/hide
Query:  NRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVAF
        +RHPTLRWAQTSDRLFITID+PDAQDVKLKLEPEGK  FSAV GAEKIPYE+DIDLYDKVDINESKA VGMRNIRY+IEKAEKKWWSRLLKQEGKPPV F
Subjt:  NRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVAF

Query:  LTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        + + +          +  + NDMDF N+DFSKLG GAGEGMGADAFGEDD+D+NDI++EGDDKEGV++DRAP+AE V EPGSTSSKPDA+A
Subjt:  LTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

KAG7016389.1 hypothetical protein SDJN02_21498, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-7577.08Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTSDRLFITID+PDAQDVKLKLEPEGK  FSAV GAEKIPYE+DIDLYDKVDINESKA VGMRNIRY+IEKAEKKWWSRLLKQEGKPPV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +          +  + NDMDF N+DFSKLG GAGEGMGADAFGEDD+D+NDI++EGDDKEGV++D+AP+AE V EPGSTSSKPDA+A
Subjt:  FLTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

XP_004140943.1 co-chaperone protein p23-1 isoform X1 [Cucumis sativus]2.8e-7175Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTSDRLFITIDLPDAQDVKLKL+PEGKF FSAVSG EKIPYEVDIDLYDKVDINESKA +GMRNI YLIEKAEKKWWSRLLKQEGKPPV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +             + NDMDFS+LDFSKLG+  G GMGADAFGEDDEDDNDI+DEG++KEG KVD+ PLA  +NE GS+S +PDA+A
Subjt:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

XP_022938739.1 uncharacterized protein At3g03773-like [Cucurbita moschata]1.8e-7880.21Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTSDRLF+TID+PDAQDVKLKLEPEGKF FSAV GAEKIPYE+DIDLYDKVDINESKA VGMRNIRY+IEKAEKKWWSRLLKQEGKPPV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +          +  + NDMDFSN+DFSKLGIGAGEGMGADAFGEDDEDDNDI++EGDDKEGV++DRAPLAE V EPGSTSSKPDA+A
Subjt:  FLTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

XP_038886632.1 co-chaperone protein p23-1-like [Benincasa hispida]2.7e-7479.17Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTS+RLFITIDLPDAQDVKLKLEP+GKFSFSAVSGAEKIPYEVDIDLYDKVDINESKA VGMRNIRYLIEKAEKKWWSRLLKQEGKPPV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +             +AN+MDFSNLDFSKLGI AGEG+GAD FGE DE DN+++ EGD+KEG KVDR PLAEA+NEPGSTSSKPDA+A
Subjt:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

TrEMBL top hitse value%identityAlignment
A0A0A0K8N2 CS domain-containing protein1.4e-7175Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTSDRLFITIDLPDAQDVKLKL+PEGKF FSAVSG EKIPYEVDIDLYDKVDINESKA +GMRNI YLIEKAEKKWWSRLLKQEGKPPV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +             + NDMDFS+LDFSKLG+  G GMGADAFGEDDEDDNDI+DEG++KEG KVD+ PLA  +NE GS+S +PDA+A
Subjt:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

A0A1S3C4J1 uncharacterized protein At3g03773-like isoform X13.7e-6975.52Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTS+RLFITIDLPDAQDVKLKLEPEGKF FSAVSGAEKIP+EVDIDLYDKVDINESKA +GMRNIRYLIEKAEKKWWSRLLKQEGK PV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +             + NDMDFS+LDFSKLG+  G G GADAFGEDDEDDNDINDEGDDKEG K D+ PLA  VNE  S+S KPD +A
Subjt:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

A0A5A7SN48 CS domain-containing protein8.2e-6976.32Show/hide
Query:  RHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVAFL
        RHPTLRWAQTS+RLFITIDLPDAQDVKLKLEPEGKF FSAVSGAEKIP+EVDIDLYDKVDINESKA +GMRNIRYLIEKAEKKWWSRLLKQEGK PV F+
Subjt:  RHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVAFL

Query:  TLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
         + +             + NDMDFS+LDFSKLG+  G G GADAFGEDDEDDNDINDEGDDKEG K D+ PLA  VNE  S+S KPD +A
Subjt:  TLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

A0A5D3BY68 CS domain-containing protein3.7e-6975.52Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTS+RLFITIDLPDAQDVKLKLEPEGKF FSAVSGAEKIP+EVDIDLYDKVDINESKA +GMRNIRYLIEKAEKKWWSRLLKQEGK PV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +             + NDMDFS+LDFSKLG+  G G GADAFGEDDEDDNDINDEGDDKEG K D+ PLA  VNE  S+S KPD +A
Subjt:  FLTLSFK----------CAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

A0A6J1FEZ5 uncharacterized protein At3g03773-like8.8e-7980.21Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPTLRWAQTSDRLF+TID+PDAQDVKLKLEPEGKF FSAV GAEKIPYE+DIDLYDKVDINESKA VGMRNIRY+IEKAEKKWWSRLLKQEGKPPV 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA
        F+ + +          +  + NDMDFSN+DFSKLGIGAGEGMGADAFGEDDEDDNDI++EGDDKEGV++DRAPLAE V EPGSTSSKPDA+A
Subjt:  FLTLSF----------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA

SwissProt top hitse value%identityAlignment
P0C8Z0 Uncharacterized protein OsI_0279401.2e-2441.73Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHP ++WAQ  D+++IT+ L DA+D K+ LEPEG FSFSA +G +   YE  ++L DKV++ ESK  VG+R+I  ++EKAE KWW +L++ + K P  
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSF-----KCAAANDMDFSNLDFSKL-GIGAGEGMG
        F+ + +     +     D++   +DFS   G+G   GMG
Subjt:  FLTLSF-----KCAAANDMDFSNLDFSKL-GIGAGEGMG

Q6ID70 Co-chaperone protein p23-21.0e-2047.42Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKP
        ++R+P + WAQ SD++++T+ LPDA+D+ +K EP+G FSFSA+ GA+   +E  ++LY K+ + E +  VG+RNI + I+K E+ WW+RLLK E KP
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKP

Q6YYB0 Uncharacterized protein Os08g03595001.2e-2441.73Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHP ++WAQ  D+++IT+ L DA+D K+ LEPEG FSFSA +G +   YE  ++L DKV++ ESK  VG+R+I  ++EKAE KWW +L++ + K P  
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSF-----KCAAANDMDFSNLDFSKL-GIGAGEGMG
        F+ + +     +     D++   +DFS   G+G   GMG
Subjt:  FLTLSF-----KCAAANDMDFSNLDFSKL-GIGAGEGMG

Q8L7U4 Co-chaperone protein p23-18.0e-2138.03Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHP ++WA+T++++F+T+ L D +D K+ L+PEG F FSA  G E   YE+ ++L DKV++ ESK  +G R+I  +IEKAE + W++LL+   K P  
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK---------CAAANDMDFSNLDFSKLGIGAGEGMG
        ++ + +           A A DMD + ++    G+G   GMG
Subjt:  FLTLSFK---------CAAANDMDFSNLDFSKLGIGAGEGMG

Q9FR62 Co-chaperone protein p23-11.4e-3647.19Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHPT++WAQ SD ++IT++LPDA+DVKLKLEPEGKF FSA SGA K  YEVD+DL D VD+NESKA V  R++ YL++KAE KWW+RL K EGK P+ 
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSF---------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVN
        +L + +               DMDF + DF+ L +G  + +G +   ED + + +   E  +K   K+D     E VN
Subjt:  FLTLSF---------KCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVN

Arabidopsis top hitse value%identityAlignment
AT3G03773.1 HSP20-like chaperones superfamily protein7.5e-2247.42Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKP
        ++R+P + WAQ SD++++T+ LPDA+D+ +K EP+G FSFSA+ GA+   +E  ++LY K+ + E +  VG+RNI + I+K E+ WW+RLLK E KP
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKP

AT3G03773.2 HSP20-like chaperones superfamily protein1.3e-2146.46Show/hide
Query:  VSLNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKP
        + + R+P + WAQ SD++++T+ LPDA+D+ +K EP+G FSFSA+ GA+   +E  ++LY K+ + E +  VG+RNI + I+K E+ WW+RLLK E KP
Subjt:  VSLNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKP

AT4G02450.1 HSP20-like chaperones superfamily protein5.7e-2238.03Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHP ++WA+T++++F+T+ L D +D K+ L+PEG F FSA  G E   YE+ ++L DKV++ ESK  +G R+I  +IEKAE + W++LL+   K P  
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK---------CAAANDMDFSNLDFSKLGIGAGEGMG
        ++ + +           A A DMD + ++    G+G   GMG
Subjt:  FLTLSFK---------CAAANDMDFSNLDFSKLGIGAGEGMG

AT4G02450.2 HSP20-like chaperones superfamily protein5.7e-2238.03Show/hide
Query:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA
        ++RHP ++WA+T++++F+T+ L D +D K+ L+PEG F FSA  G E   YE+ ++L DKV++ ESK  +G R+I  +IEKAE + W++LL+   K P  
Subjt:  LNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAEKKWWSRLLKQEGKPPVA

Query:  FLTLSFK---------CAAANDMDFSNLDFSKLGIGAGEGMG
        ++ + +           A A DMD + ++    G+G   GMG
Subjt:  FLTLSFK---------CAAANDMDFSNLDFSKLGIGAGEGMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTATATCTCTCGCTTCTTGATTCTTTTCGTTGCTTGTTCTATACAAAATGATGAGGAATCGGCGGTGTTTGTTAGTTTAAACCGACACCCTACTTTAAGATGGGC
ACAAACATCAGATAGGCTATTCATAACAATTGACTTGCCAGATGCCCAGGATGTGAAGCTTAAGTTGGAACCTGAAGGGAAATTTTCGTTTTCTGCAGTTAGTGGAGCAG
AGAAGATTCCATATGAAGTTGACATTGATCTCTATGATAAAGTTGACATAAATGAGAGCAAGGCTTGTGTTGGCATGAGAAACATCCGGTACTTGATAGAAAAGGCTGAA
AAGAAGTGGTGGAGCAGATTGTTGAAGCAAGAAGGGAAACCTCCTGTAGCCTTTCTTACCCTGTCATTCAAATGCGCAGCTGCTAATGATATGGACTTTAGCAACTTGGA
CTTTTCAAAACTCGGTATTGGTGCTGGTGAAGGCATGGGTGCCGATGCGTTTGGAGAGGATGATGAAGATGACAACGACATCAACGACGAGGGTGACGATAAAGAAGGGG
TAAAAGTCGATCGAGCACCTCTTGCAGAGGCTGTTAATGAGCCTGGCTCTACAAGCAGCAAACCTGATGCTGAAGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTATATCTCTCGCTTCTTGATTCTTTTCGTTGCTTGTTCTATACAAAATGATGAGGAATCGGCGGTGTTTGTTAGTTTAAACCGACACCCTACTTTAAGATGGGC
ACAAACATCAGATAGGCTATTCATAACAATTGACTTGCCAGATGCCCAGGATGTGAAGCTTAAGTTGGAACCTGAAGGGAAATTTTCGTTTTCTGCAGTTAGTGGAGCAG
AGAAGATTCCATATGAAGTTGACATTGATCTCTATGATAAAGTTGACATAAATGAGAGCAAGGCTTGTGTTGGCATGAGAAACATCCGGTACTTGATAGAAAAGGCTGAA
AAGAAGTGGTGGAGCAGATTGTTGAAGCAAGAAGGGAAACCTCCTGTAGCCTTTCTTACCCTGTCATTCAAATGCGCAGCTGCTAATGATATGGACTTTAGCAACTTGGA
CTTTTCAAAACTCGGTATTGGTGCTGGTGAAGGCATGGGTGCCGATGCGTTTGGAGAGGATGATGAAGATGACAACGACATCAACGACGAGGGTGACGATAAAGAAGGGG
TAAAAGTCGATCGAGCACCTCTTGCAGAGGCTGTTAATGAGCCTGGCTCTACAAGCAGCAAACCTGATGCTGAAGCTTAA
Protein sequenceShow/hide protein sequence
MRYISRFLILFVACSIQNDEESAVFVSLNRHPTLRWAQTSDRLFITIDLPDAQDVKLKLEPEGKFSFSAVSGAEKIPYEVDIDLYDKVDINESKACVGMRNIRYLIEKAE
KKWWSRLLKQEGKPPVAFLTLSFKCAAANDMDFSNLDFSKLGIGAGEGMGADAFGEDDEDDNDINDEGDDKEGVKVDRAPLAEAVNEPGSTSSKPDAEA