; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012980 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012980
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionchaperone protein dnaJ 72
Genome locationscaffold1:18244186..18258237
RNA-Seq ExpressionSpg012980
SyntenySpg012980
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001623 - DnaJ domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR018253 - DnaJ domain, conserved site
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595464.1 Chaperone protein dnaJ 72, partial [Cucurbita argyrosperma subsp. sororia]4.2e-8495.86Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGLNRSATKEE+KEAFRKLAKEFHPDKHSQSPK+VRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R+GYGSSSGFASRSG NVDGLFTNFHMLLRFLTTRAFLLN AFAGVLVGGMFAIDTSGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

KAG7027464.1 Chaperone protein dnaJ 72 [Cucurbita argyrosperma subsp. argyrosperma]4.2e-8495.86Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGLNRSATKEE+KEAFRKLAKEFHPDKHSQSPK+VRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R+GYGSSSGFASRSG NVDGLFTNFHMLLRFLTTRAFLLN AFAGVLVGGMFAIDTSGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

XP_022925035.1 chaperone protein dnaJ 72 [Cucurbita moschata]1.2e-8395.27Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGLNRSATKEE+KEAFRKLAKEFHPDKHSQSPK+VRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYAN SGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R+GYGSSSGFASRSG NVDGLFTNFHMLLRFLTTRAFLLN AFAGVLVGGMFAIDTSGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

XP_022966465.1 chaperone protein dnaJ 72 [Cucurbita maxima]2.7e-8395.27Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGLNRSATKEE+KEAFRKLAKEFHPDKHSQSPKAVRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGP VNQHYYSSYNSYANASGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R+GYGSSSGFASRSG NVDGLFTNFHMLLRFLTTRAFLLN AFAGVLVGG+FAIDTSGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

XP_023518661.1 chaperone protein dnaJ 72 [Cucurbita pepo subsp. pepo]2.1e-8395.27Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGLNRSATKEE+KEAFRKLAKEFHPDKHSQSPKAVRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSV QHYYSSYNSYANASGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R+GYGSSSGFA+RSG NVDGLFTNFHMLLRFLTTRAFLLN AFAGVLVGGMFAIDTSGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

TrEMBL top hitse value%identityAlignment
A0A1S3B4T7 chaperone protein dnaJ 722.4e-7791.12Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGL RSATKEEIK+AFRKLAKEFHPDKHSQSPK VRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQ YYSSYNSYA ASGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R  YGSSSGFASRS FN DGL TNFHMLLRFLTTRAFLLNFAFAGVL GGM AID+SGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

A0A5A7TEJ7 Chaperone protein dnaJ 721.2e-6589.4Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGL RSATKEEIK+AFRKLAKEFHPDKHSQSPK VRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQ YYSSYNSYA ASGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGM
        R  YGSSSGFASRS FN DGL TNFHMLLRFLTTRAFLLNFAFAG L   +
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGM

A0A6J1DJW0 chaperone protein dnaJ 721.6e-7690.59Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYY-SSYNSYANASG
        MD RDHYKVLGLNR ATKEEIKEAFRKLAKEFHPDKHSQSPKA+RDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSVN HYY SSYNSYANASG
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYY-SSYNSYANASG

Query:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        PR  Y SSSGFASRSG NV+G  TN HMLLRFLTTRAFLLNFAFAG LVGGMFAIDTSGEALWKM NSGK
Subjt:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

A0A6J1EGR8 chaperone protein dnaJ 725.9e-8495.27Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGLNRSATKEE+KEAFRKLAKEFHPDKHSQSPK+VRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYAN SGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R+GYGSSSGFASRSG NVDGLFTNFHMLLRFLTTRAFLLN AFAGVLVGGMFAIDTSGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

A0A6J1HRQ1 chaperone protein dnaJ 721.3e-8395.27Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        MDVRDHYKVLGLNRSATKEE+KEAFRKLAKEFHPDKHSQSPKAVRDSAT+RFKQVSEAYEILGDDCKRADYNIRSRCASGP VNQHYYSSYNSYANASGP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
        R+GYGSSSGFASRSG NVDGLFTNFHMLLRFLTTRAFLLN AFAGVLVGG+FAIDTSGEALWKM NSGK
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

SwissProt top hitse value%identityAlignment
A9VHU0 Chaperone protein DnaJ1.5e-1236.8Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP
        M+ RD+Y+VLGL++ A+K+EIK+A+R+LAK++HPD   +      ++A  +FK+V EAYE+L DD KRA                     Y+ + +A GP
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGP

Query:  RHGYGSSSGFASRSGFNVDGLFTNF
          G+G    F    GF  + +F++F
Subjt:  RHGYGSSSGFASRSGFNVDGLFTNF

B1HUD0 Chaperone protein DnaJ9.0e-1350Show/hide
Query:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYN
        M+ RD+Y+VLGL +SATK+EIK+A+RKL+K++HPD + +        A  +FK+++EAYE+L DD K+A Y+
Subjt:  MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYN

Q0WTI8 Chaperone protein dnaJ 721.3e-3553.53Show/hide
Query:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRS-----RCASGPSVNQHYYSSYNSYANASG
        DHY+VLG+ R+ATK+E+K+AFR+LA ++HPDKH+QSP+ VR +AT+RFK VSEAYE+L DD KRA YN  S     R  SG   N   Y +    A  SG
Subjt:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRS-----RCASGPSVNQHYYSSYNSYANASG

Query:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
          +GYG S+     S F+     + F    R+LTTRAFLLN A AG L     AIDTSGE LWKM+NSGK
Subjt:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

Q6KHF9 Chaperone protein DnaJ1.1e-1340.98Show/hide
Query:  RDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGPRHG
        +D+Y++LGL +SA+K+EIK+A+R LAK +HPD + ++      +A  +FK+++EAYEIL DD KR  YN     A  P  N   +   N + NA G    
Subjt:  RDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGPRHG

Query:  YGSSSGFASRSGFNVDGLFTNF
            SGF+  SGF    +FT+F
Subjt:  YGSSSGFASRSGFNVDGLFTNF

Q8GUP1 Cleavage stimulation factor subunit 776.6e-2468.49Show/hide
Query:  KYNVEVAESVANEAQRLPILEATPLYEQLLTVYPTAAKYWKQYVEAHMVINNDDATKQIFSRCLLNCLHIPLW
        KY VE AE++A  A   PI +ATP+YEQLL++YPT+A++WKQYVEA M +NNDDATKQIFSRCLL CL +PLW
Subjt:  KYNVEVAESVANEAQRLPILEATPLYEQLLTVYPTAAKYWKQYVEAHMVINNDDATKQIFSRCLLNCLHIPLW

Arabidopsis top hitse value%identityAlignment
AT1G17760.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-2568.49Show/hide
Query:  KYNVEVAESVANEAQRLPILEATPLYEQLLTVYPTAAKYWKQYVEAHMVINNDDATKQIFSRCLLNCLHIPLW
        KY VE AE++A  A   PI +ATP+YEQLL++YPT+A++WKQYVEA M +NNDDATKQIFSRCLL CL +PLW
Subjt:  KYNVEVAESVANEAQRLPILEATPLYEQLLTVYPTAAKYWKQYVEAHMVINNDDATKQIFSRCLLNCLHIPLW

AT1G59725.1 DNAJ heat shock family protein3.9e-1137.86Show/hide
Query:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNI-----RSRCASGPSVNQHYYSSYNSYANASG
        D+Y VL +N SAT++++K+++R+LA ++HPDK   +P +++  A  +FKQ+SEAY++L D  KR  Y+       +   +  S  QH YSS N+    +G
Subjt:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNI-----RSRCASGPSVNQHYYSSYNSYANASG

Query:  PRH
         R+
Subjt:  PRH

AT2G41000.1 Chaperone DnaJ-domain superfamily protein9.2e-3753.53Show/hide
Query:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRS-----RCASGPSVNQHYYSSYNSYANASG
        DHY+VLG+ R+ATK+E+K+AFR+LA ++HPDKH+QSP+ VR +AT+RFK VSEAYE+L DD KRA YN  S     R  SG   N   Y +    A  SG
Subjt:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRS-----RCASGPSVNQHYYSSYNSYANASG

Query:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK
          +GYG S+     S F+     + F    R+LTTRAFLLN A AG L     AIDTSGE LWKM+NSGK
Subjt:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGK

AT2G41000.2 Chaperone DnaJ-domain superfamily protein3.5e-3653.25Show/hide
Query:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRS-----RCASGPSVNQHYYSSYNSYANASG
        DHY+VLG+ R+ATK+E+K+AFR+LA ++HPDKH+QSP+ VR +AT+RFK VSEAYE+L DD KRA YN  S     R  SG   N   Y +    A  SG
Subjt:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRS-----RCASGPSVNQHYYSSYNSYANASG

Query:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSG
          +GYG S+     S F+     + F    R+LTTRAFLLN A AG L     AIDTSGE LWKM+NSG
Subjt:  PRHGYGSSSGFASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSG

AT3G08910.1 DNAJ heat shock family protein5.6e-1036.59Show/hide
Query:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGPRHGY
        D+YKVL ++R+A  +++K+A+RKLA ++HPDK+  + K     A  +FKQ+SEAY++L D  KRA Y+               Y      + A  P  G 
Subjt:  DHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGPRHGY

Query:  GSSSGFAS--RSGFNVDGLFTNF
        G S G AS   +G + D +F+ F
Subjt:  GSSSGFAS--RSGFNVDGLFTNF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTCGCGATCACTACAAGGTACTGGGCTTGAACAGAAGCGCCACAAAGGAAGAAATCAAGGAGGCATTCAGGAAGCTGGCGAAGGAATTTCACCCTGATAAGCA
TTCCCAATCGCCGAAGGCTGTTAGGGACTCCGCTACTATGAGATTCAAACAGGTTTCGGAAGCGTATGAAATCCTTGGCGATGATTGTAAGCGTGCTGATTATAATATTC
GGTCTCGTTGTGCCTCTGGTCCGTCCGTTAATCAACATTATTACTCCTCGTATAATTCTTATGCTAATGCGAGTGGACCTCGACATGGATATGGTTCTTCTTCTGGATTT
GCCTCTCGCTCTGGTTTCAATGTCGATGGGTTGTTTACGAATTTTCATATGCTGCTGCGCTTTCTCACAACGCGTGCATTTCTTCTCAATTTTGCTTTTGCCGGTGTTTT
AGTTGGTGGAATGTTTGCGATCGACACAAGTGGGGAGGCCTTGTGGAAGATGCAAAATTCTGGGAAGCTCGTCGGCGTCGGAGCTGGGGAGAAACGACGGCATCGAAGCT
TTTCGGCGTCAGATCTGAAAGGGGAGGGCCGATGGTCAGATCTGAAAAAGGGAGGGGAAGGGAAGGGAAGGGAAAACGAAAGGGAAGGGAAGGGAAAGGAATGCGAAAGG
GAAGGGAAAGGGGAAGGAGTGGTCGACGACGATTCGAGGGGCGGTAATGCTTGCCAGTGCACTATTCCTCAGCTTCTTCAATCTTCTTCCAAATCTAACCTTCAGCCTTC
TAATTCTAATTCTCAAATGCAAAGGATTTATGAACTAAACCTGCTTTTGCTCGAGCTCGCAGCTAAGCAGACCAGGATAACTCATACCATCATAGCTGTATTTTCACGCG
TTCTCTTCATTATGACATCAGAAGGACCTGAATCAAAGGATAAAACAGCCAGTAATAAACTTTTGGATGATTTGAAGTACAATGTTGAAGTGGCAGAAAGCGTTGCTAAT
GAGGCGCAGCGTTTGCCAATATTGGAGGCAACACCATTATATGAGCAACTGCTGACGGTGTATCCCACTGCTGCTAAATATTGGAAGCAATATGTGGAGGCACACATGGT
TATAAATAATGATGATGCTACAAAACAGATATTTAGCCGGTGCTTATTGAACTGTCTTCACATTCCTCTTTGGTATGATCCTTGTTTCACCTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTCGCGATCACTACAAGGTACTGGGCTTGAACAGAAGCGCCACAAAGGAAGAAATCAAGGAGGCATTCAGGAAGCTGGCGAAGGAATTTCACCCTGATAAGCA
TTCCCAATCGCCGAAGGCTGTTAGGGACTCCGCTACTATGAGATTCAAACAGGTTTCGGAAGCGTATGAAATCCTTGGCGATGATTGTAAGCGTGCTGATTATAATATTC
GGTCTCGTTGTGCCTCTGGTCCGTCCGTTAATCAACATTATTACTCCTCGTATAATTCTTATGCTAATGCGAGTGGACCTCGACATGGATATGGTTCTTCTTCTGGATTT
GCCTCTCGCTCTGGTTTCAATGTCGATGGGTTGTTTACGAATTTTCATATGCTGCTGCGCTTTCTCACAACGCGTGCATTTCTTCTCAATTTTGCTTTTGCCGGTGTTTT
AGTTGGTGGAATGTTTGCGATCGACACAAGTGGGGAGGCCTTGTGGAAGATGCAAAATTCTGGGAAGCTCGTCGGCGTCGGAGCTGGGGAGAAACGACGGCATCGAAGCT
TTTCGGCGTCAGATCTGAAAGGGGAGGGCCGATGGTCAGATCTGAAAAAGGGAGGGGAAGGGAAGGGAAGGGAAAACGAAAGGGAAGGGAAGGGAAAGGAATGCGAAAGG
GAAGGGAAAGGGGAAGGAGTGGTCGACGACGATTCGAGGGGCGGTAATGCTTGCCAGTGCACTATTCCTCAGCTTCTTCAATCTTCTTCCAAATCTAACCTTCAGCCTTC
TAATTCTAATTCTCAAATGCAAAGGATTTATGAACTAAACCTGCTTTTGCTCGAGCTCGCAGCTAAGCAGACCAGGATAACTCATACCATCATAGCTGTATTTTCACGCG
TTCTCTTCATTATGACATCAGAAGGACCTGAATCAAAGGATAAAACAGCCAGTAATAAACTTTTGGATGATTTGAAGTACAATGTTGAAGTGGCAGAAAGCGTTGCTAAT
GAGGCGCAGCGTTTGCCAATATTGGAGGCAACACCATTATATGAGCAACTGCTGACGGTGTATCCCACTGCTGCTAAATATTGGAAGCAATATGTGGAGGCACACATGGT
TATAAATAATGATGATGCTACAAAACAGATATTTAGCCGGTGCTTATTGAACTGTCTTCACATTCCTCTTTGGTATGATCCTTGTTTCACCTATTAG
Protein sequenceShow/hide protein sequence
MDVRDHYKVLGLNRSATKEEIKEAFRKLAKEFHPDKHSQSPKAVRDSATMRFKQVSEAYEILGDDCKRADYNIRSRCASGPSVNQHYYSSYNSYANASGPRHGYGSSSGF
ASRSGFNVDGLFTNFHMLLRFLTTRAFLLNFAFAGVLVGGMFAIDTSGEALWKMQNSGKLVGVGAGEKRRHRSFSASDLKGEGRWSDLKKGGEGKGRENEREGKGKECER
EGKGEGVVDDDSRGGNACQCTIPQLLQSSSKSNLQPSNSNSQMQRIYELNLLLLELAAKQTRITHTIIAVFSRVLFIMTSEGPESKDKTASNKLLDDLKYNVEVAESVAN
EAQRLPILEATPLYEQLLTVYPTAAKYWKQYVEAHMVINNDDATKQIFSRCLLNCLHIPLWYDPCFTY