; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020651 (gene) of Snake gourd v1 genome

Gene IDTan0020651
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionChaperone protein dnaJ 20
Genome locationLG10:3978875..3980342
RNA-Seq ExpressionTan0020651
SyntenyTan0020651
Gene Ontology termsGO:0061077 - chaperone-mediated protein folding (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR001623 - DnaJ domain
IPR018253 - DnaJ domain, conserved site
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597272.1 Chaperone protein dnaJ 20, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.7e-9587.62Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFT+S  D R  IPS+PSITGRR ISG RSR+FFPNTPPCNSTR PSLSIRAKASFN G ASSE ADGSFYDLLGISESGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRD VGGLQVAFSARRRY  DE VP+KS WK+CW+ QISELKRRSMDKDS+HNLSWGARMR+QM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

KAG7028743.1 Chaperone protein dnaJ 20, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]6.0e-9587.13Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFT+S  D R  IPS+PSITGRR ISG RSR+FFPNTPPCNSTR PSLSIRAKASFN G ASSE ADGSFYDLLGISESGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRD VGGLQVAFSARRRY  DE VP+KS WK+CW+ QISELK+RSMDKDS+HNLSWGARMR+QM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

XP_022949677.1 chaperone protein dnaJ 20, chloroplastic-like [Cucurbita moschata]6.0e-9587.13Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFT+S  D R  IPS+PSITGRR ISG RSR+FFPNTPPCNSTR PSLSIRAKASFN G ASSE ADGSFYDLLGISESGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRD VGGLQVAFSARRRY  DE VP+KS WK+CW+ QISELK+RSMDKDS+HNLSWGARMR+QM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

XP_022974737.1 chaperone protein dnaJ 20, chloroplastic-like [Cucurbita maxima]2.7e-9587.13Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFT+SG D R  IPS+PSITGRR ISG +SR+FFPNTPPCNSTR PSLSIRAKASFN G AS E ADGSFYDLLGISESGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRD VGGLQVAFSARRRY  DE VP+KS WK+CW+ QISELKRRSMDKDS+HNLSWGARMR+QM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

XP_023539417.1 chaperone protein dnaJ 20, chloroplastic-like [Cucurbita pepo subsp. pepo]1.0e-9486.63Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFT+SG D R  IPS+PSITGRR ISG RSR+FFPNTPPCNS+R PSLSIRAKASFN G ASSE ADGSFYDLLGISESGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRD VGGLQVAFSARRR+  DE VP+KS WK+CW+ QISELKRRSMDKDS+HNLSWG RMR+QM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

TrEMBL top hitse value%identityAlignment
A0A1S3AWS2 chaperone protein dnaJ 20, chloroplastic-like7.9e-9385.15Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFTFSGSDSRF+IPSNP+I+ RRPISG+RSR+FFP+TPP NSTR PSLSIRAKASFNEG  SSEVA+GSFYDLLGIS+SGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPG AEEYTK FIRVQEAYETLSDPRRRALYDRD +GGLQVAFSARRRY   E V EKSGW+N W+ QISELKRRSM+KD R N+SWGARMRRQM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

A0A6J1E8M4 chaperone protein dnaJ 20, chloroplastic-like isoform X13.7e-9084.16Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQC DFTFSGSDSRF+IPS+PSITGRRPI+GHRS IFFPNT PCN  R PSLSIRAKASFNEG  SSEVADGSFYDLLG+SES SLAEIKRAYK LARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSP G AEEYTKRFIR QEAYETLSDPRRRALYDRD VGGLQVAFSA R Y  D+  PEKSGW++CWQ QISELKRRSM+KDS  N+SWGARMRRQM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

A0A6J1GCT3 chaperone protein dnaJ 20, chloroplastic-like2.9e-9587.13Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFT+S  D R  IPS+PSITGRR ISG RSR+FFPNTPPCNSTR PSLSIRAKASFN G ASSE ADGSFYDLLGISESGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRD VGGLQVAFSARRRY  DE VP+KS WK+CW+ QISELK+RSMDKDS+HNLSWGARMR+QM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

A0A6J1IB32 chaperone protein dnaJ 20, chloroplastic-like1.3e-9587.13Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFT+SG D R  IPS+PSITGRR ISG +SR+FFPNTPPCNSTR PSLSIRAKASFN G AS E ADGSFYDLLGISESGSLAEIKRAYKQLARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRD VGGLQVAFSARRRY  DE VP+KS WK+CW+ QISELKRRSMDKDS+HNLSWGARMR+QM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

A0A6J1IRB6 chaperone protein dnaJ 20, chloroplastic-like isoform X15.7e-9184.16Show/hide
Query:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY
        MQCSDFTFSGSDSRF+IPS+PSITGRRPI+GHRS IFFPNT PCN TR PSLSIRAKASFNEG  SSEV+DGSFYDLLG+SES SLAEIKRAYK LARKY
Subjt:  MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKY

Query:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM
        HPDVSP G+AEEYTKRFIRV EAYETLSDPRRRALYDRD VGGLQVAFSA R Y  D+  PEKSGW++CWQ QISEL RRSM+KDS  N+SWGARMRRQM
Subjt:  HPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQM

Query:  NE
        NE
Subjt:  NE

SwissProt top hitse value%identityAlignment
A5VJE8 Chaperone protein DnaJ1.0e-1249.37Show/hide
Query:  VADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA
        +A+  +YD+LG+S+  S  +IKRAY++LA KYHPDV+    AEE   +F ++ EAYETLSD ++RA YD+    G Q A
Subjt:  VADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA

B2G6W4 Chaperone protein DnaJ1.0e-1249.37Show/hide
Query:  VADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA
        +A+  +YD+LG+S+  S  +IKRAY++LA KYHPDV+    AEE   +F ++ EAYETLSD ++RA YD+    G Q A
Subjt:  VADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA

B2GBQ6 Chaperone protein DnaJ6.0e-1346.59Show/hide
Query:  VADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDR-DTVGGLQVAFSARRRYG
        +A+   YD+LG+ +  S AEIKRAY++LA KYHPDV+    AE   K+F ++ EAYETLSD ++RA YD+  T G     F  +  +G
Subjt:  VADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDR-DTVGGLQVAFSARRRYG

P95830 Chaperone protein DnaJ7.8e-1345.12Show/hide
Query:  FYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYG
        FYD LG+S++ S  EIK+AY++L++KYHPD++    AE+   ++  VQEAYETLSD ++RA YD+    G    F     +G
Subjt:  FYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYG

Q9SDN0 Chaperone protein dnaJ 20, chloroplastic6.6e-4462.42Show/hide
Query:  TRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQV
        TRF S  I+++ + ++    SE  D SFYDLLG++ES +L EIK+AYKQLARKYHPDVSPP R EEYT RFIRVQEAYETLSDPRRR LYDRD   G   
Subjt:  TRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQV

Query:  AFSARRRYGTD-EVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQMNE
        +FS RR+   D EVV EKS WK  WQ Q+S L+RRS  KD+ + +SW ARMRRQ  E
Subjt:  AFSARRRYGTD-EVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQMNE

Arabidopsis top hitse value%identityAlignment
AT1G80030.1 Molecular chaperone Hsp40/DnaJ family protein8.9e-1246.84Show/hide
Query:  GSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYD-------RDTVGG
        G +Y  LG+S+S +  EIK AY++LAR+YHPDV+    A   T++F  +  AYE LSD ++RALYD       + TVGG
Subjt:  GSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYD-------RDTVGG

AT2G22360.1 DNAJ heat shock family protein2.8e-1347.44Show/hide
Query:  ADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA
        AD  +Y +LG+S++ + AEIK AY++LAR YHPDV+    AEE   +F  +  AYE LSD  +++LYDR    GL+ A
Subjt:  ADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA

AT4G13830.1 DNAJ-like 204.1e-3363.16Show/hide
Query:  TRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQV
        TRF S  I+++ + ++    SE  D SFYDLLG++ES +L EIK+AYKQLARKYHPDVSPP R EEYT RFIRVQEAYETLSDPRRR LYDRD   G   
Subjt:  TRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQV

Query:  AFSARRRYGTDEVV
        +FS RR+   D+V+
Subjt:  AFSARRRYGTDEVV

AT4G13830.2 DNAJ-like 204.7e-4562.42Show/hide
Query:  TRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQV
        TRF S  I+++ + ++    SE  D SFYDLLG++ES +L EIK+AYKQLARKYHPDVSPP R EEYT RFIRVQEAYETLSDPRRR LYDRD   G   
Subjt:  TRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQV

Query:  AFSARRRYGTD-EVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQMNE
        +FS RR+   D EVV EKS WK  WQ Q+S L+RRS  KD+ + +SW ARMRRQ  E
Subjt:  AFSARRRYGTD-EVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQMNE

AT4G39960.1 Molecular chaperone Hsp40/DnaJ family protein3.6e-1347.44Show/hide
Query:  ADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA
        AD  FY +LG+S++ + AEIK AY++LAR YHPDV+    AE+   +F  +  AYE LSD  +R+LYDR    G++ A
Subjt:  ADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRAEEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTGCTCCGATTTTACCTTTTCAGGAAGCGATTCACGCTTCTTCATCCCTTCGAACCCTAGCATTACCGGCCGGCGACCAATTTCCGGACACCGATCTCGGATTTT
CTTCCCCAACACTCCGCCGTGCAATTCGACTCGCTTTCCGTCTCTTTCGATCAGAGCCAAGGCCTCGTTCAACGAAGGAACCGCCTCTTCCGAGGTTGCCGATGGCTCCT
TCTACGACTTGCTAGGCATTTCCGAGTCCGGATCCTTGGCGGAAATCAAGCGCGCCTACAAGCAGCTCGCTCGGAAGTACCATCCGGACGTGTCGCCGCCGGGCCGGGCG
GAGGAGTATACGAAGAGGTTCATTCGTGTTCAGGAGGCGTACGAAACATTGTCGGATCCACGAAGGAGAGCGCTGTATGATAGAGATACGGTTGGAGGCCTTCAAGTCGC
GTTCTCTGCTCGGAGACGATACGGCACCGATGAGGTAGTTCCAGAGAAAAGTGGATGGAAAAATTGCTGGCAAGTTCAGATCTCAGAATTGAAGAGAAGAAGCATGGACA
AGGATTCGAGACATAATTTGTCATGGGGAGCTCGAATGCGCCGGCAAATGAATGAACATTGA
mRNA sequenceShow/hide mRNA sequence
GAGGCACGGAGAAAAAACAGAGAAATGCAGTGCTCCGATTTTACCTTTTCAGGAAGCGATTCACGCTTCTTCATCCCTTCGAACCCTAGCATTACCGGCCGGCGACCAAT
TTCCGGACACCGATCTCGGATTTTCTTCCCCAACACTCCGCCGTGCAATTCGACTCGCTTTCCGTCTCTTTCGATCAGAGCCAAGGCCTCGTTCAACGAAGGAACCGCCT
CTTCCGAGGTTGCCGATGGCTCCTTCTACGACTTGCTAGGCATTTCCGAGTCCGGATCCTTGGCGGAAATCAAGCGCGCCTACAAGCAGCTCGCTCGGAAGTACCATCCG
GACGTGTCGCCGCCGGGCCGGGCGGAGGAGTATACGAAGAGGTTCATTCGTGTTCAGGAGGCGTACGAAACATTGTCGGATCCACGAAGGAGAGCGCTGTATGATAGAGA
TACGGTTGGAGGCCTTCAAGTCGCGTTCTCTGCTCGGAGACGATACGGCACCGATGAGGTAGTTCCAGAGAAAAGTGGATGGAAAAATTGCTGGCAAGTTCAGATCTCAG
AATTGAAGAGAAGAAGCATGGACAAGGATTCGAGACATAATTTGTCATGGGGAGCTCGAATGCGCCGGCAAATGAATGAACATTGAACAGGTTTGATGAAAATTATAGAT
GAAAAAACCAAACCATGTTTGACTCAGTATATTCATACTTGAGCAGCTTGTAAATAGAACACGTAGCGTCCTTGAGAATAATAATGATTCTGTTTTTAGTCT
Protein sequenceShow/hide protein sequence
MQCSDFTFSGSDSRFFIPSNPSITGRRPISGHRSRIFFPNTPPCNSTRFPSLSIRAKASFNEGTASSEVADGSFYDLLGISESGSLAEIKRAYKQLARKYHPDVSPPGRA
EEYTKRFIRVQEAYETLSDPRRRALYDRDTVGGLQVAFSARRRYGTDEVVPEKSGWKNCWQVQISELKRRSMDKDSRHNLSWGARMRRQMNEH