; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C04G070650 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C04G070650
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionMitochondrial glycoprotein
Genome locationCla97Chr04:14121196..14125390
RNA-Seq ExpressionCla97C04G070650
SyntenyCla97C04G070650
Gene Ontology termsGO:0005759 - mitochondrial matrix (cellular component)
InterPro domainsIPR003428 - Mitochondrial glycoprotein
IPR036561 - Mitochondrial glycoprotein superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603196.1 hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sororia]2.4e-9184.69Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SRDVVLRRKLESGEE+AISAL G   FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVS+DGH GSPFKIYNAYYLQSS CL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        IAKRQ  +L
Subjt:  IAKRQPKQL

XP_022153881.1 mitochondrial acidic protein mam33 [Momordica charantia]8.7e-8981.34Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+DVVLRRKLESGEEVA+SAL G  RFG EGAFPREILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+H+HK EQGQYLNWLQ +ES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        +AK QP +L
Subjt:  IAKRQPKQL

XP_022928687.1 uncharacterized protein LOC111435528 [Cucurbita moschata]5.4e-9185.17Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G   FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        IAKRQ  +L
Subjt:  IAKRQPKQL

XP_022967898.1 uncharacterized protein LOC111467274 [Cucurbita maxima]3.2e-9185.17Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G   FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        IAKRQ  +L
Subjt:  IAKRQPKQL

XP_023544221.1 uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo]1.7e-9285.65Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SRDVVLRRKLESGEE+AISAL G   FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALKE+LISRGVEESLT+FL+IHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        IAKRQ  +L
Subjt:  IAKRQPKQL

TrEMBL top hitse value%identityAlignment
A0A1S3BK83 uncharacterized protein LOC103490527 isoform X14.7e-8080.1Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M R  Q+FRKARK   DL LL+ILQSEIAHELSST   N+E NN SSS F VE+DS  S+DVVLRRK++SGEEV ISAL G  RFG++GAFPREILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQK
        VSKPGVSSLLQFDCGVSEDGH GSPFK+YNAYYL+SS CLGP VYRGP FS+LDP+LQDALKEYLISRGVEESLTNFLLIHLHK EQGQYLNWL+K
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQK

A0A5N6RKE9 Uncharacterized protein3.3e-7064.85Show/hide
Query:  ANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSK
        A+ + R+ RK L D DLLK+LQSEI HELSS RFLN++  +GS  DF VE+DS  S+DVVLRRK E GEEV +SA+ G F +G E  FPR +LMK+C+ K
Subjt:  ANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSK

Query:  PGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESSIAK
        PG+ S+LQFDCGVS+ G++GS F I+NA Y+QSSA LGPS YRGP+FS+LDPQLQDALKEYL+S+G+ E+LTNFLL+HLHK EQGQY+NWL K+ES +AK
Subjt:  PGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESSIAK

Query:  RQ
         +
Subjt:  RQ

A0A6J1DM07 mitochondrial acidic protein mam334.2e-8981.34Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+DVVLRRKLESGEEVA+SAL G  RFG EGAFPREILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+H+HK EQGQYLNWLQ +ES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        +AK QP +L
Subjt:  IAKRQPKQL

A0A6J1EKM2 uncharacterized protein LOC1114355282.6e-9185.17Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G   FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        IAKRQ  +L
Subjt:  IAKRQPKQL

A0A6J1HWH3 uncharacterized protein LOC1114672741.5e-9185.17Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G   FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPKQL
        IAKRQ  +L
Subjt:  IAKRQPKQL

SwissProt top hitse value%identityAlignment
O94675 Mitochondrial acidic protein mam333.5e-0833.6Show/hide
Query:  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGP----------SVYRGPLFSTLDPQLQDALKEYLISRGVEE
        E  FP E L +     I +SKPG  +L+ F+    +DG     F I N Y+ +    L              Y GP F  LDP+LQD    YL  R ++E
Subjt:  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGP----------SVYRGPLFSTLDPQLQDALKEYLISRGVEE

Query:  SLTNFLLIHLHKNEQGQYLNWLQKV
        SL++F++      E  +Y+NWL+ V
Subjt:  SLTNFLLIHLHKNEQGQYLNWLQKV

P40513 Mitochondrial acidic protein MAM332.7e-0833.64Show/hide
Query:  ILMKICVSKPGVSSLLQFDCGVSEDG----HSGSPFKIYNAYYLQSSAC--LGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQG
        ++ K   S+P VS  L  +    ++G     S +P+   +A   QS+        VY GP FS LD +LQ++L+ YL SRGV E L +F+  +    E  
Subjt:  ILMKICVSKPGVSSLLQFDCGVSEDG----HSGSPFKIYNAYYLQSSAC--LGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQG

Query:  QYLNWLQKVE
        +Y++WL+K++
Subjt:  QYLNWLQKVE

Arabidopsis top hitse value%identityAlignment
AT2G41600.1 Mitochondrial glycoprotein family protein8.2e-2944.08Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P       +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF

AT2G41600.2 Mitochondrial glycoprotein family protein1.7e-2942.33Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P       +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDAL
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F +   Q   A+
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDAL

AT2G41600.3 Mitochondrial glycoprotein family protein3.2e-4948.04Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P       +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+K EQ QY+NWL+++E
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE

Query:  SSIA
        S+++
Subjt:  SSIA

AT2G41600.4 Mitochondrial glycoprotein family protein8.2e-2944.08Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P       +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF

AT2G41600.5 Mitochondrial glycoprotein family protein2.5e-4947.87Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P       +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRFRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+K EQ QY+NWL+++E
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE

Query:  SSIAKRQPKQL
        S+++   PK L
Subjt:  SSIAKRQPKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATT
TCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGG
TCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTG
CAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGG
CCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATA
AAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATT
TCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGG
TCGCGATTTCTGCTCTACCGGGTCGTTTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTG
CAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGG
CCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATA
AAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAAACAACTTTAA
Protein sequenceShow/hide protein sequence
MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRFRFGHEGAFPREILMKICVSKPGVSSLL
QFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESSIAKRQPKQL