; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G062010 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G062010
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionMitochondrial glycoprotein
Genome locationCicolChr04:14264902..14269317
RNA-Seq ExpressionCcUC04G062010
SyntenyCcUC04G062010
Gene Ontology termsGO:0005759 - mitochondrial matrix (cellular component)
InterPro domainsIPR003428 - Mitochondrial glycoprotein
IPR036561 - Mitochondrial glycoprotein superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603196.1 hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sororia]1.2e-9084.21Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVS+DGH GSPFKIYNAYYLQSS CL PSVYRGPLFS+LDP LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

XP_022153881.1 mitochondrial acidic protein mam33 [Momordica charantia]8.7e-8981.34Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+DVVLRRKLESGEEVA+SAL G LRFG EGAFPREILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDPRLQDALK+YLISRGVEESLTNFLL+H+HK EQGQYLNWLQ +ES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        +AK QPN+L
Subjt:  IAKRQPNQL

XP_022928687.1 uncharacterized protein LOC111435528 [Cucurbita moschata]2.7e-9084.69Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

XP_022967898.1 uncharacterized protein LOC111467274 [Cucurbita maxima]1.6e-9084.69Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

XP_023544221.1 uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo]8.4e-9285.17Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP LQ ALKE+LISRGVEESLT+FL+IHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

TrEMBL top hitse value%identityAlignment
A0A1S3BK83 uncharacterized protein LOC103490527 isoform X11.6e-8080.61Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R  Q+FR+ARK   DL LL+ILQSEIAHELSST   N+E NN SSS F VE+DS  S+DVVLRRK++SGEEV ISAL G LRFG++GAFPREILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQK
        VSKPGVSSLLQFDCGVSEDGH GSPFK+YNAYYL+SS CLGP VYRGP FS+LDPRLQDALKEYLISRGVEESLTNFLLIHLHK EQGQYLNWL+K
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQK

A0A6J1DM07 mitochondrial acidic protein mam334.2e-8981.34Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+DVVLRRKLESGEEVA+SAL G LRFG EGAFPREILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDPRLQDALK+YLISRGVEESLTNFLL+H+HK EQGQYLNWLQ +ES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        +AK QPN+L
Subjt:  IAKRQPNQL

A0A6J1EKM2 uncharacterized protein LOC1114355281.3e-9084.69Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

A0A6J1HWH3 uncharacterized protein LOC1114672747.7e-9184.69Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R NQIFR+ARKALHDL LLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

A0A7N2M2K1 Uncharacterized protein1.8e-6863.9Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R   I R+ARKAL D  L+K LQ+EI HELSST F   +  + S  DF VE+DSP S+DVVLRR  ESGEEVA+SA+ G + +G EG FPR++LMK+C
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        + KPG+SS+LQFDCGV E G+ GS F I+NAYY+QS  C+ PS YRGPLFS+LDP+LQDALKEYL++RG+ ESLTNFLL HLHK EQGQY+NWL+ +ESS
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQ
        +AK +
Subjt:  IAKRQ

SwissProt top hitse value%identityAlignment
O94675 Mitochondrial acidic protein mam336.0e-0833.6Show/hide
Query:  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGP----------SVYRGPLFSTLDPRLQDALKEYLISRGVEE
        E  FP E L +     I +SKPG  +L+ F+    +DG     F I N Y+ +    L              Y GP F  LDP LQD    YL  R ++E
Subjt:  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGP----------SVYRGPLFSTLDPRLQDALKEYLISRGVEE

Query:  SLTNFLLIHLHKNEQGQYLNWLQKV
        SL++F++      E  +Y+NWL+ V
Subjt:  SLTNFLLIHLHKNEQGQYLNWLQKV

P40513 Mitochondrial acidic protein MAM334.6e-0833.64Show/hide
Query:  ILMKICVSKPGVSSLLQFDCGVSEDG----HSGSPFKIYNAYYLQSSAC--LGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQG
        ++ K   S+P VS  L  +    ++G     S +P+   +A   QS+        VY GP FS LD  LQ++L+ YL SRGV E L +F+  +    E  
Subjt:  ILMKICVSKPGVSSLLQFDCGVSEDG----HSGSPFKIYNAYYLQSSAC--LGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQG

Query:  QYLNWLQKVE
        +Y++WL+K++
Subjt:  QYLNWLQKVE

Arabidopsis top hitse value%identityAlignment
AT2G41600.1 Mitochondrial glycoprotein family protein1.2e-2743.42Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + +   KA+ +  LLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF

AT2G41600.2 Mitochondrial glycoprotein family protein1.2e-2743.42Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + +   KA+ +  LLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF

AT2G41600.3 Mitochondrial glycoprotein family protein3.6e-4847.55Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + +   KA+ +  LLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+K EQ QY+NWL+++E
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE

Query:  SSIA
        S+++
Subjt:  SSIA

AT2G41600.4 Mitochondrial glycoprotein family protein1.2e-2743.42Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + +   KA+ +  LLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF

AT2G41600.5 Mitochondrial glycoprotein family protein3.6e-4847.55Show/hide
Query:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + +   KA+ +  LLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+K EQ QY+NWL+++E
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE

Query:  SSIA
        S+++
Subjt:  SSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGGGGTAATCAAATATTTCGCGAGGCCCGTAAAGCGCTCCATGATCTTCACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATT
TCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTAGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGG
TCGCGATTTCTGCTCTACCGGGTCGTCTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTG
CAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAGATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGG
CCCTTTGTTCAGCACGTTAGATCCTCGGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATA
AAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAACCAACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATTATCTCCGGACGTTAGAGACTGAGAGGTCGCACTGGGAGAAGAAGGGACAGTTGAAGAATGGGGAGGGGTAATCAAATATTTCGCGAGGCCCGTAAAGCGCTCCATGA
TCTTCACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATTTCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAAT
ATGACTCGCCCATGTCCCGAGACGTAGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGGTCGCGATTTCTGCTCTACCGGGTCGTCTCAGATTTGGACACGAAGGGGCT
TTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAA
GATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGGCCCTTTGTTCAGCACGTTAGATCCTCGGTTACAAGACGCGCTCAAGGAAT
ACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATAAAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCG
ATAGCAAAAAGACAACCAAACCAACTTTAACGACTTCGTATCAAAACTGTATGCCTGAATTTCATACGGGAAGAGAAGGCCCAACTTTAAGTGCAGGTTTTGATCATTTA
GGTTGATTGCTAAATAAGAGCTTTCTTTCATTCTGCTTGCATTTTTTTGTTTCTGCAAGTGTATCCTGTTTCTTGCACATGGAACATTCAGCCAGATCTGTTTGTGAACA
AAACAAAACTGTGGTACTCATTTAGTCTGAGCCAAGGGGAGAAATTCAATGGAAATGTTGTAACAACTTTTTGTGAATTTCTTGAATCATATTCTTGGTTTTATTATCGT
TTTTATTGACTGTTTTTATATTCAATCGAT
Protein sequenceShow/hide protein sequence
MGRGNQIFREARKALHDLHLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKICVSKPGVSSLL
QFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPRLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESSIAKRQPNQL