; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC04G069710 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC04G069710
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionMitochondrial glycoprotein
Genome locationCiama_Chr04:12369998..12374158
RNA-Seq ExpressionCaUC04G069710
SyntenyCaUC04G069710
Gene Ontology termsGO:0005759 - mitochondrial matrix (cellular component)
InterPro domainsIPR003428 - Mitochondrial glycoprotein
IPR036561 - Mitochondrial glycoprotein superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603196.1 hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sororia]1.3e-9285.65Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVS+DGH GSPFKIYNAYYLQSS CL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

XP_022153881.1 mitochondrial acidic protein mam33 [Momordica charantia]6.0e-9082.3Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+DVVLRRKLESGEEVA+SAL G LRFG EGAFPREILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+H+HK EQGQYLNWLQ +ES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        +AK QPN+L
Subjt:  IAKRQPNQL

XP_022928687.1 uncharacterized protein LOC111435528 [Cucurbita moschata]2.9e-9286.12Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

XP_022967898.1 uncharacterized protein LOC111467274 [Cucurbita maxima]1.7e-9286.12Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

XP_023544221.1 uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo]9.0e-9486.6Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DSP SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALKE+LISRGVEESLT+FL+IHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

TrEMBL top hitse value%identityAlignment
A0A1S3BK83 uncharacterized protein LOC103490527 isoform X12.1e-8080.61Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R  Q+FRKARK   DL LL+ILQSEIAHELSST   N+E NN SSS F VE+DS  S+DVVLRRK++SGEEV ISAL G LRFG++GAFPREILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQK
        VSKPGVSSLLQFDCGVSEDGH GSPFK+YNAYYL+SS CLGP VYRGP FS+LDP+LQDALKEYLISRGVEESLTNFLLIHLHK EQGQYLNWL+K
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQK

A0A6J1DM07 mitochondrial acidic protein mam332.9e-9082.3Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSSTRF + E  +G S DF VE+DSP S+DVVLRRKLESGEEVA+SAL G LRFG EGAFPREILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSSA LG SVYRGP FS+LDP+LQDALK+YLISRGVEESLTNFLL+H+HK EQGQYLNWLQ +ES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        +AK QPN+L
Subjt:  IAKRQPNQL

A0A6J1EKM2 uncharacterized protein LOC1114355281.4e-9286.12Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSSACL PSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

A0A6J1HWH3 uncharacterized protein LOC1114672748.2e-9386.12Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M RANQIFRKARKALHDLDLLKILQSEI HELSST F NHE   GSSSDFAVE+DS  SRDVVLRRKLESGEE+AISAL G L FG EGAF REILMKIC
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        VSKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSSACLGPSVYRGPLFS+LDP+LQ ALK +LISRGVEESLT+FLLIHLHK EQGQYLNWLQ VES 
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQPNQL
        IAKRQ N+L
Subjt:  IAKRQPNQL

A0A7N2M2K1 Uncharacterized protein2.0e-7065.37Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC
        M R   I RKARKAL D DL+K LQ+EI HELSST F   +  + S  DF VE+DSP S+DVVLRR  ESGEEVA+SA+ G + +G EG FPR++LMK+C
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKIC

Query:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS
        + KPG+SS+LQFDCGV E G+ GS F I+NAYY+QS  C+ PS YRGPLFS+LDPQLQDALKEYL++RG+ ESLTNFLL HLHK EQGQY+NWL+ +ESS
Subjt:  VSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESS

Query:  IAKRQ
        +AK +
Subjt:  IAKRQ

SwissProt top hitse value%identityAlignment
O94675 Mitochondrial acidic protein mam333.5e-0833.6Show/hide
Query:  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGP----------SVYRGPLFSTLDPQLQDALKEYLISRGVEE
        E  FP E L +     I +SKPG  +L+ F+    +DG     F I N Y+ +    L              Y GP F  LDP+LQD    YL  R ++E
Subjt:  EGAFPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGP----------SVYRGPLFSTLDPQLQDALKEYLISRGVEE

Query:  SLTNFLLIHLHKNEQGQYLNWLQKV
        SL++F++      E  +Y+NWL+ V
Subjt:  SLTNFLLIHLHKNEQGQYLNWLQKV

P40513 Mitochondrial acidic protein MAM333.5e-0833.64Show/hide
Query:  ILMKICVSKPGVSSLLQFDCGVSEDG----HSGSPFKIYNAYYLQSSAC--LGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQG
        ++ K   S+P VS  L  +    ++G     S +P+   +A   QS+        VY GP FS LD +LQ++L+ YL SRGV E L +F+  +    E  
Subjt:  ILMKICVSKPGVSSLLQFDCGVSEDG----HSGSPFKIYNAYYLQSSAC--LGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQG

Query:  QYLNWLQKVE
        +Y++WL+K++
Subjt:  QYLNWLQKVE

Arabidopsis top hitse value%identityAlignment
AT2G41600.1 Mitochondrial glycoprotein family protein6.3e-2944.08Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF

AT2G41600.2 Mitochondrial glycoprotein family protein1.3e-2942.33Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDAL
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F +   Q   A+
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDAL

AT2G41600.3 Mitochondrial glycoprotein family protein2.5e-4948.04Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+K EQ QY+NWL+++E
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE

Query:  SSIA
        S+++
Subjt:  SSIA

AT2G41600.4 Mitochondrial glycoprotein family protein6.3e-2944.08Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLF

AT2G41600.5 Mitochondrial glycoprotein family protein2.5e-4948.04Show/hide
Query:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK
        M + N + ++  KA+ + DLLKILQSEI HE+S  RF   E   GS  DF +++DSP S+D+VL+R+ +SGE+V +SAL  P  +    +  FPRE   K
Subjt:  MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISAL--PGRLRFGHEGAFPREILMK

Query:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE
        +C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+K EQ QY+NWL+++E
Subjt:  ICVSKPGVSSLLQFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVE

Query:  SSIA
        S+++
Subjt:  SSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATT
TCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGG
TCGCGATTTCTGCTCTACCGGGTCGTCTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTG
CAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGG
CCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATA
AAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAACCAACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGGGCTAATCAAATATTTCGCAAGGCCCGTAAAGCGCTCCATGATCTTGACCTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCGATT
TCTGAACCATGAACATAATAATGGCAGTTCCAGCGATTTCGCTGTGGAATATGACTCGCCCATGTCCCGAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGCGAGGAGG
TCGCGATTTCTGCTCTACCGGGTCGTCTCAGATTTGGACACGAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTG
CAGTTTGATTGTGGGGTTTCAGAGGATGGTCATAGTGGGTCTCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCTGCTTGTTTGGGACCTTCTGTTTATAGAGG
CCCTTTGTTCAGCACGTTAGATCCTCAGTTACAAGACGCGCTCAAGGAATACCTAATCAGTAGAGGTGTTGAAGAAAGCCTGACCAATTTCCTTCTCATTCACCTGCATA
AAAATGAGCAAGGTCAGTATTTGAATTGGTTGCAAAAGGTCGAATCTTCGATAGCAAAAAGACAACCAAACCAACTTTAA
Protein sequenceShow/hide protein sequence
MGRANQIFRKARKALHDLDLLKILQSEIAHELSSTRFLNHEHNNGSSSDFAVEYDSPMSRDVVLRRKLESGEEVAISALPGRLRFGHEGAFPREILMKICVSKPGVSSLL
QFDCGVSEDGHSGSPFKIYNAYYLQSSACLGPSVYRGPLFSTLDPQLQDALKEYLISRGVEESLTNFLLIHLHKNEQGQYLNWLQKVESSIAKRQPNQL