; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017670 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017670
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMitochondrial glycoprotein
Genome locationChr03:18163966..18167934
RNA-Seq ExpressionHG10017670
SyntenyHG10017670
Gene Ontology termsGO:0005759 - mitochondrial matrix (cellular component)
InterPro domainsIPR003428 - Mitochondrial glycoprotein
IPR036561 - Mitochondrial glycoprotein superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603196.1 hypothetical protein SDJN03_03805, partial [Cucurbita argyrosperma subsp. sororia]2.5e-9687.5Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST FQNH+  G SSDFAVEHDSPKS+DVVLRRKLESGEE+AISA SGPL FG +GAF REILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGVSSLLQFDCGVS+DGH GSPFKIYNAYYLQSS CL PSVYRGPLFSSLDP+LQ ALK +LISRGVEESLT+FLLIHLHKKEQGQYLNWLQNVESLI
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AKRQ NEL
Subjt:  AKRQSNEL

XP_022153881.1 mitochondrial acidic protein mam33 [Momordica charantia]7.5e-9384.62Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST+FQ+ D +G S DF VEHDSPKSQDVVLRRKLESGEEVA+SA SGPLRFG +GAFPREILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSS  LG SVYRGP FSSLDP+LQDALK+YLISRGVEESLTNFLL+H+HKKEQGQYLNWLQN+ESL+
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AK Q NEL
Subjt:  AKRQSNEL

XP_022928687.1 uncharacterized protein LOC111435528 [Cucurbita moschata]9.5e-9687.5Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST FQNH+  G SSDFAVEHDS KS+DVVLRRKLESGEE+AISA SGPL FG +GAF REILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSS CL PSVYRGPLFSSLDP+LQ ALK +LISRGVEESLT+FLLIHLHKKEQGQYLNWLQNVESLI
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AKRQ NEL
Subjt:  AKRQSNEL

XP_022967898.1 uncharacterized protein LOC111467274 [Cucurbita maxima]5.6e-9687.5Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST FQNH+  G SSDFAVEHDS KS+DVVLRRKLESGEE+AISA SGPL FG +GAF REILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSS CLGPSVYRGPLFSSLDP+LQ ALK +LISRGVEESLT+FLLIHLHKKEQGQYLNWLQNVESLI
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AKRQ NEL
Subjt:  AKRQSNEL

XP_023544221.1 uncharacterized protein LOC111803860 [Cucurbita pepo subsp. pepo]3.0e-9787.98Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST FQNH+  G SSDFAVEHDSPKS+DVVLRRKLESGEE+AISA SGPL FG +GAF REILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSS CL PSVYRGPLFSSLDP+LQ ALKE+LISRGVEESLT+FL+IHLHKKEQGQYLNWLQNVESLI
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AKRQ NEL
Subjt:  AKRQSNEL

TrEMBL top hitse value%identityAlignment
A0A1S3BK83 uncharacterized protein LOC103490527 isoform X14.5e-8381.96Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M R  Q+F KARK   DL LL+ILQSEIAHELSST  QN+++N  SS F VEHDS KSQDVVLRRK++SGEEV ISA  GPLRFG+ GAFPREILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQ
        SKPGVSSLLQFDCGVSEDGH GSPFK+YNAYYL+SS CLGP VYRGP FSSLDP+LQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWL+
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQ

A0A6J1DM07 mitochondrial acidic protein mam333.7e-9384.62Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST+FQ+ D +G S DF VEHDSPKSQDVVLRRKLESGEEVA+SA SGPLRFG +GAFPREILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGV S+LQFDCGVSED H GSPFKIYNAYYLQSS  LG SVYRGP FSSLDP+LQDALK+YLISRGVEESLTNFLL+H+HKKEQGQYLNWLQN+ESL+
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AK Q NEL
Subjt:  AKRQSNEL

A0A6J1EKM2 uncharacterized protein LOC1114355284.6e-9687.5Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST FQNH+  G SSDFAVEHDS KS+DVVLRRKLESGEE+AISA SGPL FG +GAF REILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGVSSLLQFDCGVSEDGH GSPFKIYNAYYLQSS CL PSVYRGPLFSSLDP+LQ ALK +LISRGVEESLT+FLLIHLHKKEQGQYLNWLQNVESLI
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AKRQ NEL
Subjt:  AKRQSNEL

A0A6J1HWH3 uncharacterized protein LOC1114672742.7e-9687.5Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M RANQIF KARKALHDL+LLKILQSEI HELSST FQNH+  G SSDFAVEHDS KS+DVVLRRKLESGEE+AISA SGPL FG +GAF REILMKICV
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
        SKPGV+SLLQFDCGVSEDGH GSPFKIYNAYYLQSS CLGPSVYRGPLFSSLDP+LQ ALK +LISRGVEESLT+FLLIHLHKKEQGQYLNWLQNVESLI
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQSNEL
        AKRQ NEL
Subjt:  AKRQSNEL

A0A7N2M2K1 Uncharacterized protein1.4e-7166.67Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV
        M R   I  KARKAL D +L+K LQ+EI HELSST FQ+ D +    DF VE DSP+SQDVVLRR  ESGEEVA+SA  GP+ +G +G FPR++LMK+C+
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICV

Query:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI
         KPG+SS+LQFDCGV E G++GS F I+NAYY+QS TC+ PS YRGPLFSSLDPQLQDALKEYL++RG+ ESLTNFLL HLHKKEQGQY+NWL+ +ES +
Subjt:  SKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLI

Query:  AKRQ
        AK +
Subjt:  AKRQ

SwissProt top hitse value%identityAlignment
O94675 Mitochondrial acidic protein mam334.1e-0933.33Show/hide
Query:  FPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGP----------SVYRGPLFSSLDPQLQDALKEYLISRGVEESLT
        FP E L +     I +SKPG  +L+ F+    +DG     F I N Y+ +    L              Y GP F  LDP+LQD    YL  R ++ESL+
Subjt:  FPREILMK-----ICVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGP----------SVYRGPLFSSLDPQLQDALKEYLISRGVEESLT

Query:  NFLLIHLHKKEQGQYLNWLQNVESLI
        +F++     KE  +Y+NWL++V   +
Subjt:  NFLLIHLHKKEQGQYLNWLQNVESLI

P40513 Mitochondrial acidic protein MAM331.3e-0745.45Show/hide
Query:  VYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVE
        VY GP FS+LD +LQ++L+ YL SRGV E L +F+  +   KE  +Y++WL+ ++
Subjt:  VYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVE

Arabidopsis top hitse value%identityAlignment
AT2G41600.1 Mitochondrial glycoprotein family protein7.7e-2742.38Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI
        M + N +  +  KA+ + +LLKILQSEI HE+S  +FQ  +  G   DF ++ DSP+SQD+VL+R+ +SGE+V +SA     P+       FPRE   K+
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI

Query:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLF
        C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLF

AT2G41600.2 Mitochondrial glycoprotein family protein9.1e-2841.36Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI
        M + N +  +  KA+ + +LLKILQSEI HE+S  +FQ  +  G   DF ++ DSP+SQD+VL+R+ +SGE+V +SA     P+       FPRE   K+
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI

Query:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDAL
        C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F S   Q   A+
Subjt:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDAL

AT2G41600.3 Mitochondrial glycoprotein family protein2.3e-4747.29Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI
        M + N +  +  KA+ + +LLKILQSEI HE+S  +FQ  +  G   DF ++ DSP+SQD+VL+R+ +SGE+V +SA     P+       FPRE   K+
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI

Query:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVES
        C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+KKEQ QY+NWL+ +ES
Subjt:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVES

Query:  LIA
         ++
Subjt:  LIA

AT2G41600.4 Mitochondrial glycoprotein family protein7.7e-2742.38Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI
        M + N +  +  KA+ + +LLKILQSEI HE+S  +FQ  +  G   DF ++ DSP+SQD+VL+R+ +SGE+V +SA     P+       FPRE   K+
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI

Query:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLF
        C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    F
Subjt:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLF

AT2G41600.5 Mitochondrial glycoprotein family protein2.3e-4747.29Show/hide
Query:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI
        M + N +  +  KA+ + +LLKILQSEI HE+S  +FQ  +  G   DF ++ DSP+SQD+VL+R+ +SGE+V +SA     P+       FPRE   K+
Subjt:  MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISA--TSGPLRFGHQGAFPREILMKI

Query:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVES
        C+ KPG+SS+LQF C V E G   S F I +AY+++S      S Y    FS +DP+L  AL++YLIS+GV E LTNFLL HL+KKEQ QY+NWL+ +ES
Subjt:  CVSKPGVSSLLQFDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVES

Query:  LIA
         ++
Subjt:  LIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGGGCTAATCAAATATTTCTCAAGGCCCGTAAAGCCCTCCATGATCTCGAACTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCAATT
TCAGAACCATGATCACAACGGCATTTCCAGCGATTTCGCTGTGGAACATGACTCGCCCAAGTCCCAAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGTGAGGAGGTCG
CAATTTCTGCTACATCGGGCCCTCTCAGATTTGGACACCAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAG
TTTGATTGTGGGGTTTCAGAGGACGGTCATGATGGGTCGCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCAACTTGTTTGGGACCTTCTGTTTATAGAGGCCC
TTTGTTCAGCTCATTAGATCCTCAATTACAAGACGCGCTTAAAGAATACCTAATCAGTAGAGGCGTTGAAGAAAGCCTGACCAACTTCCTTCTCATTCACCTGCATAAAA
AAGAGCAAGGTCAGTATTTGAATTGGTTGCAAAATGTCGAATCTTTGATAGCTAAAAGACAATCGAACGAACTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGGGCTAATCAAATATTTCTCAAGGCCCGTAAAGCCCTCCATGATCTCGAACTCCTCAAGATCTTGCAATCCGAGATAGCCCACGAGCTTTCTTCAACCCAATT
TCAGAACCATGATCACAACGGCATTTCCAGCGATTTCGCTGTGGAACATGACTCGCCCAAGTCCCAAGACGTGGTGTTGCGGCGAAAATTGGAATCGGGTGAGGAGGTCG
CAATTTCTGCTACATCGGGCCCTCTCAGATTTGGACACCAAGGGGCTTTTCCGAGGGAGATTTTGATGAAGATTTGTGTGAGTAAGCCTGGAGTTAGCTCTCTTTTGCAG
TTTGATTGTGGGGTTTCAGAGGACGGTCATGATGGGTCGCCTTTCAAAATCTACAATGCCTATTATCTTCAATCTTCAACTTGTTTGGGACCTTCTGTTTATAGAGGCCC
TTTGTTCAGCTCATTAGATCCTCAATTACAAGACGCGCTTAAAGAATACCTAATCAGTAGAGGCGTTGAAGAAAGCCTGACCAACTTCCTTCTCATTCACCTGCATAAAA
AAGAGCAAGGTCAGTATTTGAATTGGTTGCAAAATGTCGAATCTTTGATAGCTAAAAGACAATCGAACGAACTTTAG
Protein sequenceShow/hide protein sequence
MARANQIFLKARKALHDLELLKILQSEIAHELSSTQFQNHDHNGISSDFAVEHDSPKSQDVVLRRKLESGEEVAISATSGPLRFGHQGAFPREILMKICVSKPGVSSLLQ
FDCGVSEDGHDGSPFKIYNAYYLQSSTCLGPSVYRGPLFSSLDPQLQDALKEYLISRGVEESLTNFLLIHLHKKEQGQYLNWLQNVESLIAKRQSNEL