; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCytochrome c oxidase subunit Vc family protein
Genome locationchr8:11264787..11272660
RNA-Seq ExpressionMoc08g14670
SyntenyMoc08g14670
Gene Ontology termsGO:0005746 - mitochondrial respirasome (cellular component)
InterPro domainsIPR008432 - Cytochrome c oxidase subunit 5c


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141102.1 uncharacterized protein LOC101216879 [Cucumis sativus]1.2e-6763.4Show/hide
Query:  MASLAKLLRPLPRAFVFA-SSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSIS
        MA  +K LRPLPRAFVFA SSSSSSSSSF N ARF C +FEP K FP NFGS LCNR+ N R   S    IYL     LVG+QLS+ TILG SVV GSI+
Subjt:  MASLAKLLRPLPRAFVFA-SSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSIS

Query:  LWPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKK
         WPN S AM+D  +D  Q+ +D     K    +WELALRLWLPFL CWTVL NLNHP+ V  KV+LFL+STKPSPLSVYIFV++L  SS  EP LSN KK
Subjt:  LWPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKK

Query:  CVVASKVEVQDYKVACVARVEIEHQKITLLGILGG
         +VA KVEV+DYKV CVA+VE++HQ  T++G+LGG
Subjt:  CVVASKVEVQDYKVACVARVEIEHQKITLLGILGG

XP_008443551.1 PREDICTED: uncharacterized protein LOC103487112 isoform X1 [Cucumis melo]3.9e-7165.38Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSISL
        MA  +K LRPLPRAFVFASSSSSSSS  FN ARF C +FEP   FP NFGS LCNR+ N R   S    IYL     L+G+Q S+ TILG SVV GSIS 
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSISL

Query:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC
        WPN SLAMDD  +D  Q+D+D  D  K     WELALRLWLPFL CWTVL NLNHP++V  KV+LFL+STKPSPLSVYIFVE+L  SS QEP LSN KK 
Subjt:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC

Query:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG
        +VA KVEV+DYKV CVA+VE++HQ  TL+G+LGG
Subjt:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG

XP_008443552.1 PREDICTED: uncharacterized protein LOC103487112 isoform X2 [Cucumis melo]8.8e-5556.03Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP
        MA  +K LRPLPRAFVFASSSSSSSS  FN ARF C +FEP   FP NFGS LCNR+ N R   S                      GA           
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP

Query:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV
         + L + D  +D  Q+D+D  D  K     WELALRLWLPFL CWTVL NLNHP++V  KV+LFL+STKPSPLSVYIFVE+L  SS QEP LSN KK +V
Subjt:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV

Query:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG
        A KVEV+DYKV CVA+VE++HQ  TL+G+LGG
Subjt:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG

XP_022152133.1 uncharacterized protein LOC111019921 [Momordica charantia]9.9e-12399.57Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP
        MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVG+QLSRGTILGASVVSGSISLWP
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP

Query:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV
        NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV
Subjt:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV

Query:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG
        ASKVEVQDYKVACVARVEIEHQKITLLGILGG
Subjt:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG

XP_038903549.1 uncharacterized protein LOC120090109 [Benincasa hispida]5.7e-7870.09Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFR--FKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISL
        MA   K LRPLPRAFVFA   SSSSSSFFN  RF C +F+  KSFP NFGSLLCNR+SNFR  F  ++ IYL RK  L+G+Q S+GTILGASVV GSISL
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFR--FKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISL

Query:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC
        WPN SLAMDD  MD  Q+DLDA D  K     WELALRLWLPFL CWTVL NLNHP+LV  KV+LFL+STKPSPLSVYIFVE+L  SS QEP LSN KKC
Subjt:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC

Query:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG
        + A KVEV+DYK+ CVA+VE++HQK TL+GILGG
Subjt:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG

TrEMBL top hitse value%identityAlignment
A0A0A0LI06 Uncharacterized protein5.7e-6863.4Show/hide
Query:  MASLAKLLRPLPRAFVFA-SSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSIS
        MA  +K LRPLPRAFVFA SSSSSSSSSF N ARF C +FEP K FP NFGS LCNR+ N R   S    IYL     LVG+QLS+ TILG SVV GSI+
Subjt:  MASLAKLLRPLPRAFVFA-SSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSIS

Query:  LWPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKK
         WPN S AM+D  +D  Q+ +D     K    +WELALRLWLPFL CWTVL NLNHP+ V  KV+LFL+STKPSPLSVYIFV++L  SS  EP LSN KK
Subjt:  LWPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKK

Query:  CVVASKVEVQDYKVACVARVEIEHQKITLLGILGG
         +VA KVEV+DYKV CVA+VE++HQ  T++G+LGG
Subjt:  CVVASKVEVQDYKVACVARVEIEHQKITLLGILGG

A0A1S3B8A6 uncharacterized protein LOC103487112 isoform X11.9e-7165.38Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSISL
        MA  +K LRPLPRAFVFASSSSSSSS  FN ARF C +FEP   FP NFGS LCNR+ N R   S    IYL     L+G+Q S+ TILG SVV GSIS 
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSISL

Query:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC
        WPN SLAMDD  +D  Q+D+D  D  K     WELALRLWLPFL CWTVL NLNHP++V  KV+LFL+STKPSPLSVYIFVE+L  SS QEP LSN KK 
Subjt:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC

Query:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG
        +VA KVEV+DYKV CVA+VE++HQ  TL+G+LGG
Subjt:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG

A0A1S3B8D2 uncharacterized protein LOC103487112 isoform X24.3e-5556.03Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP
        MA  +K LRPLPRAFVFASSSSSSSS  FN ARF C +FEP   FP NFGS LCNR+ N R   S                      GA           
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP

Query:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV
         + L + D  +D  Q+D+D  D  K     WELALRLWLPFL CWTVL NLNHP++V  KV+LFL+STKPSPLSVYIFVE+L  SS QEP LSN KK +V
Subjt:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV

Query:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG
        A KVEV+DYKV CVA+VE++HQ  TL+G+LGG
Subjt:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG

A0A5A7UGM7 Uncharacterized protein1.9e-7165.38Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSISL
        MA  +K LRPLPRAFVFASSSSSSSS  FN ARF C +FEP   FP NFGS LCNR+ N R   S    IYL     L+G+Q S+ TILG SVV GSIS 
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDT--IYLRRKCLLVGNQLSRGTILGASVVSGSISL

Query:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC
        WPN SLAMDD  +D  Q+D+D  D  K     WELALRLWLPFL CWTVL NLNHP++V  KV+LFL+STKPSPLSVYIFVE+L  SS QEP LSN KK 
Subjt:  WPNVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKC

Query:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG
        +VA KVEV+DYKV CVA+VE++HQ  TL+G+LGG
Subjt:  VVASKVEVQDYKVACVARVEIEHQKITLLGILGG

A0A6J1DE17 uncharacterized protein LOC1110199214.8e-12399.57Show/hide
Query:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP
        MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVG+QLSRGTILGASVVSGSISLWP
Subjt:  MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWP

Query:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV
        NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV
Subjt:  NVSLAMDDGFMDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVV

Query:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG
        ASKVEVQDYKVACVARVEIEHQKITLLGILGG
Subjt:  ASKVEVQDYKVACVARVEIEHQKITLLGILGG

SwissProt top hitse value%identityAlignment
P19173 Cytochrome c oxidase subunit 5C5.8e-1754.69Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        + G  V H  YKGPS++KE+V G  L L+AGG WKM HWN QRRTKEFY++L++G + VV   +
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

Q9FLK2 Probable cytochrome c oxidase subunit 5C-34.0e-1859.38Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        + G  + HA  KGPS++KE+V GL L L AGGLWKM HWNEQR+T+ FY+LL+RG++GVVV  +
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

Q9FNE0 Putative cytochrome c oxidase subunit 5C-45.2e-1861.02Show/hide
Query:  VGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        +G A YKGPS++KEI+YG+ L    GGLWKM HWN QRRTKEFY+LL++G++ VVV+ +
Subjt:  VGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

Q9LZQ0 Cytochrome c oxidase subunit 5C-26.8e-1857.81Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        + G  V HA  KGPS++KE++ GL L L AGGLWKM HWNEQR+T+ FY+LL+RG++GVV   +
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

Q9SXX7 Cytochrome c oxidase subunit 5C7.6e-1756.45Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQ
        + G  + HA  KGPS++KEI  GL L L+AGGLWKM HWNEQR+T+ FY++L++G + VVV+
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQ

Arabidopsis top hitse value%identityAlignment
AT5G40382.1 Cytochrome c oxidase subunit Vc family protein3.7e-1961.02Show/hide
Query:  VGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        +G A YKGPS++KEI+YG+ L    GGLWKM HWN QRRTKEFY+LL++G++ VVV+ +
Subjt:  VGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

AT5G61310.1 Cytochrome c oxidase subunit Vc family protein2.8e-1959.38Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        + G  + HA  KGPS++KE+V GL L L AGGLWKM HWNEQR+T+ FY+LL+RG++GVVV  +
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

AT5G61310.2 Cytochrome c oxidase subunit Vc family protein2.8e-1959.38Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        + G  + HA  KGPS++KE+V GL L L AGGLWKM HWNEQR+T+ FY+LL+RG++GVVV  +
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

AT5G61310.3 Cytochrome c oxidase subunit Vc family protein2.8e-1959.38Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        + G  + HA  KGPS++KE+V GL L L AGGLWKM HWNEQR+T+ FY+LL+RG++GVVV  +
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD

AT5G61310.4 Cytochrome c oxidase subunit Vc family protein2.8e-1959.38Show/hide
Query:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD
        + G  + HA  KGPS++KE+V GL L L AGGLWKM HWNEQR+T+ FY+LL+RG++GVVV  +
Subjt:  LGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTTTAGCAAAACTTCTTCGGCCGCTCCCCAGAGCTTTCGTCTTCGCTTCATCGTCGTCGTCTTCTTCTTCTTCGTTCTTCAATTCCGCAAGATTCTCCTGCAC
CAGATTTGAGCCCTCGAAATCGTTTCCGAAGAACTTCGGATCACTACTCTGCAACCGCGTTTCCAATTTCAGATTTAAAATTTCCGATACAATTTACCTGCGGCGGAAGT
GTCTTTTGGTAGGCAATCAGTTAAGTAGGGGAACCATTCTGGGTGCATCCGTTGTTTCTGGATCAATCAGTTTGTGGCCCAATGTTTCATTGGCTATGGATGACGGATTC
ATGGATGATCGCCAAGACGATTTAGATGCATTAGATAATGGAAAACCTGAAGGAACCTTGTGGGAATTAGCATTAAGACTCTGGTTGCCTTTTCTTTTCTGTTGGACTGT
GTTGACGAACTTGAATCATCCTCTTCTGGTTGCGAGCAAAGTGATCCTATTCCTTCTTAGCACAAAACCCAGTCCTCTCTCTGTTTACATTTTTGTGGAGCAGCTTTGCC
ACTCTTCATGCCAGGAGCCTCGTCTCTCCAATGAGAAGAAGTGTGTGGTTGCAAGTAAAGTGGAAGTCCAAGACTACAAGGTTGCGTGTGTGGCCAGAGTCGAAATAGAA
CATCAAAAGATCACTCTGCTGGGAATTCTTGGAGGGAAAATGGTTGGTCATGCTGCATATAAGGGACCAAGCATAATAAAGGAGATTGTGTATGGATTAGGGCTGGCCTT
AATGGCTGGTGGCCTCTGGAAAATGCGCCATTGGAATGAACAGAGGAGGACCAAAGAGTTTTATGAGTTACTCGACCGAGGTGACGTCGGGGTTGTAGTTCAACACGACT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCTTTAGCAAAACTTCTTCGGCCGCTCCCCAGAGCTTTCGTCTTCGCTTCATCGTCGTCGTCTTCTTCTTCTTCGTTCTTCAATTCCGCAAGATTCTCCTGCAC
CAGATTTGAGCCCTCGAAATCGTTTCCGAAGAACTTCGGATCACTACTCTGCAACCGCGTTTCCAATTTCAGATTTAAAATTTCCGATACAATTTACCTGCGGCGGAAGT
GTCTTTTGGTAGGCAATCAGTTAAGTAGGGGAACCATTCTGGGTGCATCCGTTGTTTCTGGATCAATCAGTTTGTGGCCCAATGTTTCATTGGCTATGGATGACGGATTC
ATGGATGATCGCCAAGACGATTTAGATGCATTAGATAATGGAAAACCTGAAGGAACCTTGTGGGAATTAGCATTAAGACTCTGGTTGCCTTTTCTTTTCTGTTGGACTGT
GTTGACGAACTTGAATCATCCTCTTCTGGTTGCGAGCAAAGTGATCCTATTCCTTCTTAGCACAAAACCCAGTCCTCTCTCTGTTTACATTTTTGTGGAGCAGCTTTGCC
ACTCTTCATGCCAGGAGCCTCGTCTCTCCAATGAGAAGAAGTGTGTGGTTGCAAGTAAAGTGGAAGTCCAAGACTACAAGGTTGCGTGTGTGGCCAGAGTCGAAATAGAA
CATCAAAAGATCACTCTGCTGGGAATTCTTGGAGGGAAAATGGTTGGTCATGCTGCATATAAGGGACCAAGCATAATAAAGGAGATTGTGTATGGATTAGGGCTGGCCTT
AATGGCTGGTGGCCTCTGGAAAATGCGCCATTGGAATGAACAGAGGAGGACCAAAGAGTTTTATGAGTTACTCGACCGAGGTGACGTCGGGGTTGTAGTTCAACACGACT
AA
Protein sequenceShow/hide protein sequence
MASLAKLLRPLPRAFVFASSSSSSSSSFFNSARFSCTRFEPSKSFPKNFGSLLCNRVSNFRFKISDTIYLRRKCLLVGNQLSRGTILGASVVSGSISLWPNVSLAMDDGF
MDDRQDDLDALDNGKPEGTLWELALRLWLPFLFCWTVLTNLNHPLLVASKVILFLLSTKPSPLSVYIFVEQLCHSSCQEPRLSNEKKCVVASKVEVQDYKVACVARVEIE
HQKITLLGILGGKMVGHAAYKGPSIIKEIVYGLGLALMAGGLWKMRHWNEQRRTKEFYELLDRGDVGVVVQHD