; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1782 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1782
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionmavicyanin-like
Genome locationMC06:25107178..25109490
RNA-Seq ExpressionMC06g1782
SyntenyMC06g1782
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0046658 - anchored component of plasma membrane (cellular component)
GO:0009055 - electron transfer activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin
IPR028871 - Blue (type 1) copper protein, binding site
IPR039391 - Phytocyanin
IPR041845 - Mavicyanin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571468.1 hypothetical protein SDJN03_28196, partial [Cucurbita argyrosperma subsp. sororia]5.83e-7670.65Show/hide
Query:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT
        KM WA+ WI  +GLF +S  A VHKVGDS GWTTL+P DYAKWASSN FHVGDSLLF+YNNKFHNVLQV+Q+Q+ SCNSSSPAASY SGADSI LKR GT
Subjt:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT

Query:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL
        FYFLCG PGHCQ GQKVEIKV   SSSA LA S  PG SPLPNGPAP          PSAAST S + L  L L A++VAFY L
Subjt:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL

KAG7011230.1 hypothetical protein SDJN02_26133, partial [Cucurbita argyrosperma subsp. argyrosperma]1.94e-7570.17Show/hide
Query:  WWAMWI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYF
        W  +WI  +GLF +S  A VHKVGDS GWTTL+P DYAKWASSN FHVGDSLLF+YNNKFHNVLQV+Q+Q+ SCNSSSPAASY SGADSI LKR GTFYF
Subjt:  WWAMWI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYF

Query:  LCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL
        LCG PGHCQ GQKVEIKV   SSSA LA S  PG SPLPNGPAP          PSAAST S + L  L L A++VAFY L
Subjt:  LCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL

XP_022159180.1 mavicyanin-like [Momordica charantia]1.19e-11996.17Show/hide
Query:  MVSWKMWWAMWIIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTG
        MVSWKMWWAMWIIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTG
Subjt:  MVSWKMWWAMWIIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTG

Query:  TFYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPSP----LPNGPAPTPSAASTRSPHHLPLLLL---PAFIVAFYHLFV
        TFYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPSP    LPNGPAPTPSAASTRSPHHLPLLLL   PAFIVAFYHLFV
Subjt:  TFYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPSP----LPNGPAPTPSAASTRSPHHLPLLLL---PAFIVAFYHLFV

XP_022963846.1 mavicyanin [Cucurbita moschata]3.36e-7570.11Show/hide
Query:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT
        KM WA+ WI  +GLF +S  A VHKVGDS GWTTL+P DYAKWASSN FHVGDSLLF+Y+NKFHNVLQV+Q+Q+ SCNSSSPAASY SGADSI LKR GT
Subjt:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT

Query:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL
        FYFLCG PGHCQ GQKVEIKV   SSSA LA S  PG SPLPNGPAP          PSAAST S + L  L L A++VAFY L
Subjt:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL

XP_023553590.1 mavicyanin [Cucurbita pepo subsp. pepo]6.77e-7570.11Show/hide
Query:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT
        KM WA+ WI  +GLF +S  A VHKVGDS GWTTL+P DYAKWASSN FHVGDSLLF+YNNKFHNVLQV+Q+Q+ SCNSSSPAASY SGADSI LKR GT
Subjt:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT

Query:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPS--PLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL
        FYFLCG PGHCQ GQKVEIKV   SSSA LA S  PS  PLPNGPAP          PSAAST S + L  L L A++VAFY L
Subjt:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPS--PLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL

TrEMBL top hitse value%identityAlignment
A0A1S3BM10 mavicyanin-like1.30e-7072.5Show/hide
Query:  WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGF
        WI  + LF +SV A VH+VGDS+GWTTLIPVDYAKWASS  FHVGDSLLF YNN FHNVLQV Q+Q+ +CNSSSPAASYNSGADSI LKR GTFYFLCGF
Subjt:  WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGF

Query:  PGHCQEGQKVEIKVTRASSS--AALALSPGPSPL--PNGPAPTPSAASTRSPHHLPLLLL
        PGHCQ GQKVE+KVT ASSS   A + SPGPSP+  P+  APTPSAAST S +   +L L
Subjt:  PGHCQEGQKVEIKVTRASSS--AALALSPGPSPL--PNGPAPTPSAASTRSPHHLPLLLL

A0A5D3CSL9 Mavicyanin-like1.30e-7072.5Show/hide
Query:  WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGF
        WI  + LF +SV A VH+VGDS+GWTTLIPVDYAKWASS  FHVGDSLLF YNN FHNVLQV Q+Q+ +CNSSSPAASYNSGADSI LKR GTFYFLCGF
Subjt:  WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGF

Query:  PGHCQEGQKVEIKVTRASSS--AALALSPGPSPL--PNGPAPTPSAASTRSPHHLPLLLL
        PGHCQ GQKVE+KVT ASSS   A + SPGPSP+  P+  APTPSAAST S +   +L L
Subjt:  PGHCQEGQKVEIKVTRASSS--AALALSPGPSPL--PNGPAPTPSAASTRSPHHLPLLLL

A0A6J1DXY6 mavicyanin-like5.78e-12096.17Show/hide
Query:  MVSWKMWWAMWIIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTG
        MVSWKMWWAMWIIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTG
Subjt:  MVSWKMWWAMWIIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTG

Query:  TFYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPSP----LPNGPAPTPSAASTRSPHHLPLLLL---PAFIVAFYHLFV
        TFYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPSP    LPNGPAPTPSAASTRSPHHLPLLLL   PAFIVAFYHLFV
Subjt:  TFYFLCGFPGHCQEGQKVEIKVTRASSSAALALSPGPSP----LPNGPAPTPSAASTRSPHHLPLLLL---PAFIVAFYHLFV

A0A6J1HH72 mavicyanin1.63e-7570.11Show/hide
Query:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT
        KM WA+ WI  +GLF +S  A VHKVGDS GWTTL+P DYAKWASSN FHVGDSLLF+Y+NKFHNVLQV+Q+Q+ SCNSSSPAASY SGADSI LKR GT
Subjt:  KMWWAM-WI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGT

Query:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL
        FYFLCG PGHCQ GQKVEIKV   SSSA LA S  PG SPLPNGPAP          PSAAST S + L  L L A++VAFY L
Subjt:  FYFLCGFPGHCQEGQKVEIKVTRASSSAALALS--PGPSPLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFYHL

A0A6J1HQP3 mavicyanin1.26e-7269.83Show/hide
Query:  WWAMWI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYF
        W  +WI  +GLF VS  A VHKVGDS GWTTL+P DYAKWASSN F VGDSLLF+YNNKFHNVLQV+Q+Q+ SCNSSSPAASY SGADSI LKR GTFYF
Subjt:  WWAMWI--IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYF

Query:  LCGFPGHCQEGQKVEIKVTRASSSAALALSPGPS--PLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFY
        LCG PGHCQ GQKVEIKV   SSSA LA S  PS  PLPNGPAP          PSAAST S + L  L L A+ VAFY
Subjt:  LCGFPGHCQEGQKVEIKVTRASSSAALALSPGPS--PLPNGPAPT---------PSAASTRSPHHLPLLLLPAFIVAFY

SwissProt top hitse value%identityAlignment
O82081 Uclacyanin 11.8e-1836.11Show/hide
Query:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK
        A  H +G  +GWT  +      WA+   F VGD+L+FSY   FH+V++V + ++ SC +  P  ++ +G   + L   G  YF+CG PGHC +G K+E+ 
Subjt:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK

Query:  VTRASSSAALALSPGPSPLPNGPAPTPSAASTRSPHHLPLLLLP
        V   ++ A  A  P P+ +P+  AP+PS+     P  LPL  +P
Subjt:  VTRASSSAALALSPGPSPLPNGPAPTPSAASTRSPHHLPLLLLP

P00302 Stellacyanin1.3e-2453.4Show/hide
Query:  VHKVGDSAGWTTLI--PVDYA-KWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEI
        V+ VGDSAGW       VDY  KWAS+  FH+GD L+F Y+ +FHNV +V Q+ Y SCN ++P ASYN+G + I LK  G  Y++CG P HC  GQKV I
Subjt:  VHKVGDSAGWTTLI--PVDYA-KWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEI

Query:  KVT
         VT
Subjt:  KVT

P80728 Mavicyanin1.9e-4783.33Show/hide
Query:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK
        A VHKVGDS GWTTL+P DYAKWASSN FHVGDSLLF+YNNKFHNVLQV+Q+Q+ SCNSSSPAASY SGADSI LKR GTFYFLCG PGHCQ GQKVEIK
Subjt:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK

Query:  VTRASSSA
        V   SSSA
Subjt:  VTRASSSA

Q41001 Blue copper protein7.5e-2042.96Show/hide
Query:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK
        A V+ VGD++GW  +I  DY+ WAS   F VGDSL+F+Y    H V +V +  Y SC S +  ++ ++GA +I LK+ G  YF+CG PGH   G K+ IK
Subjt:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK

Query:  VTRASSSAALALSPGPSPLPNGPA---PTPSAAST
        V +ASS ++ A S  PS    G      TP+A +T
Subjt:  VTRASSSAALALSPGPSPLPNGPA---PTPSAAST

Q9SK27 Early nodulin-like protein 12.0e-1736.42Show/hide
Query:  IIGLFAVSVGAAVHKVGDSAGWTTLIP----VDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCG
        +I LF+++    V   G S  W   IP      + +WA    F VGD ++F Y +   +VL+V ++ Y SCN+++P A+Y  G   + L R+G FYF+ G
Subjt:  IIGLFAVSVGAAVHKVGDSAGWTTLIP----VDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCG

Query:  FPGHCQEGQKVEIKVTRASSSAALALSPGPSPL--PNGP--APTPSAASTR
          GHC++GQK+ + V     S    +SP PSP+   +GP  AP P + S R
Subjt:  FPGHCQEGQKVEIKVTRASSSAALALSPGPSPL--PNGP--APTPSAASTR

Arabidopsis top hitse value%identityAlignment
AT2G26720.1 Cupredoxin superfamily protein2.6e-2839.47Show/hide
Query:  IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHC
        + LF V+VG  VHKVG++ GW T+I  DY  WASS  F VGD+L+F+YN  +H+V +V    +  C SS P   Y +G+DSI+L + G  +F+CG PGHC
Subjt:  IGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHC

Query:  QEGQKVEIKVTRAS------------SSAALALSPGPSPLPNGP---------APTPSAASTRS-------PHHLPLLLLPAFIVAFYHL
        ++GQK++I V  AS             S + + SP PSPL + P          PTP++ S  S          L L+ L  F + F+ L
Subjt:  QEGQKVEIKVTRAS------------SSAALALSPGPSPLPNGP---------APTPSAASTRS-------PHHLPLLLLPAFIVAFYHL

AT2G31050.1 Cupredoxin superfamily protein7.4e-3141.57Show/hide
Query:  IIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGH
        ++ LF +SVG  VHKVGDS GW T++ V+Y  WAS+  F VGDSL+F YN  FH+V +V    Y  C  S P A Y +G+D + L + G  +F+CGFPGH
Subjt:  IIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGH

Query:  CQEGQKVEIKVTRASSSAALALSPGP------------SPLPN------------GPAPTPSAASTRSPHHLPLLLLP
        C  GQK++I V  AS     A  PGP            SPL              GP+P P +A++ S   + L  LP
Subjt:  CQEGQKVEIKVTRASSSAALALSPGP------------SPLPN------------GPAPTPSAASTRSPHHLPLLLLP

AT2G32300.1 uclacyanin 11.3e-1936.11Show/hide
Query:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK
        A  H +G  +GWT  +      WA+   F VGD+L+FSY   FH+V++V + ++ SC +  P  ++ +G   + L   G  YF+CG PGHC +G K+E+ 
Subjt:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK

Query:  VTRASSSAALALSPGPSPLPNGPAPTPSAASTRSPHHLPLLLLP
        V   ++ A  A  P P+ +P+  AP+PS+     P  LPL  +P
Subjt:  VTRASSSAALALSPGPSPLPNGPAPTPSAASTRSPHHLPLLLLP

AT3G60270.1 Cupredoxin superfamily protein2.4e-2141.04Show/hide
Query:  SVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKV
        +V A   +VGD+ GWT  I V+Y  W S   F VGD+L F Y    H+V  VN+  Y  C +S P  S++ G   I L + G  +FLC  PGHC  G K+
Subjt:  SVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKV

Query:  EIKVTRASSSAALALSPGPSPLPNGPAPTPSAAS
         ++V  A S         PSP P+ P+P+PSA S
Subjt:  EIKVTRASSSAALALSPGPSPLPNGPAPTPSAAS

AT5G26330.1 Cupredoxin superfamily protein4.3e-3145.64Show/hide
Query:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK
        AAV+KVGDSAGWTT+  VDY  WAS+  FH+GD++LF YN +FHNV++V    Y SCN+S P +++ +G DSI L   G  +F CG PGHC  GQK+++ 
Subjt:  AAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPGHCQEGQKVEIK

Query:  VTRASSSAALALSPGPSPLPNGPAPTPSAASTRSPHHLPLLLLPAFIVA
        V   +SS  L+  P  S   + P+ T  AA    P       LP+ + A
Subjt:  VTRASSSAALALSPGPSPLPNGPAPTPSAASTRSPHHLPLLLLPAFIVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCCTGGAAGATGTGGTGGGCGATGTGGATAATAGGGCTGTTTGCAGTGTCGGTGGGTGCTGCGGTGCACAAAGTCGGCGACTCTGCCGGGTGGACCACACTCAT
CCCCGTCGATTATGCCAAATGGGCTTCTTCCAACAACTTCCATGTCGGCGACTCTCTCCTGTTCTCTTACAACAACAAATTCCACAATGTGCTGCAAGTGAACCAGCAGC
AGTATGGGTCGTGCAACTCGTCTTCTCCGGCTGCGTCTTACAACTCCGGCGCCGACTCCATCGCCCTGAAAAGGACCGGAACCTTCTACTTCCTCTGCGGCTTCCCAGGC
CACTGTCAAGAGGGTCAGAAGGTGGAGATCAAGGTCACCCGAGCCTCATCCTCTGCAGCACTTGCCCTCTCTCCCGGCCCAAGCCCACTCCCAAATGGGCCTGCCCCCAC
TCCAAGCGCGGCCTCAACTCGCAGTCCCCACCACTTGCCCTTGCTGCTGCTGCCTGCCTTTATTGTCGCATTTTACCATCTCTTTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATTGAGTTGAAATTAGACCGAACTGTGTCGATAGTCTTTTCCATCCTTGATCTCTGGAAAATAATTGGATAGTTGGTCGCACAAAGTTATCTAATCTTAGATTACCTAAT
TCAAAGATACGGCAAGTCTCCAAACCAATGATCATAGGAACTTACTACATCTTTCTCAATCAAGGTTTTCTCCGCTTCAATTTCTTCGATAGTTGCAGCCTTCACAACTC
CTCTTCTAAAATTTTTTCAGGGCCAAATAAAGAAGGGAGAAGAAGATCAACATAAATGAAATGCTGTCGCAATCAACTTTCGATAGCAATAACGCAATGACCAATCCAAA
TGAAAAATTCATGTACGTTTGCCAGACTGCCCCATCATGTTAGCACCCCCAAAATGTATAAAACTGGTCGTAAAAACGAGTCAAAGAACATAATTGACGAATACGTAGAG
TTTCATACGCCAAATAATTGGTTGGTTATAGTCACTGCAACTCAAGAAAGTGAATGAAATAGAAGAAATTCCCTTGCCTGTTTTTCAACAGACTCCCAGAAACAATGACG
TGGGAAGAGATCATTAGGGCATGCCGCTTGAGTCTGACATAGTTTGCAGCAGAGTTCCATGAATAAACATGCGCCGTATTCAACCCACATTTGACGCAATTTAACCCTCC
ATTATAATCTAAAAGGCGACCCCCATTTGGGGATGAAGAAATAAACATATAAACAAACAGAAATCAAGAAAGAGAATGCAATTTTTGGTAGAACAGTGACGTACCTCGTA
ATGAGAGGAGTGCTCTGGCGCGGTTGTGTTTGGGTTGGAAGACGGAACGAACAGAGATTGTTGAGAAAGGCGTCGCCATGAAAACCTAAATGAATTGAAGATGTTGACAC
TGAGAGAGAGCGAAGGAGAGGAATAGGATCAGGATGATAAGCAGTAGAACCACGGCTGGAGTGGACCTTGTCTACGACAACGGAGTGAAGGAGGTGTGGATAAGAAATTG
AAATCGAACTGCTATCAGTGAGGGCTGTTTGGGTGCGGCGAATACATTATTCTGCGAGTGGATAAGCCTCTCCTGTGCCCGCATAGCATCAATTCACGAGTCGCCAACGC
CAATAAATACAAGATAAGAAGATACTAAAGTTAACAGTGAAATCCTAATGAGGATGGAACTCAGCTGGGGCCGGGTGGTAATAAAAGTTATAGAGAGACATAAATAATGA
GCCCCAGACCCAGTCCATTACTAGAAAGTTGAAACACGTAAAAAGATGGATTAACAGGAGTAATTAATTAACAATCTCCAACAACTTTTAACTTTTTATAAATGTACTGT
AGAGGGTAGATTCCATTTCTATCCCCATTTTGACCCCTTCCAGTTCCACAAAATTAAACCAATCTCCAATTCTAGTTCTTTCTACTCCACTCCAGTTCGAGACCCACAAT
TTCTCTCTGTTCAGCCCCCGTAAGCCATGGTTTCCTGGAAGATGTGGTGGGCGATGTGGATAATAGGGCTGTTTGCAGTGTCGGTGGGTGCTGCGGTGCACAAAGTCGGC
GACTCTGCCGGGTGGACCACACTCATCCCCGTCGATTATGCCAAATGGGCTTCTTCCAACAACTTCCATGTCGGCGACTCTCTCCTGTTCTCTTACAACAACAAATTCCA
CAATGTGCTGCAAGTGAACCAGCAGCAGTATGGGTCGTGCAACTCGTCTTCTCCGGCTGCGTCTTACAACTCCGGCGCCGACTCCATCGCCCTGAAAAGGACCGGAACCT
TCTACTTCCTCTGCGGCTTCCCAGGCCACTGTCAAGAGGGTCAGAAGGTGGAGATCAAGGTCACCCGAGCCTCATCCTCTGCAGCACTTGCCCTCTCTCCCGGCCCAAGC
CCACTCCCAAATGGGCCTGCCCCCACTCCAAGCGCGGCCTCAACTCGCAGTCCCCACCACTTGCCCTTGCTGCTGCTGCCTGCCTTTATTGTCGCATTTTACCATCTCTT
TGTTTGAATTTGAATGCTTGGGTTGGGCTGTACCTCTTACATCTATCTGCTACTCTAAATATCGCACCTCTGTTGTTGCACTTGCGTTAAGGTGTTATAATCTTCTTCTT
TGTACCATGCATGATATTTTATTCATGAATATGATGGATATGCCCTTGTCACGTTTATATTCTGCCCTCTCCTAACTAATTATGCTAATTTAGTGCTATATTTTCC
Protein sequenceShow/hide protein sequence
MVSWKMWWAMWIIGLFAVSVGAAVHKVGDSAGWTTLIPVDYAKWASSNNFHVGDSLLFSYNNKFHNVLQVNQQQYGSCNSSSPAASYNSGADSIALKRTGTFYFLCGFPG
HCQEGQKVEIKVTRASSSAALALSPGPSPLPNGPAPTPSAASTRSPHHLPLLLLPAFIVAFYHLFV