; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g11680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g11680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPolyprotein
Genome locationchr6:8827234..8828046
RNA-Seq ExpressionMoc06g11680
SyntenyMoc06g11680
Gene Ontology termsNA
InterPro domainsIPR028919 - Viral movement protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588193.1 hypothetical protein SDJN03_16758, partial [Cucurbita argyrosperma subsp. sororia]7.9e-4244.62Show/hide
Query:  RSCSSKFGGDHALEGEEY--VEKG-NHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGKYYNIMNIGSV
        RS  SK    H L+ +EY  VEKG + L+ W +P +PTSKIY+  T  FT FS +   ++T EG     NGGG   L  E    +  + KY  I +IG V
Subjt:  RSCSSKFGGDHALEGEEY--VEKG-NHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGKYYNIMNIGSV

Query:  QLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSS
        Q+GVK  TR  P NA ++LCLRD R     D ++ +VE+NL DGP YFNVFPNI  SLS   L ++L ++  +  FEQLPEG + IV+  R CYK+    
Subjt:  QLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSS

Query:  FGPEALIESPV-GKTVFFQTQIRES-NDVVQKVTKWEHVQLPPNWPPQLQK
             L+++P  G+T+FFQT    + ND  Q VT W  VQLP NWP    K
Subjt:  FGPEALIESPV-GKTVFFQTQIRES-NDVVQKVTKWEHVQLPPNWPPQLQK

KAG6590198.1 hypothetical protein SDJN03_15621, partial [Cucurbita argyrosperma subsp. sororia]5.0e-9766.42Show/hide
Query:  MSQFFRSCSSKFGGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKP--FTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGKYYNIMN
        MS FFRSCSSKF  DH+LE EEYVEKGN+LVKW MP +P  K+YE  TK   FT  +  DRSIR +EG++SFGNGGG+FKL+ +APSSYG KGK+Y++MN
Subjt:  MSQFFRSCSSKFGGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKP--FTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGKYYNIMN

Query:  IGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSL-TNVLSIHVLINGFEQLPEGSQPIVLSCRTCYK
        IG VQ+GVKT TRKIPSNASIILC+RDNR+EK  D ++A+VES LGDGPFYFNVFPN+N SL + SL  NVLS+HVL+ GF+QL +GS+PIV+SCRTCYK
Subjt:  IGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSL-TNVLSIHVLINGFEQLPEGSQPIVLSCRTCYK

Query:  LSSSSFGPEALIESPVGKTVFFQTQI--------RESNDVVQKVTKWEHVQLPPNWPPQLQKRKRPVLVSGENG
        L+ + FGPEAL+ESPVGKTVFFQTQI            DVVQKVT+W+ VQLP NWPPQL   K P +    NG
Subjt:  LSSSSFGPEALIESPVGKTVFFQTQI--------RESNDVVQKVTKWEHVQLPPNWPPQLQKRKRPVLVSGENG

KGN66493.1 hypothetical protein Csa_007053 [Cucumis sativus]4.3e-8863.14Show/hide
Query:  MSQFFRSCSSK-FGGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKP--FTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK--YYN
        MS FF+SCSSK F GDH+LE EEYVEKG  LVKW+MP++P  KIYE+  K   F   S +D SIRT+EG++SFGN GG+FKL+ + PSSY  + +  ++N
Subjt:  MSQFFRSCSSK-FGGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKP--FTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK--YYN

Query:  IMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTC
         MNIG VQ+GVKT T+KIP NASIILCLRDNR+EK  D ++ALVES LGDGPFYFNVFPNINLSL   S+TNVLS+HVL+ G +++P+GS PIV++CRTC
Subjt:  IMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTC

Query:  YKLSSSSFGPEALIESPVGKTVFFQTQIRE--SNDVVQKVTKWEHVQLPPNWPPQLQKRKRPVLVSGENGTARV
        YKL+ + FG EALIESPVGKTVFFQ +I E   +DVVQKVT W  VQLP +WPP+L     P LV G +  A+V
Subjt:  YKLSSSSFGPEALIESPVGKTVFFQTQIRE--SNDVVQKVTKWEHVQLPPNWPPQLQKRKRPVLVSGENGTARV

KGN66494.1 hypothetical protein Csa_006902 [Cucumis sativus]9.3e-5148.93Show/hide
Query:  SSKF-GGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK-YYNIMNIGSVQLGV
        SS F GG H+L+ EEY++KG +L+KW++PKIPT+KIY+  + PF  F  SD  I+T E  +   NG   F+L    P     + K +Y  +N+G +Q+GV
Subjt:  SSKF-GGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK-YYNIMNIGSVQLGV

Query:  KTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPE
        KT T KIPSNASIILC+ D R + F D ++ LVES L DGP +FN+FPNI + +    L     +  ++ GFEQLP+G+ PI L  RTCYKL +S+  P 
Subjt:  KTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPE

Query:  ALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV
        ALIESP GKTVFFQT    S    QKV++W+ V
Subjt:  ALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV

XP_038880673.1 uncharacterized protein LOC120072292 [Benincasa hispida]2.8e-4747.86Show/hide
Query:  CSSKF-GGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAP-SSYGGKGKYYNIMNIGSVQLG
        CSS F GG HAL+ EEY++KGN+L+KW++PK+PT+KIY++   PFT F  SD SI+T E K+S  NG  AF+L  + P  +     ++Y  +N+G +Q+G
Subjt:  CSSKF-GGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAP-SSYGGKGKYYNIMNIGSVQLG

Query:  VKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGP
        VK  T KIPSNASIILC+ D+R E F D ++ LVESNL                                 GFEQLPEG++PI L  RTCYKL  S+  P
Subjt:  VKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGP

Query:  EALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV
         AL+ESP GKTV+FQT ++ S   VQKV+KW+ V
Subjt:  EALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV

TrEMBL top hitse value%identityAlignment
A0A0A0LXM3 Uncharacterized protein2.1e-8863.14Show/hide
Query:  MSQFFRSCSSK-FGGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKP--FTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK--YYN
        MS FF+SCSSK F GDH+LE EEYVEKG  LVKW+MP++P  KIYE+  K   F   S +D SIRT+EG++SFGN GG+FKL+ + PSSY  + +  ++N
Subjt:  MSQFFRSCSSK-FGGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKP--FTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK--YYN

Query:  IMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTC
         MNIG VQ+GVKT T+KIP NASIILCLRDNR+EK  D ++ALVES LGDGPFYFNVFPNINLSL   S+TNVLS+HVL+ G +++P+GS PIV++CRTC
Subjt:  IMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTC

Query:  YKLSSSSFGPEALIESPVGKTVFFQTQIRE--SNDVVQKVTKWEHVQLPPNWPPQLQKRKRPVLVSGENGTARV
        YKL+ + FG EALIESPVGKTVFFQ +I E   +DVVQKVT W  VQLP +WPP+L     P LV G +  A+V
Subjt:  YKLSSSSFGPEALIESPVGKTVFFQTQIRE--SNDVVQKVTKWEHVQLPPNWPPQLQKRKRPVLVSGENGTARV

A0A0A0LZS0 Uncharacterized protein4.5e-5148.93Show/hide
Query:  SSKF-GGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK-YYNIMNIGSVQLGV
        SS F GG H+L+ EEY++KG +L+KW++PKIPT+KIY+  + PF  F  SD  I+T E  +   NG   F+L    P     + K +Y  +N+G +Q+GV
Subjt:  SSKF-GGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGK-YYNIMNIGSVQLGV

Query:  KTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPE
        KT T KIPSNASIILC+ D R + F D ++ LVES L DGP +FN+FPNI + +    L     +  ++ GFEQLP+G+ PI L  RTCYKL +S+  P 
Subjt:  KTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPE

Query:  ALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV
        ALIESP GKTVFFQT    S    QKV++W+ V
Subjt:  ALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV

A0A5A7U9X3 Polyprotein1.3e-3751.18Show/hide
Query:  NGGGAFKLFDEAP--SSYGGKGKYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVL
        NG  AF+L  + P  +S+  K ++  ++N+G +Q+GVKT T KI SNASIILC+ D R + F D ++ LVE+ L DGP +FN+FPNI +SL    L   L
Subjt:  NGGGAFKLFDEAP--SSYGGKGKYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVL

Query:  SIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPEALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV
         +  ++ GFEQLP+G+ PI L  RTCYKL  S+F P ALIESP GKTVFFQT    S   VQKV++W+ V
Subjt:  SIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPEALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV

A0A5D3D5V1 Polyprotein1.5e-3851.76Show/hide
Query:  NGGGAFKLFDEAP--SSYGGKGKYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVL
        NG  AF+L  + P  +S+  K ++  ++N+G +Q+GVKT T KIPSNASIILC+ D R + F D ++ LVE+ L DGP +FN+FPNI +SL    L   L
Subjt:  NGGGAFKLFDEAP--SSYGGKGKYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVL

Query:  SIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPEALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV
         +  ++ GFEQLP+G+ PI L  RTCYKL  S+F P ALIESP GKTVFFQT    S   VQKV++W+ V
Subjt:  SIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPEALIESPVGKTVFFQTQIRESNDVVQKVTKWEHV

A5C0V2 Uncharacterized protein5.9e-2731.32Show/hide
Query:  DHALEGEE--YVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGKYYNIMNIGSVQLGVKTTTRK
        D  +  EE  Y +  N L  W++PK+   +IY+K T  F     +D +I+TSE  VS        +L D   S    K   YN ++ G +Q+  K  TR 
Subjt:  DHALEGEE--YVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGKYYNIMNIGSVQLGVKTTTRK

Query:  IPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPEALIESP
        +  N SI++CLRDNR  ++ D ++  V++ L DGP YF  +PN  + + D  + + + +H+  +GF+  P G+ P+ +  R  YK  ++S G  AL  SP
Subjt:  IPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPEALIESP

Query:  VGKTVFFQTQIRESND-VVQKVTKWEHVQLPPNW-----PPQLQKRKRPV--LVSGENGTARVVF
         G+T +F + + + +D ++ K   W+ V  P NW      P + +R   +  +V   +G   +VF
Subjt:  VGKTVFFQTQIRESND-VVQKVTKWEHVQLPPNW-----PPQLQKRKRPV--LVSGENGTARVVF

SwissProt top hitse value%identityAlignment
P03545 Movement protein3.0e-0427.43Show/hide
Query:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI
        K  +++++G+V++ +K   R    +  I + L D+R+    DC++   + NL  G F F V+P   +SL+   L   LS   LI+ FE    + +G + +
Subjt:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI

Query:  VLSCRTCYKLSSS
         ++    Y L++S
Subjt:  VLSCRTCYKLSSS

P03546 Movement protein2.3e-0427.43Show/hide
Query:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI
        K  +++++G+V++ +K   R    +  I + L D+R+    DC++   + NL  G F F V+P   +SL+   L   LS   LI+ FE    + +G + +
Subjt:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI

Query:  VLSCRTCYKLSSS
         ++    Y L++S
Subjt:  VLSCRTCYKLSSS

P15631 Movement protein1.4e-0425Show/hide
Query:  MNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCY
        ++I ++Q+ +K+T  K   +  + L LRDNR+    +  +A+   NL  G   F+V   + LSL D  L   + ++        + EG+    +S R  Y
Subjt:  MNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCY

Query:  KLSSSSFGPEALIESPVGKTVFFQTQIRESNDVVQKVTKWEHVQLPPN
         LS+S    E   +  +     F   +   + V  K+TK + +++ P+
Subjt:  KLSSSSFGPEALIESPVGKTVFFQTQIRESNDVVQKVTKWEHVQLPPN

Q00966 Movement protein3.0e-0427.43Show/hide
Query:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI
        K  +++++G+V++ +K   R    +  I + L D+R+    DC++   + NL  G F F V+P   +SL+   L   LS   LI+ FE    + +G + +
Subjt:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI

Query:  VLSCRTCYKLSSS
         ++    Y L++S
Subjt:  VLSCRTCYKLSSS

Q02968 Movement protein1.8e-0427.43Show/hide
Query:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI
        K  +++++G+V++ +K   R    +  I + L D+R+    DC++   + NL  G F F V+P   +SL+   L   LS   LI+ FE    + +G + +
Subjt:  KYYNIMNIGSVQLGVKTTTRKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQ---LPEGSQPI

Query:  VLSCRTCYKLSSS
         ++    Y L++S
Subjt:  VLSCRTCYKLSSS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACAATTTTTCAGGTCTTGCAGCTCCAAATTTGGAGGCGATCATGCCTTGGAAGGTGAAGAATACGTAGAGAAAGGAAACCATTTGGTGAAATGGGAGATGCCAAA
AATCCCCACTTCAAAAATCTACGAAAAAACAACAAAGCCCTTCACTTTATTCTCTTCTTCAGATCGTTCCATAAGAACCTCAGAAGGAAAAGTCAGCTTTGGAAATGGTG
GAGGAGCCTTCAAATTGTTTGATGAAGCCCCTTCTTCCTATGGTGGCAAAGGAAAATACTACAACATTATGAACATAGGCTCAGTTCAATTGGGAGTCAAAACCACAACA
AGAAAAATCCCTTCAAACGCCTCCATAATTCTGTGCCTTCGTGACAACAGAGTGGAGAAATTCCCAGATTGTGTTGTGGCCTTGGTGGAATCAAATTTGGGGGATGGCCC
ATTTTACTTCAATGTTTTTCCAAACATAAATTTGTCTCTGTCTGATTATTCACTGACTAATGTTTTGAGTATCCACGTTTTGATCAATGGGTTTGAGCAGCTCCCAGAAG
GGTCTCAGCCAATTGTTTTGAGCTGTAGAACATGTTACAAGTTAAGTTCTAGCAGCTTTGGACCAGAGGCATTGATTGAGAGTCCTGTCGGGAAAACTGTGTTTTTTCAG
ACACAGATTCGAGAGTCGAATGATGTTGTTCAAAAGGTGACAAAATGGGAGCACGTTCAGCTGCCACCTAATTGGCCTCCTCAGCTGCAGAAAAGAAAGAGGCCGGTATT
GGTTTCTGGTGAAAATGGAACAGCTCGAGTGGTTTTTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCACAATTTTTCAGGTCTTGCAGCTCCAAATTTGGAGGCGATCATGCCTTGGAAGGTGAAGAATACGTAGAGAAAGGAAACCATTTGGTGAAATGGGAGATGCCAAA
AATCCCCACTTCAAAAATCTACGAAAAAACAACAAAGCCCTTCACTTTATTCTCTTCTTCAGATCGTTCCATAAGAACCTCAGAAGGAAAAGTCAGCTTTGGAAATGGTG
GAGGAGCCTTCAAATTGTTTGATGAAGCCCCTTCTTCCTATGGTGGCAAAGGAAAATACTACAACATTATGAACATAGGCTCAGTTCAATTGGGAGTCAAAACCACAACA
AGAAAAATCCCTTCAAACGCCTCCATAATTCTGTGCCTTCGTGACAACAGAGTGGAGAAATTCCCAGATTGTGTTGTGGCCTTGGTGGAATCAAATTTGGGGGATGGCCC
ATTTTACTTCAATGTTTTTCCAAACATAAATTTGTCTCTGTCTGATTATTCACTGACTAATGTTTTGAGTATCCACGTTTTGATCAATGGGTTTGAGCAGCTCCCAGAAG
GGTCTCAGCCAATTGTTTTGAGCTGTAGAACATGTTACAAGTTAAGTTCTAGCAGCTTTGGACCAGAGGCATTGATTGAGAGTCCTGTCGGGAAAACTGTGTTTTTTCAG
ACACAGATTCGAGAGTCGAATGATGTTGTTCAAAAGGTGACAAAATGGGAGCACGTTCAGCTGCCACCTAATTGGCCTCCTCAGCTGCAGAAAAGAAAGAGGCCGGTATT
GGTTTCTGGTGAAAATGGAACAGCTCGAGTGGTTTTTCAATGA
Protein sequenceShow/hide protein sequence
MSQFFRSCSSKFGGDHALEGEEYVEKGNHLVKWEMPKIPTSKIYEKTTKPFTLFSSSDRSIRTSEGKVSFGNGGGAFKLFDEAPSSYGGKGKYYNIMNIGSVQLGVKTTT
RKIPSNASIILCLRDNRVEKFPDCVVALVESNLGDGPFYFNVFPNINLSLSDYSLTNVLSIHVLINGFEQLPEGSQPIVLSCRTCYKLSSSSFGPEALIESPVGKTVFFQ
TQIRESNDVVQKVTKWEHVQLPPNWPPQLQKRKRPVLVSGENGTARVVFQ