; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1598 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1598
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHD domain-containing protein
Genome locationMC08:23717882..23720844
RNA-Seq ExpressionMC08g1598
SyntenyMC08g1598
Gene Ontology termsGO:0046856 - phosphatidylinositol dephosphorylation (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0043812 - phosphatidylinositol-4-phosphate phosphatase activity (molecular function)
InterPro domainsIPR003607 - HD/PDEase domain
IPR006674 - HD domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606900.1 hypothetical protein SDJN03_00242, partial [Cucurbita argyrosperma subsp. sororia]9.91e-14391.96Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAE+LVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSS  DSMEIVELAALLHDIGDYKYLRDP+EEK+VE+FLEEEGIE+NKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK +YSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKR LH+PAIRPRTSLSK+ YMNK+EQTTVNHFHEKLLKIKD+MKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWD KA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

XP_022153187.1 uncharacterized protein LOC111020741 [Momordica charantia]6.06e-15399.55Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWDGKA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

XP_022949582.1 uncharacterized protein LOC111452893 [Cucurbita moschata]2.96e-14492.86Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAE+LVEMAMGGNDASHDPSHVWRVRDLALSLA+EEGLSS  DSMEIVELAALLHDIGDYKYLRDP+EEK+VENFLEEEGIEENKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK +YSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKR LHDPAIRPRTSLSK+ YMNK+EQTTVNHFHEKLLKIKD+MKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWD KA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

XP_022998496.1 uncharacterized protein LOC111493111 [Cucurbita maxima]3.46e-14392.86Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAE+LVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSS  DSMEIVELAALLHDIGDYKYLRDP+EEK+VENFLEEEGIEENKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK +YSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKRVLHDPAI PRTSLSK+ YMNK+EQTTVNHFHEKLLKIK +MKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWD KA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

XP_023523543.1 uncharacterized protein LOC111787737 [Cucurbita pepo subsp. pepo]1.03e-14493.3Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAE+LVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSS  DSMEIVELAALLHDIGDYKYLRDP+EEK+VENFLEEEGIEENKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK +YSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKR LHDPAIRPRTSLSK+ YMNK+EQTTVNHFHEKLLKIKD+MKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWD KA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

TrEMBL top hitse value%identityAlignment
A0A1S3CFI0 uncharacterized protein YpgQ3.94e-14291.96Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MA+RETVKKAE+LVE+AMGGNDASHDPSHVWRVRDLALSLA+EEGLSS+ DSMEIVELAALLHDIGDYKYLRD +EEK+VENFL EEGIEENKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK EYSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKRVLHDPAI PRT LSK+ YMNK+EQTTVNHFHEKLLKIKDLMKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWDGKA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

A0A5D3BWV0 Metal-dependent phosphohydrolase3.94e-14291.96Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MA+RETVKKAE+LVE+AMGGNDASHDPSHVWRVRDLALSLA+EEGLSS+ DSMEIVELAALLHDIGDYKYLRD +EEK+VENFL EEGIEENKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK EYSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKRVLHDPAI PRT LSK+ YMNK+EQTTVNHFHEKLLKIKDLMKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWDGKA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

A0A6J1DI90 uncharacterized protein LOC1110207412.93e-15399.55Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWDGKA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

A0A6J1GCI5 uncharacterized protein LOC1114528931.43e-14492.86Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAE+LVEMAMGGNDASHDPSHVWRVRDLALSLA+EEGLSS  DSMEIVELAALLHDIGDYKYLRDP+EEK+VENFLEEEGIEENKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK +YSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKR LHDPAIRPRTSLSK+ YMNK+EQTTVNHFHEKLLKIKD+MKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWD KA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

A0A6J1KAC3 uncharacterized protein LOC1114931111.67e-14392.86Show/hide
Query:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI
        MARRETVKKAE+LVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSS  DSMEIVELAALLHDIGDYKYLRDP+EEK+VENFLEEEGIEENKKQ+ILAI
Subjt:  MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAI

Query:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR
        IKGMGFKEEIAGLSK +YSPEFGVVQDADRLDAIGAIG IARCFTFGGSKKRVLHDPAI PRTSLSK+ YMNK+EQTTVNHFHEKLLKIK +MKTKAGQR
Subjt:  IKGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQR

Query:  RAEKRHKFMEEFLKEFYDEWDGKA
        RAEKRHKFMEEFLKEFYDEWD KA
Subjt:  RAEKRHKFMEEFLKEFYDEWDGKA

SwissProt top hitse value%identityAlignment
P46144 Uncharacterized protein YedJ4.9e-0829.38Show/hide
Query:  DASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIV------ENFLEEEGIEENKKQRILAI---IKGMGFKEEIAG
        DA+HD  H  RV   A  LA ++ +      M ++  A   HDI          +   +         L EE  E+   ++I A+   I    F  +IA 
Subjt:  DASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIV------ENFLEEEGIEENKKQRILAI---IKGMGFKEEIAG

Query:  LSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQRRAEKRHKFMEEF
        L     + E  +VQDADRL+A+GAIG +AR F   G+    L D           Q     D++  ++HF  KLLK+   M+T  G++ A+    F+ EF
Subjt:  LSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQRRAEKRHKFMEEF

Query:  LKEFYDEWDGK
        + +   E  G+
Subjt:  LKEFYDEWDGK

P54168 Uncharacterized protein YpgQ3.0e-2136.2Show/hide
Query:  VKKAEDL---VEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENF--LEEEGIEENKKQRILAII
        +K+AE +   V+  +    + HD  HV RV DLA  + E+E        + IVE AAL+HD+ D K L D     + E +  L   G+ +    R++ II
Subjt:  VKKAEDL---VEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENF--LEEEGIEENKKQRILAII

Query:  KGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQRR
          M F++    L+K   S E   VQDADRLDAIGA+G IAR F F G+K   L+                  DEQ+   HF  KLL++KD+M T   +  
Subjt:  KGMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQRR

Query:  AEKRHKFMEEFLKEFYDEWDG
        AE+RH FM +F+++   +  G
Subjt:  AEKRHKFMEEFLKEFYDEWDG

Q5UR59 Uncharacterized protein L8037.8e-0628.28Show/hide
Query:  EMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGD------------YKYLRDPAEEKIVENFLEEEGIEENKKQRILAIIK
        E     N A HD  H   VR+ A+   + E +S+S      VE AA+LHD+ D             KY+ D    K+    +  +   +  KQ I+ +I 
Subjt:  EMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGD------------YKYLRDPAEEKIVENFLEEEGIEENKKQRILAIIK

Query:  GMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLH-DPAIRPRTSLSKQEYMNKD----------EQTTVNHFHEKLLKI
         +   +   G    E S    + +DADRL+AIG IG I RC  +    K   + D   R +TS     + N+D            + ++H+++KLL I
Subjt:  GMGFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLH-DPAIRPRTSLSKQEYMNKD----------EQTTVNHFHEKLLKI

Arabidopsis top hitse value%identityAlignment
AT1G17330.1 Metal-dependent phosphohydrolase1.9e-9577.06Show/hide
Query:  ETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAIIKGM
        +T++KAE+LVE AM GNDASHD  HVWRVRDLALS+A EEGLSS+ DSMEIVELAALLHDIGDYKY+RDP+EEK+VENFL++EGIEE KK +IL II GM
Subjt:  ETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAIIKGM

Query:  GFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQRRAEK
        GFK+E+AG++  E  PEFGVVQDADRLDAIGAIG IARCFTFGGS+ RVLHDP I+PRT L+K++Y+ ++EQTT+NHFHEKLLK+K LMKT+AG+RRAEK
Subjt:  GFKEEIAGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQRRAEK

Query:  RHKFMEEFLKEFYDEWDG
        RHKFMEE+LKEFY+EWDG
Subjt:  RHKFMEEFLKEFYDEWDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGAAGAGAAACAGTGAAGAAGGCGGAGGACCTGGTGGAGATGGCGATGGGCGGGAACGACGCGTCGCACGATCCTTCTCACGTCTGGAGAGTTCGAGACCTTGC
TCTCTCTCTGGCTGAAGAAGAAGGCCTCTCTTCCAGCATCGACTCCATGGAAATCGTCGAACTTGCTGCGCTCCTTCATGATATAGGAGATTACAAGTACTTGAGAGACC
CAGCTGAGGAAAAAATTGTGGAGAATTTTCTTGAGGAAGAGGGAATAGAGGAGAACAAGAAACAAAGGATATTAGCAATCATAAAGGGCATGGGCTTCAAGGAAGAGATT
GCCGGGCTTTCAAAAGCTGAATATTCTCCGGAGTTTGGGGTGGTTCAAGATGCTGATCGTCTAGATGCAATTGGTGCTATTGGTAGAATTGCCCGATGCTTCACTTTTGG
TGGAAGCAAGAAGCGAGTGCTGCACGATCCTGCAATTCGTCCTCGAACGAGTTTATCAAAACAAGAGTACATGAACAAAGACGAGCAGACTACCGTGAACCACTTTCACG
AGAAGCTATTGAAAATAAAGGATTTGATGAAAACAAAGGCTGGACAAAGGAGGGCAGAGAAAAGGCACAAATTCATGGAGGAATTTCTGAAGGAATTTTATGATGAATGG
GATGGAAAAGCT
mRNA sequenceShow/hide mRNA sequence
TTTAGAAATTATCCTTCAAAGAGAGAGAAAAAGAATTCAAAGGATGAAATATCAGTAGCCGCCAGAGCCCGGTGGGTCTTGACTCTTTCTGTTCCATCTCCAGTTAGCCG
CGGCGGCGCGTGAGCGACGAACCGGAGAAGAGGGAAATGGCGAGAAGAGAAACAGTGAAGAAGGCGGAGGACCTGGTGGAGATGGCGATGGGCGGGAACGACGCGTCGCA
CGATCCTTCTCACGTCTGGAGAGTTCGAGACCTTGCTCTCTCTCTGGCTGAAGAAGAAGGCCTCTCTTCCAGCATCGACTCCATGGAAATCGTCGAACTTGCTGCGCTCC
TTCATGATATAGGAGATTACAAGTACTTGAGAGACCCAGCTGAGGAAAAAATTGTGGAGAATTTTCTTGAGGAAGAGGGAATAGAGGAGAACAAGAAACAAAGGATATTA
GCAATCATAAAGGGCATGGGCTTCAAGGAAGAGATTGCCGGGCTTTCAAAAGCTGAATATTCTCCGGAGTTTGGGGTGGTTCAAGATGCTGATCGTCTAGATGCAATTGG
TGCTATTGGTAGAATTGCCCGATGCTTCACTTTTGGTGGAAGCAAGAAGCGAGTGCTGCACGATCCTGCAATTCGTCCTCGAACGAGTTTATCAAAACAAGAGTACATGA
ACAAAGACGAGCAGACTACCGTGAACCACTTTCACGAGAAGCTATTGAAAATAAAGGATTTGATGAAAACAAAGGCTGGACAAAGGAGGGCAGAGAAAAGGCACAAATTC
ATGGAGGAATTTCTGAAGGAATTTTATGATGAATGGGATGGAAAAGCT
Protein sequenceShow/hide protein sequence
MARRETVKKAEDLVEMAMGGNDASHDPSHVWRVRDLALSLAEEEGLSSSIDSMEIVELAALLHDIGDYKYLRDPAEEKIVENFLEEEGIEENKKQRILAIIKGMGFKEEI
AGLSKAEYSPEFGVVQDADRLDAIGAIGRIARCFTFGGSKKRVLHDPAIRPRTSLSKQEYMNKDEQTTVNHFHEKLLKIKDLMKTKAGQRRAEKRHKFMEEFLKEFYDEW
DGKA