; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1650 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1650
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHistone-lysine N-methyltransferase EZA1 isoform X3
Genome locationMC08:24467283..24468659
RNA-Seq ExpressionMC08g1650
SyntenyMC08g1650
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606937.1 hypothetical protein SDJN03_00279, partial [Cucurbita argyrosperma subsp. sororia]8.71e-11270.16Show/hide
Query:  QPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC
        +PSF FHLF VNLLGLLLPLSFLLL RLSSA YL A    PPPL LSLILYLNSPLL++LV+ VIVSALLHSLTGKS L    PG +SQPRLYAAWI LC
Subjt:  QPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC

Query:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA--
        T QVCVGV IEGS+S+GL DA V RV+GG WSR+LFFLGLHE+VVHWT TVVKPVVDDTV+GE+RKERWFE  AT +SFSGLWWWRLRDEAEALVV A  
Subjt:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA--

Query:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR
                    D  GW LYYV VA+GIAK+V+S+A F    +F +++
Subjt:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR

XP_008454283.1 PREDICTED: uncharacterized protein LOC103494729 [Cucumis melo]3.62e-11269.17Show/hide
Query:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAAL-PWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWI
        +L  SF FHLF +NLLGLLLPLS LLL RLSSA YL AL PW PP  +LSLILY+NSPLL++LVSFVI+S LLHSLTGKS LP KLPGP+SQPRLY AWI
Subjt:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAAL-PWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWI

Query:  FLCTLQVCVGVVIEGSISVGLND-AAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV
        FLCTLQVCVGV IEGS+S GLND  +   V+GG W R+LFF GLHEAVVHWT  VVKPVVDDT+YGE+RKE+WFE AAT VS  GLWWWRLRDEAE LVV
Subjt:  FLCTLQVCVGVVIEGSISVGLND-AAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV

Query:  AA-------------DFAGWWLYYVTVAIGIAKIVR-SLAWFGRNFVFKKQRK
         A             D +GW LYY+TV IGIAKIV+  + WFG  FV +K  K
Subjt:  AA-------------DFAGWWLYYVTVAIGIAKIVR-SLAWFGRNFVFKKQRK

XP_022949038.1 uncharacterized protein LOC111452502 [Cucurbita moschata]5.28e-11370.56Show/hide
Query:  QPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC
        +PSF FHLF VNLLGLLLPLSFLLL RLSSA YL A    PPPL LSLILYLNSPLL++LV+FVIVSALLHSLTGKS L    PG +SQPRLYAAWI LC
Subjt:  QPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC

Query:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA--
        T QVCVGV IEGS+S+GL DA V RV+GG WSR+LFFLGLHE+VVHWT TVVKPVVDDTV+GE+RKERWFE  AT +SFSGLWWWRLRDEAEALVV A  
Subjt:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA--

Query:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR
                    D  GW LYYV VA+GIAK+V+S+A F    +F +++
Subjt:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR

XP_023524601.1 uncharacterized protein LOC111788496 [Cucurbita pepo subsp. pepo]1.75e-11169.88Show/hide
Query:  LQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFL
        L+PSF FHLF VNLLGLLLPLSFLLL RLSSA YL A    PPPL LSLILYLNSPLL++LV+FVIVSALLHSLTGKS L    PG +SQPRLYAAWI L
Subjt:  LQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFL

Query:  CTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA-
        CT QVCVGV IEGS+S+GL+D  V RV+GG WSR+LFFLGLHE+VVHWT TVVKPVVDDTV+GE RKERWFE  AT +SF GLWWWRLRDEAEALVV A 
Subjt:  CTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA-

Query:  -------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR
                     D  GW LYYV VA+GIAK+V S+A F     F +++
Subjt:  -------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR

XP_038904328.1 uncharacterized protein LOC120090682 [Benincasa hispida]6.50e-11771.2Show/hide
Query:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIF
        +L PSF FHLF +NLLGLLLPLSFLLL RLSS  YL  L     PL+LSLILY+NSPLL++LVSFVIVS L HSLTGKSALP KLPGP+SQPRLY AWIF
Subjt:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIF

Query:  LCTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA
        LCTLQVCVGV IEGS+S GLN AA   ++GG W R+LFF GLHEAVVHWT  VVKPVVDDTV+GE+RKE+WFE AAT VS  GLWWWRLRDEAEALVV A
Subjt:  LCTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA

Query:  -------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQRK
                     D +GW LYY+TVAIGIAKIVR +AWFG  FV  K  K
Subjt:  -------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQRK

TrEMBL top hitse value%identityAlignment
A0A0A0KWP5 Uncharacterized protein1.01e-11169.17Show/hide
Query:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAAL-PWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWI
        +L  SF FHLF +NLLGLLLPLS LLL RLSSA YL AL PW PP  +LSLILY+NSPLL++LVSFVI+S LLHSLTGKS LP KLPGP+SQPRLY AWI
Subjt:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAAL-PWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWI

Query:  FLCTLQVCVGVVIEGSISVGLND-AAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV
        FLCTLQVCVGV IEGS+S GLND  +   V+GG W R+LFFLGLHEAVVHWT  VVKPVVDDT+YGE R E+WFE AAT VS  GLWWWRLRDEAE LVV
Subjt:  FLCTLQVCVGVVIEGSISVGLND-AAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV

Query:  AA-------------DFAGWWLYYVTVAIGIAKIVR-SLAWFGRNFVFKKQRK
         A             D +GW LYY+TV IGIAKIV+  + WFG +FV K   K
Subjt:  AA-------------DFAGWWLYYVTVAIGIAKIVR-SLAWFGRNFVFKKQRK

A0A1S3BZH0 uncharacterized protein LOC1034947291.75e-11269.17Show/hide
Query:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAAL-PWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWI
        +L  SF FHLF +NLLGLLLPLS LLL RLSSA YL AL PW PP  +LSLILY+NSPLL++LVSFVI+S LLHSLTGKS LP KLPGP+SQPRLY AWI
Subjt:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAAL-PWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWI

Query:  FLCTLQVCVGVVIEGSISVGLND-AAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV
        FLCTLQVCVGV IEGS+S GLND  +   V+GG W R+LFF GLHEAVVHWT  VVKPVVDDT+YGE+RKE+WFE AAT VS  GLWWWRLRDEAE LVV
Subjt:  FLCTLQVCVGVVIEGSISVGLND-AAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV

Query:  AA-------------DFAGWWLYYVTVAIGIAKIVR-SLAWFGRNFVFKKQRK
         A             D +GW LYY+TV IGIAKIV+  + WFG  FV +K  K
Subjt:  AA-------------DFAGWWLYYVTVAIGIAKIVR-SLAWFGRNFVFKKQRK

A0A6J1GAY8 uncharacterized protein LOC1114525022.56e-11370.56Show/hide
Query:  QPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC
        +PSF FHLF VNLLGLLLPLSFLLL RLSSA YL A    PPPL LSLILYLNSPLL++LV+FVIVSALLHSLTGKS L    PG +SQPRLYAAWI LC
Subjt:  QPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC

Query:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA--
        T QVCVGV IEGS+S+GL DA V RV+GG WSR+LFFLGLHE+VVHWT TVVKPVVDDTV+GE+RKERWFE  AT +SFSGLWWWRLRDEAEALVV A  
Subjt:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA--

Query:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR
                    D  GW LYYV VA+GIAK+V+S+A F    +F +++
Subjt:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQR

A0A6J1J235 uncharacterized protein LOC1114820027.44e-11169.6Show/hide
Query:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIF
        +L+P F FHL  VNLL LLLPLS LLL RLSSA YLA LP  PP L+LSLILY+ SPLL +LVSFV+VSALLHSLTGKSALP KLP P+SQPRLY  WIF
Subjt:  VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIF

Query:  LCTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA
        LCTLQVCVGV IEGS+S GLN AA   V+GG W R+LFF GLHEAVVHWT  VVKPVVDDTV+GE+RKERWFE AAT VS  G+WWWRLRDEA+ALVV A
Subjt:  LCTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA

Query:  DF-------------AGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQRK
        +              A W LYY+ VAIGIAKIV S+AW  R  V KK  K
Subjt:  DF-------------AGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKKQRK

A0A6J1K8H9 uncharacterized protein LOC1114932124.89e-11169.72Show/hide
Query:  LQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFL
        L+P F FHLF VNLLGLLLPLSFLLL RLSSA YL A    PPPL+LSLI YLNSPLL++LV+FVIVSALLHSLTGKS L A  PG +SQPRLYAAWI L
Subjt:  LQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFL

Query:  CTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA-
        CT QVCVGV IEGS+S+GL+D  V R++GG WSR+LFFLGLHE+VVHWT  VVKPVVDDTV+GE++KERWFE  AT +SF GLWWWRLRDEAEALVV A 
Subjt:  CTLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA-

Query:  -------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVF-KKQRK
                     DF GW LYYV VA+GIAKIV+S+A F    +F +KQ K
Subjt:  -------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVF-KKQRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02570.1 unknown protein3.8e-3839.92Show/hide
Query:  FHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLCTLQVC
        F + +++LL LL+PLSFL L RLS +   A +  +    V SL+   +  +LY ++S +IVS L+H L+GK          +    LY  WI L  +Q C
Subjt:  FHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLCTLQVC

Query:  VGVVIEGSISVGLN---DAAVDRVKGGQW--SRVLFFLGLHEAVVHWTSTVVKPVVDDTVYG-EARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA-
        V   IEG++S  ++   D +       +W   RV+FFLGLHE ++ W   VVKPV+DDTV+G    +ERW E+A   V+F  +WWWRLRDE E+LVV A 
Subjt:  VGVVIEGSISVGLN---DAAVDRVKGGQW--SRVLFFLGLHEAVVHWTSTVVKPVVDDTVYG-EARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA-

Query:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWF
                    DF  WW+YY+ V IG+ KI +   +F
Subjt:  ------------DFAGWWLYYVTVAIGIAKIVRSLAWF

AT1G02575.1 unknown protein5.3e-3238Show/hide
Query:  FHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLCTLQVC
        F + +++ L LLLPLSFL L RLS   Y ++ P T    V S+I   +  +LY ++  +IV  L+HSL+GK          +    LY  WI L   Q C
Subjt:  FHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLCTLQVC

Query:  V-GVVIEGSISVGLN-DAAVDRVKGGQWS--RVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA---
          G+    S ++ +N D  +      +W   RVLFFLGLHE ++ W   VVKPVVD+T+YG   +ERW E+A   V+F  +WWWRLRDE E+LVV     
Subjt:  V-GVVIEGSISVGLN-DAAVDRVKGGQWS--RVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAA---

Query:  -----------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFV--FKKQRK
                   +F  W +YY+ V IG+ KI +    F        K+ RK
Subjt:  -----------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFV--FKKQRK

AT2G47360.1 unknown protein8.7e-3537.55Show/hide
Query:  FHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPP-----LVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC
        F L T  LL LLLPLSFLLL RLSSA +L +L  + P       V SL L  N  ++Y +VS + V  L+  LT K             P +  AW+ L 
Subjt:  FHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPP-----LVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLC

Query:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVY----GEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV
         +Q+ VG+ +E +IS GL    +   +    SR++FF GLHE ++ W   +V+PVVD+T+     G+ R+E   E+ A  VS   LWWW+LRDE EALV 
Subjt:  TLQVCVGVVIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVY----GEARKERWFEKAATMVSFSGLWWWRLRDEAEALVV

Query:  AA-------------------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKK
         A                         DF  WWLYY+ V IG+ +I++   WFG   +F++
Subjt:  AA-------------------------DFAGWWLYYVTVAIGIAKIVRSLAWFGRNFVFKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTTCTTCAACCTTCCTTCTTCTTTCATCTCTTCACCGTCAACCTCCTCGGCCTCTTGCTTCCTCTCTCCTTTCTCCTCCTCGGCCGCCTCTCCTCCGCCGGATATCTCGC
CGCACTGCCATGGACGCCGCCGCCCTTGGTTCTCTCTCTCATCCTCTATCTAAACTCTCCTCTTCTTTACATTCTTGTCTCGTTCGTCATCGTCTCCGCCCTCCTCCACT
CCCTCACCGGGAAATCCGCTCTCCCCGCTAAGCTCCCGGGTCCTATCTCCCAACCACGCCTCTACGCTGCTTGGATTTTCCTATGCACCCTACAAGTCTGTGTCGGTGTT
GTAATCGAGGGAAGTATATCGGTCGGCCTCAACGATGCAGCGGTCGACCGTGTCAAGGGCGGCCAATGGAGTAGGGTATTGTTCTTCTTGGGACTGCACGAGGCGGTGGT
GCATTGGACGAGTACTGTGGTGAAGCCGGTGGTGGATGACACCGTGTACGGGGAAGCTCGGAAGGAAAGGTGGTTTGAGAAGGCGGCGACCATGGTGAGCTTCAGCGGCC
TATGGTGGTGGCGGCTGAGGGATGAGGCGGAGGCGCTGGTGGTTGCGGCGGACTTTGCCGGTTGGTGGTTGTATTACGTCACCGTCGCCATCGGAATTGCTAAGATTGTC
AGGTCTCTTGCTTGGTTTGGTCGGAATTTTGTCTTTAAAAAACAACGTAAA
mRNA sequenceShow/hide mRNA sequence
GTTCTTCAACCTTCCTTCTTCTTTCATCTCTTCACCGTCAACCTCCTCGGCCTCTTGCTTCCTCTCTCCTTTCTCCTCCTCGGCCGCCTCTCCTCCGCCGGATATCTCGC
CGCACTGCCATGGACGCCGCCGCCCTTGGTTCTCTCTCTCATCCTCTATCTAAACTCTCCTCTTCTTTACATTCTTGTCTCGTTCGTCATCGTCTCCGCCCTCCTCCACT
CCCTCACCGGGAAATCCGCTCTCCCCGCTAAGCTCCCGGGTCCTATCTCCCAACCACGCCTCTACGCTGCTTGGATTTTCCTATGCACCCTACAAGTCTGTGTCGGTGTT
GTAATCGAGGGAAGTATATCGGTCGGCCTCAACGATGCAGCGGTCGACCGTGTCAAGGGCGGCCAATGGAGTAGGGTATTGTTCTTCTTGGGACTGCACGAGGCGGTGGT
GCATTGGACGAGTACTGTGGTGAAGCCGGTGGTGGATGACACCGTGTACGGGGAAGCTCGGAAGGAAAGGTGGTTTGAGAAGGCGGCGACCATGGTGAGCTTCAGCGGCC
TATGGTGGTGGCGGCTGAGGGATGAGGCGGAGGCGCTGGTGGTTGCGGCGGACTTTGCCGGTTGGTGGTTGTATTACGTCACCGTCGCCATCGGAATTGCTAAGATTGTC
AGGTCTCTTGCTTGGTTTGGTCGGAATTTTGTCTTTAAAAAACAACGTAAA
Protein sequenceShow/hide protein sequence
VLQPSFFFHLFTVNLLGLLLPLSFLLLGRLSSAGYLAALPWTPPPLVLSLILYLNSPLLYILVSFVIVSALLHSLTGKSALPAKLPGPISQPRLYAAWIFLCTLQVCVGV
VIEGSISVGLNDAAVDRVKGGQWSRVLFFLGLHEAVVHWTSTVVKPVVDDTVYGEARKERWFEKAATMVSFSGLWWWRLRDEAEALVVAADFAGWWLYYVTVAIGIAKIV
RSLAWFGRNFVFKKQRK