; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g26370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g26370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr9:19720084..19721551
RNA-Seq ExpressionMoc09g26370
SyntenyMoc09g26370
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.8e-9297.65Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE

XP_022155229.1 uncharacterized protein LOC111022371 [Momordica charantia]4.3e-8295.51Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEY LR PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRG
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]7.8e-9297.65Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.1e-9298.24Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKEHFPRGRKVGTLVTDKLLLE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.8e-13984.33Show/hide
Query:  MSSSFISDLGSDEDLARRLDSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS  S+L  + DLARRL+S+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFISDLGSDEDLARRLDSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE---LRWKVELLKKEEDRRKAQLRAAHAITKGLEKE
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLE   L +   +   E  R  ++L        G++++
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE---LRWKVELLKKEEDRRKAQLRAAHAITKGLEKE

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138263.8e-9297.65Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE

A0A6J1DPM7 uncharacterized protein LOC1110223712.1e-8295.51Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEY LR PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRG
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG

A0A6J1DWD2 uncharacterized protein LOC1110246803.8e-9297.65Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE

A0A6J1DWF1 uncharacterized protein LOC1110251081.0e-9298.24Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSI+GWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKEHFPRGRKVGTLVTDKLLLE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE

A0A6J1DXS5 uncharacterized protein LOC1110255021.9e-13984.33Show/hide
Query:  MSSSFISDLGSDEDLARRLDSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS  S+L  + DLARRL+S+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFISDLGSDEDLARRLDSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSI+GWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE---LRWKVELLKKEEDRRKAQLRAAHAITKGLEKE
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLE   L +   +   E  R  ++L        G++++
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLE---LRWKVELLKKEEDRRKAQLRAAHAITKGLEKE

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic2.8e-0727.7Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA

Query:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQAFFDTLKY
          + +  L    E +R+ K +  R+Y+   KG   I   P+  E +   +F+ + E    ++       V TR+G     L  + P+P+   + F  L  
Subjt:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQAFFDTLKY

Query:  YK----EHFPRGR
         K    +HF R R
Subjt:  YK----EHFPRGR

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.7e-0722.86Show/hide
Query:  RLDSELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ
        R++++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLDSELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ

Query:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYASGEW-LAKDE
         F     +A +Q+       I   A L  L AR       L V+ +       ++  K G+ Y+ + +G   +  GP+    W+  +FYA  +  L +D 
Subjt:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYASGEW-LAKDE

Query:  SGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLELRWK-VELLK-KEEDRRKAQLRAA
        S    F +        ++R +   ++   D  +  +E  P                + WK V  LK K++++RK Q  +A
Subjt:  SGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLELRWK-VELLK-KEEDRRKAQLRAA

AT2G15420.1 myosin heavy chain-related4.0e-0927.89Show/hide
Query:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGRFYMCARKGAGGIVKGPTS-IEGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKE----HFPRG
        + PG +Y  A K    IV G  S I GW R++F+      + +     F D  T   +   +  V +    F D +   +E    H+P G
Subjt:  KKPGRFYMCARKGAGGIVKGPTS-IEGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKE----HFPRG

AT3G42060.1 myosin heavy chain-related2.7e-0527.06Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE
        SR    + G        PE +   IPE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE

Query:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IEGWVRKWFYASGEWLAKDESGRSFFDV
        ++D+D L     +  I  K  R  +CA    G  I  G TS +  W + +F+A    ++ D++  S  ++
Subjt:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IEGWVRKWFYASGEWLAKDESGRSFFDV

AT5G38190.1 INVOLVED IN: biological_process unknown1.8e-0624.21Show/hide
Query:  RLDSELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ
        R D++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLDSELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ

Query:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA
         F     +A +Q+       I   A L  L AR       L V+ +       ++  K G+ Y+ + +G   +   P+    W+  +FYA
Subjt:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTATCAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGACTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGT
GCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCGAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCA
GGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTTCTTCGACACGTTGAAATATTACAAG
GAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGCTGAGGTGGAAGGTCGAGCTGCTGAAGAAAGAAGAGGACAGACGC
AAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAG
GAGGAGGAGCTGAAGCATGCGACTGTTGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGAT
GGATTTGCCAAAGACTTCTCTTACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGG
TATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGCTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAG
GATCAGGTCAGTACCACTCAAGAGGGCACTCCTCAAGCAGGCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTATCAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGACTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGT
GCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCGAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCA
GGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTTCTTCGACACGTTGAAATATTACAAG
GAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGCTGAGGTGGAAGGTCGAGCTGCTGAAGAAAGAAGAGGACAGACGC
AAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAG
GAGGAGGAGCTGAAGCATGCGACTGTTGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGAT
GGATTTGCCAAAGACTTCTCTTACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGG
TATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGCTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAG
GATCAGGTCAGTACCACTCAAGAGGGCACTCCTCAAGCAGGCACTTAG
Protein sequenceShow/hide protein sequence
MSSSFISDLGSDEDLARRLDSELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPL
HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIEGWVRKWFYASGEWLAKDES
GRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLELRWKVELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAK
EEELKHATVELETVKERLSNGALLEESFRQHPDFDGFAKDFSYAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALLDKYVRDLDSDYSDLEE
DQVSTTQEGTPQAGT