; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr7:7804614..7809769
RNA-Seq ExpressionMoc07g10170
SyntenyMoc07g10170
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]8.5e-11385.77Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM------
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAM      
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM------

Query:  -----------------SSEPATPAVAGPASEDPAPVIELESSGGP
                         SS+P TPAV GPASEDPAPVIELESSGGP
Subjt:  -----------------SSEPATPAVAGPASEDPAPVIELESSGGP

XP_022155229.1 uncharacterized protein LOC111022371 [Momordica charantia]7.2e-8093.59Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEY LR PLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLR RDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRG
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.0e-10296.35Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]4.2e-10495.92Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMSS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL M S
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMSS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.6e-16286.8Show/hide
Query:  MSSFFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSS  SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSFFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE EL  VDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM-----------
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAM           
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM-----------

Query:  ------------SSEPATPAVAGPASEDPAPVIELESSGGP
                    SS+PATPAV GPASEDPA VIELESSGGP
Subjt:  ------------SSEPATPAVAGPASEDPAPVIELESSGGP

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138264.1e-11385.77Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM------
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAM      
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM------

Query:  -----------------SSEPATPAVAGPASEDPAPVIELESSGGP
                         SS+P TPAV GPASEDPAPVIELESSGGP
Subjt:  -----------------SSEPATPAVAGPASEDPAPVIELESSGGP

A0A6J1DPM7 uncharacterized protein LOC1110223713.5e-8093.59Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEY LR PLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLR RDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRG
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRG

A0A6J1DWD2 uncharacterized protein LOC1110246805.0e-10396.35Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251082.0e-10495.92Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE ELL VDQLLACFEAKRIAKKPGRFYMC RKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMSS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL M S
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMSS

A0A6J1DXS5 uncharacterized protein LOC1110255023.7e-16286.8Show/hide
Query:  MSSFFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSS  SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSFFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIF LAILFWLRARDSEE EL  VDQLLACFEAKRIAKKPGRFYMC RKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM-----------
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAM           
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM-----------

Query:  ------------SSEPATPAVAGPASEDPAPVIELESSGGP
                    SS+PATPAV GPASEDPA VIELESSGGP
Subjt:  ------------SSEPATPAVAGPASEDPAPVIELESSGGP

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic3.4e-0829.44Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSE-E
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+      +  +L IL  +R+ +SE E
Subjt:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSE-E

Query:  VELLYVDQLLACFEAKRIAK-KPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQAFFDTLK
        + L ++   L   E +R+ K +  R+Y+ P KG   I   P+  + +   +F+ + E    ++       V TR+G     L  + P+P+   + F  L 
Subjt:  VELLYVDQLLACFEAKRIAK-KPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQAFFDTLK

Query:  YYK----EHFPRGR
          K    +HF R R
Subjt:  YYK----EHFPRGR

Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.3e-0723.68Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ

Query:  EFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA
         F     +A +Q+       I   A L  L AR    + +  +++L +     ++  K G+ Y+   +G   +  GP+  + W+  +FYA
Subjt:  EFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA

AT2G15420.1 myosin heavy chain-related1.8e-0726.84Show/hide
Query:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAI   L A    E++  + ++        R+ 
Subjt:  PENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIA

Query:  KKPGRFYMCPRKGAGGIVKGPTS-IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKE----HFPRG
        + PG +Y    K    IV G  S I GW R++F+      + +     F D  T   +   +  V +    F D +   +E    H+P G
Subjt:  KKPGRFYMCPRKGAGGIVKGPTS-IKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKE----HFPRG

AT5G38190.1 INVOLVED IN: biological_process unknown1.1e-0624.58Show/hide
Query:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +
Subjt:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPA

Query:  QVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA
        Q+       I   A L  L AR    + +  +++L +     ++  K G+ Y+   +G   +   P+  + W+  +FYA
Subjt:  QVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGTCCTCCATGTGTCCAGGGTATTCTTTCCCCCAAACATTGGCCCCCTCTCCGTTTGGTCCGATCTCGACCTGGCGGAGAAGTTCATTCGATTTGCTTCG
GACACGTGGCGACTTCCCATTCGTGGGAAAACATCACTGTTGCGGTGCATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATTTTCGGGAGGATCCCAG
CCGCTCGTTGATTACACGTGTACGCTCGAACCCTCGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGCAGTTGCCATGTCGTCCTTTTTTAGCAGCGAC
TTAGGATCCGATGAGGATTTAGCTCGTAGGTTGGAGTCCGAGCTCGAGGAGATAGAAAATTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCG
GGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAG
GGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTC
TTCCGAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGTTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGTC
GAGCTGTTGTACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCCCAAGGAAAGGCGCAGGCGGTATA
GTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCC
ACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTTCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAG
GTCGGAACCTTGGTTACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCC
ATGAGTTCGGAACCTGCCACTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTAGAGTCTTCTGGGGGTCCTAAGTCCATAACTGTT
CTTCATATTGCACCTCGTACCCTCAGGTTCAAAGGCTTGATCATTCTGAACTTTTCACATCGCCCCATTGCCTTGAAGGATAATAACGCTTCAGGTGCTCCGAGG
TTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATATGCCCTCTTCAAGGGTTAGACATTTCAATAGAGGCAGGGAAAAGCCGCGTCGGTACAACGCT
CCACCTCGGACCACGAACCGAGCTGCTTGCCTTGCCAACCTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCACAATCGGGTCCATCCAT
GAGGACTCTGGAGCGTCGATCTCCATCAGATCTGGCTCTGAGATCGAGGGATTACCTAAGATCTCGACGGGGACCGACCTTGCCAGAGGAACAAGCTCGAGCTCC
TCAGTGGGTGCGGCAAACTCCCTCCTCGGCAGGTCGGCCTTGAACTCGAGCGTCCCATCCCTACTGACGAGAGTTTCGAGGGCGCAGACCGATGAGCCTTTGAGT
GCGGAGGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGTCCTCCATGTGTCCAGGGTATTCTTTCCCCCAAACATTGGCCCCCTCTCCGTTTGGTCCGATCTCGACCTGGCGGAGAAGTTCATTCGATTTGCTTCG
GACACGTGGCGACTTCCCATTCGTGGGAAAACATCACTGTTGCGGTGCATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATTTTCGGGAGGATCCCAG
CCGCTCGTTGATTACACGTGTACGCTCGAACCCTCGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGCAGTTGCCATGTCGTCCTTTTTTAGCAGCGAC
TTAGGATCCGATGAGGATTTAGCTCGTAGGTTGGAGTCCGAGCTCGAGGAGATAGAAAATTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCG
GGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAG
GGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTC
TTCCGAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGTTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGTC
GAGCTGTTGTACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCCCAAGGAAAGGCGCAGGCGGTATA
GTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCC
ACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTTCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAG
GTCGGAACCTTGGTTACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCC
ATGAGTTCGGAACCTGCCACTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTAGAGTCTTCTGGGGGTCCTAAGTCCATAACTGTT
CTTCATATTGCACCTCGTACCCTCAGGTTCAAAGGCTTGATCATTCTGAACTTTTCACATCGCCCCATTGCCTTGAAGGATAATAACGCTTCAGGTGCTCCGAGG
TTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATATGCCCTCTTCAAGGGTTAGACATTTCAATAGAGGCAGGGAAAAGCCGCGTCGGTACAACGCT
CCACCTCGGACCACGAACCGAGCTGCTTGCCTTGCCAACCTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCACAATCGGGTCCATCCAT
GAGGACTCTGGAGCGTCGATCTCCATCAGATCTGGCTCTGAGATCGAGGGATTACCTAAGATCTCGACGGGGACCGACCTTGCCAGAGGAACAAGCTCGAGCTCC
TCAGTGGGTGCGGCAAACTCCCTCCTCGGCAGGTCGGCCTTGAACTCGAGCGTCCCATCCCTACTGACGAGAGTTTCGAGGGCGCAGACCGATGAGCCTTTGAGT
GCGGAGGCATAG
Protein sequenceShow/hide protein sequence
MAVLHVSRVFFPPNIGPLSVWSDLDLAEKFIRFASDTWRLPIRGKTSLLRCIYRRNIQIFRRFGFSGGSQPLVDYTCTLEPSVGRSLPSLSLSNAVAMSSFFSSD
LGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFL
FRTGLAPAQVAPNGWGVIFVLAILFWLRARDSEEVELLYVDQLLACFEAKRIAKKPGRFYMCPRKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP
TRFGNLVSIRPVPELTQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMSSEPATPAVAGPASEDPAPVIELESSGGPKSITV
LHIAPRTLRFKGLIILNFSHRPIALKDNNASGAPRFHGCARTSPFRSANMPSSRVRHFNRGREKPRRYNAPPRTTNRAACLANLLRSLGSCGELPLMKSTIGSIH
EDSGASISIRSGSEIEGLPKISTGTDLARGTSSSSSVGAANSLLGRSALNSSVPSLLTRVSRAQTDEPLSAEA