; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19950 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19950
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:13454588..13459828
RNA-Seq ExpressionMoc03g19950
SyntenyMoc03g19950
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.6e-13996.91Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL
         VKRKSKGRAHALEAAQSSKP TP V GPASEDPAPVIELESSGGPSREKRPRDQTEA+
Subjt:  SVKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL

XP_022155229.1 uncharacterized protein LOC111022371 [Momordica charantia]1.6e-8397.44Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEY LR PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]4.7e-107100Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]8.8e-10697.94Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]6.2e-19296.59Show/hide
Query:  MSSSFSSNLGSDLARRLESELEEVENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLARRLES+LEE+EN R+SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLARRLESELEEVENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL
        GRAHALEAAQSSKPATP V GPASEDPA VIELESSGGPSREKRPRDQTEA+
Subjt:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.7e-13996.91Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL
         VKRKSKGRAHALEAAQSSKP TP V GPASEDPAPVIELESSGGPSREKRPRDQTEA+
Subjt:  SVKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL

A0A6J1DPM7 uncharacterized protein LOC1110223717.8e-8497.44Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEY LR PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLR RDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG

A0A6J1DWD2 uncharacterized protein LOC1110246802.3e-107100Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251084.3e-10697.94Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255023.0e-19296.59Show/hide
Query:  MSSSFSSNLGSDLARRLESELEEVENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSS SSNL SDLARRLES+LEE+EN R+SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSFSSNLGSDLARRLESELEEVENFRLSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL
        GRAHALEAAQSSKPATP V GPASEDPA VIELESSGGPSREKRPRDQTEA+
Subjt:  GRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEAL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.7e-0645.45Show/hide
Query:  NLVKWSWLASPHKQGGLGIGSLRLRNNALLMKWLWRFTVERDSLWRRVIATIYGV
        +LVKWS + SP K+GGLG+ + +  N AL+ K  WR   E++SLW  V+   Y V
Subjt:  NLVKWSWLASPHKQGGLGIGSLRLRNNALLMKWLWRFTVERDSLWRRVIATIYGV

Q9LEX8 Uncharacterized protein At3g60930, chloroplastic1.1e-0526.73Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA

Query:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKY
          + +  L    E +R+ K +  R+Y+   KG   I   P+  + +   +F+ + E    ++       V TR+G     L  + P+P+   ++F  L  
Subjt:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKY

Query:  YK
         K
Subjt:  YK

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related1.7e-0625.39Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG
        + PG +Y  A K    IV G  S I GW R++F+                 +W    E      D P  F  L +I  + EL    + T  + + R  R 
Subjt:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG

Query:  RKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPVVAGPASED---PAPVIE---LESSGG--PSR
        R +G ++           +    +  +E S   +E  +      +   +S GR  A E+A            P +ED      V+    L S GG  PS+
Subjt:  RKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPVVAGPASED---PAPVIE---LESSGG--PSR

Query:  EKRPRDQTE--ALGPPRSP
        ++  RD  E  +   P++P
Subjt:  EKRPRDQTE--ALGPPRSP

AT3G42060.1 myosin heavy chain-related5.5e-0526.47Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE
        SR    + G        PE +   +PE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAE

Query:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASGEWLAKDESGRSFFDV
        ++D+D L     +  I  K  R  +CA    G  I  G TS ++ W + +F+A    ++ D++  S  ++
Subjt:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASGEWLAKDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAATTTGGTGAAGTGGTCTTGGTTGGCTTCGCCTCATAAACAGGGGGGCCTTGGAATTGGCTCATTGAGGCTAAGGAACAATGCTCTATTAATGAAATGG
TTATGGCGGTTCACAGTGGAAAGAGATAGCCTTTGGAGACGGGTGATAGCCACTATTTACGGTGTGGAGTTTTTTGGATGGTGGACAAAACCATCAGATTATGAG
AGGACCCATGGATTGATTCTGCCCCTTTCCACCACTTTCCCAGATTTGTTTGCATTATCCTCCAAGAAAGGTGCGGCTATTGGAAATGGAAGGGACATTTTATGG
TGGAAAGTAGACCCATCCGACCGTTTCACAGTCAATTCAGCCTTCATGGCTCTCACGTCCCCTTCTCTTAGACTTAATTCAGCTACAGCAAACTTGATTTGGAAT
TTTAGAGTCCCGAAGGAGATTTCTCTCTTTGCATTCCTAGAAGAATTGAGGATTTCATTGCTGAAGGTTATGGAGGAAGGCAGCTCAAGGGCAAAGCGTGGGTTC
TTTGGAATTGTGCGGCTCGGGCTCTTTCATGGTCGATATGGAAGGAATGGAATAATTGCAGCTCGAACTCGGCCTCCGGACCGACCTGAACGCTTGGGCGGACCT
GCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGCCAGAAATCATCATCATCCCTGATCGTGGGGTCATACCTT
ACGTCCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTT
AGCAGCAACTTAGGATCTGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGGTAGAAAACTTTAGACTCTCCGATGACGGGGAGGATAGTGACGCCTCCACT
TCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCCATCCCTGAGAACATCCTCCTCAGGCTTCCGGAG
GAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTT
CTCTTTCGGACTGGGTTGGCTCCAGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAG
GCCGAGCTGTTGGACGTAGATCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGT
ATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTC
CCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCATCCTTCGACACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGG
AAGGTCGGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTT
GCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCCGTCGTGGCA
GGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGCTCGGGCCGCCG
AGATCACCACCACCAGCGTTGACAGATGCTAAGTATGAGGGCCGAGCTGAACCTGGCCGAGGTCCGATCTACCGGGAAGCTCGGTGGGGGCCGAGGTCCGCCCAA
GTATTCAGATCGGTCCGGAGGCCGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAATTTGGTGAAGTGGTCTTGGTTGGCTTCGCCTCATAAACAGGGGGGCCTTGGAATTGGCTCATTGAGGCTAAGGAACAATGCTCTATTAATGAAATGG
TTATGGCGGTTCACAGTGGAAAGAGATAGCCTTTGGAGACGGGTGATAGCCACTATTTACGGTGTGGAGTTTTTTGGATGGTGGACAAAACCATCAGATTATGAG
AGGACCCATGGATTGATTCTGCCCCTTTCCACCACTTTCCCAGATTTGTTTGCATTATCCTCCAAGAAAGGTGCGGCTATTGGAAATGGAAGGGACATTTTATGG
TGGAAAGTAGACCCATCCGACCGTTTCACAGTCAATTCAGCCTTCATGGCTCTCACGTCCCCTTCTCTTAGACTTAATTCAGCTACAGCAAACTTGATTTGGAAT
TTTAGAGTCCCGAAGGAGATTTCTCTCTTTGCATTCCTAGAAGAATTGAGGATTTCATTGCTGAAGGTTATGGAGGAAGGCAGCTCAAGGGCAAAGCGTGGGTTC
TTTGGAATTGTGCGGCTCGGGCTCTTTCATGGTCGATATGGAAGGAATGGAATAATTGCAGCTCGAACTCGGCCTCCGGACCGACCTGAACGCTTGGGCGGACCT
GCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGCCAGAAATCATCATCATCCCTGATCGTGGGGTCATACCTT
ACGTCCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTT
AGCAGCAACTTAGGATCTGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGGTAGAAAACTTTAGACTCTCCGATGACGGGGAGGATAGTGACGCCTCCACT
TCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCCATCCCTGAGAACATCCTCCTCAGGCTTCCGGAG
GAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTT
CTCTTTCGGACTGGGTTGGCTCCAGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAG
GCCGAGCTGTTGGACGTAGATCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGT
ATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTC
CCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCATCCTTCGACACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGG
AAGGTCGGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTT
GCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCCGTCGTGGCA
GGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGCTCGGGCCGCCG
AGATCACCACCACCAGCGTTGACAGATGCTAAGTATGAGGGCCGAGCTGAACCTGGCCGAGGTCCGATCTACCGGGAAGCTCGGTGGGGGCCGAGGTCCGCCCAA
GTATTCAGATCGGTCCGGAGGCCGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAA
Protein sequenceShow/hide protein sequence
MGNLVKWSWLASPHKQGGLGIGSLRLRNNALLMKWLWRFTVERDSLWRRVIATIYGVEFFGWWTKPSDYERTHGLILPLSTTFPDLFALSSKKGAAIGNGRDILW
WKVDPSDRFTVNSAFMALTSPSLRLNSATANLIWNFRVPKEISLFAFLEELRISLLKVMEEGSSRAKRGFFGIVRLGLFHGRYGRNGIIAARTRPPDRPERLGGP
AQKGEHSDDQVSIGRIPSLVRGQKSSSSLIVGSYLTSPEFLEFDLKAARTLGRSVSSLSLSNVVAMSSSFSSNLGSDLARRLESELEEVENFRLSDDGEDSDAST
SGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE
AELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGR
KVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPVVAGPASEDPAPVIELESSGGPSREKRPRDQTEALGPP
RSPPPALTDAKYEGRAEPGRGPIYREARWGPRSAQVFRSVRRPSSSCNLKYTVVHILA