; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:3342285..3343596
RNA-Seq ExpressionMoc03g04480
SyntenyMoc03g04480
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]7.1e-8182.3Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES  +   VP      ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPN
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN

Query:  SELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEAREEAPPK-RRKKKK
        SELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAVVGPA EDPAPVIELESS GPSREKRPRDQT+AVD         PLGEE REE P K RRKKKK
Subjt:  SELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEAREEAPPK-RRKKKK

Query:  AISPSEVGA
          SP EVGA
Subjt:  AISPSEVGA

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]9.1e-13792Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEA
        GVKRKSKGRAHALEAAQSSKP TPAVVGPA EDPAPVIELESSGGPSREKRPRDQT+AVDAQTEAA   PLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEA

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.4e-9793.75Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.7e-9691.75Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]5.3e-18594.33Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPS+IPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVD
        GRAHALEAAQSSKPATPAVVGPA EDPA VIELESSGGPSREKRPRDQT+AVD
Subjt:  GRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVD

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.4e-8182.3Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES  +   VP      ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPN
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN

Query:  SELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEAREEAPPK-RRKKKK
        SELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAVVGPA EDPAPVIELESS GPSREKRPRDQT+AVD         PLGEE REE P K RRKKKK
Subjt:  SELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEAREEAPPK-RRKKKK

Query:  AISPSEVGA
          SP EVGA
Subjt:  AISPSEVGA

A0A6J1CR42 uncharacterized protein LOC1110138264.4e-13792Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEA
        GVKRKSKGRAHALEAAQSSKP TPAVVGPA EDPAPVIELESSGGPSREKRPRDQT+AVDAQTEAA   PLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVDAQTEAAGALPLGEEA

A0A6J1DWD2 uncharacterized protein LOC1110246806.9e-9893.75Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251081.3e-9691.75Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLLACFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255022.5e-18594.33Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPS+IPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +VDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVD
        GRAHALEAAQSSKPATPAVVGPA EDPA VIELESSGGPSREKRPRDQT+AVD
Subjt:  GRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSREKRPRDQTKAVD

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic8.6e-0528.21Show/hide
Query:  SKIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E  
Subjt:  SKIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA

Query:  ELLNVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE
          + +  L    E +R+ K +  R+Y+   KG   I   P+  + +   +F+ + E
Subjt:  ELLNVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related1.9e-0731.34Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    ++ D         R+ 
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIA

Query:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFY
        + PG +Y  A K    IV G  S I GW R++F+
Subjt:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTATTAGTAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGATGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGCTTGGAATACCCTTCTAAGATACCTGAGCACTACCTCGGATCCCTTCGCAGGGGGTTCGCTATCCCTGAAAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTT
CTCTTCCGGACAGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGAACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGG
GGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAACTGCTGCTTGAGTCCGGGCTGCTAGA
TTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATG
CTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGTAGGGCCTGCCTTGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGG
GAGAAGCGCCCCAGGGATCAGACCAAGGCGGTGGACGCCCAGACCGAGGCGGCGGGCGCCCTGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCCGAAGCGCAGGAA
GAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTATTAGTAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGATGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGCTTGGAATACCCTTCTAAGATACCTGAGCACTACCTCGGATCCCTTCGCAGGGGGTTCGCTATCCCTGAAAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTT
CTCTTCCGGACAGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGAACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGG
GGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAACTGCTGCTTGAGTCCGGGCTGCTAGA
TTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATG
CTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGTAGGGCCTGCCTTGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGG
GAGAAGCGCCCCAGGGATCAGACCAAGGCGGTGGACGCCCAGACCGAGGCGGCGGGCGCCCTGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCCGAAGCGCAGGAA
GAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGA
Protein sequenceShow/hide protein sequence
MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSKIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEF
LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG
NLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPALEDPAPVIELESSGGPSR
EKRPRDQTKAVDAQTEAAGALPLGEEAREEAPPKRRKKKKAISPSEVGA