; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g18090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g18090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr7:12879531..12881649
RNA-Seq ExpressionMoc07g18090
SyntenyMoc07g18090
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.4e-13192.28Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFAS
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMV  FAS
Subjt:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV
         VKRKSKGRAHALEAAQ+SKP TPAVVGPASEDPAPVIELESSGGPSR + P+     V
Subjt:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]3.8e-10597.92Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]6.5e-10596.91Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLL CFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.3e-18191.53Show/hide
Query:  MSSSFSSNLGFDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGED+DASTSGQGLEYPSRIPEHYLGSLR+GFAIPENILLR+PEEG RADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGFDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +VDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFASNVKRK
        SG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMV GFAS VKRK
Subjt:  SGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFASNVKRK

Query:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV
        SKGRAHALEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSR + P+     V
Subjt:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.6e-15467.1Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASG+WLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVYGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPS--------------------RGE
         VR IE+SRPNSELAMV GF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S                    RGE
Subjt:  AVRPIESSRPNSELAMVYGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPS--------------------RGE

Query:  AP-----------QGSDRGV-GRLALGRGERVDDPEARMGGTSNVTARFRVEPLSSGVRDQVSRISGASLDRYLRRASKFVSDPGSVLQRTIDYAAKAFV
        +P             S+ G  G L     + VDDPEARM GTSNV  RF +EP SSGV+DQVSRIS   LDRYLRRASKFVSDPGSVLQRTID  A+AF+
Subjt:  AP-----------QGSDRGV-GRLALGRGERVDDPEARMGGTSNVTARFRVEPLSSGVRDQVSRISGASLDRYLRRASKFVSDPGSVLQRTIDYAAKAFV

Query:  ASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVDTKAELLKKEEDKRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQ
        ASI   + VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEVD K +LLKKE +K KA LRAAHAITKGLEKEKFQLLKEKDD+ Q
Subjt:  ASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVDTKAELLKKEEDKRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQ

Query:  ALKAKDKELKNATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL
         L+ KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L
Subjt:  ALKAKDKELKNATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138266.7e-13292.28Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFAS
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMV  FAS
Subjt:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV
         VKRKSKGRAHALEAAQ+SKP TPAVVGPASEDPAPVIELESSGGPSR + P+     V
Subjt:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV

A0A6J1DWD2 uncharacterized protein LOC1110246801.8e-10597.92Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251083.1e-10596.91Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELL+VDQLL CFEAKRIAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM
        KWFYASG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255021.6e-18191.53Show/hide
Query:  MSSSFSSNLGFDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGED+DASTSGQGLEYPSRIPEHYLGSLR+GFAIPENILLR+PEEG RADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGFDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL +VDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFASNVKRK
        SG+WLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMV GFAS VKRK
Subjt:  SGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFASNVKRK

Query:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV
        SKGRAHALEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSR + P+     V
Subjt:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGV

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-15467.1Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASG+WLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGKWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVYGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPS--------------------RGE
         VR IE+SRPNSELAMV GF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S                    RGE
Subjt:  AVRPIESSRPNSELAMVYGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPS--------------------RGE

Query:  AP-----------QGSDRGV-GRLALGRGERVDDPEARMGGTSNVTARFRVEPLSSGVRDQVSRISGASLDRYLRRASKFVSDPGSVLQRTIDYAAKAFV
        +P             S+ G  G L     + VDDPEARM GTSNV  RF +EP SSGV+DQVSRIS   LDRYLRRASKFVSDPGSVLQRTID  A+AF+
Subjt:  AP-----------QGSDRGV-GRLALGRGERVDDPEARMGGTSNVTARFRVEPLSSGVRDQVSRISGASLDRYLRRASKFVSDPGSVLQRTIDYAAKAFV

Query:  ASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVDTKAELLKKEEDKRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQ
        ASI   + VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEVD K +LLKKE +K KA LRAAHAITKGLEKEKFQLLKEKDD+ Q
Subjt:  ASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVDTKAELLKKEEDKRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQ

Query:  ALKAKDKELKNATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL
         L+ KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L
Subjt:  ALKAKDKELKNATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related5.1e-0724.21Show/hide
Query:  RLESELEEIENFRFSDDGEDNDASTSGQGLEY------PSRIPEHYLGSLRKGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ
        R+ ++ +   N    D+ E  D + SG+  +       P+      +G       +P  + +RIP +  R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRFSDDGEDNDASTSGQGLEY------PSRIPEHYLGSLRKGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ

Query:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
         F     +A +Q+       I   A L  L AR       L+V+ +       ++  K G+ Y+ + +G   +  GP+  + W+  +FYA
Subjt:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATTCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAATGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAAGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTCAGGATTCCGGAGGAGGGGGGGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGAACGTAGACCAGCTCCTCGTGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGC
GCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGAAATGGCTTGCAAAGGACGAGTCA
GGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAG
GAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCC
TCAAGGCCGAACTCCGAATTAGCCATGGTTTACGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAAAATTCGAAA
CCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGGAGAAGCGCCCCAGGGATCA
GACCGAGGTGTTGGACGCCTTGCCCTTGGGCGAGGAGAACGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCAATGTGACGGCACGGTTCAGAGTTGAG
CCGTTAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGGCGCAAGTTTGGACCGCTACCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTT
CTGCAGAGGACCATCGACTACGCCGCTAAGGCGTTTGTTGCTTCCATTCAATCGACTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGG
GAAAAGGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAT
ACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAAACGCAAGGCCCAGCTCCGAGCTGCCCACGCTATCACCAAGGGTTTGGAGAAGGAGAAGTTCCAACTGCTC
AAGGAGAAGGACGACATGCTCCAGGCGCTTAAAGCGAAGGATAAGGAGCTGAAGAATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCGTCTCAGCAATGGAGTC
CTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATG
CCTGACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATTCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAATGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAAGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTCAGGATTCCGGAGGAGGGGGGGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATAGTGAAGAGGCCGAGCTGTTGAACGTAGACCAGCTCCTCGTGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGC
GCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGAAATGGCTTGCAAAGGACGAGTCA
GGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAG
GAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCC
TCAAGGCCGAACTCCGAATTAGCCATGGTTTACGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAAAATTCGAAA
CCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGGAGAAGCGCCCCAGGGATCA
GACCGAGGTGTTGGACGCCTTGCCCTTGGGCGAGGAGAACGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCAATGTGACGGCACGGTTCAGAGTTGAG
CCGTTAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGGCGCAAGTTTGGACCGCTACCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTT
CTGCAGAGGACCATCGACTACGCCGCTAAGGCGTTTGTTGCTTCCATTCAATCGACTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGG
GAAAAGGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAT
ACCAAGGCCGAGCTGCTGAAGAAGGAAGAAGACAAACGCAAGGCCCAGCTCCGAGCTGCCCACGCTATCACCAAGGGTTTGGAGAAGGAGAAGTTCCAACTGCTC
AAGGAGAAGGACGACATGCTCCAGGCGCTTAAAGCGAAGGATAAGGAGCTGAAGAATGCGACTGCCGAGCTGGAGACGGCAAAGGAGCGTCTCAGCAATGGAGTC
CTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATG
CCTGACCTTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGFDEDLARRLESELEEIENFRFSDDGEDNDASTSGQGLEYPSRIPEHYLGSLRKGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYGLRLPL
HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLNVDQLLVCFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGKWLAKDES
GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVYGFASNVKRKSKGRAHALEAAQNSK
PATPAVVGPASEDPAPVIELESSGGPSRGEAPQGSDRGVGRLALGRGERVDDPEARMGGTSNVTARFRVEPLSSGVRDQVSRISGASLDRYLRRASKFVSDPGSV
LQRTIDYAAKAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVDTKAELLKKEEDKRKAQLRAAHAITKGLEKEKFQLL
KEKDDMLQALKAKDKELKNATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDL