; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr9:8155171..8157005
RNA-Seq ExpressionMoc09g09700
SyntenyMoc09g09700
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]3.9e-10984.65Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEAREEAPLKRRR
        AVR IESSRPNSELAMVCGFAS VKRKSKG+AHALE AQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEE REE PLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEAREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFR+EPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.1e-13893.09Show/hide
Query:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEY LRLP+HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK IAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IE SRPNS LAMVC FAS
Subjt:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGEEA
        GVKRKSKGRAHALE AQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE A
Subjt:  GVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGEEA

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.9e-10397.4Show/hide
Query:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEY LRLP+HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK IAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSEL
Subjt:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.0e-18996.32Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYL SLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEY LR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLR

Query:  LPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR
        LP+HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAS 
Subjt:  LPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALE AQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.4e-12869.07Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEARE
         VRLIE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  E R 
Subjt:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEARE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVMQRTIDYAAEAF
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSV+QRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVMQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKTQL
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ K  L
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKTQL

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.9e-10984.65Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYAS EWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEAREEAPLKRRR
        AVR IESSRPNSELAMVCGFAS VKRKSKG+AHALE AQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEE REE PLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEAREEAPLKRRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ
        KKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFR+EPSSSGVRDQ
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138265.1e-13993.09Show/hide
Query:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEY LRLP+HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK IAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFAS
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IE SRPNS LAMVC FAS
Subjt:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGEEA
        GVKRKSKGRAHALE AQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA       PPLGE A
Subjt:  GVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA-------PPLGEEA

A0A6J1DWD2 uncharacterized protein LOC1110246809.1e-10497.4Show/hide
Query:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEY LRLP+HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK IAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL
        KWFYAS EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSEL
Subjt:  KWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255024.9e-19096.32Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYL SLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEY LR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLR

Query:  LPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR
        LP+HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAS 
Subjt:  LPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASR

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        GRAHALE AQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  GRAHALEPAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256654.1e-12869.07Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+AS EWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEARE
         VRLIE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  E R 
Subjt:  AVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLGEEARE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVMQRTIDYAAEAF
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSV+QRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVMQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKTQL
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ K  L
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKTQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related8.1e-0424.84Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIA
        PE +   +PE  +R  + PEG++ L+   F E  L  P+  F+  +  R  +A +Q++         L IL       +EE  ++D+D         +  
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYSLRLPIHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIA

Query:  KKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASREWLAKDESGRSFFDV
        K   R  +CA    G  I  G TS ++ W + +F+A    ++ D++  S  ++
Subjt:  KKPGRFYMCARKGAG-GIVKGPTS-IKGWVRKWFYASREWLAKDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACAGCCTCAGACTTCCCATTCACCCTTTTGTCCAAGAATTT
CTCTTCCGGACTGGGTTAGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGGACGTAGATCAGCTCCTCGCGTGCTTCGAGGCGAAAATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGG
GGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCAGGGAATGGCTCGCGAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGAC
TGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCG
GCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGCCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTG
ATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCT
GAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGGGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGG
GCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGG
GCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTATGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTGGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCT
GGATGGAAGGGAAGTTCTGGCAGCAAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGG
AGGCTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGACCCAACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACAGCCTCAGACTTCCCATTCACCCTTTTGTCCAAGAATTT
CTCTTCCGGACTGGGTTAGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGGACGTAGATCAGCTCCTCGCGTGCTTCGAGGCGAAAATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGG
GGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCAGGGAATGGCTCGCGAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGAC
TGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCG
GCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGCCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTG
ATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCT
GAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGGGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGG
GCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGG
GCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTATGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTGGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCT
GGATGGAAGGGAAGTTCTGGCAGCAAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGG
AGGCTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGACCCAACTCTGA
Protein sequenceShow/hide protein sequence
MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYSLRLPIHPFVQEF
LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKMIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASREWLAKDESGRSFFDVPTRFG
NLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELAMVCGFASGVKRKSKGRAHALEPAQSSKPATPAVAGPASEDPAPV
IELESSGGPSREKRPRDQTEAVDAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRR
ASKFVSDPGSVMQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKTQL