; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g18560 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g18560
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr11:14162442..14165351
RNA-Seq ExpressionMoc11g18560
SyntenyMoc11g18560
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.0e-11084.65Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWL KDES              V+IRPVPELTQASFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSR NS+LAMVCGFAS+VKRKSKG+AHA EAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQ
        KKKK  SP EVGA GVLPASFADRVDDP ARMGGT DVT RFRV+PSS+GVRDQ
Subjt:  KKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.5e-11682.78Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQV P                           +DQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFAS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE SR NS LAMVC FAS
Subjt:  KWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFAS

Query:  SVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHA EAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.6e-11383.64Show/hide
Query:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLEAREKE---------------
        G   + A+ R++PSS+GVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVL AREKE               
Subjt:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLEAREKE---------------

Query:  -------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD
               EVE LKAEVE+QAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKEL+HATAELETA ERLSNGVLLEE+FRQHPDFD
Subjt:  -------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRDLDSDYSDLEEDQVGT
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RYAEKWASGPGG PGPQALVDQYVRDLDSDYSD EEDQVG+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRDLDSDYSDLEEDQVGT

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.9e-16586.48Show/hide
Query:  MSSSFSSNLGSDEDLARSLESELEEIENFRLSDDGEDSDASTSGQDLEYPSRIPKHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLAR LES+LEEIEN R+SDDGEDSDASTSGQ LEYPSRIP+HYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARSLESELEEIENFRLSDDGEDSDASTSGQDLEYPSRIPKHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQV P                           +DQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFASSVKRK
        SGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIESSR NS+LAMVCGFAS VKRK
Subjt:  SGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFASSVKRK

Query:  SKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHA EAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.1e-18368.49Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWL KDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SR NS+LAMVCGF  SVKRKSKGRAHA +    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKK  S SE GA G LP S AD VDDP ARM GTS+V  RF ++PSS+GV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLEAREKE---------------------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ
        +ASI  A+ VKAELDGRE L A+E+E                     EV+IL+AEV+ + +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q
Subjt:  VASIQSALAVKAELDGREVLEAREKE---------------------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ

Query:  ALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRD
         LE KD  +   T EL+   ERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+EKWASGP G P PQ+LVD+YVR+
Subjt:  ALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRD

Query:  LDSDYSDLEED
        LDSDYSD+EE+
Subjt:  LDSDYSDLEED

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092989.8e-11184.65Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWL KDES              V+IRPVPELTQASFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSR NS+LAMVCGFAS+VKRKSKG+AHA EAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQ
        KKKK  SP EVGA GVLPASFADRVDDP ARMGGT DVT RFRV+PSS+GVRDQ
Subjt:  KKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138267.0e-11782.78Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQV P                           +DQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFAS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE SR NS LAMVC FAS
Subjt:  KWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFAS

Query:  SVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHA EAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1D971 uncharacterized protein LOC1110185381.2e-11383.64Show/hide
Query:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLEAREKE---------------
        G   + A+ R++PSS+GVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVL AREKE               
Subjt:  GTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLEAREKE---------------

Query:  -------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD
               EVE LKAEVE+QAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKEL+HATAELETA ERLSNGVLLEE+FRQHPDFD
Subjt:  -------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRDLDSDYSDLEEDQVGT
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RYAEKWASGPGG PGPQALVDQYVRDLDSDYSD EEDQVG+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRDLDSDYSDLEEDQVGT

A0A6J1DXS5 uncharacterized protein LOC1110255022.4e-16586.48Show/hide
Query:  MSSSFSSNLGSDEDLARSLESELEEIENFRLSDDGEDSDASTSGQDLEYPSRIPKHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLAR LES+LEEIEN R+SDDGEDSDASTSGQ LEYPSRIP+HYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARSLESELEEIENFRLSDDGEDSDASTSGQDLEYPSRIPKHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQV P                           +DQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVPP---------------------------MDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFASSVKRK
        SGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIESSR NS+LAMVCGFAS VKRK
Subjt:  SGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSKLAMVCGFASSVKRK

Query:  SKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHA EAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-18368.49Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWL KDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SR NS+LAMVCGF  SVKRKSKGRAHA +    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRSNSKLAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKK  S SE GA G LP S AD VDDP ARM GTS+V  RF ++PSS+GV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLEAREKE---------------------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ
        +ASI  A+ VKAELDGRE L A+E+E                     EV+IL+AEV+ + +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q
Subjt:  VASIQSALAVKAELDGREVLEAREKE---------------------EVEILKAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ

Query:  ALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRD
         LE KD  +   T EL+   ERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+EKWASGP G P PQ+LVD+YVR+
Subjt:  ALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRD

Query:  LDSDYSDLEED
        LDSDYSD+EE+
Subjt:  LDSDYSDLEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related2.7e-0428.91Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVP---------------------PMDQLLACFEAKRIAKKPGRF
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+                        D         R+ + PG +
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVP---------------------PMDQLLACFEAKRIAKKPGRF

Query:  YMCARKGAGGIVKGPTS-IKGWVRKWFY
        Y  A K    IV G  S I GW R++F+
Subjt:  YMCARKGAGGIVKGPTS-IKGWVRKWFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTTGTCCCGCTAATCTCGCAACGGATACACCCGGTAATCTCGGGATCGTCGGTTACACCCGGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGAT
CTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGAC
TTAGCTCGTAGCTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGACTCTCCGATGACGGGGAGGATAGTGACGCCTCCACCTCAGGTCAGGATTTGGAATAC
CCTTCTAGGATACCCAAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAAT
CCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCT
CCGGCTCAAGTGCCCCCAATGGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGC
GGTATAGTTAAGGGGCCTACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCACAAAGGACGAGTCAGGTCGTTCCTTCTTCGAC
GTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTTCCGAAGGGT
AGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGTCGAACTCCAAA
CTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTTTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCTGTG
GCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCC
CTGCCGTTGGGCGAGGAGGTGAGGGAGGAAGTTCCTCTGAAGCGAAGGAGGAAGAAAAAGAAGGCGATCTCCCCCTCTGAGGTCGGAGCTTGCGGGGTCTTGCCT
GCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGGTGAGGGAC
CAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCC
GCTGAGGCGTTTGTTGCTTCCATTCAATCAGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGAAGCGAGGGAGAAAGAGGAAGTGGAGATTTTG
AAGGCCGAGGTGGAGACCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAG
AAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAATGAGCGT
CTCAGCAATGGAGTCCTACTGGAGGAGTCGTTTAGGCAGCATCCTGACTTCGATGGATTTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGC
ATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCAACCCTGGCCCCCAAGCG
TTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCTGATCTCGAAGAGGACCAGGTCGGCACCGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTTGTCCCGCTAATCTCGCAACGGATACACCCGGTAATCTCGGGATCGTCGGTTACACCCGGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGAT
CTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGAC
TTAGCTCGTAGCTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGACTCTCCGATGACGGGGAGGATAGTGACGCCTCCACCTCAGGTCAGGATTTGGAATAC
CCTTCTAGGATACCCAAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAAT
CCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCT
CCGGCTCAAGTGCCCCCAATGGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGC
GGTATAGTTAAGGGGCCTACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCACAAAGGACGAGTCAGGTCGTTCCTTCTTCGAC
GTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTTCCGAAGGGT
AGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGTCGAACTCCAAA
CTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTTTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCTGTG
GCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCC
CTGCCGTTGGGCGAGGAGGTGAGGGAGGAAGTTCCTCTGAAGCGAAGGAGGAAGAAAAAGAAGGCGATCTCCCCCTCTGAGGTCGGAGCTTGCGGGGTCTTGCCT
GCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGGTGAGGGAC
CAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCC
GCTGAGGCGTTTGTTGCTTCCATTCAATCAGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGAAGCGAGGGAGAAAGAGGAAGTGGAGATTTTG
AAGGCCGAGGTGGAGACCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAG
AAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAATGAGCGT
CTCAGCAATGGAGTCCTACTGGAGGAGTCGTTTAGGCAGCATCCTGACTTCGATGGATTTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGC
ATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCAACCCTGGCCCCCAAGCG
TTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCTGATCTCGAAGAGGACCAGGTCGGCACCGCATAG
Protein sequenceShow/hide protein sequence
MGGCPANLATDTPGNLGIVGYTRSYLTFPEFLEFDLKAARTLGRSVSSLSLSNVVAMSSSFSSNLGSDEDLARSLESELEEIENFRLSDDGEDSDASTSGQDLEY
PSRIPKHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVPPMDQLLACFEAKRIAKKPGRFYMCARKGAG
GIVKGPTSIKGWVRKWFYASGEWLTKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIESSRSNSK
LAMVCGFASSVKRKSKGRAHAFEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKAISPSEVGACGVLP
ASFADRVDDPAARMGGTSDVTARFRVQPSSAGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLEAREKEEVEIL
KAEVETQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKG
IASDMPDLQIDLSGLKKRYAEKWASGPGGNPGPQALVDQYVRDLDSDYSDLEEDQVGTA