; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:10280630..10282663
RNA-Seq ExpressionMoc03g15380
SyntenyMoc03g15380
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.8e-11697.88Show/hide
Query:  RPIEYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKF
        R I+YAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKF
Subjt:  RPIEYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKF

Query:  QLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPG
        QLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPG
Subjt:  QLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPG

Query:  PQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPPAGS
        PQALVDQYVRDLDSDYSDPEEDQVGSTQEGA P GS
Subjt:  PQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPPAGS

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]5.8e-9896.74Show/hide
Query:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR
        MFEYGLRLPLH FVQEF FRTGLAPAQV PNGWGVIFALAILFWLRA DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]4.2e-9695.11Show/hide
Query:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR
        MFEYGLRLPLH FVQEF FRTGLAPAQV PNGWGVIFALAILFWLRA DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAVRPIE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-14578.08Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVRKWFYA
        LRLPLH FVQEF FRTGLAPAQV PNGWGVIFALAILFWLRA DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  LRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIEYAAE----AFVASIQSALAVK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE +      A V    S   VK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIEYAAE----AFVASIQSALAVK

Query:  AELDGREVLAAREKEEFSAALEAASST--MKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQ
         +  GR           + ALEAA S+      ++   SE   L  E+ES     +++  R Q +
Subjt:  AELDGREVLAAREKEEFSAALEAASST--MKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQ

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.2e-10646.31Show/hide
Query:  MCARKGAGGIVKGLTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGLTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIEYA--------------------------------------------------------------------------------------------
         VR IE +                                                                                            
Subjt:  AVRPIEYA--------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------AEAFV
                                                                                                       AEAF+
Subjt:  -----------------------------------------------------------------------------------------------AEAFV

Query:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKFQLLKEKDDMLQ
        ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ +A LRAAHAIT+GLE+EKFQLLKEKDD+ Q
Subjt:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKFQLLKEKDDMLQ

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDPEED--------QVGSTQEGAP
        LDSDYSD EE+        +VG+TQE  P
Subjt:  LDSDYSDPEED--------QVGSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1D971 uncharacterized protein LOC1110185382.3e-11697.88Show/hide
Query:  RPIEYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKF
        R I+YAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKF
Subjt:  RPIEYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKF

Query:  QLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPG
        QLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPG
Subjt:  QLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPG

Query:  PQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPPAGS
        PQALVDQYVRDLDSDYSDPEEDQVGSTQEGA P GS
Subjt:  PQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPPAGS

A0A6J1DWD2 uncharacterized protein LOC1110246802.8e-9896.74Show/hide
Query:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR
        MFEYGLRLPLH FVQEF FRTGLAPAQV PNGWGVIFALAILFWLRA DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE

A0A6J1DWF1 uncharacterized protein LOC1110251082.0e-9695.11Show/hide
Query:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR
        MFEYGLRLPLH FVQEF FRTGLAPAQV PNGWGVIFALAILFWLRA DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FP+GRKVGTLVTD+LLLESGLLDYNPAVRPIE
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIE

A0A6J1DXS5 uncharacterized protein LOC1110255028.1e-14678.08Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVRKWFYA
        LRLPLH FVQEF FRTGLAPAQV PNGWGVIFALAILFWLRA DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  LRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIEYAAE----AFVASIQSALAVK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFP+GRKVGTLVTDELLLESGLLDYNPAVRPIE +      A V    S   VK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNPAVRPIEYAAE----AFVASIQSALAVK

Query:  AELDGREVLAAREKEEFSAALEAASST--MKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQ
         +  GR           + ALEAA S+      ++   SE   L  E+ES     +++  R Q +
Subjt:  AELDGREVLAAREKEEFSAALEAASST--MKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQ

A0A6J1DZB3 uncharacterized protein LOC1110256655.7e-10746.31Show/hide
Query:  MCARKGAGGIVKGLTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FP+ RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGLTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPKGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIEYA--------------------------------------------------------------------------------------------
         VR IE +                                                                                            
Subjt:  AVRPIEYA--------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------AEAFV
                                                                                                       AEAF+
Subjt:  -----------------------------------------------------------------------------------------------AEAFV

Query:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKFQLLKEKDDMLQ
        ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ +A LRAAHAIT+GLE+EKFQLLKEKDD+ Q
Subjt:  ASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQAQLRAAHAITRGLEREKFQLLKEKDDMLQ

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDPEED--------QVGSTQEGAP
        LDSDYSD EE+        +VG+TQE  P
Subjt:  LDSDYSDPEED--------QVGSTQEGAP

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic7.9e-0531.06Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEA
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E+     +A +Q+T      +  + I    R+++SE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEA

Query:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKG
          + +  L    E +R+ K +  R+Y+   KG
Subjt:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKG

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related9.2e-0932.84Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL +F+ E+  R  +A +Q+T         LAIL       +E    +D D         R+ 
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGRFYMCARKGAGGIVKGLTS-IKGWVRKWFY
        + PG +Y  A K    IV G  S I GW R++F+
Subjt:  KKPGRFYMCARKGAGGIVKGLTS-IKGWVRKWFY

AT3G42060.1 myosin heavy chain-related6.2e-0526.47Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAE
        SR    + G        PE +   +PE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++         L IL       +EE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILFWLRAWDSEEAE

Query:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGLTS-IKGWVRKWFYASGEWLAKDESGRSFFDV
        ++D+D L     +  I  K  R  +CA    G  I  G TS ++ W + +F+A    ++ D++  S  ++
Subjt:  LLDVDQLLACFEAKRIAKKPGRFYMCARKGAG-GIVKGLTS-IKGWVRKWFYASGEWLAKDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAACATAACCGCTGCGG
AAGATTTATCGTCGGAAAATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCT
CACTTTCTCTTTCGAACGTGGTTGTCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGACCTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAAC
TTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTT
CGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCA
GACTTCCCCTTCACTCTTTCGTCCAAGAATTTCCCTTCCGGACGGGGTTGGCTCCGGCTCAAGTGACCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTT
TGGCTACGAGCTTGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTG
TGCAAGGAAAGGTGCAGGCGGTATAGTTAAAGGGCTGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGATGAGTCAGGTC
GTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTTGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGCGCTTT
CCGAAGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGTTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATACGCCGCCGAGGCGTT
CGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCT
CCACCATGAAGGATGAGCTGCTGAAGGCTCATTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGTTGAAGAAGGAGGAGGACAGGCGCCAGGCC
CAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCT
GGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACT
TTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCT
GGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGG
CGCTCCCCCAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAACATAACCGCTGCGG
AAGATTTATCGTCGGAAAATTCAAATATTCCGACGCTTCGGATCTTAGGGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCT
CACTTTCTCTTTCGAACGTGGTTGTCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGACCTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAAC
TTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTT
CGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCA
GACTTCCCCTTCACTCTTTCGTCCAAGAATTTCCCTTCCGGACGGGGTTGGCTCCGGCTCAAGTGACCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTT
TGGCTACGAGCTTGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTG
TGCAAGGAAAGGTGCAGGCGGTATAGTTAAAGGGCTGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGATGAGTCAGGTC
GTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTTGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATACTACAAGGAGCGCTTT
CCGAAGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGTTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATACGCCGCCGAGGCGTT
CGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCT
CCACCATGAAGGATGAGCTGCTGAAGGCTCATTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGTTGAAGAAGGAGGAGGACAGGCGCCAGGCC
CAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCT
GGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACT
TTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCT
GGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGG
CGCTCCCCCAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MLAPSLSGPISTWQRSSFDLLWTRGDFLFVGKHNRCGRFIVGKFKYSDASDLREDPSRSLITRLEPLVGRSLPSLSLSNVVVMSSSFSSNLGSDEDLARRLESELEEIEN
FRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHSFVQEFPFRTGLAPAQVTPNGWGVIFALAILF
WLRAWDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGLTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERF
PKGRKVGTLVTDELLLESGLLDYNPAVRPIEYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRQA
QLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWAS
GPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPPAGS