; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr4:7503406..7505682
RNA-Seq ExpressionMoc04g10110
SyntenyMoc04g10110
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.0e-11085.2Show/hide
Query:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKGA GIVKGPTSIKGW+RKWFYA GEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TP V G ASEDPAPVIELES  GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG
        KKKKT SPLEVGA GVLPASFADRVDDPEARMGGT DVT RFRV+PSS+G
Subjt:  KKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.8e-13892.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMC RKGAGGIVKGPTSIKGW+R
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR

Query:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TP V G ASEDPAPVIELES GGPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDAL-------PLGE

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]2.9e-10597.92Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMC RKGAGGIVKGPTSIKGW+R
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR

Query:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.2e-16096.26Show/hide
Query:  RRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE
        RRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFE
Subjt:  RRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE

Query:  AKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDE
        AK+IAKKPGRFYMC RKGAGGIVKGPTSIKGW+RKWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDE
Subjt:  AKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDE

Query:  LLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVD
        LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSKGRAHALEAAQSSKPATP V G ASEDPA VIELES GGPSREKRPRDQTEAVD
Subjt:  LLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]9.9e-17063.77Show/hide
Query:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKG GGIVKGPTSIKGW+ KWF+A GEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTV--------AGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TPTV        +G +S  P PVIEL+  GG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTV--------AGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG----------------------------------------AF
        E PL+RRRKKKKT S  E GA G LP S AD VDDPEARM GTS+V  RF ++PSS+G                                        AF
Subjt:  EVPLKRRRKKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG----------------------------------------AF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE------LLKEEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AE      LLK+E ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE------LLKEEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEYATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGTPGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QALEAKDKELEYATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYFDLEED--------QVGTTQEGAP
        +LDSDY D+EE+        +VGTTQE  P
Subjt:  DLDSDYFDLEED--------QVGTTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092985.0e-11185.2Show/hide
Query:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKGA GIVKGPTSIKGW+RKWFYA GEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TP V G ASEDPAPVIELES  GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG
        KKKKT SPLEVGA GVLPASFADRVDDPEARMGGT DVT RFRV+PSS+G
Subjt:  KKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG

A0A6J1CR42 uncharacterized protein LOC1110138262.8e-13892.31Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMC RKGAGGIVKGPTSIKGW+R
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR

Query:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TP V G ASEDPAPVIELES GGPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1DWD2 uncharacterized protein LOC1110246801.4e-10597.92Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMC RKGAGGIVKGPTSIKGW+R
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMR

Query:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255021.5e-16096.26Show/hide
Query:  RRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE
        RRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFE
Subjt:  RRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE

Query:  AKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDE
        AK+IAKKPGRFYMC RKGAGGIVKGPTSIKGW+RKWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDE
Subjt:  AKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDE

Query:  LLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVD
        LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSKGRAHALEAAQSSKPATP V G ASEDPA VIELES GGPSREKRPRDQTEAVD
Subjt:  LLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256654.8e-17063.77Show/hide
Query:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MC RKG GGIVKGPTSIKGW+ KWF+A GEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTV--------AGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TPTV        +G +S  P PVIEL+  GG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTV--------AGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG----------------------------------------AF
        E PL+RRRKKKKT S  E GA G LP S AD VDDPEARM GTS+V  RF ++PSS+G                                        AF
Subjt:  EVPLKRRRKKKKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAG----------------------------------------AF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE------LLKEEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AE      LLK+E ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE------LLKEEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEYATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGTPGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QALEAKDKELEYATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDSDYFDLEED--------QVGTTQEGAP
        +LDSDY D+EE+        +VGTTQE  P
Subjt:  DLDSDYFDLEED--------QVGTTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related3.9e-0730.6Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         ++ 
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIA

Query:  KKPGRFYMCVRKGAGGIVKGPTS-IKGWMRKWFY
        + PG +Y    K    IV G  S I GW R++F+
Subjt:  KKPGRFYMCVRKGAGGIVKGPTS-IKGWMRKWFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGACTTAGCTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGC
TGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGG
CTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAG
CTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGTAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGG
ATGGATGAGGAAGTGGTTCTACGCTTTTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGAC
CAGTCCCTGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAG
TCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAA
AGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTACCGTGGCAGGGCTTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTTTG
GGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAA
AAGAAGACGATCTCCCCCTTGGAGGTCGGAGCTTGCGGGGTCTTGCCTGCGAGTTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGAC
GGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGG
AGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGCTGCTGAAGGAG
GAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACTAGGGGTTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCT
TGAAGCGAAGGACAAGGAGCTGGAGTATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAGCATCCTGACT
TCGATGGATTTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGG
TATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTTAGAGATCTGGACTCTGACTACTTTGATCTCGAAGAGGACCA
GGTCGGCACCACACAGGAGGGCGCTCCTCAGGCGGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGACTTAGCTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGC
TGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGG
CTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAG
CTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGTAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGG
ATGGATGAGGAAGTGGTTCTACGCTTTTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGAC
CAGTCCCTGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAG
TCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAA
AGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTACCGTGGCAGGGCTTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTTTG
GGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAA
AAGAAGACGATCTCCCCCTTGGAGGTCGGAGCTTGCGGGGTCTTGCCTGCGAGTTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGAC
GGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGG
AGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGCTGCTGAAGGAG
GAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACTAGGGGTTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCT
TGAAGCGAAGGACAAGGAGCTGGAGTATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAGCATCCTGACT
TCGATGGATTTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGG
TATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTTAGAGATCTGGACTCTGACTACTTTGATCTCGAAGAGGACCA
GGTCGGCACCACACAGGAGGGCGCTCCTCAGGCGGACTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQ
LLACFEAKKIAKKPGRFYMCVRKGAGGIVKGPTSIKGWMRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLE
SGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPTVAGLASEDPAPVIELESFGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKK
KKTISPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRVQPSSAGAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAELLKE
EEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEYATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKR
YAEKWASGPGGTPGPQALVDQYVRDLDSDYFDLEEDQVGTTQEGAPQADS