; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g09040 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g09040
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr7:6997302..6998769
RNA-Seq ExpressionMoc07g09040
SyntenyMoc07g09040
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.4e-8777.59Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNS
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDE               V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYN 
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNS

Query:  AVRPIESSRPNSEL---------------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEVREEAPLKWRR
        AVRPIESSRPNSEL               G +    A QSS+P TPAV GPASEDPAPVI+LESS GPSREKRPR QTEAVDVS LGEEVREE PLK RR
Subjt:  AVRPIESSRPNSEL---------------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEVREEAPLKWRR

Query:  KNKKTTSPLEVGARGALPASFVDRVDDPEARM
        K KKTTSPLEVGARG LPASF DRVDDPEARM
Subjt:  KNKKTTSPLEVGARGALPASFVDRVDDPEARM

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.7e-12083.58Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAELLDVDQLLACF AKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL--------
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIE SRPNS L        
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL--------

Query:  -------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEV
               G +    A QSS+P TPAV GPASEDPAPVI+LESSGGPSREKRPR QTEAVD  +   +V
Subjt:  -------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEV

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]3.3e-10395.85Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAELLDVDQLLACF AKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIESSRPNSELG
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]8.7e-10496.37Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAELLDVDQLLACF AKRIAKKPGR+YMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYN AVRPIESSRPNSELG
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.0e-16585.92Show/hide
Query:  MSSSFSSDLGSEEELARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLCRGFAIHENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  E +LARRLE +LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYL SL RGFAI ENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSEEELARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLCRGFAIHENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAEL DVDQLLACF AKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL-------------
        SGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIESSRPNSEL             
Subjt:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL-------------

Query:  --GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVD
          G +    A QSS+PATPAV GPASEDPA VI+LESSGGPSREKRPR QTEAVD
Subjt:  --GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVD

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.1e-8777.59Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNS
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDE               V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYN 
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNS

Query:  AVRPIESSRPNSEL---------------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEVREEAPLKWRR
        AVRPIESSRPNSEL               G +    A QSS+P TPAV GPASEDPAPVI+LESS GPSREKRPR QTEAVDVS LGEEVREE PLK RR
Subjt:  AVRPIESSRPNSEL---------------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEVREEAPLKWRR

Query:  KNKKTTSPLEVGARGALPASFVDRVDDPEARM
        K KKTTSPLEVGARG LPASF DRVDDPEARM
Subjt:  KNKKTTSPLEVGARGALPASFVDRVDDPEARM

A0A6J1CR42 uncharacterized protein LOC1110138263.2e-12083.58Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAELLDVDQLLACF AKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL--------
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIE SRPNS L        
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL--------

Query:  -------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEV
               G +    A QSS+P TPAV GPASEDPAPVI+LESSGGPSREKRPR QTEAVD  +   +V
Subjt:  -------GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVDVSSLGEEV

A0A6J1DWD2 uncharacterized protein LOC1110246801.6e-10395.85Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAELLDVDQLLACF AKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIESSRPNSELG
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG

A0A6J1DWF1 uncharacterized protein LOC1110251084.2e-10496.37Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAELLDVDQLLACF AKRIAKKPGR+YMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYN AVRPIESSRPNSELG
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELG

A0A6J1DXS5 uncharacterized protein LOC1110255025.1e-16685.92Show/hide
Query:  MSSSFSSDLGSEEELARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLCRGFAIHENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  E +LARRLE +LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYL SL RGFAI ENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSEEELARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLCRGFAIHENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAP+GWGVIFALAILFWLRARD+EEAEL DVDQLLACF AKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL-------------
        SGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYN AVRPIESSRPNSEL             
Subjt:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSEL-------------

Query:  --GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVD
          G +    A QSS+PATPAV GPASEDPA VI+LESSGGPSREKRPR QTEAVD
Subjt:  --GPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREKRPRGQTEAVD

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic4.6e-0727.98Show/hide
Query:  ILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAK-
        + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E    + +  L      +R+ K 
Subjt:  ILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAK-

Query:  KPGRYYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKYYK----EHFPRGR
        +  RYY+   KG   I   P+  + +   +F+ + E    ++L      V TR+G     L  + P+P+   ++F  L   K    +HF R R
Subjt:  KPGRYYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKYYK----EHFPRGR

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related3.3e-0830.77Show/hide
Query:  PEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYM
        P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL        E    +D D         R+ + PG YY 
Subjt:  PEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYM

Query:  CARKGAGGIVKGPTS-IKGWVRKWFYASGEWLAKDELGRSFFD
         A K    IV G  S I GW R++F+      + + L   F D
Subjt:  CARKGAGGIVKGPTS-IKGWVRKWFYASGEWLAKDELGRSFFD

AT3G42060.1 myosin heavy chain-related4.9e-0427.63Show/hide
Query:  ENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAK
        E +   IPE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++ +       L IL        EE  ++D+D L     +  I  
Subjt:  ENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAK

Query:  KPGRYYMCARKGAG-GIVKGPTS-IKGWVRKWFYASGEWLAKDELGRSFFDV
        K  R  +CA    G  I  G TS ++ W + +F+A    ++ D+   S  ++
Subjt:  KPGRYYMCARKGAG-GIVKGPTS-IKGWVRKWFYASGEWLAKDELGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTGGGATCCGAAGAGGAATTAGCTCGTAGGTTAGAGTTCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGACGCTTCCACCTCGGGTCAGGGTCTGGAATACCCTTCTAGGATACCCGAGCACTACCTCAGATCCCTTTGTAGGGGGTTCGCTATCCATGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAATTTCTCTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAGTGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGGGACAACGAAGA
GGCCGAGCTGCTGGACGTAGACCAGCTCCTCGCGTGCTTCGTAGCGAAAAGGATAGCTAAGAAGCCCGGTCGGTACTATATGTGCGCAAGGAAAGGCGCAGGAGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTTAGGTCGTTCCTTCTTTGACGTTCCCACTAGG
TTTGGGAACCTAGTATCAATCCGACCGGTCCCCGAGCTTACTCAAGCCTCCTTCGACACCTTGAAGTATTACAAGGAGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTT
GGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACTCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGGGCCGAGCCCATGCTCTC
GAGCCACCCAGAGTTCGGAACCCGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAGATCCAGCCCCAGTGATCAAGCTGGAGTCTTCTGGGGGTCCTTCGCGGGAGAAG
CGCCCAAGGGGTCAGACCGAGGCGGTGGACGTCTCGTCGTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGTGGAGGAGGAAGAATAAGAAGACCACCTCTCCCTT
GGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGTAGATCGGGTGGACGATCCTGAAGCCAGGATGGTTATAAGCATGTGTGCTGATTTTGAAGCATGGATGGTTC
ACATGCCTATTTTGAAGCATCTGTGCTGTGCATGTGATTTCATGTCGATGATGATTATAAGAGATTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTGGGATCCGAAGAGGAATTAGCTCGTAGGTTAGAGTTCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGACGCTTCCACCTCGGGTCAGGGTCTGGAATACCCTTCTAGGATACCCGAGCACTACCTCAGATCCCTTTGTAGGGGGTTCGCTATCCATGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAATTTCTCTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAGTGGGTGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGGGACAACGAAGA
GGCCGAGCTGCTGGACGTAGACCAGCTCCTCGCGTGCTTCGTAGCGAAAAGGATAGCTAAGAAGCCCGGTCGGTACTATATGTGCGCAAGGAAAGGCGCAGGAGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTTAGGTCGTTCCTTCTTTGACGTTCCCACTAGG
TTTGGGAACCTAGTATCAATCCGACCGGTCCCCGAGCTTACTCAAGCCTCCTTCGACACCTTGAAGTATTACAAGGAGCACTTTCCGAGGGGCAGGAAGGTCGGAACCTT
GGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACTCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGGGCCGAGCCCATGCTCTC
GAGCCACCCAGAGTTCGGAACCCGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAGATCCAGCCCCAGTGATCAAGCTGGAGTCTTCTGGGGGTCCTTCGCGGGAGAAG
CGCCCAAGGGGTCAGACCGAGGCGGTGGACGTCTCGTCGTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGTGGAGGAGGAAGAATAAGAAGACCACCTCTCCCTT
GGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGTAGATCGGGTGGACGATCCTGAAGCCAGGATGGTTATAAGCATGTGTGCTGATTTTGAAGCATGGATGGTTC
ACATGCCTATTTTGAAGCATCTGTGCTGTGCATGTGATTTCATGTCGATGATGATTATAAGAGATTGTTGA
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSEEELARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLCRGFAIHENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQ
EFLFRTGLAPAQVAPSGWGVIFALAILFWLRARDNEEAELLDVDQLLACFVAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTR
FGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNSAVRPIESSRPNSELGPSPCSRATQSSEPATPAVAGPASEDPAPVIKLESSGGPSREK
RPRGQTEAVDVSSLGEEVREEAPLKWRRKNKKTTSPLEVGARGALPASFVDRVDDPEARMVISMCADFEAWMVHMPILKHLCCACDFMSMMIIRDC