; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr9:7601022..7607339
RNA-Seq ExpressionMoc09g09220
SyntenyMoc09g09220
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]8.9e-11279.85Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRA+DSEEAELLDVD LLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKE FPR                                     AMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFAS

Query:  NVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAV-------DVLPLGE
         VKRKSKGR+HALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKR RDQTEAV       DV PLGE
Subjt:  NVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAV-------DVLPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.0e-9986.19Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKN
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAE FVASIQSALAV+AELDGRE LAAREKEEFSAALE ASSTMK+
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKN

Query:  ELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFD
        ELLKAHSEVE LKAEVE +AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KER SNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ
        GFAKDFSDAGFKFLMKGIASD+ DLQID  GLK+RYAE+
Subjt:  GFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.0e-13674.93Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENF---------------------------------RGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN                                  RGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENF---------------------------------RGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRA+DSEEAEL DVD LLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKE FPR                                     AMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKRK

Query:  SKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVD
        SKGR+HALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKR RDQTEAVD
Subjt:  SKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]1.9e-9869.87Show/hide
Query:  MVCGFASNVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVREEVPLKRRRKKKKTTSPLEVGARG
        MVCGFAS+VKRKSKGR+HA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKR RDQTEAVD LPLGEEVREEVPLKRRRKKKKT SPLEVGA G
Subjt:  MVCGFASNVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVREEVPLKRRRKKKKTTSPLEVGARG

Query:  ALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAR
         LP SFADRVDDPE RMGGTSDVTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE FVASIQSALAV+AELDGRE LAAR
Subjt:  ALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAR

Query:  EKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKER
        EKEEFS                                                                        ALEAK++EL+HATAELET KER
Subjt:  EKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKER

Query:  FSNGALLEESFR
         SNG LLEESFR
Subjt:  FSNGALLEESFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]7.7e-15665.42Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-----------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL  A+FDTLK+YK+HFPR                       
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-----------------------

Query:  --------------AMVCGFASNVKRKSKGRSHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVRE
                      AMVCGF  +VKRKSKGR+HAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DV PL  EVR 
Subjt:  --------------AMVCGFASNVKRKSKGRSHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEF
        E PL+RRRKKKKT+S  E GARG LPTS AD VDDPE RM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE F
Subjt:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEF

Query:  VASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ V+AELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ
        Q LE K+  +   T EL+ +KER +NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+  LQID  GLKK+Y+E+
Subjt:  QALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138264.3e-11279.85Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRA+DSEEAELLDVD LLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKE FPR                                     AMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFAS

Query:  NVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAV-------DVLPLGE
         VKRKSKGR+HALEAAQSSKP TPAVVGPASEDPAPVIELESSGGPSREKR RDQTEAV       DV PLGE
Subjt:  NVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAV-------DVLPLGE

A0A6J1D971 uncharacterized protein LOC1110185385.0e-10086.19Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKN
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAE FVASIQSALAV+AELDGRE LAAREKEEFSAALE ASSTMK+
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKN

Query:  ELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFD
        ELLKAHSEVE LKAEVE +AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KER SNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ
        GFAKDFSDAGFKFLMKGIASD+ DLQID  GLK+RYAE+
Subjt:  GFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ

A0A6J1DXS5 uncharacterized protein LOC1110255021.9e-13674.93Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENF---------------------------------RGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN                                  RGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENF---------------------------------RGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRA+DSEEAEL DVD LLACFEAKRIAKKP RFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELT ASFDTLKYYKE FPR                                     AMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKRK

Query:  SKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVD
        SKGR+HALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKR RDQTEAVD
Subjt:  SKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256069.4e-9969.87Show/hide
Query:  MVCGFASNVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVREEVPLKRRRKKKKTTSPLEVGARG
        MVCGFAS+VKRKSKGR+HA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKR RDQTEAVD LPLGEEVREEVPLKRRRKKKKT SPLEVGA G
Subjt:  MVCGFASNVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVREEVPLKRRRKKKKTTSPLEVGARG

Query:  ALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAR
         LP SFADRVDDPE RMGGTSDVTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE FVASIQSALAV+AELDGRE LAAR
Subjt:  ALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAR

Query:  EKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKER
        EKEEFS                                                                        ALEAK++EL+HATAELET KER
Subjt:  EKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKER

Query:  FSNGALLEESFR
         SNG LLEESFR
Subjt:  FSNGALLEESFR

A0A6J1DZB3 uncharacterized protein LOC1110256653.7e-15665.42Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-----------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL  A+FDTLK+YK+HFPR                       
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPR-----------------------

Query:  --------------AMVCGFASNVKRKSKGRSHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVRE
                      AMVCGF  +VKRKSKGR+HAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DV PL  EVR 
Subjt:  --------------AMVCGFASNVKRKSKGRSHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRLRDQTEAVDVLPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEF
        E PL+RRRKKKKT+S  E GARG LPTS AD VDDPE RM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE F
Subjt:  EVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEEF

Query:  VASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ V+AELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ
        Q LE K+  +   T EL+ +KER +NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+  LQID  GLKK+Y+E+
Subjt:  QALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related4.6e-0522.96Show/hide
Query:  IPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRI
        +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +Q+       + ++     L+   +     L V+ +       ++
Subjt:  IPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRI

Query:  AKKPSRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
          K  + Y+ + +G   +  GP+  + W+  +FYA
Subjt:  AKKPSRFYMCARKGAGGIVKGPTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAATTGGGGTGTGGAGGAAGAGTAGGCCTGACAACCGATCTAGGGGCCCAACAACCACAACCGCTGGTCGGAGTGGTTCTAATGATCTTGTCTCCTCCCACTGC
CTATAAAAGGGACACTCCACCTCTTTACTCAACTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACGACGATCAAGTCAGTA
TAGGTCGGATTCCCAGTTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCACTCGACTTGCTTTGG
ACGCGTGGCGACTTCCTATTCGTGGGAAAACATAACCGTTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGGATCCTAGCCGCTC
GTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGG
ATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAAGAGATAGAAAACTTTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGAC
AATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGAACTGGGTTGGCTCC
AGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCAGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCACCTCC
TCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTAGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGG
GTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCGACCAGT
CCCCGAGCTTACGCCAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGCCATGGTTTGCGGATTCGCAAGCAACGTGAAGCGCAAGTCCAAGGGCC
GATCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGACCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGT
CCTTCGAGGGAGAAACGCCTCAGGGATCAGACCGAGGCGGTGGACGTCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAA
GACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGTCAGGATGGGCGGGACGTCCGACGTGACAGCAC
GGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCG
GGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGAGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGGAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGC
GAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGAATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGG
TCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAAGAGAAGTTCCAACTACTCAAGGAG
AAGGATGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTTTCAGCAATGGAGCCCTATTGGAGGA
ATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGATATTCTTGACCTTCAGATCG
ATTTCGGTGGTCTAAAGAAGAGGTATGCTGAGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGAAATTGGGGTGTGGAGGAAGAGTAGGCCTGACAACCGATCTAGGGGCCCAACAACCACAACCGCTGGTCGGAGTGGTTCTAATGATCTTGTCTCCTCCCACTGC
CTATAAAAGGGACACTCCACCTCTTTACTCAACTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACGACGATCAAGTCAGTA
TAGGTCGGATTCCCAGTTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCACTCGACTTGCTTTGG
ACGCGTGGCGACTTCCTATTCGTGGGAAAACATAACCGTTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGGATCCTAGCCGCTC
GTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGG
ATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAAGAGATAGAAAACTTTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGAC
AATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGAACTGGGTTGGCTCC
AGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCAGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCACCTCC
TCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTAGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGG
GTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCGACCAGT
CCCCGAGCTTACGCCAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGCCATGGTTTGCGGATTCGCAAGCAACGTGAAGCGCAAGTCCAAGGGCC
GATCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGACCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGT
CCTTCGAGGGAGAAACGCCTCAGGGATCAGACCGAGGCGGTGGACGTCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAA
GACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGTCAGGATGGGCGGGACGTCCGACGTGACAGCAC
GGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCG
GGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGAGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGGAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGC
GAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGAATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGG
TCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAAGAGAAGTTCCAACTACTCAAGGAG
AAGGATGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTTTCAGCAATGGAGCCCTATTGGAGGA
ATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGATATTCTTGACCTTCAGATCG
ATTTCGGTGGTCTAAAGAAGAGGTATGCTGAGCAGTAG
Protein sequenceShow/hide protein sequence
MMKLGCGGRVGLTTDLGAQQPQPLVGVVLMILSPPTAYKRDTPPLYSTRTRPPDRSEYLGGPAQKGEHDDQVSIGRIPSLVRGYSLPQTLAPSLSGPISTWQRSSLDLLW
TRGDFLFVGKHNRCGGFIVGIFKYSDASDLREDPSRSLITRLEPLVGRSLPSLSLSNVVAMSSSFSSDLGSDEDLARRLESELEEIENFRGFAIPENILLRIPEEGERAD
NPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRAQDSEEAELLDVDHLLACFEAKRIAKKPSRFYMCARKGAGGIVKGPTSIKGW
VRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTPASFDTLKYYKEHFPRAMVCGFASNVKRKSKGRSHALEAAQSSKPATPAVVGPASEDPAPVIELESSGG
PSREKRLRDQTEAVDVLPLGEEVREEVPLKRRRKKKKTTSPLEVGARGALPTSFADRVDDPEVRMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDP
GSVLQRTIDYAAEEFVASIQSALAVEAELDGREALAAREKEEFSAALEAASSTMKNELLKAHSEVEILKAEVEVKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKE
KDDMLQALEAKEEELKHATAELETVKERFSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDILDLQIDFGGLKKRYAEQ