; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr7:14593700..14595403
RNA-Seq ExpressionMoc07g20280
SyntenyMoc07g20280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.4e-10580.84Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEAREE
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAV GPAS+DP PVIELESS GPSREKRPR+QTEAVD         PLGEE REE
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEAREE

Query:  TPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQ
         PLKRRRKKKK  SP EVGA  VLPAS+ADRVDDP ARMGGT DVT RFR+EPSS GVR+Q
Subjt:  TPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.1e-14294.18Show/hide
Query:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR
        MFEYG+RLPLHPFVQE LF TGLAPAQV PNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGR YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEA
        GVKRKSKGRAHALEAAQSSKPPTPAV GPAS+DP PVIELESS GPSREKRPR+QTEAVDAQTEAAD PPLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEA

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]3.0e-10396.88Show/hide
Query:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR
        MFEYG+RLPLHPFVQE LF TGLAPAQV PNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGR YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.8e-18694.33Show/hide
Query:  MSSSISSNLGSDLARRSESELEEIEHFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGIR
        MSSSISSNL SDLARR ES+LEEIE+ RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYG+R
Subjt:  MSSSISSNLGSDLARRSESELEEIEHFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGIR

Query:  LPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQE LF TGLAPAQV PNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGR YMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVD
        GRAHALEAAQSSKP TPAV GPAS+DP  VIELESS GPSREKRPR+QTEAVD
Subjt:  GRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.3e-11469.28Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPP
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  PTPVIEL+ S G S EKR RE++EA+D         P
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPP

Query:  LGEEAREETPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI
        L  E R E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR +RRASKFVS PGSVLQRTI
Subjt:  LGEEAREETPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI

Query:  DYAAEAFVASIQSALVIKAELDGREALAAREK
        D  AEAF+ASI  A+++KAELDGREALAA+E+
Subjt:  DYAAEAFVASIQSALVIKAELDGREALAAREK

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092986.9e-10680.84Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEAREE
        AVRPIESSRPNSELAMVCGFAS VKRKSKG+AHALEAAQSSKP TPAV GPAS+DP PVIELESS GPSREKRPR+QTEAVD         PLGEE REE
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEAREE

Query:  TPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQ
         PLKRRRKKKK  SP EVGA  VLPAS+ADRVDDP ARMGGT DVT RFR+EPSS GVR+Q
Subjt:  TPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQ

A0A6J1CR42 uncharacterized protein LOC1110138265.4e-14394.18Show/hide
Query:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR
        MFEYG+RLPLHPFVQE LF TGLAPAQV PNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGR YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEA
        GVKRKSKGRAHALEAAQSSKPPTPAV GPAS+DP PVIELESS GPSREKRPR+QTEAVDAQTEAAD PPLGE A
Subjt:  GVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEA

A0A6J1DWD2 uncharacterized protein LOC1110246801.4e-10396.88Show/hide
Query:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR
        MFEYG+RLPLHPFVQE LF TGLAPAQV PNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGR YMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255022.3e-18694.33Show/hide
Query:  MSSSISSNLGSDLARRSESELEEIEHFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGIR
        MSSSISSNL SDLARR ES+LEEIE+ RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERAD+PPEGWVTLYFKMFEYG+R
Subjt:  MSSSISSNLGSDLARRSESELEEIEHFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGIR

Query:  LPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQE LF TGLAPAQV PNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGR YMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSK

Query:  GRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVD
        GRAHALEAAQSSKP TPAV GPAS+DP  VIELESS GPSREKRPR+QTEAVD
Subjt:  GRAHALEAAQSSKPPTPAVAGPASDDPTPVIELESSVGPSREKRPREQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256654.0e-11469.28Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPP
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  PTPVIEL+ S G S EKR RE++EA+D         P
Subjt:  AVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAV--------AGPASDDPTPVIELESSVGPSREKRPREQTEAVDAQTEAADAPP

Query:  LGEEAREETPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI
        L  E R E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR +RRASKFVS PGSVLQRTI
Subjt:  LGEEAREETPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTI

Query:  DYAAEAFVASIQSALVIKAELDGREALAAREK
        D  AEAF+ASI  A+++KAELDGREALAA+E+
Subjt:  DYAAEAFVASIQSALVIKAELDGREALAAREK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related2.3e-0530.6Show/hide
Query:  PENILLRLPEEGERADHPPEGWVTLYFKMF-EYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   G+  PL  F+ E      +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRLPEEGERADHPPEGWVTLYFKMF-EYGIRLPLHPFVQELLFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGRIYMCARKGAGGIVKGPTS-IKGWVRKWFY
        + PG  Y  A K    IV G  S I GW R++F+
Subjt:  KKPGRIYMCARKGAGGIVKGPTS-IKGWVRKWFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCCATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGTCAGAGTCCGAGCTCGAAGAGATAGAACACTTTAGAATCTCCGATGACGGGGAGGATAGCGA
TGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTATTTCAAAATGTTTGAGTACGGCATCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTG
CTCTTCTGGACTGGGTTGGCTCCGGCTCAAGTGGGCCCCAATGGGTGGGGTGTCATCTTCGCTTTGGCCATCCTCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGAATCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATTGTTAAGG
GGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGAC
TGACGAACTGCTGCTCGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCTAGGCCGAACTCTGAACTCGCCATGGTTTGCGGATTTGCAAGCG
GCGTGAAGCGCAAGTCCAAGGGCCGAGCTCATGCTCTTGAGGCTGCCCAGAGTTCCAAACCGCCCACCCCTGCCGTGGCAGGGCCTGCCTCGGACGATCCAACCCCGGTG
ATCGAGCTGGAGTCTTCTGTGGGTCCCTCGAGGGAGAAGCGCCCTAGGGAGCAGACCGAGGCAGTGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCATTGGGCGAGGA
GGCGAGGGAGGAAACCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTGGGCTGATCGGGTGG
ACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACTCGCATCTCGGCTGCGAGT
TTGGACCGTTGCATAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGC
TCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCCATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGTCAGAGTCCGAGCTCGAAGAGATAGAACACTTTAGAATCTCCGATGACGGGGAGGATAGCGA
TGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
CGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTATTTCAAAATGTTTGAGTACGGCATCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTG
CTCTTCTGGACTGGGTTGGCTCCGGCTCAAGTGGGCCCCAATGGGTGGGGTGTCATCTTCGCTTTGGCCATCCTCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGAATCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATTGTTAAGG
GGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGAC
TGACGAACTGCTGCTCGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATCGAATCCTCTAGGCCGAACTCTGAACTCGCCATGGTTTGCGGATTTGCAAGCG
GCGTGAAGCGCAAGTCCAAGGGCCGAGCTCATGCTCTTGAGGCTGCCCAGAGTTCCAAACCGCCCACCCCTGCCGTGGCAGGGCCTGCCTCGGACGATCCAACCCCGGTG
ATCGAGCTGGAGTCTTCTGTGGGTCCCTCGAGGGAGAAGCGCCCTAGGGAGCAGACCGAGGCAGTGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCATTGGGCGAGGA
GGCGAGGGAGGAAACCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTGGGCTGATCGGGTGG
ACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACTCGCATCTCGGCTGCGAGT
TTGGACCGTTGCATAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGC
TCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAATAG
Protein sequenceShow/hide protein sequence
MSSSISSNLGSDLARRSESELEEIEHFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGIRLPLHPFVQEL
LFWTGLAPAQVGPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRIYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG
NLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASGVKRKSKGRAHALEAAQSSKPPTPAVAGPASDDPTPV
IELESSVGPSREKRPREQTEAVDAQTEAADAPPLGEEAREETPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAAS
LDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREK