; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g10170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g10170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptioncaffeic acid 3-O-methyltransferase-like
Genome locationchr2:7197743..7199939
RNA-Seq ExpressionMoc02g10170
SyntenyMoc02g10170
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.4e-2830.46Show/hide
Query:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL
        G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E + L VDQLLA    K   +  GR+Y+ AR+  G +++ P+S K W   WF 
Subjt:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL

Query:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMA
         SGEWL +D  G     VP  FG++  + PVP +   S++ +K  + R  +    G  LV++  L  SGL  ++          P  R ++  +    +A
Subjt:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMA

Query:  SKFLALSSTKRQRASASIAPPAPPQVRHRTSSADVPAPGLANLAKLQALQAQPLQTVAGGEGSSTPLVRKKRARDMLPSEEVNSRSLP-PPIVVDSEEGS
              S  KR+    + A  A         S+  P P +   A         L++  G         R+KR RD   + +  + +   PP+   ++   
Subjt:  SKFLALSSTKRQRASASIAPPAPPQVRHRTSSADVPAPGLANLAKLQALQAQPLQTVAGGEGSSTPLVRKKRARDMLPSEEVNSRSLP-PPIVVDSEEGS

Query:  LH
        LH
Subjt:  LH

XP_022154107.1 caffeic acid 3-O-methyltransferase-like [Momordica charantia]1.9e-3385.26Show/hide
Query:  PVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMASKFLALSSTKRQRASASIAPPAPP
        PV RI+NKSWEII+RAQGR LKYALWGEALV NSHLFSSGLS HDQPDE KKLPKPRQ S QKRKAPE MASKFLALSSTKRQRASA   PPA P
Subjt:  PVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMASKFLALSSTKRQRASASIAPPAPP

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]4.9e-2940.59Show/hide
Query:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL
        G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E + L VDQLLA    K   +  GR+Y+ AR+  G +++ P+S K W   WF 
Subjt:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL

Query:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGL
         SGEWL +D  G     VP  FG++  + PVP +   S++ +K  + R  +    G  LV++  L  SGL
Subjt:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.2e-2738.86Show/hide
Query:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL
        G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E + L VDQLLA    K   +  GR+Y+ AR+    +++ P+S K W   WF 
Subjt:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL

Query:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKR-----AQGRRLKYALWGEALVSNSHLFSSGL
         SGEWL +D  G     VP  FG++  + PVP +   S++ +K       +GR++        LV++  L  SGL
Subjt:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKR-----AQGRRLKYALWGEALVSNSHLFSSGL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.9e-3839.55Show/hide
Query:  PSSCSVGRLAELRKTYSIPNSVELRVHNIGEHLDDPASGFVSFYPQMFR-GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDEDR
        PS      L  LR+ ++IP ++ LR+   GE  D+P  G+V+ Y +MF  G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E  
Subjt:  PSSCSVGRLAELRKTYSIPNSVELRVHNIGEHLDDPASGFVSFYPQMFR-GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDEDR

Query:  L-TVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFLVSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRL
        L  VDQLLA    K   +  GR+Y+ AR+  G +++ P+S K W   WF  SGEWL +D  G     VP  FG++  + PVP +   S++ +K  + R  
Subjt:  L-TVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFLVSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRL

Query:  KYALWGEALVSNSHLFSSGL
        +    G  LV++  L  SGL
Subjt:  KYALWGEALVSNSHLFSSGL

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138262.6e-2830.46Show/hide
Query:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL
        G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E + L VDQLLA    K   +  GR+Y+ AR+  G +++ P+S K W   WF 
Subjt:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL

Query:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMA
         SGEWL +D  G     VP  FG++  + PVP +   S++ +K  + R  +    G  LV++  L  SGL  ++          P  R ++  +    +A
Subjt:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMA

Query:  SKFLALSSTKRQRASASIAPPAPPQVRHRTSSADVPAPGLANLAKLQALQAQPLQTVAGGEGSSTPLVRKKRARDMLPSEEVNSRSLP-PPIVVDSEEGS
              S  KR+    + A  A         S+  P P +   A         L++  G         R+KR RD   + +  + +   PP+   ++   
Subjt:  SKFLALSSTKRQRASASIAPPAPPQVRHRTSSADVPAPGLANLAKLQALQAQPLQTVAGGEGSSTPLVRKKRARDMLPSEEVNSRSLP-PPIVVDSEEGS

Query:  LH
        LH
Subjt:  LH

A0A6J1DKS4 caffeic acid 3-O-methyltransferase-like9.3e-3485.26Show/hide
Query:  PVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMASKFLALSSTKRQRASASIAPPAPP
        PV RI+NKSWEII+RAQGR LKYALWGEALV NSHLFSSGLS HDQPDE KKLPKPRQ S QKRKAPE MASKFLALSSTKRQRASA   PPA P
Subjt:  PVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRKAPEGMASKFLALSSTKRQRASASIAPPAPP

A0A6J1DWD2 uncharacterized protein LOC1110246802.4e-2940.59Show/hide
Query:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL
        G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E + L VDQLLA    K   +  GR+Y+ AR+  G +++ P+S K W   WF 
Subjt:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL

Query:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGL
         SGEWL +D  G     VP  FG++  + PVP +   S++ +K  + R  +    G  LV++  L  SGL
Subjt:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGL

A0A6J1DWF1 uncharacterized protein LOC1110251085.8e-2838.86Show/hide
Query:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL
        G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E + L VDQLLA    K   +  GR+Y+ AR+    +++ P+S K W   WF 
Subjt:  GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDE-DRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFL

Query:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKR-----AQGRRLKYALWGEALVSNSHLFSSGL
         SGEWL +D  G     VP  FG++  + PVP +   S++ +K       +GR++        LV++  L  SGL
Subjt:  VSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKR-----AQGRRLKYALWGEALVSNSHLFSSGL

A0A6J1DXS5 uncharacterized protein LOC1110255024.8e-3839.55Show/hide
Query:  PSSCSVGRLAELRKTYSIPNSVELRVHNIGEHLDDPASGFVSFYPQMFR-GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDEDR
        PS      L  LR+ ++IP ++ LR+   GE  D+P  G+V+ Y +MF  G+RLPLHPF+Q FL    LAPAQ++PNGW  +    +L+W+  RD +E  
Subjt:  PSSCSVGRLAELRKTYSIPNSVELRVHNIGEHLDDPASGFVSFYPQMFR-GVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWV-LRDEDEDR

Query:  L-TVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFLVSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRL
        L  VDQLLA    K   +  GR+Y+ AR+  G +++ P+S K W   WF  SGEWL +D  G     VP  FG++  + PVP +   S++ +K  + R  
Subjt:  L-TVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKNWKECWFLVSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRL

Query:  KYALWGEALVSNSHLFSSGL
        +    G  LV++  L  SGL
Subjt:  KYALWGEALVSNSHLFSSGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G30230.1 myosin heavy chain-related9.9e-0426.59Show/hide
Query:  NPYVEDSSANPNIASTSGNPNLGPVAELPPSSCSVGRLAELRKTYSIPNSV--ELRVHNIGEHLDDPASGFVSFYPQMFRGVRL--PLHPFIQRFLSAVN
        +P  ++  +N +I  T  N   GP      S C++  L  L+  Y I   +  E  +H   E  +DP  G++  Y   F+G  L  PL   +  +L A+ 
Subjt:  NPYVEDSSANPNIASTSGNPNLGPVAELPPSSCSVGRLAELRKTYSIPNSV--ELRVHNIGEHLDDPASGFVSFYPQMFRGVRL--PLHPFIQRFLSAVN

Query:  LAPAQLSPNGWSTVLGAYVLWWVLRDEDEDRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKN
        +A  QL+PN   T+ G       +  E    + V +L    T+++S++  G      RR      RN S ++N
Subjt:  LAPAQLSPNGWSTVLGAYVLWWVLRDEDEDRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQLIRNPSSNKN

AT5G32590.1 myosin heavy chain-related2.8e-0628.42Show/hide
Query:  NPYVEDSSANPNIASTSGNPNLGPVAELPPSSCSVGRLAELRKTYSIPNSV--ELRVHNIGEHLDDPASGFVSFYPQMFRGVRL--PLHPFIQRFLSAVN
        +P  +D  ++ N+  T  N   GP  +   S+C++  L  L +   IP  V  +L+     E  +D   GF   Y   F+G  L  PL   + R+L+A+ 
Subjt:  NPYVEDSSANPNIASTSGNPNLGPVAELPPSSCSVGRLAELRKTYSIPNSV--ELRVHNIGEHLDDPASGFVSFYPQMFRGVRL--PLHPFIQRFLSAVN

Query:  LAPAQLSPNGWSTVLGAYVLWWVLRDEDEDRLTVDQLLATHTVKASNEGEGRY--YLSARRNVGQLIRNPSSNKNWKECWFLV
        +A  QL+PN   T+LG       +  E    + V +L     V+ S++  G +  Y +A RN+  +   P+ ++N    WFLV
Subjt:  LAPAQLSPNGWSTVLGAYVLWWVLRDEDEDRLTVDQLLATHTVKASNEGEGRY--YLSARRNVGQLIRNPSSNKNWKECWFLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCGTGCTGTCCTATAAAAGGCGAATTCAGGTACTCTCTCCCTCTTTCTTTAGATAGTTCTTTCAAGTCCTCCCAGTCAGGTCTGGCGATGAGTAGCTCGTCTAG
GGACTCGTCGTCTGAGTCGTCTAGTTCAGACGACGTTTTAGGAGATCCGACTTTAGCTCCTACCTCAAGGAACCCTTACGTCGAGGATAGCTCGGCGAACCCCAACATAG
CCTCAACTTCGGGAAACCCTAACCTCGGTCCTGTAGCTGAACTCCCTCCTTCATCCTGCTCGGTAGGTAGGTTGGCCGAGCTCCGCAAAACTTACTCAATCCCCAACTCT
GTAGAGCTTAGGGTGCATAACATCGGGGAACACCTCGACGACCCTGCTTCGGGCTTCGTCAGTTTTTACCCCCAGATGTTCCGGGGGGTTAGACTTCCACTCCATCCTTT
TATACAAAGGTTCCTTAGCGCAGTTAATCTTGCCCCAGCTCAACTCTCCCCTAACGGATGGTCTACCGTGCTCGGAGCCTACGTCCTGTGGTGGGTCCTCCGAGACGAGG
ATGAGGACAGGCTTACGGTGGACCAGCTCCTAGCGACCCATACCGTCAAAGCTTCCAACGAGGGAGAGGGCCGGTACTACCTCTCAGCTCGGCGGAACGTAGGTCAGCTG
ATCAGGAACCCCTCCTCTAACAAAAATTGGAAGGAGTGTTGGTTCCTCGTCTCTGGCGAGTGGCTTCTGAGGGATGGAGGGGGTGAGCCGACCTGCCAAGTCCCAGTAGA
GTTCGGGGACGTAGCTCTTCTGTCCCCCGTCCCGAGGATCAACAACAAGTCTTGGGAGATCATCAAACGAGCCCAAGGTCGGAGACTAAAGTACGCGCTGTGGGGTGAAG
CACTTGTCTCAAACTCCCACCTCTTCTCCAGCGGCCTCAGTCCTCACGATCAACCAGACGAGCTCAAGAAGCTCCCCAAACCTCGACAACGCAGCTTGCAGAAGCGCAAA
GCTCCTGAAGGAATGGCTTCCAAATTCCTCGCACTCAGCAGTACAAAGCGGCAGCGGGCCTCGGCCTCGATTGCTCCGCCGGCGCCCCCCCAAGTTCGTCATCGCACAAG
TAGCGCCGACGTCCCGGCTCCAGGGCTGGCCAACTTGGCGAAGCTTCAGGCCCTCCAAGCTCAGCCCCTGCAGACTGTCGCTGGGGGCGAGGGATCGTCCACTCCTCTGG
TGAGGAAGAAGCGAGCGAGGGATATGCTTCCCTCCGAGGAGGTCAACTCCCGATCTCTGCCTCCCCCCATCGTGGTCGACTCAGAGGAGGGGTCGCTACATGGTCGGGGT
CATGGCGCCGAGCTCCCACTCCCTTTCAGGCTCGCTTTCTCCCGCCTCAGTATTCACGAGGCCGCGACGAAAGTGAGGTCCTCGAGCTCTCACTTGGCAGAACGAGCTAT
TCAGGCGCCCGAGGACACTGCGGAGGCGGTGAGAAAGACCATAGCTCTTGCTGCAGAGCTGCACAATCATACCTGCCTCTCGGCGTCGATGATGAGCTTGGAGCTCGGAG
GGCGGGACTTGAAGGAGGAGGAGACGAATGCCTCCTACGAGCGGCGGATTGCCGAGCTTCAGGCCGAGGTTGAGCAGGCTCGAGCCGACAAGGAGGCCGAGGCAGATAGG
TCGAGGGCCTTGATAGCTCAAGCTCTCACTGACCTGATGAATGCCCGGAGCCGAGCCGAGACTGCAGAGAAGGGCTGGAATGAGGCTCAACAGAACTTGGCGAGCTGGGC
CTACTTGGAGAAGAGAATCAAAGAATTTCCTGAGTTTGACGGCCTAGCTCGCGACATGGTAGACAAGGGCTTCTACTACGCCGTCAACAGGCTTGACCTCGCTTGCTCGG
AGGATACCATCGTCGCTGTCGAGCAAGAGTACGAGTCCATCTGGGTGGACCGCCCTGCTGAGGCCAGCAAAGAGGCCGAGGCTGCTGGTGCTGAAGCAGGGCTCGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCGTGCTGTCCTATAAAAGGCGAATTCAGGTACTCTCTCCCTCTTTCTTTAGATAGTTCTTTCAAGTCCTCCCAGTCAGGTCTGGCGATGAGTAGCTCGTCTAG
GGACTCGTCGTCTGAGTCGTCTAGTTCAGACGACGTTTTAGGAGATCCGACTTTAGCTCCTACCTCAAGGAACCCTTACGTCGAGGATAGCTCGGCGAACCCCAACATAG
CCTCAACTTCGGGAAACCCTAACCTCGGTCCTGTAGCTGAACTCCCTCCTTCATCCTGCTCGGTAGGTAGGTTGGCCGAGCTCCGCAAAACTTACTCAATCCCCAACTCT
GTAGAGCTTAGGGTGCATAACATCGGGGAACACCTCGACGACCCTGCTTCGGGCTTCGTCAGTTTTTACCCCCAGATGTTCCGGGGGGTTAGACTTCCACTCCATCCTTT
TATACAAAGGTTCCTTAGCGCAGTTAATCTTGCCCCAGCTCAACTCTCCCCTAACGGATGGTCTACCGTGCTCGGAGCCTACGTCCTGTGGTGGGTCCTCCGAGACGAGG
ATGAGGACAGGCTTACGGTGGACCAGCTCCTAGCGACCCATACCGTCAAAGCTTCCAACGAGGGAGAGGGCCGGTACTACCTCTCAGCTCGGCGGAACGTAGGTCAGCTG
ATCAGGAACCCCTCCTCTAACAAAAATTGGAAGGAGTGTTGGTTCCTCGTCTCTGGCGAGTGGCTTCTGAGGGATGGAGGGGGTGAGCCGACCTGCCAAGTCCCAGTAGA
GTTCGGGGACGTAGCTCTTCTGTCCCCCGTCCCGAGGATCAACAACAAGTCTTGGGAGATCATCAAACGAGCCCAAGGTCGGAGACTAAAGTACGCGCTGTGGGGTGAAG
CACTTGTCTCAAACTCCCACCTCTTCTCCAGCGGCCTCAGTCCTCACGATCAACCAGACGAGCTCAAGAAGCTCCCCAAACCTCGACAACGCAGCTTGCAGAAGCGCAAA
GCTCCTGAAGGAATGGCTTCCAAATTCCTCGCACTCAGCAGTACAAAGCGGCAGCGGGCCTCGGCCTCGATTGCTCCGCCGGCGCCCCCCCAAGTTCGTCATCGCACAAG
TAGCGCCGACGTCCCGGCTCCAGGGCTGGCCAACTTGGCGAAGCTTCAGGCCCTCCAAGCTCAGCCCCTGCAGACTGTCGCTGGGGGCGAGGGATCGTCCACTCCTCTGG
TGAGGAAGAAGCGAGCGAGGGATATGCTTCCCTCCGAGGAGGTCAACTCCCGATCTCTGCCTCCCCCCATCGTGGTCGACTCAGAGGAGGGGTCGCTACATGGTCGGGGT
CATGGCGCCGAGCTCCCACTCCCTTTCAGGCTCGCTTTCTCCCGCCTCAGTATTCACGAGGCCGCGACGAAAGTGAGGTCCTCGAGCTCTCACTTGGCAGAACGAGCTAT
TCAGGCGCCCGAGGACACTGCGGAGGCGGTGAGAAAGACCATAGCTCTTGCTGCAGAGCTGCACAATCATACCTGCCTCTCGGCGTCGATGATGAGCTTGGAGCTCGGAG
GGCGGGACTTGAAGGAGGAGGAGACGAATGCCTCCTACGAGCGGCGGATTGCCGAGCTTCAGGCCGAGGTTGAGCAGGCTCGAGCCGACAAGGAGGCCGAGGCAGATAGG
TCGAGGGCCTTGATAGCTCAAGCTCTCACTGACCTGATGAATGCCCGGAGCCGAGCCGAGACTGCAGAGAAGGGCTGGAATGAGGCTCAACAGAACTTGGCGAGCTGGGC
CTACTTGGAGAAGAGAATCAAAGAATTTCCTGAGTTTGACGGCCTAGCTCGCGACATGGTAGACAAGGGCTTCTACTACGCCGTCAACAGGCTTGACCTCGCTTGCTCGG
AGGATACCATCGTCGCTGTCGAGCAAGAGTACGAGTCCATCTGGGTGGACCGCCCTGCTGAGGCCAGCAAAGAGGCCGAGGCTGCTGGTGCTGAAGCAGGGCTCGGCTAG
Protein sequenceShow/hide protein sequence
MPPCCPIKGEFRYSLPLSLDSSFKSSQSGLAMSSSSRDSSSESSSSDDVLGDPTLAPTSRNPYVEDSSANPNIASTSGNPNLGPVAELPPSSCSVGRLAELRKTYSIPNS
VELRVHNIGEHLDDPASGFVSFYPQMFRGVRLPLHPFIQRFLSAVNLAPAQLSPNGWSTVLGAYVLWWVLRDEDEDRLTVDQLLATHTVKASNEGEGRYYLSARRNVGQL
IRNPSSNKNWKECWFLVSGEWLLRDGGGEPTCQVPVEFGDVALLSPVPRINNKSWEIIKRAQGRRLKYALWGEALVSNSHLFSSGLSPHDQPDELKKLPKPRQRSLQKRK
APEGMASKFLALSSTKRQRASASIAPPAPPQVRHRTSSADVPAPGLANLAKLQALQAQPLQTVAGGEGSSTPLVRKKRARDMLPSEEVNSRSLPPPIVVDSEEGSLHGRG
HGAELPLPFRLAFSRLSIHEAATKVRSSSSHLAERAIQAPEDTAEAVRKTIALAAELHNHTCLSASMMSLELGGRDLKEEETNASYERRIAELQAEVEQARADKEAEADR
SRALIAQALTDLMNARSRAETAEKGWNEAQQNLASWAYLEKRIKEFPEFDGLARDMVDKGFYYAVNRLDLACSEDTIVAVEQEYESIWVDRPAEASKEAEAAGAEAGLG