; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0056871 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0056871
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionProtein of unknown function (DUF962)
Genome locationCMiso1.1chr02:23070560..23073663
RNA-Seq ExpressionCmc02g0056871
SyntenyCmc02g0056871
Gene Ontology termsGO:0046521 - sphingoid catabolic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR009305 - 2-hydroxy-palmitic acid dioxygenase Mpo1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138174.1 uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis sativus]6.0e-10898.48Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GAS+IANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKE KEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

XP_008453216.1 PREDICTED: uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis melo]8.4e-110100Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

XP_022933403.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita moschata]1.1e-10191.41Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTL+YAA YV+FDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GAS+I+N+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKEKKEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

XP_023531593.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita pepo subsp. pepo]1.8e-9990.26Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTL+YAA YV+FDK+AGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKE
        GAS+I+N+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKEKK+
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKE

XP_038880038.1 2-hydroxy-palmitic acid dioxygenase MPO1 [Benincasa hispida]2.0e-10393.43Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGK+GLFDLE+ FAFYGAYHSNP+NIFIH+LFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTL+YAA YVVFDK+AGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GAS+IAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI EWKEKK+KLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

TrEMBL top hitse value%identityAlignment
A0A0A0LP54 Uncharacterized protein2.9e-10898.48Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GAS+IANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKE KEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

A0A1S3BV50 uncharacterized endoplasmic reticulum membrane protein YGL010W4.1e-110100Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

A0A5A7UQ13 Putative endoplasmic reticulum membrane protein4.1e-110100Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

A0A6J1F4N0 uncharacterized endoplasmic reticulum membrane protein C16E8.025.4e-10291.41Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTL+YAA YV+FDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GAS+I+N+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKEKKEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

A0A6J1IBP8 uncharacterized endoplasmic reticulum membrane protein C16E8.022.5e-9988.89Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ +AFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTL YAA YV+FDK+AGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS
        GAS+I+N+LGYSQTWKVVLAAQLFCWTNQ IGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFS+SVQAKI+ADI+EWKE KEKLS
Subjt:  GASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS

SwissProt top hitse value%identityAlignment
O13737 2-hydroxy-palmitic acid dioxygenase mpo13.5e-1834.81Show/hide
Query:  LEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSM-AALLCFVCWVGASYIAN
        L + ++FY AYHSNP+NI IH + +  +  T+L+ L+      T+  S       L +N      L Y  +YV  D   G + + +L    ++  S +  
Subjt:  LEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSM-AALLCFVCWVGASYIAN

Query:  KLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIR
            S   +      + CW  QFIGHGVFEKR PALLDNL Q+  +AP F  LE         P+ G+  SV +KI+A+I+
Subjt:  KLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIR

P25338 2-hydroxy-palmitic acid dioxygenase MPO11.9e-1632.79Show/hide
Query:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWVGASY
        GL DL  Q  FY  YH NP N+ IH +FV  I F+    L+    + +I          L       F++ Y   Y+     AG +  LL        + 
Subjt:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWVGASY

Query:  IANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI
        I +++    T+K  L      W  QF+GHGVFEKR PAL+DNL Q+ ++AP+F++ E L  L       GF   ++A ++ D+
Subjt:  IANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI

Arabidopsis top hitse value%identityAlignment
AT1G18720.1 Protein of unknown function (DUF962)8.0e-6662.81Show/hide
Query:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLY-FTPSFYTIPKSPCGFDHGLVL------NFGFFFTLMYAAYYVVFDKRAGSMAALLCFV
        GLFDLEK FAFYGAYHSNP+NI IH++FVWPIFF+ L+ L+  TP F     S  GF   L L      N GF F L+YA +Y+  DK++G +AAL+CF 
Subjt:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLY-FTPSFYTIPKSPCGFDHGLVL------NFGFFFTLMYAAYYVVFDKRAGSMAALLCFV

Query:  CWVGASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEK
        CWVG+S++A +LG S   KV LA+QL CWT QF+GHGVFEKRAPALLDNL QAFLMAPFFV+LEVLQS+F YEPYPGF A V AK+++DI+E++ KK+K
Subjt:  CWVGASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEK

AT1G74440.1 Protein of unknown function (DUF962)5.2e-6560Show/hide
Query:  KTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCG------FDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCF
        + GL DLEK FAFYGAYHSNP+NI IH LFVWP  F +L++LY TP    +  S  G      FD  L L+ GF  T+ YA +Y+  DK++G +AALLCF
Subjt:  KTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCG------FDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCF

Query:  VCWVGASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEK
         CW+G+S++A +LG+S T KV +A+QL CWT QF+GHG+FEKRAPALLDNL QAFLM PFFV+LEVLQS+F YEPYPGF A V +KI++ I+EW+EKK++
Subjt:  VCWVGASYIANKLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGACTGGATTGTTTGATCTGGAGAAACAGTTCGCCTTCTATGGCGCTTATCACAGCAACCCAATGAACATTTTCATCCACGTTTTGTTTGTGTGGCCAATTTT
CTTCACCAGTCTCATGTATTTGTATTTCACCCCTTCATTCTACACTATTCCCAAATCCCCTTGTGGGTTTGACCACGGCTTGGTTTTGAACTTCGGATTTTTTTTCACTT
TAATGTATGCCGCATATTATGTTGTTTTTGATAAGAGAGCTGGGTCCATGGCTGCTTTGCTTTGTTTCGTTTGTTGGGTTGGAGCAAGCTACATCGCTAATAAACTTGGT
TATTCTCAGACTTGGAAGGTTGTACTGGCAGCTCAGTTGTTCTGTTGGACCAATCAGTTTATAGGCCATGGAGTATTTGAGAAACGAGCACCGGCTTTGTTAGATAATCT
TGCACAAGCTTTTTTAATGGCTCCGTTCTTTGTAGTTTTAGAGGTTCTTCAAAGTTTATTCAAATATGAACCATATCCGGGGTTTAGTGCGAGTGTGCAAGCAAAGATAA
AAGCAGATATAAGAGAATGGAAAGAAAAGAAGGAAAAGTTGTCATGA
mRNA sequenceShow/hide mRNA sequence
GGACCATTCACAATTTTGTCAATTATATAATGTTTCCAACCGAATTACTCCACTTTAAACTGCTTATCCTCTCGAATTGTTATTGAGATTAATTTAGCTGACGATCTTAA
CCAACCCAATTAATATATCTTTCTCTAGACTCTAATCGTCCTTTATTTTCTTACCCTCATAATCTCTCCGGTCATCTTACCTTACACGCCTCGCCTCTTCTGAGAAAATT
TTCAGGTGATCGTTCCGTTTCGTATTTGGGTTGTTTGTCAGAAGATTTCATTATGGGAAAGACTGGATTGTTTGATCTGGAGAAACAGTTCGCCTTCTATGGCGCTTATC
ACAGCAACCCAATGAACATTTTCATCCACGTTTTGTTTGTGTGGCCAATTTTCTTCACCAGTCTCATGTATTTGTATTTCACCCCTTCATTCTACACTATTCCCAAATCC
CCTTGTGGGTTTGACCACGGCTTGGTTTTGAACTTCGGATTTTTTTTCACTTTAATGTATGCCGCATATTATGTTGTTTTTGATAAGAGAGCTGGGTCCATGGCTGCTTT
GCTTTGTTTCGTTTGTTGGGTTGGAGCAAGCTACATCGCTAATAAACTTGGTTATTCTCAGACTTGGAAGGTTGTACTGGCAGCTCAGTTGTTCTGTTGGACCAATCAGT
TTATAGGCCATGGAGTATTTGAGAAACGAGCACCGGCTTTGTTAGATAATCTTGCACAAGCTTTTTTAATGGCTCCGTTCTTTGTAGTTTTAGAGGTTCTTCAAAGTTTA
TTCAAATATGAACCATATCCGGGGTTTAGTGCGAGTGTGCAAGCAAAGATAAAAGCAGATATAAGAGAATGGAAAGAAAAGAAGGAAAAGTTGTCATGATTTGTATTGGC
TAGACTTCAAAACAACCCTGCCTTGGATTTCCCCTTGTAAAAATAGTGTGGAAACTAACTACTTCAGTTGAAAAACCTTTCTATGATATTATTAAACATGTTTCTGCTTT
TCAACCTTGAGTTGCCATTCTACTATGATGTAAAACAGAATACATTAAAAGAATTAGGAGGACAAAGGTGTGAAGACTTACGTGTTAAATAAGAACTCCAAATGAATGGA
CACTGAAAATTTAGATATAACTTTCATTCCTTTCTGCATTATTACAATGGATTGATTTCCAAATAACTATTACCTGGCTCTGGTTTCTCCACTCCTTTCTGTTGCATTGA
TTTTCTGGTCTTTATCATTTCATTCCATTGACCAGAACTTGCGTACATATTTGCAAGAAGAACATAATCACTACTATGATCTGGCTCTATCTCTAGCACATGGCTACTTA
CTCTCTCCCCGAGTTTAACGTTTCCATGCATCTGACAGGCTGCGAGCAATGTTCTCCATATGACAGCATTGCACTCCATTGGCATGCTCTTTATTAGCTGATAAGCTTCT
TCTACAAATCCAGCTCGTCCCAGAATATCCACCATGGATCCATAATGCTTAAGTGTGGGTTGGATATTGAAGTGTTTGGTCATAAGATCAAAATACCTCCTCCCTTCCTC
CACCTTCCCTCCATGATTACAAGCACACAATACTGCCAAGAAAGTAACAGCATCAGGAGTCTCCACCTTCTCGGTTAACATGTTTGAGAAGAGTGTCAGTGCATCCTCTG
CATCACCATGTGTTGCTAATCCCATGATCATTGTGTTCCATGTTACTACGTTCTTGCCGCTCATTGCGTTAAACATCTCGCGGGCGTAATCAACCGCTCCACATTTGGCG
TACATGTCGATCAACGAATTGAAAACAGCTGTAGTTTTTCCCCTATCGTTACTGTTGACATGGGAATGAATCCACCTCCCAAAGTCCAGTGCACCCAATGCAGAACATGC
TGAAACCGTCACAACCAGTGTGGCTTCGTCTGGCTCTACGCCACTCTGCAACATTTGAAGAAACAAGTCAAGTGCTTCGTTGTACATTCCACAAGAGACATGACAGTCAA
TTACGGCATTCCAAGCCACTAAATCTGTTTTGGGCAATTCATCAAACAGGTTTCGTGCTATATTGACGTCTTTTAACCTGCCATACATATGTATAAGCGTGTTCCTCACA
TACACATGAGAATCAAGGCCAAGTTTCAGAATATTAACATGCAACTGTTTCCCCAACATAATTGAACCCAACTGCCCGGTAATCTTCAGCAAGAAAGAAAAAGTGAAATT
ATCTGCCTCTATTCCCTTCTCTAACATCCTCTTGTAGAACTCAAACGCCATTAGCAGTTTCCTAATCCTTCCAAATCCCCTGATCATTGTATTCCAAAGAAACCCATCTG
GGTTTTCGATTCTGTCAAAAACATCAACAGCGTAATTCATGTCTCCATGATCCGAAACCGCACAAAAATCAATGAGTTTGCCAATAACAAAGAGATTCTGATCGAAACCC
AATCGAATAATACGGGCATGGAGTTGATTCAAGTGTTTCAAAGTGGAACATTGCTTGAAAAGAAACATTAAATTTTGCTCCTTAGCAAAATAGCCGCCGTTGGGTATTGT
ATCAGTGGCTGTTTTTGGACCCGAATTCCAAATGAACCTGGGAACAACCAACTGAAATTTTGAAAATTTGATCTTACCATGGGAACAC
Protein sequenceShow/hide protein sequence
MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFFFTLMYAAYYVVFDKRAGSMAALLCFVCWVGASYIANKLG
YSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKEKKEKLS