; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G016100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G016100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF962)
Genome locationCG_Chr06:29387330..29388771
RNA-Seq ExpressionClCG06G016100
SyntenyClCG06G016100
Gene Ontology termsGO:0046521 - sphingoid catabolic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009305 - 2-hydroxy-palmitic acid dioxygenase Mpo1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138174.1 uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis sativus]4.5e-10393.94Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+GLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFL TLMYA  YVVFDK+AGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GASFIAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI EWKE KEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

XP_008453216.1 PREDICTED: uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis melo]5.9e-10393.43Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+GLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF  TLMYA  YVVFDK+AGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GAS+IAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI EWKEKKEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

XP_022933403.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita moschata]1.0e-10292.42Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+  FDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFL TL+YA SYV+FDK+AGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKEKKEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

XP_023531593.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita pepo subsp. pepo]2.5e-10192.31Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+  FDLERHFAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFL TL+YA SYV+FDKKAGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKE
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKEKK+
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKE

XP_038880038.1 2-hydroxy-palmitic acid dioxygenase MPO1 [Benincasa hispida]1.1e-10696.97Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGKSGLFDLERHFAFYGAYHSNPVNIFIH+LFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFL TL+YA SYVVFDKKAGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKK+K+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

TrEMBL top hitse value%identityAlignment
A0A0A0LP54 Uncharacterized protein2.2e-10393.94Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+GLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFL TLMYA  YVVFDK+AGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GASFIAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI EWKE KEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

A0A1S3BV50 uncharacterized endoplasmic reticulum membrane protein YGL010W2.8e-10393.43Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+GLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF  TLMYA  YVVFDK+AGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GAS+IAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI EWKEKKEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

A0A5A7UQ13 Putative endoplasmic reticulum membrane protein2.8e-10393.43Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+GLFDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF  TLMYA  YVVFDK+AGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GAS+IAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI EWKEKKEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

A0A6J1F4N0 uncharacterized endoplasmic reticulum membrane protein C16E8.024.8e-10392.42Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+  FDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFL TL+YA SYV+FDK+AGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKEKKEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

A0A6J1IBP8 uncharacterized endoplasmic reticulum membrane protein C16E8.024.5e-10190.91Show/hide
Query:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV
        MGK+  FDLERH+AFYGAYHSNPVNIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFL TL YA SYV+FDKKAGSMAALLCFVCWV
Subjt:  MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQ IGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFS+SVQAKI+ADI+EWKE KEK+S
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS

SwissProt top hitse value%identityAlignment
O13737 2-hydroxy-palmitic acid dioxygenase mpo13.5e-1836.46Show/hide
Query:  LERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSM-AALLCFVCWVGASFIAN
        L R ++FY AYHSNPVNI IH + +  +  T+L+ L+      T+  S       L +N   L  L Y   YV  D   G + + +L    ++  S +  
Subjt:  LERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSM-AALLCFVCWVGASFIAN

Query:  RLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIE
            S   +      + CW  QFIGHGVFEKR PALLDNL Q+  +AP F  LE         P+ G+  SV +KI+A+I+
Subjt:  RLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIE

P25338 2-hydroxy-palmitic acid dioxygenase MPO17.4e-1633.15Show/hide
Query:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWVGASF
        GL DL     FY  YH NP N+ IH +FV  I F+    L+    + +I          L      L ++ Y   Y+     AG +  LL        + 
Subjt:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWVGASF

Query:  IANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIE
        I +R+    T+K  L      W  QF+GHGVFEKR PAL+DNL Q+ ++AP+F++ E L  L       GF   ++A ++ D+E
Subjt:  IANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIE

Arabidopsis top hitse value%identityAlignment
AT1G18720.1 Protein of unknown function (DUF962)4.7e-6663.82Show/hide
Query:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLY-FTPSFYTIPKSPCGFDHGLVL------NFGFLSTLMYAGSYVVFDKKAGSMAALLCFV
        GLFDLE+HFAFYGAYHSNP+NI IH++FVWPIFF+ L+ L+  TP F     S  GF   L L      N GF+  L+YA  Y+  DKK+G +AAL+CF 
Subjt:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLY-FTPSFYTIPKSPCGFDHGLVL------NFGFLSTLMYAGSYVVFDKKAGSMAALLCFV

Query:  CWVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEK
        CWVG+SF+A RLG S   KV LA+QL CWT QF+GHGVFEKRAPALLDNL QAFLMAPFFV+LEVLQS+F YEPYPGF A V AK+++DI+E++ KK+K
Subjt:  CWVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEK

AT1G74440.1 Protein of unknown function (DUF962)1.4e-6562.12Show/hide
Query:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCG------FDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVC
        GL DLE+HFAFYGAYHSNP+NI IH LFVWP  F +L++LY TP    +  S  G      FD  L L+ GF  T+ YA  Y+  DKK+G +AALLCF C
Subjt:  GLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCG------FDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVC

Query:  WVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEK
        W+G+SF+A RLG+S T KV +A+QL CWT QF+GHG+FEKRAPALLDNL QAFLM PFFV+LEVLQS+F YEPYPGF A V +KI++ I+EW+EKK++
Subjt:  WVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGTCTGGATTGTTTGATCTGGAGAGGCACTTCGCCTTCTATGGCGCCTATCACAGCAACCCAGTCAACATTTTCATTCATGTTCTGTTTGTTTGGCCGATTTT
CTTCACCAGTCTCATGTATTTGTATTTCACACCTTCATTCTACACTATTCCCAAATCCCCTTGTGGGTTTGACCATGGCTTGGTTTTGAACTTCGGATTTCTCTCCACTT
TAATGTATGCTGGATCTTATGTGGTTTTCGATAAGAAAGCAGGGTCCATGGCCGCTTTGCTTTGTTTCGTTTGTTGGGTTGGAGCAAGCTTCATCGCTAATAGGCTTGGT
TATTCTCAAACTTGGAAGGTTGTACTGGCAGCTCAGTTGTTCTGTTGGACCAATCAGTTTATAGGACATGGAGTATTTGAGAAACGAGCACCGGCTTTGTTAGACAATCT
CGCACAAGCTTTTCTAATGGCTCCATTCTTTGTAGTTTTGGAGGTTCTTCAAAGTTTATTCAAATATGAACCATATCCAGGGTTTAGTGCGAGTGTGCAAGCAAAGATAA
AAGCAGATATAGAAGAATGGAAGGAAAAGAAGGAAAAGATGTCATAG
mRNA sequenceShow/hide mRNA sequence
TTTAACCAAACCAATTAAGGTATCCTTCTCTGATAGTTATTTATTTTGTAACCCTCATAATCTCTCGGTCGTCTTCTGAGAAAATTTTCCCGCAATCGTTTCGTTTCGTA
TTTGGGTTGTCTGTCTATTTTCTCAGAAGATTTAACTATGGGGAAGTCTGGATTGTTTGATCTGGAGAGGCACTTCGCCTTCTATGGCGCCTATCACAGCAACCCAGTCA
ACATTTTCATTCATGTTCTGTTTGTTTGGCCGATTTTCTTCACCAGTCTCATGTATTTGTATTTCACACCTTCATTCTACACTATTCCCAAATCCCCTTGTGGGTTTGAC
CATGGCTTGGTTTTGAACTTCGGATTTCTCTCCACTTTAATGTATGCTGGATCTTATGTGGTTTTCGATAAGAAAGCAGGGTCCATGGCCGCTTTGCTTTGTTTCGTTTG
TTGGGTTGGAGCAAGCTTCATCGCTAATAGGCTTGGTTATTCTCAAACTTGGAAGGTTGTACTGGCAGCTCAGTTGTTCTGTTGGACCAATCAGTTTATAGGACATGGAG
TATTTGAGAAACGAGCACCGGCTTTGTTAGACAATCTCGCACAAGCTTTTCTAATGGCTCCATTCTTTGTAGTTTTGGAGGTTCTTCAAAGTTTATTCAAATATGAACCA
TATCCAGGGTTTAGTGCGAGTGTGCAAGCAAAGATAAAAGCAGATATAGAAGAATGGAAGGAAAAGAAGGAAAAGATGTCATAGGTTGAATTGCCTAGACTTCGAAAACA
ACTCGTGTCTTGGATTACCTCTTGTAAAAATAGTGTGAAAACTAACTATTTCACTTGAAAAATCTATGATATTATTATACATGTTCTGCATTTCAACCTTGATTTGCCAT
TCTACTATGATGTGTGACAGAATATATTAAAAGAATTAAGAGGACACAGGTGTGAGGACTTACATGTTAAATAAGAACTCCAAACGAATGCACA
Protein sequenceShow/hide protein sequence
MGKSGLFDLERHFAFYGAYHSNPVNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLSTLMYAGSYVVFDKKAGSMAALLCFVCWVGASFIANRLG
YSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIEEWKEKKEKMS