; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G02390 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G02390
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF962)
Genome locationChr1:1518796..1521777
RNA-Seq ExpressionCSPI01G02390
SyntenyCSPI01G02390
Gene Ontology termsGO:0046521 - sphingoid catabolic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009305 - 2-hydroxy-palmitic acid dioxygenase Mpo1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138174.1 uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis sativus]8.5e-11099.49Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GASFIAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

XP_008453216.1 PREDICTED: uncharacterized endoplasmic reticulum membrane protein YGL010W [Cucumis melo]2.7e-10897.98Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GAS+IAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKE KEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

XP_022933403.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita moschata]7.7e-10392.42Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTL+YAA YV+FDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKE KEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

XP_022972978.1 uncharacterized endoplasmic reticulum membrane protein C16E8.02 [Cucurbita maxima]7.2e-10190.4Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ +AFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTL YAA YV+FDK+AGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQ IGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFS+SVQAKI+ADI+EWKE KEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

XP_038880038.1 2-hydroxy-palmitic acid dioxygenase MPO1 [Benincasa hispida]1.4e-10494.44Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGK+GLFDLE+ FAFYGAYHSNP+NIFIH+LFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTL+YAA YVVFDK+AGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI EWKE K+KLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

TrEMBL top hitse value%identityAlignment
A0A0A0LP54 Uncharacterized protein4.1e-11099.49Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GASFIAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

A0A1S3BV50 uncharacterized endoplasmic reticulum membrane protein YGL010W1.3e-10897.98Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GAS+IAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKE KEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

A0A5A7UQ13 Putative endoplasmic reticulum membrane protein1.3e-10897.98Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGF FTLMYAAYYVVFDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GAS+IAN+LGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKE KEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

A0A6J1F4N0 uncharacterized endoplasmic reticulum membrane protein C16E8.023.7e-10392.42Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ FAFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTL+YAA YV+FDKRAGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFSASVQAKI+ADI+EWKE KEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

A0A6J1IBP8 uncharacterized endoplasmic reticulum membrane protein C16E8.023.5e-10190.4Show/hide
Query:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV
        MGKT  FDLE+ +AFYGAYHSNP+NIFIHVLFVWPIFFT+LMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTL YAA YV+FDK+AGSMAALLCFVCWV
Subjt:  MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWV

Query:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS
        GASFI+NRLGYSQTWKVVLAAQLFCWTNQ IGHGVFEKRAPALLDNLAQAFLMAPFFV LEVLQ+LFKYEPYPGFS+SVQAKI+ADI+EWKE KEKLS
Subjt:  GASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS

SwissProt top hitse value%identityAlignment
O13737 2-hydroxy-palmitic acid dioxygenase mpo19.4e-1935.36Show/hide
Query:  LEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSM-AALLCFVCWVGASFIAN
        L + ++FY AYHSNP+NI IH + +  +  T+L+ L+      T+  S       L +N   L  L Y  +YV  D   G + + +L    ++  S +  
Subjt:  LEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSM-AALLCFVCWVGASFIAN

Query:  RLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIR
            S   +      + CW  QFIGHGVFEKR PALLDNL Q+  +AP F  LE         P+ G+  SV +KI+A+I+
Subjt:  RLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIR

P25338 2-hydroxy-palmitic acid dioxygenase MPO11.4e-1733.88Show/hide
Query:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWVGASF
        GL DL  Q  FY  YH NP N+ IH +FV  I F+    L+    + +I          L      LF++ Y   Y+     AG +  LL        + 
Subjt:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWVGASF

Query:  IANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI
        I +R+    T+K  L      W  QF+GHGVFEKR PAL+DNL Q+ ++AP+F++ E L  L       GF   ++A ++ D+
Subjt:  IANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADI

Arabidopsis top hitse value%identityAlignment
AT1G18720.1 Protein of unknown function (DUF962)9.5e-6763.32Show/hide
Query:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLY-FTPSFYTIPKSPCGFDHGLVL------NFGFLFTLMYAAYYVVFDKRAGSMAALLCFV
        GLFDLEK FAFYGAYHSNP+NI IH++FVWPIFF+ L+ L+  TP F     S  GF   L L      N GF+F L+YA +Y+  DK++G +AAL+CF 
Subjt:  GLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLY-FTPSFYTIPKSPCGFDHGLVL------NFGFLFTLMYAAYYVVFDKRAGSMAALLCFV

Query:  CWVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEK
        CWVG+SF+A RLG S   KV LA+QL CWT QF+GHGVFEKRAPALLDNL QAFLMAPFFV+LEVLQS+F YEPYPGF A V AK+++DI+E++  K+K
Subjt:  CWVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEK

AT1G74440.1 Protein of unknown function (DUF962)1.0e-6560.5Show/hide
Query:  KTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCG------FDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCF
        + GL DLEK FAFYGAYHSNP+NI IH LFVWP  F +L++LY TP    +  S  G      FD  L L+ GF  T+ YA +Y+  DK++G +AALLCF
Subjt:  KTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCG------FDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCF

Query:  VCWVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEK
         CW+G+SF+A RLG+S T KV +A+QL CWT QF+GHG+FEKRAPALLDNL QAFLM PFFV+LEVLQS+F YEPYPGF A V +KI++ I+EW+E K++
Subjt:  VCWVGASFIANRLGYSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGACTGGATTGTTTGATCTGGAGAAACAGTTCGCCTTCTATGGCGCTTATCACAGCAACCCAATGAACATTTTCATCCACGTTTTGTTTGTGTGGCCAATCTT
CTTCACCAGTCTCATGTATTTGTATTTCACCCCTTCATTCTACACAATTCCCAAATCCCCTTGTGGGTTTGACCACGGCTTGGTTTTGAACTTCGGATTTCTTTTCACTT
TAATGTATGCCGCATATTATGTTGTTTTCGATAAGAGAGCTGGGTCCATGGCTGCTTTGCTTTGTTTCGTTTGTTGGGTTGGAGCAAGCTTTATCGCTAATAGACTTGGT
TATTCTCAGACTTGGAAGGTTGTACTGGCCGCTCAGTTGTTCTGTTGGACCAATCAGTTTATAGGCCATGGAGTATTTGAGAAACGAGCACCGGCTTTGTTAGACAATCT
TGCACAAGCTTTTCTAATGGCTCCATTCTTTGTAGTTTTAGAGGTTCTTCAAAGTTTATTCAAATATGAACCATATCCGGGGTTTAGTGCGAGTGTGCAAGCAAAGATAA
AAGCAGATATAAGAGAATGGAAAGAAACGAAGGAAAAACTGTCATAG
mRNA sequenceShow/hide mRNA sequence
GAAATAATAAAAAACAAAAAAAAAAAAATTAAAGAAACATGAATTTCAAAGTTTCTAAAAAAAAAAATAAAGGACCATTCACAAATTTCGTCAATTATTTAATATTTCCA
ACCGAAACTCCACTTTAAACAGTTTATCATCTCGAATTGTTATTGAGATTAATTTTGCTGACGATCTTAACCAACCCAATTAATATATCTTTCTTTAATGATCATTTATC
TTCTTACCCTCATAATCTCTCCGGTCATCTTACCTTACACGCCTCACCTCTTCTGAGAAAATTTTCCGGCGATCGTTCCGTTTCGTATTTGGGTTGTTTGTCAGGAGATT
TCATTATGGGGAAGACTGGATTGTTTGATCTGGAGAAACAGTTCGCCTTCTATGGCGCTTATCACAGCAACCCAATGAACATTTTCATCCACGTTTTGTTTGTGTGGCCA
ATCTTCTTCACCAGTCTCATGTATTTGTATTTCACCCCTTCATTCTACACAATTCCCAAATCCCCTTGTGGGTTTGACCACGGCTTGGTTTTGAACTTCGGATTTCTTTT
CACTTTAATGTATGCCGCATATTATGTTGTTTTCGATAAGAGAGCTGGGTCCATGGCTGCTTTGCTTTGTTTCGTTTGTTGGGTTGGAGCAAGCTTTATCGCTAATAGAC
TTGGTTATTCTCAGACTTGGAAGGTTGTACTGGCCGCTCAGTTGTTCTGTTGGACCAATCAGTTTATAGGCCATGGAGTATTTGAGAAACGAGCACCGGCTTTGTTAGAC
AATCTTGCACAAGCTTTTCTAATGGCTCCATTCTTTGTAGTTTTAGAGGTTCTTCAAAGTTTATTCAAATATGAACCATATCCGGGGTTTAGTGCGAGTGTGCAAGCAAA
GATAAAAGCAGATATAAGAGAATGGAAAGAAACGAAGGAAAAACTGTCATAGATTGAATTGGCTAGACTTCAAAACAACCCTGCCTTGGATTTCCCCTTGTAAAAATAGT
GTGGAAACTAACTACTTCAGTTGTAAAATCTTTCTATGATATTATTAAACATGTTTCTGCTTTTCAACCTTGAGTTGCCATTCTACTATGATGGATAACAGAATACATTA
AAAGAATTAGGAAGACAAAGGAGTAAAAACTTACTTGTTAAATAAGAACTCCAAATGAATGGACACTGAAAATTTAGATATAACTTTCGTCCCTTTCTGCATCATATTAC
AATGGATTGATTTCCAAATAACTATTACCTGGCTCTGGTTTCTCCACTCCTTTCCGGTGCATTGATTTCCTGGTCTTTATCATTTCATTCCATTGACCAGAACTTGCGTA
CATATTTGCAAGAAGAACATAATCACTACTATGATCTGCCACTATCTCTAGCACATGGCTACTTACTCTCTCCCCGAGTTCAACGTTTCCATGCATCTGACAGGCTGCGA
GCAATGTTCTCCATATGACAGCATTGCACTCCATTGGCATGCTCTTTATTAGCTGATAAGCTTCTTCTACAAATCCAGCTCGTCCCAGAATATCCACCATGGATCCATAA
TGCTTAAGTGTGGGTTGGATATTGAAGTGTTTGGTCATAAGATCAAAATATCTCCTCCCTTCCTCCACCTTCCCTCCATAATTACAAGCACACAATACTGCCAAGAAAGT
AACACCATCAGGAGTCTCCACCCTCTCGGCTAACATGTTTGAGAAGAGTGTCAGTGCATCCTCTGCATCACCATGTGTTGCTAATCCCATGATCATTGTGTTCCACGTTA
CTATGTTCTTGCCGCTCACTGCGTTAAACATCTCGCGGGCGTATTCAACCGCTCCACATTTGGCGTACATGTCGATCAATGAATTGAAAACAGCTATAGTTTTTCCCCTA
TTGTTACTGTTGACATGGGAATGAACCCATCTCCCACAGTCCAGTGCACCCAATGCAGAACATGCTGAAATCGTCACAACCAGTGTGGCTTCGTCTGGCTCGACGCCACT
CTGCAACATTTGAACAAACAAGTCAAGTGCTTCGTTGTACATTCCACAAGAGACATGACAGTCAATTACGGCATTCCAAGCCACTAAATCTGTTTTGGGCAATTCATCAA
ACAGGTTGCGTGCTATATTGACGTCTTTTAACCTGCCATACATATGTATAAGCGTGTTCCTCACATACACATGAGAATCAAGGCCAAGTTTCAGAATATTAACATGCAAC
TGTTTCCCCAACATAATTGAACCCAACTGCCCAGTCATCTTCAGCAAGAAAGAAAAAGTGAAATTATCCGCCGCTATTCCCTTCTCTAACATCCTCTTGTAGAACTCAAA
CGCCATTAGCAGTTTACGATTTCTTCCAAATCCCCTGATCATTGTATTCCAAAGAAACCCATCTGCGTTTTCGATTCTGTCAAAAACAACAACAGCGTAATTCATGTCTC
CATGATCCGAAACCGCACAAAAATCAATGAGTTTGCCAATAACAAAGAGATTCTGATCGAAACCCAATCGAAT
Protein sequenceShow/hide protein sequence
MGKTGLFDLEKQFAFYGAYHSNPMNIFIHVLFVWPIFFTSLMYLYFTPSFYTIPKSPCGFDHGLVLNFGFLFTLMYAAYYVVFDKRAGSMAALLCFVCWVGASFIANRLG
YSQTWKVVLAAQLFCWTNQFIGHGVFEKRAPALLDNLAQAFLMAPFFVVLEVLQSLFKYEPYPGFSASVQAKIKADIREWKETKEKLS