; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPIUnG00800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPIUnG00800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationScaffold000138:50704..54878
RNA-Seq ExpressionCSPIUnG00800
SyntenyCSPIUnG00800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147362.2 uncharacterized protein LOC101217609 isoform X2 [Cucumis sativus]6.6e-13388.05Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEK------EKKEGLLNTDTLAIVDNLHKPPVSKILKSSLEVLEN
        DKSGVHNIGLTKATNRVVP HRRAKARGALLQDSEDDNEKERSLQSTFEEK        ++   +     I    H+ P  KI K ++E  +N
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEK------EKKEGLLNTDTLAIVDNLHKPPVSKILKSSLEVLEN

XP_008460879.1 PREDICTED: uncharacterized protein LOC103499622 isoform X1 [Cucumis melo]3.2e-13593.41Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEWVNRGSLSL QFLFHVY+P+PS LRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMATMSLGLFNSLKEKECSL+KEQEHSLQL TMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
        DKS VHNIG TK TNRVVPAHRRAK RGALLQDSEDDNE+ERSLQSTFEEKEKKEGLLNTDTLAIVD L K P
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP

XP_011649429.1 uncharacterized protein LOC101217609 isoform X1 [Cucumis sativus]2.6e-14599.63Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
        DKSGVHNIGLTKATNRVVP HRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP

XP_038902923.1 uncharacterized protein LOC120089505 isoform X1 [Benincasa hispida]2.2e-12888.41Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEW+N+GSL L QFLFHVY+PNPS LRFL TDFHSNTWESTKSA QLEDMRDDIGIGGAFSEFV+YIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMAT+S GLFNSLK KECSL+KEQEHSLQLTTMISTEKEKNENIQTQL QYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPVSK
        DKS V+NIGLTKATNRVVPAHRRAK RGALLQDSEDDNE E SLQSTFEEKEKK  LLNTDTLA VD   K P  K
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPVSK

XP_038902924.1 uncharacterized protein LOC120089505 isoform X2 [Benincasa hispida]8.4e-12889.01Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEW+N+GSL L QFLFHVY+PNPS LRFL TDFHSNTWESTKSA QLEDMRDDIGIGGAFSEFV+YIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMAT+S GLFNSLK KECSL+KEQEHSLQLTTMISTEKEKNENIQTQL QYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
        DKS V+NIGLTKATNRVVPAHRRAK RGALLQDSEDDNE E SLQSTFEEKEKK  LLNTDTLA VD   K P
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP

TrEMBL top hitse value%identityAlignment
A0A0A0LK82 Uncharacterized protein1.3e-14599.63Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
        DKSGVHNIGLTKATNRVVP HRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP

A0A1S3CDX2 uncharacterized protein LOC103499622 isoform X11.5e-13593.41Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEWVNRGSLSL QFLFHVY+P+PS LRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMATMSLGLFNSLKEKECSL+KEQEHSLQL TMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
        DKS VHNIG TK TNRVVPAHRRAK RGALLQDSEDDNE+ERSLQSTFEEKEKKEGLLNTDTLAIVD L K P
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP

A0A5A7TGG1 Uncharacterized protein1.5e-13593.41Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEWVNRGSLSL QFLFHVY+P+PS LRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMATMSLGLFNSLKEKECSL+KEQEHSLQL TMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP
        DKS VHNIG TK TNRVVPAHRRAK RGALLQDSEDDNE+ERSLQSTFEEKEKKEGLLNTDTLAIVD L K P
Subjt:  DKSGVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPP

A0A6J1L4C1 uncharacterized protein LOC111499771 isoform X28.2e-10574.55Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        ME+ DFA IFG+P +VEW+NR SL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS 
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
         DGAA  KL AQKSKGMPVFS+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMISTEKEK E+IQ+QLGQY KKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKS-GVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPVSKIL
        DKS G H IGLTK TNR VPAHRRAK RGALLQDSEDDNE+E SL+ST E+KEK++ L NT+  A VD  HK P   ++
Subjt:  DKS-GVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPVSKIL

A0A6J1L735 uncharacterized protein LOC111499771 isoform X12.2e-10575.45Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        ME+ DFA IFG+P +VEW+NR SL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS 
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
         DGAA  KL AQKSKGMPVFS+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMISTEKEK E+IQ+QLGQY KKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKS-GVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPVSK
        DKS G H IGLTK TNR VPAHRRAK RGALLQDSEDDNE+E SL+ST E+KEK++ L NT+  A VD  HK P  K
Subjt:  DKS-GVHNIGLTKATNRVVPAHRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPVSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G64010.1 unknown protein1.1e-3740.08Show/hide
Query:  QDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDG
        + F PIFG+    E  + GS  L + LFHVY+ +  +L   VTDF S  W +  S  QL+DMRD +GIGG++SEFVDY VAS+K  +V+L +   S  +G
Subjt:  QDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDG

Query:  AASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSPDKS
          + +L++QK+KGMP  ++ LTK+V+S+A+EAMA +SL LF + K K+       +  +  +   + EK+K +    QL +Y +K  +   + +N  D  
Subjt:  AASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSPDKS

Query:  GVHNIGLTKATNRV--VPAHRRAKARGALLQDSEDDN
           +       N V  VPAHRR + RGALLQDSE+++
Subjt:  GVHNIGLTKATNRV--VPAHRRAKARGALLQDSEDDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTCAAGATTTTGCGCCAATTTTTGGTAAACCCACCAGAGTGGAGTGGGTAAACAGGGGTTCACTTTCTTTGCTCCAATTTTTGTTCCATGTTTATTCTCCAAA
TCCTTCGCACCTCAGATTTCTTGTTACTGATTTTCATTCTAACACTTGGGAATCCACCAAATCAGCTTTCCAGCTTGAGGATATGAGAGATGACATTGGAATTGGAGGGG
CTTTTTCAGAGTTTGTTGATTATATTGTTGCATCTATGAAATTTGGAGATGTAAGGCTTTGTATGGAAGGACAATCGGGGAAAGATGGTGCAGCATCTGTCAAACTAATT
GCTCAGAAATCAAAGGGAATGCCTGTATTTTCCATTTCTCTCACAAAACTTGTTGACTCTGCTGCTGCTGAAGCTATGGCAACTATGTCCTTGGGGCTTTTTAACTCATT
AAAAGAAAAGGAGTGTTCACTTATGAAAGAACAAGAGCACTCCCTTCAGTTGACAACCATGATATCAACTGAAAAGGAAAAGAACGAAAATATCCAAACTCAGCTCGGGC
AATATAGAAAGAAACAGAAGTTACAAAATATGAATGCCTCAAATTCTCCAGATAAATCTGGCGTACATAATATTGGCTTGACAAAAGCCACCAATCGTGTGGTGCCAGCA
CACCGGAGGGCGAAAGCTAGGGGTGCCCTTCTGCAAGACTCTGAAGATGACAATGAAAAAGAGCGCTCCCTTCAGTCGACATTCGAGGAAAAGGAAAAGAAAGAGGGGTT
GCTAAATACAGATACCTTGGCCATTGTAGACAATCTTCATAAGCCTCCTGTATCCAAGATTCTTAAGAGCTCACTGGAAGTTCTTGAAAACTGTTGTGAACAAGTTGTTC
GAGTTCACGGACGAAACACATTCGGGAGTGCAGACATAATGATGGCATCGGTCAAAACTCTCTGA
mRNA sequenceShow/hide mRNA sequence
GGGGCAGCCCAAAGAACGGTTGTGACCTCCATTGCAGAAACAGAAGAACAAAGCTCATTTGGTTGTAAAACCAATAGAACGAATCGAAAGCCTCCCTTGAGAGTTATACA
GTTGAAAATGGAGCTTCAAGATTTTGCGCCAATTTTTGGTAAACCCACCAGAGTGGAGTGGGTAAACAGGGGTTCACTTTCTTTGCTCCAATTTTTGTTCCATGTTTATT
CTCCAAATCCTTCGCACCTCAGATTTCTTGTTACTGATTTTCATTCTAACACTTGGGAATCCACCAAATCAGCTTTCCAGCTTGAGGATATGAGAGATGACATTGGAATT
GGAGGGGCTTTTTCAGAGTTTGTTGATTATATTGTTGCATCTATGAAATTTGGAGATGTAAGGCTTTGTATGGAAGGACAATCGGGGAAAGATGGTGCAGCATCTGTCAA
ACTAATTGCTCAGAAATCAAAGGGAATGCCTGTATTTTCCATTTCTCTCACAAAACTTGTTGACTCTGCTGCTGCTGAAGCTATGGCAACTATGTCCTTGGGGCTTTTTA
ACTCATTAAAAGAAAAGGAGTGTTCACTTATGAAAGAACAAGAGCACTCCCTTCAGTTGACAACCATGATATCAACTGAAAAGGAAAAGAACGAAAATATCCAAACTCAG
CTCGGGCAATATAGAAAGAAACAGAAGTTACAAAATATGAATGCCTCAAATTCTCCAGATAAATCTGGCGTACATAATATTGGCTTGACAAAAGCCACCAATCGTGTGGT
GCCAGCACACCGGAGGGCGAAAGCTAGGGGTGCCCTTCTGCAAGACTCTGAAGATGACAATGAAAAAGAGCGCTCCCTTCAGTCGACATTCGAGGAAAAGGAAAAGAAAG
AGGGGTTGCTAAATACAGATACCTTGGCCATTGTAGACAATCTTCATAAGCCTCCTGTATCCAAGATTCTTAAGAGCTCACTGGAAGTTCTTGAAAACTGTTGTGAACAA
GTTGTTCGAGTTCACGGACGAAACACATTCGGGAGTGCAGACATAATGATGGCATCGGTCAAAACTCTCTGA
Protein sequenceShow/hide protein sequence
MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDGAASVKLI
AQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSPDKSGVHNIGLTKATNRVVPA
HRRAKARGALLQDSEDDNEKERSLQSTFEEKEKKEGLLNTDTLAIVDNLHKPPVSKILKSSLEVLENCCEQVVRVHGRNTFGSADIMMASVKTL