; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G17154 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G17154
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionUnknown protein
Genome locationctg2699:360596..370465
RNA-Seq ExpressionCucsat.G17154
SyntenyCucsat.G17154
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147362.2 uncharacterized protein LOC101217609 isoform X2 [Cucumis sativus]4.35e-208100Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTNLFMISARQKSPSMSCQHITGQEHEVPFCKITKMTMEGKKNKTVSKC
        DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTNLFMISARQKSPSMSCQHITGQEHEVPFCKITKMTMEGKKNKTVSKC
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTNLFMISARQKSPSMSCQHITGQEHEVPFCKITKMTMEGKKNKTVSKC

XP_008460879.1 PREDICTED: uncharacterized protein LOC103499622 isoform X1 [Cucumis melo]6.48e-16083.62Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEWVNRGSLSL QFLFHVY+P+PS LRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMATMSLGLFNSLKEKECSL+KEQEHSLQL TMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKIT
        DKS VHNIG TK TNRVVP HRRAK RGALLQDSEDDNE+ERSLQSTFEEK           L ++   QKSP  S        H++   KIT
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKIT

XP_011649429.1 uncharacterized protein LOC101217609 isoform X1 [Cucumis sativus]4.14e-17088.44Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKITK
        DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEK           L ++    K P+ S        H++   KITK
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKITK

XP_038902923.1 uncharacterized protein LOC120089505 isoform X1 [Benincasa hispida]8.46e-15479.8Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEW+N+GSL L QFLFHVY+PNPS LRFL TDFHSNTWESTKSA QLEDMRDDIGIGGAFSEFV+YIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMAT+S GLFNSLK KECSL+KEQEHSLQLTTMISTEKEKNENIQTQL QYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKITKMTM
        DKS V+NIGLTKATNRVVP HRRAK RGALLQDSEDDNE E SLQSTFEEK           L  +   QKSP+          H++   K+TK  M
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKITKMTM

XP_038902924.1 uncharacterized protein LOC120089505 isoform X2 [Benincasa hispida]1.42e-15480.47Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEW+N+GSL L QFLFHVY+PNPS LRFL TDFHSNTWESTKSA QLEDMRDDIGIGGAFSEFV+YIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMAT+S GLFNSLK KECSL+KEQEHSLQLTTMISTEKEKNENIQTQL QYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKITKMTM
        DKS V+NIGLTKATNRVVP HRRAK RGALLQDSEDDNE E SLQSTFEEK           L  +   QKSP  S  H  G        K+TK  M
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKITKMTM

TrEMBL top hitse value%identityAlignment
A0A0A0LK82 Uncharacterized protein1.41e-17093.09Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSM
        DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEK           L ++    K PSM
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSM

A0A1S3CDX2 uncharacterized protein LOC103499622 isoform X13.14e-16083.62Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEWVNRGSLSL QFLFHVY+P+PS LRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMATMSLGLFNSLKEKECSL+KEQEHSLQL TMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKIT
        DKS VHNIG TK TNRVVP HRRAK RGALLQDSEDDNE+ERSLQSTFEEK           L ++   QKSP  S        H++   KIT
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKIT

A0A5A7TGG1 Uncharacterized protein3.14e-16083.62Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        MELQDFAPIFG+PTRVEWVNRGSLSL QFLFHVY+P+PS LRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
        KDGAA VKLIAQKSKGMPVFSISLTKL+DSAA+EAMATMSLGLFNSLKEKECSL+KEQEHSLQL TMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKIT
        DKS VHNIG TK TNRVVP HRRAK RGALLQDSEDDNE+ERSLQSTFEEK           L ++   QKSP  S        H++   KIT
Subjt:  DKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTN---------LFMISARQKSPSMSCQHITGQEHEVPFCKIT

A0A6J1L4C1 uncharacterized protein LOC111499771 isoform X22.94e-12569.18Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        ME+ DFA IFG+P +VEW+NR SL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS 
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
         DGAA  KL AQKSKGMPVFS+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMISTEKEK E+IQ+QLGQY KKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKS-GVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTNLFMISARQKSPSMSCQHITGQE---HEVPFCKITKMTM
        DKS G H IGLTK TNR VP HRRAK RGALLQDSEDDNE+E SL+ST E+K     +     S ++   H +  +   H++   K+T   M
Subjt:  DKS-GVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTNLFMISARQKSPSMSCQHITGQE---HEVPFCKITKMTM

A0A6J1L739 uncharacterized protein LOC111499771 isoform X51.52e-12577.78Show/hide
Query:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG
        ME+ DFA IFG+P +VEW+NR SL+   FLFHV++PNPSHLRF VTDFHSNTWESTKS  QL DMRD+IGIGGA SEFVDYI+ S+KFGDVRL +E QS 
Subjt:  MELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSG

Query:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP
         DGAA  KL AQKSKGMPVFS+SLTKL D AA+EA+AT+SLGLFNSLK KECSL+KEQE SLQLTTMISTEKEK E+IQ+QLGQY KKQKLQNMNASNSP
Subjt:  KDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSP

Query:  DKS-GVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEK
        DKS G H IGLTK TNR VP HRRAK RGALLQDSEDDNE+E SL+ST E+K
Subjt:  DKS-GVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G64010.1 unknown protein2.7e-3739.66Show/hide
Query:  QDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDG
        + F PIFG+    E  + GS  L + LFHVY+ +  +L   VTDF S  W +  S  QL+DMRD +GIGG++SEFVDY VAS+K  +V+L +   S  +G
Subjt:  QDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDIGIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDG

Query:  AASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSPDKS
          + +L++QK+KGMP  ++ LTK+V+S+A+EAMA +SL LF + K K+       +  +  +   + EK+K +    QL +Y +K  +   + +N  D  
Subjt:  AASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQTQLGQYRKKQKLQNMNASNSPDKS

Query:  GVHNIGLTKATNRV--VPVHRRAKARGALLQDSEDDN
           +       N V  VP HRR + RGALLQDSE+++
Subjt:  GVHNIGLTKATNRV--VPVHRRAKARGALLQDSEDDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATTGTTGGGGCAGCCCAAAGAACGGTTGTGACCTCCATTGCAGAAACAGAAGAACAAAGCTCATTTGGTTGTAAAACCAATAGAACGAATCGAAAGCCTCCCTTGAGAGT
TATACAGTTGAAAATGGAGCTTCAAGATTTTGCGCCAATTTTTGGTAAACCCACCAGAGTGGAGTGGGTAAACAGGGGTTCACTTTCTTTGCTCCAATTTTTGTTCCATG
TTTATTCTCCAAATCCTTCGCACCTCAGATTTCTTGTTACTGATTTTCATTCTAACACTTGGGAATCCACCAAATCAGCTTTCCAGCTTGAGGATATGAGAGATGACATT
GGAATTGGAGGGGCTTTTTCAGAGTTTGTTGATTATATTGTTGCATCTATGAAATTTGGAGATGTAAGGCTTTGTATGGAAGGACAATCGGGGAAAGATGGTGCAGCATC
TGTCAAACTAATTGCTCAGAAATCAAAGGGAATGCCTGTATTTTCCATTTCTCTCACAAAACTTGTTGACTCTGCTGCTGCTGAAGCTATGGCAACTATGTCCTTGGGGC
TTTTTAACTCATTAAAAGAAAAGGAGTGTTCACTTATGAAAGAACAAGAGCACTCCCTTCAGTTGACAACCATGATATCAACTGAAAAGGAAAAGAACGAAAATATCCAA
ACTCAGCTCGGGCAATATAGAAAGAAACAGAAGTTACAAAATATGAATGCCTCAAATTCTCCTGATAAATCTGGCGTACATAATATTGGCTTGACAAAAGCCACCAATCG
TGTGGTGCCAGTACACCGGAGGGCGAAAGCTAGGGGTGCCCTTCTGCAAGACTCTGAAGATGACAATGAAAAAGAGCGCTCCCTTCAGTCGACATTCGAGGAAAAGACAA
ATCTGTTCATGATATCGGCTCGACAAAAATCACCAAGCATGTCGTGCCAACACATCACAGGGCAAGAACACGAGGTGCCCTTCTGCAAGATAACGAAGATGACGATGGAA
GGTAAAAAAAACAAGACAGTTTCAAAATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTGTTGGGGCAGCCCAAAGAACGGTTGTGACCTCCATTGCAGAAACAGAAGAACAAAGCTCATTTGGTTGTAAAACCAATAGAACGAATCGAAAGCCTCCCTTGAGAGT
TATACAGTTGAAAATGGAGCTTCAAGATTTTGCGCCAATTTTTGGTAAACCCACCAGAGTGGAGTGGGTAAACAGGGGTTCACTTTCTTTGCTCCAATTTTTGTTCCATG
TTTATTCTCCAAATCCTTCGCACCTCAGATTTCTTGTTACTGATTTTCATTCTAACACTTGGGAATCCACCAAATCAGCTTTCCAGCTTGAGGATATGAGAGATGACATT
GGAATTGGAGGGGCTTTTTCAGAGTTTGTTGATTATATTGTTGCATCTATGAAATTTGGAGATGTAAGGCTTTGTATGGAAGGACAATCGGGGAAAGATGGTGCAGCATC
TGTCAAACTAATTGCTCAGAAATCAAAGGGAATGCCTGTATTTTCCATTTCTCTCACAAAACTTGTTGACTCTGCTGCTGCTGAAGCTATGGCAACTATGTCCTTGGGGC
TTTTTAACTCATTAAAAGAAAAGGAGTGTTCACTTATGAAAGAACAAGAGCACTCCCTTCAGTTGACAACCATGATATCAACTGAAAAGGAAAAGAACGAAAATATCCAA
ACTCAGCTCGGGCAATATAGAAAGAAACAGAAGTTACAAAATATGAATGCCTCAAATTCTCCTGATAAATCTGGCGTACATAATATTGGCTTGACAAAAGCCACCAATCG
TGTGGTGCCAGTACACCGGAGGGCGAAAGCTAGGGGTGCCCTTCTGCAAGACTCTGAAGATGACAATGAAAAAGAGCGCTCCCTTCAGTCGACATTCGAGGAAAAGACAA
ATCTGTTCATGATATCGGCTCGACAAAAATCACCAAGCATGTCGTGCCAACACATCACAGGGCAAGAACACGAGGTGCCCTTCTGCAAGATAACGAAGATGACGATGGAA
GGTAAAAAAAACAAGACAGTTTCAAAATGTTGA
Protein sequenceShow/hide protein sequence
IVGAAQRTVVTSIAETEEQSSFGCKTNRTNRKPPLRVIQLKMELQDFAPIFGKPTRVEWVNRGSLSLLQFLFHVYSPNPSHLRFLVTDFHSNTWESTKSAFQLEDMRDDI
GIGGAFSEFVDYIVASMKFGDVRLCMEGQSGKDGAASVKLIAQKSKGMPVFSISLTKLVDSAAAEAMATMSLGLFNSLKEKECSLMKEQEHSLQLTTMISTEKEKNENIQ
TQLGQYRKKQKLQNMNASNSPDKSGVHNIGLTKATNRVVPVHRRAKARGALLQDSEDDNEKERSLQSTFEEKTNLFMISARQKSPSMSCQHITGQEHEVPFCKITKMTME
GKKNKTVSKC