; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21002 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21002
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionNudix hydrolase domain-containing protein
Genome locationctg910:56624..61869
RNA-Seq ExpressionCucsat.G21002
SyntenyCucsat.G21002
Gene Ontology termsNA
InterPro domainsIPR015797 - NUDIX hydrolase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437661.1 PREDICTED: uncharacterized protein LOC103483001 [Cucumis melo]6.35e-18392.12Show/hide
Query:  MPPAPPPPP--VPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEW
        MPPAPPPPP  +PPPQPQPISNLTHLNKS AALPDFFLAALSLFAF SS SS SKSFKFPAFSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEW
Subjt:  MPPAPPPPP--VPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEW

Query:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
        LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
Subjt:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE

Query:  ELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE
        ELGSIL DSD S LVRIVPDSY+LKIEERDSVSYPGL A YVLHSMD+RVEGLPDGDFCTVE+EEYVNSE+TNIAD AVSVKKHFWKW   E
Subjt:  ELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE

XP_011654586.1 uncharacterized protein LOC101208896 isoform X1 [Cucumis sativus]5.92e-21099.67Show/hide
Query:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLEKSHSHLSPTL
        GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDV VEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLEKSHSHLSPTL
Subjt:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLEKSHSHLSPTL

Query:  I
        I
Subjt:  I

XP_011654587.1 uncharacterized protein LOC101208896 isoform X3 [Cucumis sativus]7.22e-19999.65Show/hide
Query:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW
        GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDV VEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW
Subjt:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW

XP_031741555.1 uncharacterized protein LOC101208896 isoform X2 [Cucumis sativus]1.13e-19899.65Show/hide
Query:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW
        GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDV VEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW
Subjt:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW

XP_038874473.1 uncharacterized protein LOC120067121 isoform X2 [Benincasa hispida]1.12e-16184.27Show/hide
Query:  PVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISM---EFPNSNSKSLTFTSPQSLSEWLQPRLPS
        P P P PQP SNL HLNKS  ALPDFFLAALSLF F SS SSSSKSFKFPAFSIQLNPRRF KIPS SM   +FPNS S SLTFTSPQSLSEWL+PRLPS
Subjt:  PVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISM---EFPNSNSKSLTFTSPQSLSEWLQPRLPS

Query:  HSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSILG
         SFASWGV PGTKN+HNLWLEISQGETSLADSNPPIRTLHVLSLRI+DNHHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGSI+G
Subjt:  HSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSILG

Query:  DSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLEK
        DSD SQ+VRIVPDSY++KIEER+SVSYPGL A YVLHSMDV VEGLP+G+FCTVEEEEY NSE+T+IAD AVSVKKHFWKW +  K
Subjt:  DSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLEK

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein2.08e-19998.62Show/hide
Query:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE
        GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDV VEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKW   E
Subjt:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE

A0A1S3AUP2 uncharacterized protein LOC1034830013.08e-18392.12Show/hide
Query:  MPPAPPPPP--VPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEW
        MPPAPPPPP  +PPPQPQPISNLTHLNKS AALPDFFLAALSLFAF SS SS SKSFKFPAFSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEW
Subjt:  MPPAPPPPP--VPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEW

Query:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
        LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
Subjt:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE

Query:  ELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE
        ELGSIL DSD S LVRIVPDSY+LKIEERDSVSYPGL A YVLHSMD+RVEGLPDGDFCTVE+EEYVNSE+TNIAD AVSVKKHFWKW   E
Subjt:  ELGSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE

A0A6J1CCN3 uncharacterized protein LOC1110095086.55e-13673.36Show/hide
Query:  PPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNP-RRFFKIPSISMEFPNSNSK--SLTFTSPQSLSEWLQPR
        PPPP+PPP   PISNLTHLNKS   LPDF+LAALSLF F    SSSSKSFKFP    Q NP RRF KIPS+S+  P+  ++  +  F SPQSLS+WL PR
Subjt:  PPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNP-RRFFKIPSISMEFPNSNSK--SLTFTSPQSLSEWLQPR

Query:  LPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGS
        LPS SFASWGV PGTKN+HNLWLEIS+GETSLADSNPPIRT+ V+SLRI+D H+R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGS
Subjt:  LPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGS

Query:  ILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHA-VSVKKHFWKWAKLE
        I+GD D  ++VRIVP+SY++KIEER+SVSYPGL A YVLHSMDV VEGLPD +FCTVEEEEY  SE+T IA  A VSVKKHFWKW   +
Subjt:  ILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHA-VSVKKHFWKWAKLE

A0A6J1E2U8 uncharacterized protein LOC1114303197.06e-14777.24Show/hide
Query:  PPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNS------KSLTFTSPQSLSEWLQ
        PPPVPPPQ  PIS+L HL +S   LPDFFLAALSLF FLS  SSSS+SFKFP   IQ NPRRF K PS+S   PNS        +   FTSPQSLS+WL+
Subjt:  PPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNS------KSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPS SFASWGV PGTKN+HNLWLE+S+GETSLADSNPPIRT+ VLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE
        GSILGDSD S++V+IVPDSY++KIEER+S SYPGL A YVLHSMDV VEGLP  DFCTVEEEEYVNSE+T+IAD AVSVKKHFWKW  L+
Subjt:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE

A0A6J1IA21 uncharacterized protein LOC1114729794.49e-14375.52Show/hide
Query:  PPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNS------KSLTFTSPQSLSEWLQ
        PPPVPPPQ  PIS+L HL +S   LPDFFLAALSLF FLS  SSSS+SFK P   IQ NPRRF K PS+S   PNS        +   F SPQSLS+WL+
Subjt:  PPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNS------KSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPS SFASWGV PGTKN+HNLWLE+S+GETSLADS PPIRT+ VLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE
        GSILGDSD S++V+IVPDSY++KIEER+S SYPGL A YVLHSMDV VEGLP  DFCTVEEEEY NSE+++IAD AVSVKKHFWKW  L+
Subjt:  GSILGDSDYSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein1.9e-7151.84Show/hide
Query:  PPPPPVPPPQPQPISNLTHLNKS-KAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQPRL
        P PP  P    + I+N    N +  +ALPD FLAA+SL  FL SS     S     FS  LNPRR   I ++S   P     +  F +PQSLS+WL+ RL
Subjt:  PPPPPVPPPQPQPISNLTHLNKS-KAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQPRL

Query:  PSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSI
        PS SFA+WGV PGTKN+HNLWLE+S GETSLADS PP+RT++V+++R+I  + R+L+E+HQ+LSDG++R R RPLSEKMKP E+P+ AV+RA++EELGSI
Subjt:  PSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSI

Query:  L-GDSD-YSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEY------VNSEDTNIADHAVSVKKHFWKWAKLEKSHS
          GD D   Q ++I+P +Y  ++EER+S+SYPGL A Y LHS++  VEGLP+ DFCT EE+EY       +S +T  A +AV+VK+H+WKW   +   S
Subjt:  L-GDSD-YSQLVRIVPDSYRLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEY------VNSEDTNIADHAVSVKKHFWKWAKLEKSHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCAGCCCCACCTCCACCTCCAGTTCCACCCCCACAACCGCAGCCCATCTCCAATCTTACTCACCTCAACAAATCTAAGGCAGCTCTTCCCGACTTTTTCCTCGC
GGCTCTATCGCTTTTCGCTTTCCTCTCTTCTTCTTCTTCTTCCTCCAAATCATTCAAATTTCCTGCTTTCTCTATTCAATTAAACCCTCGCCGTTTTTTCAAGATACCTT
CCATTTCCATGGAATTCCCCAACTCCAACTCCAAATCCTTGACCTTCACCTCTCCTCAATCCCTCTCCGAATGGCTTCAACCTCGCCTACCTTCCCATTCTTTTGCTTCT
TGGGGCGTGATCCCTGGCACCAAGAACCTCCATAACCTCTGGCTCGAGATCTCCCAAGGAGAAACTTCCCTTGCCGATTCCAACCCTCCCATCCGCACCCTTCATGTCCT
TTCTCTTCGAATTATTGATAATCATCACCGACTTCTTCTTGAATCCCACCAGCAGCTCTCTGATGGTACCCTACGGAATCGAAATCGACCCCTGTCCGAGAAAATGAAGC
CCAATGAGACTCCTGAATCTGCCGTCTACCGGGCTGTCCAAGAAGAGCTCGGTTCCATTCTTGGCGATTCCGATTATTCTCAACTTGTTAGGATTGTTCCAGATTCCTAT
AGATTGAAGATTGAGGAGCGCGACTCAGTTTCCTACCCTGGTTTGTCGGCTAGTTACGTTTTGCATTCCATGGATGTTAGGGTGGAAGGTTTACCCGATGGAGACTTCTG
CACTGTGGAAGAGGAGGAATACGTAAACTCTGAGGACACAAACATTGCGGACCACGCTGTGTCCGTCAAGAAGCATTTTTGGAAATGGGCAAAATTGGAGAAGTCCCATT
CCCATCTCTCACCAACTTTAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCAGCCCCACCTCCACCTCCAGTTCCACCCCCACAACCGCAGCCCATCTCCAATCTTACTCACCTCAACAAATCTAAGGCAGCTCTTCCCGACTTTTTCCTCGC
GGCTCTATCGCTTTTCGCTTTCCTCTCTTCTTCTTCTTCTTCCTCCAAATCATTCAAATTTCCTGCTTTCTCTATTCAATTAAACCCTCGCCGTTTTTTCAAGATACCTT
CCATTTCCATGGAATTCCCCAACTCCAACTCCAAATCCTTGACCTTCACCTCTCCTCAATCCCTCTCCGAATGGCTTCAACCTCGCCTACCTTCCCATTCTTTTGCTTCT
TGGGGCGTGATCCCTGGCACCAAGAACCTCCATAACCTCTGGCTCGAGATCTCCCAAGGAGAAACTTCCCTTGCCGATTCCAACCCTCCCATCCGCACCCTTCATGTCCT
TTCTCTTCGAATTATTGATAATCATCACCGACTTCTTCTTGAATCCCACCAGCAGCTCTCTGATGGTACCCTACGGAATCGAAATCGACCCCTGTCCGAGAAAATGAAGC
CCAATGAGACTCCTGAATCTGCCGTCTACCGGGCTGTCCAAGAAGAGCTCGGTTCCATTCTTGGCGATTCCGATTATTCTCAACTTGTTAGGATTGTTCCAGATTCCTAT
AGATTGAAGATTGAGGAGCGCGACTCAGTTTCCTACCCTGGTTTGTCGGCTAGTTACGTTTTGCATTCCATGGATGTTAGGGTGGAAGGTTTACCCGATGGAGACTTCTG
CACTGTGGAAGAGGAGGAATACGTAAACTCTGAGGACACAAACATTGCGGACCACGCTGTGTCCGTCAAGAAGCATTTTTGGAAATGGGCAAAATTGGAGAAGTCCCATT
CCCATCTCTCACCAACTTTAATTTGA
Protein sequenceShow/hide protein sequence
MPPAPPPPPVPPPQPQPISNLTHLNKSKAALPDFFLAALSLFAFLSSSSSSSKSFKFPAFSIQLNPRRFFKIPSISMEFPNSNSKSLTFTSPQSLSEWLQPRLPSHSFAS
WGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSILGDSDYSQLVRIVPDSY
RLKIEERDSVSYPGLSASYVLHSMDVRVEGLPDGDFCTVEEEEYVNSEDTNIADHAVSVKKHFWKWAKLEKSHSHLSPTLI