; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G000060 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G000060
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionNudix hydrolase domain-containing protein
Genome locationCG_Chr01:60516..62749
RNA-Seq ExpressionClCG01G000060
SyntenyClCG01G000060
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011654586.1 uncharacterized protein LOC101208896 isoform X1 [Cucumis sativus]3.1e-10681.42Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETS
        +ALSLF F   SSSSSSSKSFKFPAFSI    RRF KIPS+ + +FP+S S    F+SPQSLSEWL+PRLPS SFASWGV PGTKN+HNLWLEIS+GETS
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETS

Query:  LADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVSYP
        LADSNPPIRTLHVLSLRIID HHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGSILGDSD S++VRIVPDSY++KIEER+SVSYP
Subjt:  LADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVSYP

Query:  GLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW
        GL A YVLHSMDVWVEGLP+G+FCTVEE+EY NSE T+IA  AVSVKKHFWKW
Subjt:  GLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW

XP_038874472.1 uncharacterized protein LOC120067121 isoform X1 [Benincasa hispida]8.4e-12089.02Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE
        +ALSLFVFF   SSSSSSKSFKFPAFSI    RRFLKIP  SMPLSQFP+SKS    F+SPQSLSEWLKPRLPSDSFASWGV PGTKNVHNLWLEIS+GE
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE

Query:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS
        TSLADSNPPIRTLHVLSLRI+D HHRVL+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+GDSDCS+IVRIVPDSY+MKIEERNSVS
Subjt:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS

Query:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW
        YPGLPACYVLHSMDVWVEGLPEGEFCTVEE+EYGNSE+TSIA +AVSVKKHFWKW
Subjt:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW

XP_038874473.1 uncharacterized protein LOC120067121 isoform X2 [Benincasa hispida]8.4e-12089.02Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE
        +ALSLFVFF   SSSSSSKSFKFPAFSI    RRFLKIP  SMPLSQFP+SKS    F+SPQSLSEWLKPRLPSDSFASWGV PGTKNVHNLWLEIS+GE
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE

Query:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS
        TSLADSNPPIRTLHVLSLRI+D HHRVL+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+GDSDCS+IVRIVPDSY+MKIEERNSVS
Subjt:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS

Query:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW
        YPGLPACYVLHSMDVWVEGLPEGEFCTVEE+EYGNSE+TSIA +AVSVKKHFWKW
Subjt:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW

XP_038874474.1 uncharacterized protein LOC120067121 isoform X3 [Benincasa hispida]6.4e-12088.67Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE
        +ALSLFVFF   SSSSSSKSFKFPAFSI    RRFLKIP  SMPLSQFP+SKS    F+SPQSLSEWLKPRLPSDSFASWGV PGTKNVHNLWLEIS+GE
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE

Query:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS
        TSLADSNPPIRTLHVLSLRI+D HHRVL+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+GDSDCS+IVRIVPDSY+MKIEERNSVS
Subjt:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS

Query:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWT
        YPGLPACYVLHSMDVWVEGLPEGEFCTVEE+EYGNSE+TSIA +AVSVKKHFWKW+
Subjt:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWT

XP_038874475.1 uncharacterized protein LOC120067121 isoform X4 [Benincasa hispida]8.4e-12089.02Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE
        +ALSLFVFF   SSSSSSKSFKFPAFSI    RRFLKIP  SMPLSQFP+SKS    F+SPQSLSEWLKPRLPSDSFASWGV PGTKNVHNLWLEIS+GE
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIP--SMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGE

Query:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS
        TSLADSNPPIRTLHVLSLRI+D HHRVL+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+GDSDCS+IVRIVPDSY+MKIEERNSVS
Subjt:  TSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVS

Query:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW
        YPGLPACYVLHSMDVWVEGLPEGEFCTVEE+EYGNSE+TSIA +AVSVKKHFWKW
Subjt:  YPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKW

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein8.8e-10780.78Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETS
        +ALSLF F   SSSSSSSKSFKFPAFSI    RRF KIPS+ + +FP+S S    F+SPQSLSEWL+PRLPS SFASWGV PGTKN+HNLWLEIS+GETS
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETS

Query:  LADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVSYP
        LADSNPPIRTLHVLSLRIID HHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGSILGDSD S++VRIVPDSY++KIEER+SVSYP
Subjt:  LADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVSYP

Query:  GLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTA
        GL A YVLHSMDVWVEGLP+G+FCTVEE+EY NSE T+IA  AVSVKKHFWKW +
Subjt:  GLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTA

A0A1S3AUP2 uncharacterized protein LOC1034830012.6e-10680Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETS
        +ALSLF FF   SSSS SKSFKFPAFSI    RRFLKIPS  + +FP+S S    F+SPQSLSEWL+PRLPS SFASWGV PGTKN+HNLWLEIS+GETS
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS----FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETS

Query:  LADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVSYP
        LADSNPPIR LHVLSLRIID HHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGSIL DSDCS +VRIVPDSY++KIEER+SVSYP
Subjt:  LADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSVSYP

Query:  GLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTA
        GLPACYVLHSMD+ VEGLP+G+FCTVE++EY NSE+T+IA +AVSVKKHFWKW +
Subjt:  GLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTA

A0A6J1CCN3 uncharacterized protein LOC1110095081.4e-10479.15Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI-----RRFLKIPSMPLSQFPSSKS------FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEG
        +ALSLFVFF     SSSSKSFKFP         RRFLKIPSM LS  P  K+      F+SPQSLS+WL PRLPSDSFASWGVKPGTKNVHNLWLEISEG
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI-----RRFLKIPSMPLSQFPSSKS------FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEG

Query:  ETSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSV
        ETSLADSNPPIRT+ V+SLRI+DKH+RVL+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+GD DC EIVRIVP+SY+MKIEERNSV
Subjt:  ETSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERNSV

Query:  SYPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAE-AVSVKKHFWKWTA
        SYPGLPACYVLHSMDVWVEGLP+ EFCTVEE+EY  SE+T IA + AVSVKKHFWKW +
Subjt:  SYPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAE-AVSVKKHFWKWTA

A0A6J1E2U8 uncharacterized protein LOC1114303192.6e-10678.93Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS---------FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEIS
        +ALSLFVF     SSSSS+SFKFP   I    RRFLK PSM  S   S ++         F+SPQSLS+WLKPRLPSDSFASWGVKPGTKNVHNLWLE+S
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS---------FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEIS

Query:  EGETSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERN
        EGETSLADSNPPIRT+ VLSLRIID H R+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIV+IVPDSY+MKIEERN
Subjt:  EGETSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERN

Query:  SVSYPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTAV
        S SYPGLPACYVLHSMDV VEGLP+ +FCTVEE+EY NSE+TSIA EAVSVKKHFWKW ++
Subjt:  SVSYPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTAV

A0A6J1IA21 uncharacterized protein LOC1114729798.2e-10577.78Show/hide
Query:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS---------FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEIS
        +ALSLFVF     SSSSS+SFK P   I    RRFLK PSM  S   S ++         F+SPQSLS+WLKPRLPSDSFASWGVKPGTKNVHNLWLE+S
Subjt:  SALSLFVFFSSSSSSSSSKSFKFPAFSI----RRFLKIPSMPLSQFPSSKS---------FSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEIS

Query:  EGETSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERN
        EGETSLADS PPIRT+ VLSLRIID H R+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIV+IVPDSY+MKIEERN
Subjt:  EGETSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILGDSDCSEIVRIVPDSYQMKIEERN

Query:  SVSYPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTAV
        S SYPGLPACYVLHSMDV VEGLP+ +FCTVEE+EY NSE++SIA EAVSVKKHFWKW ++
Subjt:  SVSYPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein2.6e-7154.41Show/hide
Query:  SALSLFVFFSSSSS--SSSSKSFKFPAFSIRRFL-----KIPSMPLSQFPSSKSFSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSL
        +A+SL   +SS     S     F FP    RR +     + P  P    P ++ F++PQSLS+WL+ RLPSDSFA+WGVKPGTKNVHNLWLE+S+GETSL
Subjt:  SALSLFVFFSSSSS--SSSSKSFKFPAFSIRRFL-----KIPSMPLSQFPSSKSFSSPQSLSEWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSL

Query:  ADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL-GDSD-CSEIVRIVPDSYQMKIEERNSVSY
        ADS PP+RT++V+++R+I K+ R+L+E+HQ+LSDG++R R RPLSEKMKP E+P+ AV+RA+KEELGSI  GD D   + ++I+P +Y  ++EERNS+SY
Subjt:  ADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL-GDSD-CSEIVRIVPDSYQMKIEERNSVSY

Query:  PGLPACYVLHSMDVWVEGLPEGEFCTVEED-EYGNSEKTSI----AAEAVSVKKHFWKWTA
        PGLPA Y LHS++  VEGLPE +FCT E++ E G+S K S+    A  AV+VK+H+WKW +
Subjt:  PGLPACYVLHSMDVWVEGLPEGEFCTVEED-EYGNSEKTSI----AAEAVSVKKHFWKWTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTAATTGAGTCGATTTTCGTTGAGCTCTTAGAACCAGAAGTTCGTCTGTATTGCGATTTGCCGAAGCCGTCTTCCGCTTGTGTGTTCCTGTGGGTATTG
TGGTATCAGCTGCTTTGTTTCTACATCGTTGATATGCCATCAGCTTCGGCTCTCTCACTTTTCGTTTTCTTCTCTTCTTCTTCCTCCTCCTCCTCCTCCAAATCC
TTCAAATTTCCTGCTTTCTCTATTCGCCGTTTTCTGAAGATACCCTCCATGCCCCTCTCACAATTTCCCAGCTCCAAATCCTTCTCCTCTCCTCAATCCCTCTCC
GAATGGCTTAAACCTCGCCTCCCTTCCGATTCTTTTGCTTCTTGGGGCGTAAAGCCTGGCACCAAGAACGTTCACAACCTCTGGCTCGAGATCTCCGAAGGAGAA
ACTTCCCTTGCCGATTCCAACCCTCCCATCCGCACCCTTCATGTCCTTTCTCTTCGAATTATTGATAAACATCACCGAGTTCTCATCGAATCCCACCAGCAACTC
TCTGATGGCACCCTACGGAATCGAAATCGACCCTTGTCTGAGAAAATGAAGCCCAATGAGACCCCTGAATCTGCCGTCTACCGGGCTGTCAAAGAAGAGCTCGGT
TCCATCCTTGGAGATTCCGATTGTTCTGAAATTGTGAGGATCGTTCCTGATTCCTATCAAATGAAGATTGAGGAGCGCAACTCGGTTTCCTACCCTGGTTTGCCG
GCTTGTTACGTTTTGCATTCCATGGATGTTTGGGTGGAAGGTTTACCTGAGGGAGAGTTCTGCACTGTGGAGGAGGATGAATACGGAAATTCTGAGAAGACAAGC
ATTGCGGCCGAGGCTGTGTCCGTCAAGAAGCATTTTTGGAAATGGACAGCAGTTTATTTTAACAACAAAAGAAATGAGAACTCACTACCAAGAATACATGGACCA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTAATTGAGTCGATTTTCGTTGAGCTCTTAGAACCAGAAGTTCGTCTGTATTGCGATTTGCCGAAGCCGTCTTCCGCTTGTGTGTTCCTGTGGGTATTG
TGGTATCAGCTGCTTTGTTTCTACATCGTTGATATGCCATCAGCTTCGGCTCTCTCACTTTTCGTTTTCTTCTCTTCTTCTTCCTCCTCCTCCTCCTCCAAATCC
TTCAAATTTCCTGCTTTCTCTATTCGCCGTTTTCTGAAGATACCCTCCATGCCCCTCTCACAATTTCCCAGCTCCAAATCCTTCTCCTCTCCTCAATCCCTCTCC
GAATGGCTTAAACCTCGCCTCCCTTCCGATTCTTTTGCTTCTTGGGGCGTAAAGCCTGGCACCAAGAACGTTCACAACCTCTGGCTCGAGATCTCCGAAGGAGAA
ACTTCCCTTGCCGATTCCAACCCTCCCATCCGCACCCTTCATGTCCTTTCTCTTCGAATTATTGATAAACATCACCGAGTTCTCATCGAATCCCACCAGCAACTC
TCTGATGGCACCCTACGGAATCGAAATCGACCCTTGTCTGAGAAAATGAAGCCCAATGAGACCCCTGAATCTGCCGTCTACCGGGCTGTCAAAGAAGAGCTCGGT
TCCATCCTTGGAGATTCCGATTGTTCTGAAATTGTGAGGATCGTTCCTGATTCCTATCAAATGAAGATTGAGGAGCGCAACTCGGTTTCCTACCCTGGTTTGCCG
GCTTGTTACGTTTTGCATTCCATGGATGTTTGGGTGGAAGGTTTACCTGAGGGAGAGTTCTGCACTGTGGAGGAGGATGAATACGGAAATTCTGAGAAGACAAGC
ATTGCGGCCGAGGCTGTGTCCGTCAAGAAGCATTTTTGGAAATGGACAGCAGTTTATTTTAACAACAAAAGAAATGAGAACTCACTACCAAGAATACATGGACCA
TGA
Protein sequenceShow/hide protein sequence
MALIESIFVELLEPEVRLYCDLPKPSSACVFLWVLWYQLLCFYIVDMPSASALSLFVFFSSSSSSSSSKSFKFPAFSIRRFLKIPSMPLSQFPSSKSFSSPQSLS
EWLKPRLPSDSFASWGVKPGTKNVHNLWLEISEGETSLADSNPPIRTLHVLSLRIIDKHHRVLIESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
SILGDSDCSEIVRIVPDSYQMKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGEFCTVEEDEYGNSEKTSIAAEAVSVKKHFWKWTAVYFNNKRNENSLPRIHGP