; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G6142 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G6142
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein of unknown function (DUF1685)
Genome locationctg1425:47115..49813
RNA-Seq ExpressionCucsat.G6142
SyntenyCucsat.G6142
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033089.1 uncharacterized protein E6C27_scaffold269G002160 [Cucumis melo var. makuwa]7.09e-10372.27Show/hide
Query:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
        MS F+SQISP SAIEGDPEAQTLHFQSEFSSEHDRDESV PTYWNNTKKNQILLEGFVEASD+DNLTRTKSLTDDDLE+LKGCVDLGFAFCYDEIPELCN
Subjt:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN

Query:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED
        TLPALELCYSM+QKFMDEHQK PENPLPES+DSVSGPIPNWKISSP                                                GDHPED
Subjt:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED

Query:  VKARLKYWAQAVACTVRLCN
        VKARLKYWAQAVACT   C+
Subjt:  VKARLKYWAQAVACTVRLCN

KAG7013888.1 hypothetical protein SDJN02_24057, partial [Cucurbita argyrosperma subsp. argyrosperma]4.21e-9969.47Show/hide
Query:  SPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTK-------KNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDE
        S FESQISPESAI GD EAQ LHFQSEF+S  DR+ESV PT WNNTK       KNQILLEGFVE SDE+NLTRTKSLTDDDLE+LKGCVDLGFAFCYDE
Subjt:  SPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTK-------KNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDE

Query:  IPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCS
        IPELCNTLPALELCYSM+QKF+DEHQKVPE+ LP+  DSVS PIPNWKISSPG+PF SILNL                                      
Subjt:  IPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCS

Query:  GDHPEDVKARLKYWAQAVACTVRLCN
        GDHPEDVKARLK+WAQAVACTVRLCN
Subjt:  GDHPEDVKARLKYWAQAVACTVRLCN

XP_008445710.1 PREDICTED: uncharacterized protein LOC103488660 [Cucumis melo]4.28e-10874.09Show/hide
Query:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
        MS F+SQISP SAIEGDPEAQTLHFQSEFSSEHDRDESV PTYWNNTKKNQILLEGFVEASD+DNLTRTKSLTDDDLE+LKGCVDLGFAFCYDEIPELCN
Subjt:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN

Query:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED
        TLPALELCYSM+QKFMDEHQK PENPLPES+DSVSGPIPNWKISSP                                                GDHPED
Subjt:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED

Query:  VKARLKYWAQAVACTVRLCN
        VKARLKYWAQAVACTVRLCN
Subjt:  VKARLKYWAQAVACTVRLCN

XP_031743911.1 uncharacterized protein LOC101222799 [Cucumis sativus]4.23e-11578.18Show/hide
Query:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
        MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
Subjt:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN

Query:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED
        TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSP                                                GDHPED
Subjt:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED

Query:  VKARLKYWAQAVACTVRLCN
        VKARLKYWAQAVACTVRLCN
Subjt:  VKARLKYWAQAVACTVRLCN

XP_038885613.1 uncharacterized protein LOC120075933 [Benincasa hispida]6.63e-9667.97Show/hide
Query:  MSPF----ESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKK-------NQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFA
        M+PF    ESQISPESA+EGDPEAQ LHFQSEFSSE D DESV  T W NTKK       NQILLEGFVEASDEDNLTRTKSLTDDDLE+LKGCVDLGFA
Subjt:  MSPF----ESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKK-------NQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFA

Query:  FCYDEIPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFE
        FCYDEIPELCNTLPALELCYSM+QKFMDEHQKVPEN  PES+DSVS PIPNWKISSP                                           
Subjt:  FCYDEIPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFE

Query:  IVLCSGDHPEDVKARLKYWAQAVACTVRLCN
             GDHPEDVKARLKYWAQAVACTVRLCN
Subjt:  IVLCSGDHPEDVKARLKYWAQAVACTVRLCN

TrEMBL top hitse value%identityAlignment
A0A0A0K8B9 Uncharacterized protein2.05e-11578.18Show/hide
Query:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
        MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
Subjt:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN

Query:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED
        TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSP                                                GDHPED
Subjt:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED

Query:  VKARLKYWAQAVACTVRLCN
        VKARLKYWAQAVACTVRLCN
Subjt:  VKARLKYWAQAVACTVRLCN

A0A1S3BE79 uncharacterized protein LOC1034886602.07e-10874.09Show/hide
Query:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
        MS F+SQISP SAIEGDPEAQTLHFQSEFSSEHDRDESV PTYWNNTKKNQILLEGFVEASD+DNLTRTKSLTDDDLE+LKGCVDLGFAFCYDEIPELCN
Subjt:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN

Query:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED
        TLPALELCYSM+QKFMDEHQK PENPLPES+DSVSGPIPNWKISSP                                                GDHPED
Subjt:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED

Query:  VKARLKYWAQAVACTVRLCN
        VKARLKYWAQAVACTVRLCN
Subjt:  VKARLKYWAQAVACTVRLCN

A0A5A7SV80 Uncharacterized protein3.43e-10372.27Show/hide
Query:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN
        MS F+SQISP SAIEGDPEAQTLHFQSEFSSEHDRDESV PTYWNNTKKNQILLEGFVEASD+DNLTRTKSLTDDDLE+LKGCVDLGFAFCYDEIPELCN
Subjt:  MSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCN

Query:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED
        TLPALELCYSM+QKFMDEHQK PENPLPES+DSVSGPIPNWKISSP                                                GDHPED
Subjt:  TLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPED

Query:  VKARLKYWAQAVACTVRLCN
        VKARLKYWAQAVACT   C+
Subjt:  VKARLKYWAQAVACTVRLCN

A0A6J1EVL6 uncharacterized protein LOC1114363731.93e-9065.04Show/hide
Query:  SPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTK-------KNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDE
        S FESQISPE AI GD EAQ LHFQSEF+S  DR+ESV PT WNNTK       KNQILLEGFVE SDE+NLTRTKSLTD+DLE+LKGCVDLGFAFCYDE
Subjt:  SPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTK-------KNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDE

Query:  IPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCS
        IPELCNTLPALELCYSM+QKF+DEHQKVPE+ LP+  DSVS PIPNWKISSP                                                
Subjt:  IPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCS

Query:  GDHPEDVKARLKYWAQAVACTVRLCN
        GDHPEDVKARLK+WAQAVACTVRLCN
Subjt:  GDHPEDVKARLKYWAQAVACTVRLCN

A0A6J1L190 uncharacterized protein LOC1114981751.40e-9062.55Show/hide
Query:  DRNRRISKIPRKKKMSPF----ESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKK-----NQILLEGFVEASDEDNLTRTKSLTDDDL
        + NRRI K+   +KMSP     ESQISPESAI GDPE Q  H++ +FSSE D D SV  T WNNTKK     NQILLEGFVEASD++NLTRTKSLTDDDL
Subjt:  DRNRRISKIPRKKKMSPF----ESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKK-----NQILLEGFVEASDEDNLTRTKSLTDDDL

Query:  EDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFM
        E+LKGCVDLGFAFCYDEIPELCNTLPALELCYSM+QKFMD+HQKVPE+  PES+DS S PIPNWKISSP                               
Subjt:  EDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFM

Query:  NFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN
                         GDHPEDVKARLKYWAQAVACTVRLCN
Subjt:  NFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)5.5e-4049.73Show/hide
Query:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS
        KK+Q+LLEG+VE +        +D+LTR+KSLTDDDLEDL+GC+DLGF F YDEIPELCNTLPALELCYSM+QKF+D+ Q K PE    E   S     +
Subjt:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS

Query:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN
         PI NWKISSP                                                GD+P+DVKARLKYWAQAVACTV+LC+
Subjt:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN

AT1G05870.2 Protein of unknown function (DUF1685)5.5e-4049.73Show/hide
Query:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS
        KK+Q+LLEG+VE +        +D+LTR+KSLTDDDLEDL+GC+DLGF F YDEIPELCNTLPALELCYSM+QKF+D+ Q K PE    E   S     +
Subjt:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS

Query:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN
         PI NWKISSP                                                GD+P+DVKARLKYWAQAVACTV+LC+
Subjt:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN

AT1G05870.3 Protein of unknown function (DUF1685)5.5e-4049.73Show/hide
Query:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS
        KK+Q+LLEG+VE +        +D+LTR+KSLTDDDLEDL+GC+DLGF F YDEIPELCNTLPALELCYSM+QKF+D+ Q K PE    E   S     +
Subjt:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS

Query:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN
         PI NWKISSP                                                GD+P+DVKARLKYWAQAVACTV+LC+
Subjt:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN

AT1G05870.4 Protein of unknown function (DUF1685)5.5e-4049.73Show/hide
Query:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS
        KK+Q+LLEG+VE +        +D+LTR+KSLTDDDLEDL+GC+DLGF F YDEIPELCNTLPALELCYSM+QKF+D+ Q K PE    E   S     +
Subjt:  KKNQILLEGFVEAS-------DEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQ-KVPENPLPESMDS----VS

Query:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN
         PI NWKISSP                                                GD+P+DVKARLKYWAQAVACTV+LC+
Subjt:  GPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN

AT2G31560.1 Protein of unknown function (DUF1685)1.2e-3949.72Show/hide
Query:  KKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDE----HQKVPENPLPESMDSVSGPIPNWKI
        KK+Q+LLEG+    D+D+LTR KSLTDDDLE+LKGC+DLGF F YDEIPELCNTLPALELCYSM+QKF+D+    H K  E        + + PI NWKI
Subjt:  KKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCVDLGFAFCYDEIPELCNTLPALELCYSMNQKFMDE----HQKVPENPLPESMDSVSGPIPNWKI

Query:  SSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN
        SSP                                                GD P+DVKARLKYWAQ VACTVRLC+
Subjt:  SSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCSGDHPEDVKARLKYWAQAVACTVRLCN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCTAACGCGTTTGGCCGAAGAACCAAAATTACAAACGACCGAAACCGCCGAATCTCAAAAATTCCAAGAAAAAAAAAAATGAGTCCCTTCGAATCTCAGATTTCTCCTGA
ATCCGCCATTGAAGGGGACCCAGAAGCTCAAACCCTCCATTTCCAATCTGAATTCAGCTCTGAACACGATCGCGACGAGTCAGTATTTCCTACCTACTGGAACAACACCA
AGAAGAACCAGATCTTGCTCGAGGGTTTTGTCGAGGCTTCGGATGAGGATAATTTGACTAGAACAAAAAGCTTAACAGATGACGATCTTGAGGATCTCAAAGGATGTGTG
GATCTAGGGTTTGCGTTTTGCTATGACGAGATTCCTGAGCTTTGTAACACCTTGCCGGCCCTCGAACTCTGTTATTCCATGAACCAGAAGTTTATGGATGAACACCAAAA
GGTTCCGGAAAATCCTTTACCCGAGTCGATGGATTCTGTTTCCGGTCCCATTCCGAATTGGAAGATCTCTAGTCCTGGTAAGCCGTTCTGTTCAATTCTTAATCTCAATA
TCGCTTGTAAAATTGAAAAGGAATTTCTCTTTGTGTGGTTAACTGTTCTTCAGTTTATGAACTTTTTGTTTAGGTTGAGTATGATTCAGTTTGAAATTGTTCTATGTTCA
GGTGATCATCCAGAAGATGTTAAGGCCAGGCTCAAATACTGGGCGCAAGCAGTGGCTTGTACTGTCAGATTATGCAACTGA
mRNA sequenceShow/hide mRNA sequence
CCTAACGCGTTTGGCCGAAGAACCAAAATTACAAACGACCGAAACCGCCGAATCTCAAAAATTCCAAGAAAAAAAAAAATGAGTCCCTTCGAATCTCAGATTTCTCCTGA
ATCCGCCATTGAAGGGGACCCAGAAGCTCAAACCCTCCATTTCCAATCTGAATTCAGCTCTGAACACGATCGCGACGAGTCAGTATTTCCTACCTACTGGAACAACACCA
AGAAGAACCAGATCTTGCTCGAGGGTTTTGTCGAGGCTTCGGATGAGGATAATTTGACTAGAACAAAAAGCTTAACAGATGACGATCTTGAGGATCTCAAAGGATGTGTG
GATCTAGGGTTTGCGTTTTGCTATGACGAGATTCCTGAGCTTTGTAACACCTTGCCGGCCCTCGAACTCTGTTATTCCATGAACCAGAAGTTTATGGATGAACACCAAAA
GGTTCCGGAAAATCCTTTACCCGAGTCGATGGATTCTGTTTCCGGTCCCATTCCGAATTGGAAGATCTCTAGTCCTGGTAAGCCGTTCTGTTCAATTCTTAATCTCAATA
TCGCTTGTAAAATTGAAAAGGAATTTCTCTTTGTGTGGTTAACTGTTCTTCAGTTTATGAACTTTTTGTTTAGGTTGAGTATGATTCAGTTTGAAATTGTTCTATGTTCA
GGTGATCATCCAGAAGATGTTAAGGCCAGGCTCAAATACTGGGCGCAAGCAGTGGCTTGTACTGTCAGATTATGCAACTGA
Protein sequenceShow/hide protein sequence
PNAFGRRTKITNDRNRRISKIPRKKKMSPFESQISPESAIEGDPEAQTLHFQSEFSSEHDRDESVFPTYWNNTKKNQILLEGFVEASDEDNLTRTKSLTDDDLEDLKGCV
DLGFAFCYDEIPELCNTLPALELCYSMNQKFMDEHQKVPENPLPESMDSVSGPIPNWKISSPGKPFCSILNLNIACKIEKEFLFVWLTVLQFMNFLFRLSMIQFEIVLCS
GDHPEDVKARLKYWAQAVACTVRLCN