; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G12800 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G12800
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionHNHc domain-containing protein
Genome locationctg1838:4551147..4556581
RNA-Seq ExpressionCucsat.G12800
SyntenyCucsat.G12800
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141149.1 uncharacterized protein LOC101207660 [Cucumis sativus]2.51e-19898.23Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF
        MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]3.25e-18794.33Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF
        MAQFTAH+R+KLLLNGDGLP GSESKDRFRYKLRSVR RRFPLS PSSST SSTSALRK TQHAAEVRVGVRDESV GGDD +VGFDYDYE ESDDLACF
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022152374.1 uncharacterized protein LOC111020122 [Momordica charantia]7.60e-16886.67Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPS---SSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDL
        MAQFT   R+KLLLNGDG+PFGSESKDR RYKLRSVR  R PLS  S   S + SS SALRK  QH AE+RVGVRDESV+  D  +VG D DYE ESDDL
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPS---SSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESL
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SL
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESL

Query:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        TIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]1.40e-16384.72Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRP--RRFPLSRPSSS----TTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELES
        MAQFTAH+R+KLLLNGDG+PFGSE KDR R+KLRSVR   RR PLS  SS+    ++SS SALRK  Q     RVGVR ESV+G DD ++  DYDYE ES
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRP--RRFPLSRPSSS----TTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLR     VPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH

Query:  ESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ESLTIDHVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]1.75e-17389.08Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRP--RRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLA
        MAQFTAH+R+KLLLNGDG+PFGSE KDRFRYKLR VR   RR PLS PS ST SS SALRK  QHAAEVRVGVR ESV+G DD +V  DYDYE E+DDLA
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRP--RRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLA

Query:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
        CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
Subjt:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT

Query:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        IDHVLPISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A0A0LG99 HNHc domain-containing protein1.22e-19898.23Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF
        MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A1S3CNK0 uncharacterized protein LOC1035029821.57e-18794.33Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF
        MAQFTAH+R+KLLLNGDGLP GSESKDRFRYKLRSVR RRFPLS PSSST SSTSALRK TQHAAEVRVGVRDESV GGDD +VGFDYDYE ESDDLACF
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease1.57e-18794.33Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF
        MAQFTAH+R+KLLLNGDGLP GSESKDRFRYKLRSVR RRFPLS PSSST SSTSALRK TQHAAEVRVGVRDESV GGDD +VGFDYDYE ESDDLACF
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1DFU6 uncharacterized protein LOC1110201223.68e-16886.67Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPS---SSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDL
        MAQFT   R+KLLLNGDG+PFGSESKDR RYKLRSVR  R PLS  S   S + SS SALRK  QH AE+RVGVRDESV+  D  +VG D DYE ESDDL
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPS---SSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESL
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLR     VPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SL
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESL

Query:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        TIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391706.78e-16484.72Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRP--RRFPLSRPSSS----TTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELES
        MAQFTAH+R+KLLLNGDG+PFGSE KDR R+KLRSVR   RR PLS  SS+    ++SS SALRK  Q     RVGVR ESV+G DD ++  DYDYE ES
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRP--RRFPLSRPSSS----TTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLR     VPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH

Query:  ESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ESLTIDHVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease1.6e-9162.98Show/hide
Query:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKL------RSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYE---
        MA F+A  RLKLL + DGL FG +S+D+FR  L        + P R    +  +   SS S   K      ++R     E     D+D   +D+D +   
Subjt:  MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKL------RSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYE---

Query:  LESDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQY
        LE+DD L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTV+SP+GSFYIPAVLR     VPHLLQVVKRRR+KNSLSRKNIL RD+YTCQY
Subjt:  LESDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQY

Query:  CSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        CSS E+LTIDHV+P+SRGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  CSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCAATTCACTGCACACACTCGACTTAAGTTGCTGCTCAACGGAGACGGTTTACCATTCGGTTCAGAATCCAAAGATCGGTTCAGATACAAGCTCAGATCAGTCCG
ACCTCGGAGGTTTCCTCTCTCTCGTCCCTCTTCTTCTACTACTTCCTCTACTTCAGCTTTGAGGAAACCCACTCAGCATGCTGCCGAGGTGCGTGTTGGTGTGAGGGATG
AGAGCGTTAATGGCGGTGACGACGACGTTGTTGGTTTTGACTATGACTACGAGCTTGAGTCTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGG
CCAGTTAACGTTGTTTGTTGGAAACGTGCAATTTGTTTGGAGTTCATGGAAAAGGCTGATGTTTTGGAATACTATGACCAGACAGTGAATTCTCCAAGTGGATCCTTCTA
TATACCAGCAGTCTTGCGGCATCTCTGTTTTCAGGTTCCCCATTTATTGCAAGTTGTTAAGAGGAGAAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGG
ACAATTACACTTGTCAGTATTGTTCATCACACGAGAGTTTGACCATTGACCATGTTTTGCCCATATCCCGGGGAGGAGAATGGACATGGGAAAACCTGGTTGCTGCCTGT
GTAAAATGCAATTCAAAGAAAGGCCAAAAAACCGTAGAAGAAGCAAATATGAAGCTGAAAAAAACTCCCAAGGCTCCAAAAGATTATGATATACTTGCCATTCCTCTAAC
AAGTACCGCAATAAAGATGTTGAAACTGAGAAAGGGAACTCCTGAAGAATGGCGTCAATATCTGTCAAGTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCAATTCACTGCACACACTCGACTTAAGTTGCTGCTCAACGGAGACGGTTTACCATTCGGTTCAGAATCCAAAGATCGGTTCAGATACAAGCTCAGATCAGTCCG
ACCTCGGAGGTTTCCTCTCTCTCGTCCCTCTTCTTCTACTACTTCCTCTACTTCAGCTTTGAGGAAACCCACTCAGCATGCTGCCGAGGTGCGTGTTGGTGTGAGGGATG
AGAGCGTTAATGGCGGTGACGACGACGTTGTTGGTTTTGACTATGACTACGAGCTTGAGTCTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGG
CCAGTTAACGTTGTTTGTTGGAAACGTGCAATTTGTTTGGAGTTCATGGAAAAGGCTGATGTTTTGGAATACTATGACCAGACAGTGAATTCTCCAAGTGGATCCTTCTA
TATACCAGCAGTCTTGCGGCATCTCTGTTTTCAGGTTCCCCATTTATTGCAAGTTGTTAAGAGGAGAAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGG
ACAATTACACTTGTCAGTATTGTTCATCACACGAGAGTTTGACCATTGACCATGTTTTGCCCATATCCCGGGGAGGAGAATGGACATGGGAAAACCTGGTTGCTGCCTGT
GTAAAATGCAATTCAAAGAAAGGCCAAAAAACCGTAGAAGAAGCAAATATGAAGCTGAAAAAAACTCCCAAGGCTCCAAAAGATTATGATATACTTGCCATTCCTCTAAC
AAGTACCGCAATAAAGATGTTGAAACTGAGAAAGGGAACTCCTGAAGAATGGCGTCAATATCTGTCAAGTGAGCAATGA
Protein sequenceShow/hide protein sequence
MAQFTAHTRLKLLLNGDGLPFGSESKDRFRYKLRSVRPRRFPLSRPSSSTTSSTSALRKPTQHAAEVRVGVRDESVNGGDDDVVGFDYDYELESDDLACFRGLVLDISYR
PVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRHLCFQVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAAC
VKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ