; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000831 (gene) of Snake gourd v1 genome

Gene IDTan0000831
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHNHc domain-containing protein
Genome locationLG10:23125454..23131114
RNA-Seq ExpressionTan0000831
SyntenyTan0000831
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]7.3e-14292.58Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
        MAQFTAHSRVKLLLNGDGVPFGSE KDR R KL+S+RTLKRR PLSGAS STGLSPSSSS SALRKSAQ     R+GV GESV GDDAI+D+D DYEFES
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTI
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        DHVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]1.6e-13689.79Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVC-GDDAIVDLDCDYEFE
        MAQFTAHSRVKLLLNGDG+P GSESKDRFRYKL+SVR   RR PLS  S+ST      SS SALRKS QH AE+R+GV  ESV  GDDAIV  D DYEFE
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVC-GDDAIVDLDCDYEFE

Query:  SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
        SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
Subjt:  SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT

Query:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022152374.1 uncharacterized protein LOC111020122 [Momordica charantia]3.4e-13990.81Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
        MAQFT  +RVKLLLNGDGVPFGSESKDR RYKL+SV    RRIPLS  +ASTG+SPS+SS SALRKSAQH AEMR+GV  ESV  D AIV LDCDYEFES
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTI
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]1.5e-14292.93Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
        MAQFTAHSRVKLLLNGDGVPFGSE KDR R+KL+SVRTLKRR PLSGAS STGLSPSSSS SALRKSAQ     R+GV GESV GDDAI+D+D DYEFES
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTI
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        DHVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]1.6e-14492.93Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
        MAQFTAHSRVKLLLNGDGVPFGSE KDRFRYKL+ VRTLKRRIPLSG       SPS+SS SALRKSAQH AE+R+GV GESV GDDAIVDLD DYEFE+
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        DHVLPISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A1S3CNK0 uncharacterized protein LOC1035029827.6e-13789.79Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVC-GDDAIVDLDCDYEFE
        MAQFTAHSRVKLLLNGDG+P GSESKDRFRYKL+SVR   RR PLS  S+ST      SS SALRKS QH AE+R+GV  ESV  GDDAIV  D DYEFE
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVC-GDDAIVDLDCDYEFE

Query:  SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
        SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
Subjt:  SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT

Query:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease7.6e-13789.79Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVC-GDDAIVDLDCDYEFE
        MAQFTAHSRVKLLLNGDG+P GSESKDRFRYKL+SVR   RR PLS  S+ST      SS SALRKS QH AE+R+GV  ESV  GDDAIV  D DYEFE
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVC-GDDAIVDLDCDYEFE

Query:  SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
        SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT
Subjt:  SDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLT

Query:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  IDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1DFU6 uncharacterized protein LOC1110201221.6e-13990.81Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
        MAQFT  +RVKLLLNGDGVPFGSESKDR RYKL+SV    RRIPLS  +ASTG+SPS+SS SALRKSAQH AEMR+GV  ESV  D AIV LDCDYEFES
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTI
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391707.1e-14392.93Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
        MAQFTAHSRVKLLLNGDGVPFGSE KDR R+KL+SVRTLKRR PLSGAS STGLSPSSSS SALRKSAQ     R+GV GESV GDDAI+D+D DYEFES
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTI
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        DHVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1IAG4 uncharacterized protein LOC1114707257.1e-14392.93Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
        MAQFTAHSRVKLLLNGDGVPFGSE KDR R+KL+SVRTLKRR PLSGAS STGLSPSSSS SALRKSAQ     R+GV GESV GDDAI+D+D DYEFES
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES

Query:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
        DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTI
Subjt:  DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        DHVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease3.9e-9362.9Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIP--LSGASASTGLSPSSSSPSALRKSAQ-HGAEMRLGVTGESVCGDDAIVDLDCDYE
        MA F+A  R+KLL + DG+ FG +S+D+FR  L         +P  +S       +  S S P   +K       E  L +  ++   D+   D D D  
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIP--LSGASASTGLSPSSSSPSALRKSAQ-HGAEMRLGVTGESVCGDDAIVDLDCDYE

Query:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES
           D L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCSS E+
Subjt:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES

Query:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        LTIDHV+P+SRGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTTTCCGTGCGCATCAGCACCCAAATGGCCCAATTCACTGCACACAGTCGGGTTAAGTTGCTGCTCAACGGAGACGGAGTCCCATTCGGTTCAGAATCAAAAGA
TCGATTCAGATACAAGCTCAAATCAGTACGAACCCTTAAGCGAAGAATCCCTTTATCTGGTGCCTCTGCCTCCACTGGACTTTCCCCTTCTTCTTCCTCGCCTTCAGCTT
TGAGGAAATCCGCTCAGCATGGTGCGGAGATGCGTCTTGGTGTGACGGGTGAGAGCGTTTGCGGTGACGACGCCATTGTTGATCTTGACTGTGATTACGAGTTTGAGAGT
GACGATCTGGCTTGCTTCAGAGGTCTTGTCTTGGATATTTCCTACAGGCCAGTCAACGTTGTTTGTTGGAAGCGTGCTATTTGTTTGGAATTCATGGAGAAGGCTGATGT
ATTGGAATACTATGACCAGACCGTGAGTTCTCCAAGTGGATCCTTCTATATACCAGCAGTCTTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAATTAAGA
ACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATTGTTCATCACATGAGAGTTTGACCATTGACCATGTTTTGCCCATATCCCGGGGTGGA
GAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCAAAGAAAGGTCAAAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCC
AAAAGATTATGATATTCTTGCCATTCCTCTAACAAGTACCGCAATAAAGATGTTGAAACTGAGAAAGGGGACCCCTGAAGAATGGCGCCAATATTTGTCAAGTGAGCAAT
AA
mRNA sequenceShow/hide mRNA sequence
AGCAATTTTTGGAGAAGGAAGATATGAAGTGAAATGTGCGGGAAAAATGTGTGGTGATGGACTTTTCCGTGCGCATCAGCACCCAAATGGCCCAATTCACTGCACACAGT
CGGGTTAAGTTGCTGCTCAACGGAGACGGAGTCCCATTCGGTTCAGAATCAAAAGATCGATTCAGATACAAGCTCAAATCAGTACGAACCCTTAAGCGAAGAATCCCTTT
ATCTGGTGCCTCTGCCTCCACTGGACTTTCCCCTTCTTCTTCCTCGCCTTCAGCTTTGAGGAAATCCGCTCAGCATGGTGCGGAGATGCGTCTTGGTGTGACGGGTGAGA
GCGTTTGCGGTGACGACGCCATTGTTGATCTTGACTGTGATTACGAGTTTGAGAGTGACGATCTGGCTTGCTTCAGAGGTCTTGTCTTGGATATTTCCTACAGGCCAGTC
AACGTTGTTTGTTGGAAGCGTGCTATTTGTTTGGAATTCATGGAGAAGGCTGATGTATTGGAATACTATGACCAGACCGTGAGTTCTCCAAGTGGATCCTTCTATATACC
AGCAGTCTTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAATTAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATT
GTTCATCACATGAGAGTTTGACCATTGACCATGTTTTGCCCATATCCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCAAAGAAA
GGTCAAAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATTCTTGCCATTCCTCTAACAAGTACCGCAATAAAGATGTT
GAAACTGAGAAAGGGGACCCCTGAAGAATGGCGCCAATATTTGTCAAGTGAGCAATAACCTGTATTTATAATGGCACTCGTAAATTCTTCTTTGCACATATGCACAAATT
GCACATAACATATGTACTTGGTTCTTCTAAGACTCTTATGGCACCTTGAAAGTAATTTATTCTATCACTAGTTCCCA
Protein sequenceShow/hide protein sequence
MDFSVRISTQMAQFTAHSRVKLLLNGDGVPFGSESKDRFRYKLKSVRTLKRRIPLSGASASTGLSPSSSSPSALRKSAQHGAEMRLGVTGESVCGDDAIVDLDCDYEFES
DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPISRGG
EWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ