; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025081 (gene) of Chayote v1 genome

Gene IDSed0025081
OrganismSechium edule (Chayote v1)
DescriptionHNHc domain-containing protein
Genome locationLG12:2914418..2919877
RNA-Seq ExpressionSed0025081
SyntenySed0025081
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]4.6e-13087.14Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL
        MAQFTAHSRVKLLLNGD G PFGSEPK RSR KLRS+RTLKRR      +STG+SPS  SSS SALRKSAQ       + SGDDAI+D+DYDYEFESDDL
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSF IPAVLRVPHLLQVVKRRR+  +LSRKNILYRD YTCQYCSSHESLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        LP+SRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]2.4e-12382.17Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD-----------SGDDAIVDLDYDYE
        MAQFTAHSRVKLLLNGD G P GSE K R RYKLRSVR  +  +          +PSS +SS SALRKS QH A+            GDDAIV  DYDYE
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD-----------SGDDAIVDLDYDYE

Query:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHES
        FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSF IPAVLRVPHLLQVVKRRR+ N+LSRKNILYRD YTCQYCSSHES
Subjt:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHES

Query:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        LTIDHVLPISRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022152374.1 uncharacterized protein LOC111020122 [Momordica charantia]4.2e-12383.51Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD----------SGDDAIVDLDYDYEF
        MAQFT  +RVKLLLNGD G PFGSE K R RYKLRSVR      I L  ASTGISPS  +SS SALRKSAQH A+          S D AIV LD DYEF
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD----------SGDDAIVDLDYDYEF

Query:  ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESL
        ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSF IPAVLRVPHLLQVVKRRR+ N+LSRKNILYRD YTCQYCSSH+SL
Subjt:  ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESL

Query:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        TIDHVLPISRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]9.2e-13187.5Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL
        MAQFTAHSRVKLLLNGD G PFGSEPK RSR+KLRSVRTLKRR      +STG+SPS  SSS SALRKSAQ       + SGDDAI+D+DYDYEFESDDL
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSF IPAVLRVPHLLQVVKRRR+  +LSRKNILYRD YTCQYCSSHESLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        LP+SRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]1.7e-12986.32Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD----------SGDDAIVDLDYDYEF
        MAQFTAHSRVKLLLNGD G PFGSEPK R RYKLR VRTLKRR I L G      PS  +SS SALRKSAQH A+          SGDDAIVDLDYDYEF
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD----------SGDDAIVDLDYDYEF

Query:  ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESL
        E+DDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSF IPAVLRVPHLLQVVKRRR+ N+LSRKNILYRD YTCQYCSSHESL
Subjt:  ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESL

Query:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        TIDHVLPISRGGEWTWENLVAACV+CNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A1S3CNK0 uncharacterized protein LOC1035029821.2e-12382.17Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD-----------SGDDAIVDLDYDYE
        MAQFTAHSRVKLLLNGD G P GSE K R RYKLRSVR  +  +          +PSS +SS SALRKS QH A+            GDDAIV  DYDYE
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD-----------SGDDAIVDLDYDYE

Query:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHES
        FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSF IPAVLRVPHLLQVVKRRR+ N+LSRKNILYRD YTCQYCSSHES
Subjt:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHES

Query:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        LTIDHVLPISRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease1.2e-12382.17Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD-----------SGDDAIVDLDYDYE
        MAQFTAHSRVKLLLNGD G P GSE K R RYKLRSVR  +  +          +PSS +SS SALRKS QH A+            GDDAIV  DYDYE
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD-----------SGDDAIVDLDYDYE

Query:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHES
        FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSF IPAVLRVPHLLQVVKRRR+ N+LSRKNILYRD YTCQYCSSHES
Subjt:  FESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHES

Query:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        LTIDHVLPISRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  LTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1DFU6 uncharacterized protein LOC1110201222.0e-12383.51Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD----------SGDDAIVDLDYDYEF
        MAQFT  +RVKLLLNGD G PFGSE K R RYKLRSVR      I L  ASTGISPS  +SS SALRKSAQH A+          S D AIV LD DYEF
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPAD----------SGDDAIVDLDYDYEF

Query:  ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESL
        ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSF IPAVLRVPHLLQVVKRRR+ N+LSRKNILYRD YTCQYCSSH+SL
Subjt:  ESDDLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESL

Query:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        TIDHVLPISRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  TIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391704.5e-13187.5Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL
        MAQFTAHSRVKLLLNGD G PFGSEPK RSR+KLRSVRTLKRR      +STG+SPS  SSS SALRKSAQ       + SGDDAI+D+DYDYEFESDDL
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSF IPAVLRVPHLLQVVKRRR+  +LSRKNILYRD YTCQYCSSHESLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        LP+SRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1IAG4 uncharacterized protein LOC1114707254.5e-13187.5Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL
        MAQFTAHSRVKLLLNGD G PFGSEPK RSR+KLRSVRTLKRR      +STG+SPS  SSS SALRKSAQ       + SGDDAI+D+DYDYEFESDDL
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQH-----PADSGDDAIVDLDYDYEFESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSF IPAVLRVPHLLQVVKRRR+  +LSRKNILYRD YTCQYCSSHESLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        LP+SRGGEWTWENLVAACVKCNSKKG KTVEEANMKLKKTPKAPK+YDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease7.4e-8657.88Show/hide
Query:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPADSGDDAIVDLDY------------DY
        MA F+A  R+KLL + D G  FG + + + R  L           ++ G  + + P   S      R  + +        I DLD             ++
Subjt:  MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPADSGDDAIVDLDY------------DY

Query:  EFESDD--------LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYT
        +F+ DD        L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSF IPAVLRVPHLLQVVKRRRV N+LSRKNIL RD YT
Subjt:  EFESDD--------LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYT

Query:  CQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        CQYCSS E+LTIDHV+P+SRGGEWTW+NLVAAC +CNS+KG KT +EA+MKL K PK PK+YDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  CQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS

AT3G47490.1 HNH endonuclease9.2e-0439.62Show/hide
Query:  NILYRDKYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKT
        NI++R    C  C  H+    DH++P S+GG+ T EN      K N  KG+KT
Subjt:  NILYRDKYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKKGHKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCAATTCACCGCACACAGTCGGGTTAAGCTGCTGCTCAACGGAGACGGAGGGGCTCCATTCGGTTCAGAGCCTAAATGTCGATCGAGATACAAGCTCAGATCAGT
ACGAACCCTAAAGCGAAGAATAATCCATTTATGTGGTGCCTCCACTGGAATTTCACCTTCTTCTCCTTCATCCTCGCCTTCAGCTTTGAGGAAATCTGCTCAGCATCCTG
CGGATAGCGGCGACGACGCCATTGTTGATCTTGACTATGATTACGAGTTTGAGAGTGACGATCTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTC
AATGTTGTTTGTTGGAAGCGTGCTATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGAATACTACGATCAGACTGTAAGTTCTCCAAGTGGATCCTTCAATATACC
AGCAGTTTTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAGTCAGTAACACTTTGAGTCGTAAAAACATCCTGTATCGGGACAAATACACTTGCCAGTATT
GTTCATCACATGAGAGTTTGACCATTGATCATGTTTTGCCCATATCCCGGGGTGGAGAATGGACATGGGAAAATCTGGTCGCAGCCTGCGTAAAATGCAATTCAAAGAAA
GGTCATAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGAATATGATATACTTGCCATTCCTCTAACCAGTACCGCAATTAAGATGTT
GAAACTGAGAAAGGGGACCCCTGAAGAATGGCGTCAATATTTGTCAAGTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
AAAGTTTATGGAAAATGAAGATATGAAGTTGAAATGTGCGGGAAAAATATGTGGTGAAGGACTTTTCCGTGTCCATGTAATGTTCTTTTCTTCTTCCTTTCGAAGTTCAA
TATCTTCATCTCCATTGATTGCTCACAGGAGCCAATCTCAGCTCAGAAATGGCGCAATTCACCGCACACAGTCGGGTTAAGCTGCTGCTCAACGGAGACGGAGGGGCTCC
ATTCGGTTCAGAGCCTAAATGTCGATCGAGATACAAGCTCAGATCAGTACGAACCCTAAAGCGAAGAATAATCCATTTATGTGGTGCCTCCACTGGAATTTCACCTTCTT
CTCCTTCATCCTCGCCTTCAGCTTTGAGGAAATCTGCTCAGCATCCTGCGGATAGCGGCGACGACGCCATTGTTGATCTTGACTATGATTACGAGTTTGAGAGTGACGAT
CTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTCAATGTTGTTTGTTGGAAGCGTGCTATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGA
ATACTACGATCAGACTGTAAGTTCTCCAAGTGGATCCTTCAATATACCAGCAGTTTTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAGTCAGTAACACTT
TGAGTCGTAAAAACATCCTGTATCGGGACAAATACACTTGCCAGTATTGTTCATCACATGAGAGTTTGACCATTGATCATGTTTTGCCCATATCCCGGGGTGGAGAATGG
ACATGGGAAAATCTGGTCGCAGCCTGCGTAAAATGCAATTCAAAGAAAGGTCATAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGA
ATATGATATACTTGCCATTCCTCTAACCAGTACCGCAATTAAGATGTTGAAACTGAGAAAGGGGACCCCTGAAGAATGGCGTCAATATTTGTCAAGTGAGCAATGACATG
TATTTATAATGGCACTTGTAAATTCTTCTTTGCACATATTGTACAAATTGCACATATCATAATCATTACATACTTGATTGTTAGATTCTTATGCATCTTGAAAGTAATTT
GTTCTATCATAATTACCATATGGC
Protein sequenceShow/hide protein sequence
MAQFTAHSRVKLLLNGDGGAPFGSEPKCRSRYKLRSVRTLKRRIIHLCGASTGISPSSPSSSPSALRKSAQHPADSGDDAIVDLDYDYEFESDDLACFRGLVLDISYRPV
NVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFNIPAVLRVPHLLQVVKRRRVSNTLSRKNILYRDKYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVKCNSKK
GHKTVEEANMKLKKTPKAPKEYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ