; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0952 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0952
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHNHc domain-containing protein
Genome locationMC05:10076715..10082606
RNA-Seq ExpressionMC05g0952
SyntenyMC05g0952
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]2.65e-16988.3Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD
        MAQFT  +RVKLLLNGDGVPFGSE KDR R KLRS+R    R PLS A+STG+SPS+SSASALRKSAQ     RVGVR ESVS D AI+ +D DYEFESD
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        HVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]3.25e-17290Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR--RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSD-DGAIVGLDCDYEFESDDL
        MAQFT  +RVKLLLNGDG+P GSESKDR RYKLRSVR  R PLSA S+    S SS SALRKS QH AE+RVGVRDESV+  D AIVG D DYEFESDDL
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR--RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSD-DGAIVGLDCDYEFESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

XP_022152374.1 uncharacterized protein LOC111020122 [Momordica charantia]1.68e-197100Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVRRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDDLACF
        MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVRRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDDLACF
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVRRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPI
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPI

Query:  SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
Subjt:  SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]3.23e-17088.65Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD
        MAQFT  +RVKLLLNGDGVPFGSE KDR R+KLRSVR    R PLS A+STG+SPS+SSASALRKSAQ     RVGVR ESVS D AI+ +D DYEFESD
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        HVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]7.11e-17490.04Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDD
        MAQFT  +RVKLLLNGDGVPFGSE KDR RYKLR VR    RIPLS    G SPS SSASALRKSAQH AE+RVGVR ESVS D AIV LD DYEFE+DD
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        VLPISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

TrEMBL top hitse value%identityAlignment
A0A1S3CNK0 uncharacterized protein LOC1035029821.58e-17290Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR--RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSD-DGAIVGLDCDYEFESDDL
        MAQFT  +RVKLLLNGDG+P GSESKDR RYKLRSVR  R PLSA S+    S SS SALRKS QH AE+RVGVRDESV+  D AIVG D DYEFESDDL
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR--RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSD-DGAIVGLDCDYEFESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

A0A5D3E2H2 HNH endonuclease1.58e-17290Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR--RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSD-DGAIVGLDCDYEFESDDL
        MAQFT  +RVKLLLNGDG+P GSESKDR RYKLRSVR  R PLSA S+    S SS SALRKS QH AE+RVGVRDESV+  D AIVG D DYEFESDDL
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR--RIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSD-DGAIVGLDCDYEFESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

A0A6J1DFU6 uncharacterized protein LOC1110201228.16e-198100Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVRRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDDLACF
        MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVRRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDDLACF
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVRRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPI
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPI

Query:  SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
Subjt:  SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

A0A6J1F2H9 uncharacterized protein LOC1114391701.56e-17088.65Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD
        MAQFT  +RVKLLLNGDGVPFGSE KDR R+KLRSVR    R PLS A+STG+SPS+SSASALRKSAQ     RVGVR ESVS D AI+ +D DYEFESD
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        HVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

A0A6J1IAG4 uncharacterized protein LOC1114707251.56e-17088.65Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD
        MAQFT  +RVKLLLNGDGVPFGSE KDR R+KLRSVR    R PLS A+STG+SPS+SSASALRKSAQ     RVGVR ESVS D AI+ +D DYEFESD
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVR----RIPLS-AASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTID

Query:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ
        HVLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  HVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease9.1e-9260.36Show/hide
Query:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSV----RRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYE-FESD
        MA F+   R+KLL + DG+ FG +S+D+ R  L         +P+  +         S+ +     + + ++    ++  + +D      D D +  E+D
Subjt:  MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSV----RRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYE-FESD

Query:  D-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTI
        D L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTV+SP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCSS ++LTI
Subjt:  D-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTI

Query:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        DHV+P+SRGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  DHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS

AT3G47490.1 HNH endonuclease5.5e-0441.51Show/hide
Query:  NILYRDNYTCQYCSSHDSLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKT
        NI++R    C  C  HD    DH++P S+GG+ T EN      K N  KG KT
Subjt:  NILYRDNYTCQYCSSHDSLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAATTCACCACACTCAATCGTGTAAAGTTGCTGCTGAACGGAGATGGAGTGCCGTTCGGTTCTGAATCCAAAGACCGATTGAGATACAAGCTCAGATCAGTACG
AAGAATTCCTTTATCTGCTGCCTCCACTGGAATTTCTCCTTCTGCGTCCTCTGCTTCAGCTTTGAGGAAATCCGCTCAACATGTTGCGGAGATGCGTGTTGGTGTGAGGG
ATGAGAGCGTTAGCGATGACGGCGCCATTGTTGGTCTTGACTGTGACTACGAGTTTGAGAGCGACGATTTGGCTTGTTTCAGAGGCCTGGTCTTGGATATTTCGTACAGG
CCAGTCAATGTTGTTTGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTGTTGGAATACTATGACCAGACTGTGAATTCTCCAAGTGGATCCTTCTA
TATACCAGCAGTCTTAAGGGTTCCTCATTTATTGCAAGTTGTAAAAAGAAGAAGAATCAAGAACTCTTTAAGTCGTAAAAACATACTTTATAGGGACAATTACACTTGTC
AGTATTGTTCATCACATGATAGTTTGACGATTGATCATGTTTTACCCATATCCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTGAAATGCAATTCA
AAGAAAGGTCAAAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCCATCCCTCTAACAAGTACTGCAATAAA
GATGTTGAAACTGAGAAAGGGGACCCCTGAAGAATGGCGTCAATATCTGTCGAATGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
GCACGTGCTAGTTAATTACACGACAAAATCCAGGACTCCAATTTTCTGCCGTGACGAGCAATTTTTGGAAAAGGAAGATATGAAGTAAAATGTGCGGGAAAAATATGTGG
TTACGGCCTTTTCCCTGAGCTTCTGATGTTCTTTTCTTCTGCTTCTCAATCACTGGGCTTTGATTTGTTCATCTGATGCACAAACTCGACTTTGATTTCCACTGACTGCT
CGTAGAGAAAATTCGAGTTCCAGAAAATGGCCCAATTCACCACACTCAATCGTGTAAAGTTGCTGCTGAACGGAGATGGAGTGCCGTTCGGTTCTGAATCCAAAGACCGA
TTGAGATACAAGCTCAGATCAGTACGAAGAATTCCTTTATCTGCTGCCTCCACTGGAATTTCTCCTTCTGCGTCCTCTGCTTCAGCTTTGAGGAAATCCGCTCAACATGT
TGCGGAGATGCGTGTTGGTGTGAGGGATGAGAGCGTTAGCGATGACGGCGCCATTGTTGGTCTTGACTGTGACTACGAGTTTGAGAGCGACGATTTGGCTTGTTTCAGAG
GCCTGGTCTTGGATATTTCGTACAGGCCAGTCAATGTTGTTTGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTGTTGGAATACTATGACCAGACT
GTGAATTCTCCAAGTGGATCCTTCTATATACCAGCAGTCTTAAGGGTTCCTCATTTATTGCAAGTTGTAAAAAGAAGAAGAATCAAGAACTCTTTAAGTCGTAAAAACAT
ACTTTATAGGGACAATTACACTTGTCAGTATTGTTCATCACATGATAGTTTGACGATTGATCATGTTTTACCCATATCCCGGGGTGGAGAATGGACATGGGAAAATCTGG
TTGCTGCCTGTGTGAAATGCAATTCAAAGAAAGGTCAAAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCC
ATCCCTCTAACAAGTACTGCAATAAAGATGTTGAAACTGAGAAAGGGGACCCCTGAAGAATGGCGTCAATATCTGTCGAATGAGCAATGACATGTATTTATAGTGGCACT
TGTAAATTCCTCTTTGCACATATCCCATGAATTGCACATATCATTTATGTACTTAATTCTTCCAACACTCTTATGGCACCTTAGAAGTAATTTATTCTATCTCTAATTCC
ATGGGTACAAGGAACCGACCCAACCCGAAAAATTGCCATGAACCAATTCAGTCGTTCTCTTTCTG
Protein sequenceShow/hide protein sequence
MAQFTTLNRVKLLLNGDGVPFGSESKDRLRYKLRSVRRIPLSAASTGISPSASSASALRKSAQHVAEMRVGVRDESVSDDGAIVGLDCDYEFESDDLACFRGLVLDISYR
PVNVVCWKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPISRGGEWTWENLVAACVKCNS
KKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSNEQ