; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019513 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019513
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionHNHc domain-containing protein
Genome locationChr04:22679398..22684381
RNA-Seq ExpressionHG10019513
SyntenyHG10019513
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]2.2e-14090.78Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R KLRS+RTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_004141149.1 uncharacterized protein LOC101207660 [Cucumis sativus]2.2e-14092.83Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA
        MAQFTAHTR+KLLLNGDG+PFGSE KDRFRYKLRSVR   RR PLS PSSS TSSTSALRK TQHAAEVRVGVR ESV+ GDD VV  DYDYE E+DDLA
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA

Query:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
        CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
Subjt:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL

Query:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        PISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]4.8e-14393.53Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC
        MAQFTAH+RVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PSSSTSSTSALRKSTQHAAEVRVGVR ESV+ GDDA+V  DYDYEFE+DDLAC
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]4.4e-14191.13Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]3.3e-15297.83Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETDDLACF
        MAQFTAH+RVKLLLNGDGVPFGSEPKDRFRYKLR VRTLKRRIPLSGPS STSS SALRKS QHAAEVRVGVRGESVSGDDA+VDLDYDYEFETDDLACF
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A0A0LG99 HNHc domain-containing protein1.1e-14092.83Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA
        MAQFTAHTR+KLLLNGDG+PFGSE KDRFRYKLRSVR   RR PLS PSSS TSSTSALRK TQHAAEVRVGVR ESV+ GDD VV  DYDYE E+DDLA
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA

Query:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
        CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
Subjt:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL

Query:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        PISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A1S3CNK0 uncharacterized protein LOC1035029822.3e-14393.53Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC
        MAQFTAH+RVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PSSSTSSTSALRKSTQHAAEVRVGVR ESV+ GDDA+V  DYDYEFE+DDLAC
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease2.3e-14393.53Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC
        MAQFTAH+RVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PSSSTSSTSALRKSTQHAAEVRVGVR ESV+ GDDA+V  DYDYEFE+DDLAC
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391702.2e-14191.13Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1IAG4 uncharacterized protein LOC1114707252.2e-14191.13Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease4.8e-9362.9Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRS-------VRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYE-F
        MA F+A  R+KLL + DG+ FG + +D+FR  L         V     R+       S+ S    RK  +         +   +  D+   D D D +  
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRS-------VRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYE-F

Query:  ETDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES
        ETDD L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCSS E+
Subjt:  ETDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES

Query:  LTIDHVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        LTIDHV+P+SRGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  LTIDHVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAATTCACTGCACACACTCGGGTTAAGTTGCTGCTCAACGGTGACGGAGTTCCATTCGGTTCAGAACCGAAAGATCGATTCAGATACAAGCTCAGATCA
GTACGAACCCTCAAACGCAGGATTCCCCTGTCTGGTCCCTCCTCTTCTACATCTTCTACTTCAGCTTTGAGGAAATCCACCCAGCATGCTGCGGAGGTGCGTGTT
GGTGTGAGGGGTGAGAGCGTTAGCGGTGACGATGCCGTTGTTGATCTTGACTATGACTACGAATTTGAGACTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTG
GATATTTCCTACAGGCCAGTTAACGTTGTTTGTTGGAAACGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTTTTGGAATACTATGACCAGACTGTGAGT
TCTCCAAGTGGATCCTTCTATATACCAGCAGTCTTACGGGTTCCCCATTTATTGCAAGTTGTTAAGAGGAGGAGAATCAAGAACTCTTTGAGTCGTAAAAACATA
CTTTATCGGGACAATTACACTTGTCAGTATTGTTCATCACATGAGAGTTTGACCATTGACCATGTTCTGCCCATATCCCGGGGTGGAGAATGGACATGGGAAAAC
CTGGTTGCTGCCTGTGTACAATGCAATTCAAAGAAAGGCCAAAAAACAGTAGAAGAAGCAAATATGAAGCTGAAGAAAACTCCTAAGGCCCCTAAAGATTATGAT
ATACTTGCCATTCCTCTAACAAGTACCGCAATAAAGATGTTGAAACTGAGAAAGGGGACTCCTGAAGAATGGCGTCAATATCTATCAAGTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAATTCACTGCACACACTCGGGTTAAGTTGCTGCTCAACGGTGACGGAGTTCCATTCGGTTCAGAACCGAAAGATCGATTCAGATACAAGCTCAGATCA
GTACGAACCCTCAAACGCAGGATTCCCCTGTCTGGTCCCTCCTCTTCTACATCTTCTACTTCAGCTTTGAGGAAATCCACCCAGCATGCTGCGGAGGTGCGTGTT
GGTGTGAGGGGTGAGAGCGTTAGCGGTGACGATGCCGTTGTTGATCTTGACTATGACTACGAATTTGAGACTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTG
GATATTTCCTACAGGCCAGTTAACGTTGTTTGTTGGAAACGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTTTTGGAATACTATGACCAGACTGTGAGT
TCTCCAAGTGGATCCTTCTATATACCAGCAGTCTTACGGGTTCCCCATTTATTGCAAGTTGTTAAGAGGAGGAGAATCAAGAACTCTTTGAGTCGTAAAAACATA
CTTTATCGGGACAATTACACTTGTCAGTATTGTTCATCACATGAGAGTTTGACCATTGACCATGTTCTGCCCATATCCCGGGGTGGAGAATGGACATGGGAAAAC
CTGGTTGCTGCCTGTGTACAATGCAATTCAAAGAAAGGCCAAAAAACAGTAGAAGAAGCAAATATGAAGCTGAAGAAAACTCCTAAGGCCCCTAAAGATTATGAT
ATACTTGCCATTCCTCTAACAAGTACCGCAATAAAGATGTTGAAACTGAGAAAGGGGACTCCTGAAGAATGGCGTCAATATCTATCAAGTGAGCAATGA
Protein sequenceShow/hide protein sequence
MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETDDLACFRGLVL
DISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWEN
LVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ