; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G191510 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G191510
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionHNHc domain-containing protein
Genome locationCiama_Chr10:25804791..25810628
RNA-Seq ExpressionCaUC10G191510
SyntenyCaUC10G191510
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]2.0e-14191.49Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD
        MAQFTAHSRVKLLLNGDGVPFGSEPKDR R KLRS+RTLKRR PLSG      SPS+SS SALRKSA+     RVGVRGESVSGDDAI+D+DYDYEFE+D
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_004141149.1 uncharacterized protein LOC101207660 [Cucumis sativus]2.1e-13891.04Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPS-TSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLA
        MAQFTAH+R+KLLLNGDG+PFGSE KDRFRYKLRSVR   RR PLS PS S TSSTSALRK  +HAAEVRVGVR ESV+ GDD +V  DYDYE E+DDLA
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPS-TSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLA

Query:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
        CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
Subjt:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL

Query:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        PISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]5.3e-14293.17Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLAC
        MAQFTAHSRVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PS STSSTSALRKS +HAAEVRVGVR ESV+ GDDAIV  DYDYEFE+DDLAC
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]4.0e-14291.84Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD
        MAQFTAHSRVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG      SPS+SS SALRKSA+     RVGVRGESVSGDDAI+D+DYDYEFE+D
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]1.0e-15398.92Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETDDLACF
        MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLR VRTLKRRIPLSGPSPSTSS SALRKSA+HAAEVRVGVRGESVSGDDAIVDLDYDYEFETDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A0A0LG99 HNHc domain-containing protein1.0e-13891.04Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPS-TSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLA
        MAQFTAH+R+KLLLNGDG+PFGSE KDRFRYKLRSVR   RR PLS PS S TSSTSALRK  +HAAEVRVGVR ESV+ GDD +V  DYDYE E+DDLA
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPS-TSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLA

Query:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
        CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
Subjt:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL

Query:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        PISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A1S3CNK0 uncharacterized protein LOC1035029822.5e-14293.17Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLAC
        MAQFTAHSRVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PS STSSTSALRKS +HAAEVRVGVR ESV+ GDDAIV  DYDYEFE+DDLAC
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease2.5e-14293.17Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLAC
        MAQFTAHSRVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PS STSSTSALRKS +HAAEVRVGVR ESV+ GDDAIV  DYDYEFE+DDLAC
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVS-GDDAIVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391701.9e-14291.84Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD
        MAQFTAHSRVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG      SPS+SS SALRKSA+     RVGVRGESVSGDDAI+D+DYDYEFE+D
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1IAG4 uncharacterized protein LOC1114707251.9e-14291.84Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD
        MAQFTAHSRVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG      SPS+SS SALRKSA+     RVGVRGESVSGDDAI+D+DYDYEFE+D
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGP-----SPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease2.8e-9363.25Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRS-------VRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYE-F
        MA F+A  R+KLL + DG+ FG + +D+FR  L         V     R+       S+ S    RK  R         +   +  D+   D D D +  
Subjt:  MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRS-------VRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYE-F

Query:  ETDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES
        ETDD L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCSS E+
Subjt:  ETDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES

Query:  LTIDHVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        LTIDHV+P+SRGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  LTIDHVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAATTCACTGCACACAGTCGGGTTAAGTTGCTGCTCAACGGCGACGGAGTTCCATTCGGTTCAGAACCCAAAGATCGATTCAGATACAAGCTCAGATCAGTACG
AACCCTCAAACGCAGGATCCCCTTGTCTGGTCCCTCCCCTTCTACATCCTCTACTTCAGCTTTGAGGAAATCCGCTCGGCATGCTGCGGAGGTGCGTGTTGGTGTGAGGG
GTGAGAGCGTTAGCGGTGACGACGCCATTGTTGATCTTGACTACGACTACGAGTTTGAGACTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGG
CCAGTTAACGTTGTTTGTTGGAAACGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTCTTGGAATACTATGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTA
TATACCAGCAGTCTTACGGGTTCCCCATTTATTGCAAGTTGTTAAGAGGAGGAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTC
AGTATTGTTCATCACATGAGAGTTTGACCATTGACCATGTTTTGCCCATATCCCGGGGGGGAGAATGGACATGGGAAAACCTGGTTGCTGCCTGTGTACAATGCAATTCA
AAGAAAGGCCAAAAAACCGTAGAAGAAGCAAATATGAAGCTGAAAAAAACTCCTAAGGCCCCTAAAGATTATGATATACTTGCCATTCCTCTAACAAGTACCGCAATAAA
GATGTTGAAACTGAGAAAGGGGACCCCTGAAGAATGGCGTCAATATCTATCAAGTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
GCTGATAAAGCAAATTTTGGAAACCGAAGATATGAAGTGAAATGTGCGGGAAAAATATGTGGTGATGGATTTTTCCGTGCGCATGTAATGTTCGTTTCGTCTTCCCTCAA
CTACTGGGCTTCGATTTCTTCATCCGACGAGCAGCAATCTCGATTTCTATTTCCATTGACTGCTTGCCGGACCCAATTTCAGTTCAGCAATGGCCCAATTCACTGCACAC
AGTCGGGTTAAGTTGCTGCTCAACGGCGACGGAGTTCCATTCGGTTCAGAACCCAAAGATCGATTCAGATACAAGCTCAGATCAGTACGAACCCTCAAACGCAGGATCCC
CTTGTCTGGTCCCTCCCCTTCTACATCCTCTACTTCAGCTTTGAGGAAATCCGCTCGGCATGCTGCGGAGGTGCGTGTTGGTGTGAGGGGTGAGAGCGTTAGCGGTGACG
ACGCCATTGTTGATCTTGACTACGACTACGAGTTTGAGACTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTTAACGTTGTTTGTTGG
AAACGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTCTTGGAATACTATGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTATATACCAGCAGTCTTACGGGT
TCCCCATTTATTGCAAGTTGTTAAGAGGAGGAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATTGTTCATCACATGAGA
GTTTGACCATTGACCATGTTTTGCCCATATCCCGGGGGGGAGAATGGACATGGGAAAACCTGGTTGCTGCCTGTGTACAATGCAATTCAAAGAAAGGCCAAAAAACCGTA
GAAGAAGCAAATATGAAGCTGAAAAAAACTCCTAAGGCCCCTAAAGATTATGATATACTTGCCATTCCTCTAACAAGTACCGCAATAAAGATGTTGAAACTGAGAAAGGG
GACCCCTGAAGAATGGCGTCAATATCTATCAAGTGAGCAATGACCGTGTATTTAAGTGGTACTTGTAAATTCTTCTTTGCACATATTGCACAAATTCTACGTAATACTTG
GTTCTTCTAAGACTTCTATGGCAACTTCTAAATGGCTTAATCTATCATTAGTTCTCAC
Protein sequenceShow/hide protein sequence
MAQFTAHSRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSPSTSSTSALRKSARHAAEVRVGVRGESVSGDDAIVDLDYDYEFETDDLACFRGLVLDISYR
PVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVQCNS
KKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ