; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030671 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030671
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHNHc domain-containing protein
Genome locationscaffold11:25924469..25929769
RNA-Seq ExpressionSpg030671
SyntenySpg030671
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]1.6e-13892.42Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR   KLRS+RTLKRR PLSGASS  LSPS+SSA ALRKSA RV VRGESVSGDDAI+D+DYDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]3.6e-12786.88Show/hide
Query:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVS-GDDAIVDLDYDYEFESD
        MAQFTAHSRVKLLLNGDG+P     +DR  YKLRSVR   RR PLS  SS    S SS  ALRKS      +RV VR ESV+ GDDAIV  DYDYEFESD
Subjt:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVS-GDDAIVDLDYDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPI RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022152374.1 uncharacterized protein LOC111020122 [Momordica charantia]3.5e-13087.9Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVSGDDAIVDLDYDYEFESDD
        MAQFT  +RVKLLLNGDGVPF    +DRL YKLRSV    RRIPLS AS+ +SPSASSA ALRKSA     +RV VR ESVS D AIV LD DYEFESDD
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVSGDDAIVDLDYDYEFESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDH

Query:  VLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        VLPI RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  VLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]3.1e-13992.78Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR  +KLRSVRTLKRR PLSGASS  LSPS+SSA ALRKSA RV VRGESVSGDDAI+D+DYDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]8.5e-13791.1Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVSGDDAIVDLDYDYEFESDD
        MAQFTAHSRVKLLLNGDGVPF    +DR  YKLR VRTLKRRIPLSG     SPS SSA ALRKSA     +RV VRGESVSGDDAIVDLDYDYEFE+DD
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVSGDDAIVDLDYDYEFESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDH

Query:  VLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        VLPI RGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  VLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A1S3CNK0 uncharacterized protein LOC1035029821.7e-12786.88Show/hide
Query:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVS-GDDAIVDLDYDYEFESD
        MAQFTAHSRVKLLLNGDG+P     +DR  YKLRSVR   RR PLS  SS    S SS  ALRKS      +RV VR ESV+ GDDAIV  DYDYEFESD
Subjt:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVS-GDDAIVDLDYDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPI RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease1.7e-12786.88Show/hide
Query:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVS-GDDAIVDLDYDYEFESD
        MAQFTAHSRVKLLLNGDG+P     +DR  YKLRSVR   RR PLS  SS    S SS  ALRKS      +RV VR ESV+ GDDAIV  DYDYEFESD
Subjt:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVS-GDDAIVDLDYDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPI RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1DFU6 uncharacterized protein LOC1110201221.7e-13087.9Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVSGDDAIVDLDYDYEFESDD
        MAQFT  +RVKLLLNGDGVPF    +DRL YKLRSV    RRIPLS AS+ +SPSASSA ALRKSA     +RV VR ESVS D AIV LD DYEFESDD
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSA-----LRVSVRGESVSGDDAIVDLDYDYEFESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDH

Query:  VLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        VLPI RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  VLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391701.5e-13992.78Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR  +KLRSVRTLKRR PLSGASS  LSPS+SSA ALRKSA RV VRGESVSGDDAI+D+DYDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1IAG4 uncharacterized protein LOC1114707251.5e-13992.78Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR  +KLRSVRTLKRR PLSGASS  LSPS+SSA ALRKSA RV VRGESVSGDDAI+D+DYDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASS-ALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease2.2e-9061.19Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPFEDRLSYKLRSVRTLKRRIPLSGASSALSP-------------SASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYE-
        MA F+A  R+KLL + DG      LS+ + S    ++ + ++G  S L P             S+ S P  RK    +    +++  D+   + D+D + 
Subjt:  MAQFTAHSRVKLLLNGDGVPFEDRLSYKLRSVRTLKRRIPLSGASSALSP-------------SASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYE-

Query:  --FESDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSS
           E+DD L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCSS
Subjt:  --FESDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSS

Query:  HESLTIDHVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
         E+LTIDHV+P+ RGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  HESLTIDHVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAATTCACCGCACACAGTCGGGTTAAGTTGCTGCTCAACGGAGACGGAGTGCCATTTGAAGATCGATTGAGTTACAAGCTCAGATCAGTGCGAACCCTTAAGCG
TAGAATCCCTTTATCTGGTGCCTCCTCTGCACTTTCCCCTTCTGCATCCTCTGCTCCAGCTTTGAGGAAATCCGCTCTGCGTGTTAGTGTGAGAGGTGAGAGCGTTAGCG
GTGACGACGCCATTGTTGATCTTGACTACGATTACGAGTTTGAGAGTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTCAACGTTGTT
TGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGAATACTACGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTATATACCAGCAGTCTT
AAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGGATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATTGTTCATCAC
ATGAGAGTTTGACCATTGACCATGTTTTGCCCATATGCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCAAAGAAAGGTCAGAAA
ACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCCATTCCTCTAACGAGTACCGCAATAAAGATGTTGAAACTGAG
AAAGGGGACCCCTGAAGAATGGCGTCAATATCTGTCAAGTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAATTCACCGCACACAGTCGGGTTAAGTTGCTGCTCAACGGAGACGGAGTGCCATTTGAAGATCGATTGAGTTACAAGCTCAGATCAGTGCGAACCCTTAAGCG
TAGAATCCCTTTATCTGGTGCCTCCTCTGCACTTTCCCCTTCTGCATCCTCTGCTCCAGCTTTGAGGAAATCCGCTCTGCGTGTTAGTGTGAGAGGTGAGAGCGTTAGCG
GTGACGACGCCATTGTTGATCTTGACTACGATTACGAGTTTGAGAGTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTCAACGTTGTT
TGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGAATACTACGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTATATACCAGCAGTCTT
AAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGGATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATTGTTCATCAC
ATGAGAGTTTGACCATTGACCATGTTTTGCCCATATGCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCAAAGAAAGGTCAGAAA
ACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCCATTCCTCTAACGAGTACCGCAATAAAGATGTTGAAACTGAG
AAAGGGGACCCCTGAAGAATGGCGTCAATATCTGTCAAGTGAGCAATGA
Protein sequenceShow/hide protein sequence
MAQFTAHSRVKLLLNGDGVPFEDRLSYKLRSVRTLKRRIPLSGASSALSPSASSAPALRKSALRVSVRGESVSGDDAIVDLDYDYEFESDDLACFRGLVLDISYRPVNVV
CWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPICRGGEWTWENLVAACVKCNSKKGQK
TVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ