; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022121 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022121
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHNHc domain-containing protein
Genome locationchr7:18889294..18894263
RNA-Seq ExpressionLag0022121
SyntenyLag0022121
Gene Ontology termsNA
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]1.4e-13992.42Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR   KLRS+RTLKRR PLSGASS+ LSPS+SSASALRKSA RV VRGESVSGDDAI+D++YDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]6.1e-12786.69Show/hide
Query:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVS-GDDAIVDLNYDYEFESDDLAC
        MAQFTAHSRVKLLLNGDG+P     +DR  YKLRSVR   RR PLS  SSS  S SA   S    + +RV VR ESV+ GDDAIV  +YDYEFESDDLAC
Subjt:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVS-GDDAIVDLNYDYEFESDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        I RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022152374.1 uncharacterized protein LOC111020122 [Momordica charantia]1.9e-12887.59Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSA-----LRVSVRGESVSGDDAIVDLNYDYEFESD
        MAQFT  +RVKLLLNGDGVPF    +DRL YKLRSV    RRIPLS A+S+ +SPSASSASALRKSA     +RV VR ESVS D AIV L+ DYEFESD
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSA-----LRVSVRGESVSGDDAIVDLNYDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPI RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]2.8e-14092.78Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR  +KLRSVRTLKRR PLSGASS+ LSPS+SSASALRKSA RV VRGESVSGDDAI+D++YDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]1.9e-13690.78Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSA-----LRVSVRGESVSGDDAIVDLNYDYEFESD
        MAQFTAHSRVKLLLNGDGVPF    +DR  YKLR VRTLKRRIPLSG      SPS SSASALRKSA     +RV VRGESVSGDDAIVDL+YDYEFE+D
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSA-----LRVSVRGESVSGDDAIVDLNYDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPI RGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A1S3CNK0 uncharacterized protein LOC1035029823.0e-12786.69Show/hide
Query:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVS-GDDAIVDLNYDYEFESDDLAC
        MAQFTAHSRVKLLLNGDG+P     +DR  YKLRSVR   RR PLS  SSS  S SA   S    + +RV VR ESV+ GDDAIV  +YDYEFESDDLAC
Subjt:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVS-GDDAIVDLNYDYEFESDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        I RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease3.0e-12786.69Show/hide
Query:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVS-GDDAIVDLNYDYEFESDDLAC
        MAQFTAHSRVKLLLNGDG+P     +DR  YKLRSVR   RR PLS  SSS  S SA   S    + +RV VR ESV+ GDDAIV  +YDYEFESDDLAC
Subjt:  MAQFTAHSRVKLLLNGDGVP----FEDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVS-GDDAIVDLNYDYEFESDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        I RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1DFU6 uncharacterized protein LOC1110201229.2e-12987.59Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSA-----LRVSVRGESVSGDDAIVDLNYDYEFESD
        MAQFT  +RVKLLLNGDGVPF    +DRL YKLRSV    RRIPLS A+S+ +SPSASSASALRKSA     +RV VR ESVS D AIV L+ DYEFESD
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSA-----LRVSVRGESVSGDDAIVDLNYDYEFESD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLPI RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS+EQ
Subjt:  HVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391701.4e-14092.78Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR  +KLRSVRTLKRR PLSGASS+ LSPS+SSASALRKSA RV VRGESVSGDDAI+D++YDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1IAG4 uncharacterized protein LOC1114707251.4e-14092.78Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF
        MAQFTAHSRVKLLLNGDGVPF    +DR  +KLRSVRTLKRR PLSGASS+ LSPS+SSASALRKSA RV VRGESVSGDDAI+D++YDYEFESDDLACF
Subjt:  MAQFTAHSRVKLLLNGDGVPF----EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTIDHVLP+
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
         RGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  CRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease1.6e-8861.79Show/hide
Query:  MAQFTAHSRVKLLLNGDGVPF-EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRG-ESVSGDDAIVDLNYDYEFESDD------
        MA F+A  R+KLL + DG+ F  D      +S+       PL     S L   A   S+      R  +R  ++   +  I + N +++F+ DD      
Subjt:  MAQFTAHSRVKLLLNGDGVPF-EDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRG-ESVSGDDAIVDLNYDYEFESDD------

Query:  --LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI
          L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCSS E+LTI
Subjt:  --LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTI

Query:  DHVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        DHV+P+ RGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  DHVLPICRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAATTCACCGCACACAGTCGGGTTAAGTTGCTGCTCAACGGAGACGGAGTGCCATTTGAAGATCGATTGAGTTACAAGCTCAGATCAGTGCGAACCCTTAAGCG
TAGGATCCCTTTATCAGGTGCCTCCTCCTCTGCACTTTCCCCTTCTGCATCCTCTGCTTCAGCTTTGAGGAAATCCGCTCTGCGTGTTAGTGTGAGGGGTGAGAGCGTTA
GCGGTGACGACGCCATTGTTGATCTTAACTACGATTACGAGTTTGAGAGTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTCAACGTT
GTTTGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGAATACTACGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTATATACCAGCAGT
CTTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATTGTTCAT
CACATGAGAGTTTGACCATTGACCATGTTTTGCCCATATGCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCAAAGAAAGGTCAG
AAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCCATTCCTCTAACGAGTACCGCAATAAAGATGTTGAAACT
GAGAAAGGGGACCCCTGAAGAATGGCGTCAATATCTGTCAAGTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAATTCACCGCACACAGTCGGGTTAAGTTGCTGCTCAACGGAGACGGAGTGCCATTTGAAGATCGATTGAGTTACAAGCTCAGATCAGTGCGAACCCTTAAGCG
TAGGATCCCTTTATCAGGTGCCTCCTCCTCTGCACTTTCCCCTTCTGCATCCTCTGCTTCAGCTTTGAGGAAATCCGCTCTGCGTGTTAGTGTGAGGGGTGAGAGCGTTA
GCGGTGACGACGCCATTGTTGATCTTAACTACGATTACGAGTTTGAGAGTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTCAACGTT
GTTTGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGAATACTACGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTATATACCAGCAGT
CTTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATTGTTCAT
CACATGAGAGTTTGACCATTGACCATGTTTTGCCCATATGCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCAAAGAAAGGTCAG
AAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCCATTCCTCTAACGAGTACCGCAATAAAGATGTTGAAACT
GAGAAAGGGGACCCCTGAAGAATGGCGTCAATATCTGTCAAGTGAGCAATGA
Protein sequenceShow/hide protein sequence
MAQFTAHSRVKLLLNGDGVPFEDRLSYKLRSVRTLKRRIPLSGASSSALSPSASSASALRKSALRVSVRGESVSGDDAIVDLNYDYEFESDDLACFRGLVLDISYRPVNV
VCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPICRGGEWTWENLVAACVKCNSKKGQ
KTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ