; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G012840 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G012840
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHNHc domain-containing protein
Genome locationchr03:23670584..23676103
RNA-Seq ExpressionLsi03G012840
SyntenyLsi03G012840
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]2.2e-14090.78Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R KLRS+RTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_004141149.1 uncharacterized protein LOC101207660 [Cucumis sativus]2.2e-14092.83Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA
        MAQFTAHTR+KLLLNGDG+PFGSE KDRFRYKLRSVR   RR PLS PSSS TSSTSALRK TQHAAEVRVGVR ESV+ GDD VV  DYDYE E+DDLA
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA

Query:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
        CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
Subjt:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL

Query:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        PISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]4.8e-14393.53Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC
        MAQFTAH+RVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PSSSTSSTSALRKSTQHAAEVRVGVR ESV+ GDDA+V  DYDYEFE+DDLAC
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]4.4e-14191.13Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]3.3e-15297.83Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETDDLACF
        MAQFTAH+RVKLLLNGDGVPFGSEPKDRFRYKLR VRTLKRRIPLSGPS STSS SALRKS QHAAEVRVGVRGESVSGDDA+VDLDYDYEFETDDLACF
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETDDLACF

Query:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
        RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI
Subjt:  RGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPI

Query:  SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  SRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

TrEMBL top hitse value%identityAlignment
A0A0A0LG99 HNHc domain-containing protein1.1e-14092.83Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA
        MAQFTAHTR+KLLLNGDG+PFGSE KDRFRYKLRSVR   RR PLS PSSS TSSTSALRK TQHAAEVRVGVR ESV+ GDD VV  DYDYE E+DDLA
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSS-TSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLA

Query:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
        CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL
Subjt:  CFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVL

Query:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        PISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  PISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A1S3CNK0 uncharacterized protein LOC1035029822.3e-14393.53Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC
        MAQFTAH+RVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PSSSTSSTSALRKSTQHAAEVRVGVR ESV+ GDDA+V  DYDYEFE+DDLAC
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A5D3E2H2 HNH endonuclease2.3e-14393.53Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC
        MAQFTAH+RVKLLLNGDG+P GSE KDRFRYKLRSVR   RR PLS PSSSTSSTSALRKSTQHAAEVRVGVR ESV+ GDDA+V  DYDYEFE+DDLAC
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVS-GDDAVVDLDYDYEFETDDLAC

Query:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
        FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP
Subjt:  FRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLP

Query:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        ISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  ISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1F2H9 uncharacterized protein LOC1114391702.2e-14191.13Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

A0A6J1IAG4 uncharacterized protein LOC1114707252.2e-14191.13Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD
        MAQFTAH+RVKLLLNGDGVPFGSEPKDR R+KLRSVRTLKRR PLSG SS     S+SS SALRKS Q     RVGVRGESVSGDDA++D+DYDYEFE+D
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSS-----STSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETD

Query:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID
        DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSHESLTID
Subjt:  DLACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTID

Query:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
        HVLP+SRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ
Subjt:  HVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease4.8e-9362.9Show/hide
Query:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRS-------VRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYE-F
        MA F+A  R+KLL + DG+ FG + +D+FR  L         V     R+       S+ S    RK  +         +   +  D+   D D D +  
Subjt:  MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRS-------VRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYE-F

Query:  ETDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES
        ETDD L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCSS E+
Subjt:  ETDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHES

Query:  LTIDHVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS
        LTIDHV+P+SRGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG PEEWRQYL+
Subjt:  LTIDHVLPISRGGEWTWENLVAACVQCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAATTCACTGCACACACTCGGGTTAAGTTGCTGCTCAACGGTGACGGAGTTCCATTCGGTTCAGAACCGAAAGATCGATTCAGATACAAGCTCAGATCAGTACG
AACCCTCAAACGCAGGATTCCCCTGTCTGGTCCCTCCTCTTCTACATCTTCTACTTCAGCTTTGAGGAAATCCACCCAGCATGCTGCGGAGGTGCGTGTTGGTGTGAGGG
GTGAGAGCGTTAGCGGTGACGATGCCGTTGTTGATCTTGACTATGACTACGAATTTGAGACTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGG
CCAGTTAACGTTGTTTGTTGGAAACGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTTTTGGAATACTATGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTA
TATACCAGCAGTCTTACGGGTTCCCCATTTATTGCAAGTTGTTAAGAGGAGGAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTC
AGTATTGTTCATCACATGAGAGTTTGACCATTGACCATGTTCTGCCCATATCCCGGGGTGGAGAATGGACATGGGAAAACCTGGTTGCTGCCTGTGTACAATGCAATTCA
AAGAAAGGCCAAAAAACAGTAGAAGAAGCAAATATGAAGCTGAAGAAAACTCCTAAGGCCCCTAAAGATTATGATATACTTGCCATTCCTCTAACAAGTACCGCAATAAA
GATGTTGAAACTGAGAAAGGGGACTCCTGAAGAATGGCGTCAATATCTATCAAGTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
TTGGAAACCGAAGATATGAAGTGAAATGTGCGGGAAAAATATGTGGTGATGGATTTTTCCGTGCGCATGTAATGTTCTTTTCGTCTTCTCCTCAACTACTGGGCTTCGAT
TTCTTCATCCGATGAGCACCAATCTCGATTTCCATTTCCATTAACAGCTTCCAGGAGCCAATTTCAGTTCACAAATGGCCCAATTCACTGCACACACTCGGGTTAAGTTG
CTGCTCAACGGTGACGGAGTTCCATTCGGTTCAGAACCGAAAGATCGATTCAGATACAAGCTCAGATCAGTACGAACCCTCAAACGCAGGATTCCCCTGTCTGGTCCCTC
CTCTTCTACATCTTCTACTTCAGCTTTGAGGAAATCCACCCAGCATGCTGCGGAGGTGCGTGTTGGTGTGAGGGGTGAGAGCGTTAGCGGTGACGATGCCGTTGTTGATC
TTGACTATGACTACGAATTTGAGACTGACGATTTGGCTTGCTTCAGAGGTCTCGTCTTGGATATTTCCTACAGGCCAGTTAACGTTGTTTGTTGGAAACGTGCAATTTGT
TTGGAGTTCATGGAGAAGGCTGATGTTTTGGAATACTATGACCAGACTGTGAGTTCTCCAAGTGGATCCTTCTATATACCAGCAGTCTTACGGGTTCCCCATTTATTGCA
AGTTGTTAAGAGGAGGAGAATCAAGAACTCTTTGAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTCAGTATTGTTCATCACATGAGAGTTTGACCATTGACC
ATGTTCTGCCCATATCCCGGGGTGGAGAATGGACATGGGAAAACCTGGTTGCTGCCTGTGTACAATGCAATTCAAAGAAAGGCCAAAAAACAGTAGAAGAAGCAAATATG
AAGCTGAAGAAAACTCCTAAGGCCCCTAAAGATTATGATATACTTGCCATTCCTCTAACAAGTACCGCAATAAAGATGTTGAAACTGAGAAAGGGGACTCCTGAAGAATG
GCGTCAATATCTATCAAGTGAGCAATGACCGTGTATTTAAGTGGTACTTTTAAATTCTTCTTTGCACATATTGCACAAATTCCATGTAATACTTGGTTCTTCTAAGACTT
CTATGGCACCTTCTTAAATAGCTTAATCTATCATTAGTTCCCATAACCATTTTGCATTTCTTTTTGCTGAAAATATTGCACTTCTTCTCTTTGTGTTCTACCAATCACTG
CATTGTCAGGACCTTTAAACTTCCATTTATAGTATTACATCAAATAGTTCATATCATTTTAACCTTATTATCCCTAGGAGCTACAAGTGTCAATACAATGTTTTGTGCCT
TCCAGAAACTGAAAGTGAACATTATTTTGCCTGTCAAAATACTCTTTATT
Protein sequenceShow/hide protein sequence
MAQFTAHTRVKLLLNGDGVPFGSEPKDRFRYKLRSVRTLKRRIPLSGPSSSTSSTSALRKSTQHAAEVRVGVRGESVSGDDAVVDLDYDYEFETDDLACFRGLVLDISYR
PVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHESLTIDHVLPISRGGEWTWENLVAACVQCNS
KKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPEEWRQYLSSEQ