; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018104 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018104
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHNHc domain-containing protein
Genome locationtig00153092:1574149..1598189
RNA-Seq ExpressionSgr018104
SyntenySgr018104
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR003615 - HNH nuclease
IPR029471 - HNH endonuclease 5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597719.1 hypothetical protein SDJN03_10899, partial [Cucurbita argyrosperma subsp. sororia]5.4e-12385.41Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD
        A  TA S    L NG      SEPKDR R +LRS+RTLKRR PLSGAS+ GLSPS+SSASA RKSAQ     RVG+R ESVS DDAI+D+  DYE ESDD
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        VLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

XP_008465340.1 PREDICTED: uncharacterized protein LOC103502982 [Cucumis melo]3.6e-11983.63Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVS-DDDAIV--DLDYELESDD
        A  TA S    L NG      SE KDR RY+LRSVR   RR PLS  S+    S SS SA RKS QH AE+RVG+RDESV+  DDAIV  D DYE ESDD
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVS-DDDAIV--DLDYELESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

XP_022152374.1 uncharacterized protein LOC111020122 [Momordica charantia]1.2e-12787.86Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIV--DLDYELESDDL
        A  T  +    L NG      SE KDRLRY+LRSV    RRIPLS AS G+SPSASSASA RKSAQHVAEMRVG+RDESVSDD AIV  D DYE ESDDL
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIV--DLDYELESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLSNEQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

XP_022932702.1 uncharacterized protein LOC111439170 [Cucurbita moschata]1.1e-12385.77Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD
        A  TA S    L NG      SEPKDR R++LRSVRTLKRR PLSGAS+ GLSPS+SSASA RKSAQ     RVG+R ESVS DDAI+D+  DYE ESDD
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        VLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

XP_038905692.1 uncharacterized protein LOC120091663 [Benincasa hispida]2.3e-12686.79Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDDL
        A  TA S    L NG      SEPKDR RY+LR VRTLKRRIPLSG     SPS SSASA RKSAQH AE+RVG+R ESVS DDAIVDL  DYE E+DDL
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        LPISRGGEWTWENLVAACV+CNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

TrEMBL top hitse value%identityAlignment
A0A1S3CNK0 uncharacterized protein LOC1035029821.7e-11983.63Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVS-DDDAIV--DLDYELESDD
        A  TA S    L NG      SE KDR RY+LRSVR   RR PLS  S+    S SS SA RKS QH AE+RVG+RDESV+  DDAIV  D DYE ESDD
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVS-DDDAIV--DLDYELESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

A0A5D3E2H2 HNH endonuclease1.7e-11983.63Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVS-DDDAIV--DLDYELESDD
        A  TA S    L NG      SE KDR RY+LRSVR   RR PLS  S+    S SS SA RKS QH AE+RVG+RDESV+  DDAIV  D DYE ESDD
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVS-DDDAIV--DLDYELESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

A0A6J1DFU6 uncharacterized protein LOC1110201226.0e-12887.86Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIV--DLDYELESDDL
        A  T  +    L NG      SE KDRLRY+LRSV    RRIPLS AS G+SPSASSASA RKSAQHVAEMRVG+RDESVSDD AIV  D DYE ESDDL
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIV--DLDYELESDDL

Query:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
        ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTV+SPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV
Subjt:  ACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHV

Query:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLSNEQ
Subjt:  LPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

A0A6J1F2H9 uncharacterized protein LOC1114391705.2e-12485.77Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD
        A  TA S    L NG      SEPKDR R++LRSVRTLKRR PLSGAS+ GLSPS+SSASA RKSAQ     RVG+R ESVS DDAI+D+  DYE ESDD
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        VLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

A0A6J1IAG4 uncharacterized protein LOC1114707255.2e-12485.77Show/hide
Query:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD
        A  TA S    L NG      SEPKDR R++LRSVRTLKRR PLSGAS+ GLSPS+SSASA RKSAQ     RVG+R ESVS DDAI+D+  DYE ESDD
Subjt:  ASMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASA-GLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDL--DYELESDD

Query:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH
        LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIK SLSRKNILYRDNYTCQYCSSH+SLTIDH
Subjt:  LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDH

Query:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ
        VLP+SRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTP EWRQYLS+EQ
Subjt:  VLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23840.1 HNH endonuclease1.3e-8780.75Show/hide
Query:  DYELESDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCS
        D  LE+DD L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQTVSSP+GSFYIPAVLRVPHLLQVVKRRR+KNSLSRKNIL RD+YTCQYCS
Subjt:  DYELESDD-LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCS

Query:  SHDSLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLS
        S ++LTIDHV+P+SRGGEWTW+NLVAAC +CNS+KGQKT +EA+MKL K PK PKDYDI+AIPLT+ AI+ML+  KG P EWRQYL+
Subjt:  SHDSLTIDHVLPISRGGEWTWENLVAACVKCNSKKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGATGCAGCAGAATCTAAAGCTTTACCATCGCCTAAGGAAACAGTGACAAGGAAGGATAAAAAAAGGGACAGGGAGACGAGTTCCTCGAAAAAAAAAACCACAAA
AGGAAAGAAGCAAAAGGGTAAAAAGAAGTCATCAAAAGGAGGTGACGCGAAGAAAGGGGATAGGAGCAAGGAAGAACACAAATCTCCAACAAAATCTAATGATGGCAAGA
TGAAGAAAGATCAAACAAACAAATTACCGATGATGGAGGACAAAATTCATAAAAGAATTCAAGAACTGATACAGATTATTCAGGTCAACACCACTCTTCAAGTTATGTTC
AGAGTAAAATTTCCAAAAGTCGCCGGTCTGGCTGTGAGTACTGCTGACAAGTCTGGCTGCCAGTTTGTCAAATTCCCAAAAATTGCGTGCATTGCACGAGATTTTCATGG
CCCCAAAACTGGGGCCTTCCCCTTTTTTCTCATTCTTCCAACACAGACCGTTTCTCTCTGTCACTCTCTCATCTCTCTGCTGCTGCTCTCACTCCCTCAGTCGCAAATTG
TCACTCTTCCAGCGCTGACTTCTCTGTTCGTCACTCTTCCCTGCCCGCTCCTCTCTTCTCGTTGCTGTAGAGATCATGATTGGAGACATCAGCATGCTACATATATATCT
CTATGGCGTCAACGTCGTAGTCGTTGTGCATCTGGTGGAATAATAAATGGGCCATCTGTGTCAGATGATTACGTATCTTGGTATAATGCAATTGCAAGAAAATATATCAC
ACAAGAAGGGGCTTATTACGACCATGAGTTCATTCATAACACTAGCAGACCCGATGTTCCTAACTTTGATAGGAGACGCGCTCATTGTAGACGCCCACAACCACCAGTTG
AAGATATTTCAGACCCAGATCTTCCTGTTGCTGAAGATCCGATTTCGATGACTCCCATGCACTCACCTTTTCTAATGATATCGACACTGGGAGTTCAAATTGACTTTGCA
TCAATGACTGCTCGTAGCAGCCAATTGCAGCTGAGAAATGGCCCAGTTCACCACACACAGTCGGAACCCAAAGATCGATTAAGATACCAGCTCAGATCAGTTCGAACCCT
CAAACGCAGAATCCCCTTATCTGGTGCCTCCGCTGGACTTTCCCCTTCTGCCTCCTCTGCTTCAGCTTCAAGGAAATCCGCTCAACATGTTGCGGAGATGCGTGTTGGTT
TGAGGGATGAGAGCGTTAGCGATGACGACGCCATTGTTGATCTTGACTACGAGCTCGAGAGTGATGATTTGGCTTGTTTCAGAGGTCTGGTCTTGGATATTTCCTACAGG
CCAGTCAATGTCGTTTGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGAATACTATGACCAGACTGTGAGTTCTCCAAGTGGATCATTCTA
TATACCAGCAGTATTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAATCAAGAACTCTTTAAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTC
AGTATTGTTCATCGCATGATAGTTTGACCATTGACCATGTTTTGCCTATATCCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCA
AAGAAAGGTCAAAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCCATCCCTCTAACAAGCACCGCAATAAA
AATGTTGAAACTGAGAAAGGGGACCCCTGCAGAATGGCGTCAATATCTGTCAAATGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGATAGATGCAGCAGAATCTAAAGCTTTACCATCGCCTAAGGAAACAGTGACAAGGAAGGATAAAAAAAGGGACAGGGAGACGAGTTCCTCGAAAAAAAAAACCACAAA
AGGAAAGAAGCAAAAGGGTAAAAAGAAGTCATCAAAAGGAGGTGACGCGAAGAAAGGGGATAGGAGCAAGGAAGAACACAAATCTCCAACAAAATCTAATGATGGCAAGA
TGAAGAAAGATCAAACAAACAAATTACCGATGATGGAGGACAAAATTCATAAAAGAATTCAAGAACTGATACAGATTATTCAGGTCAACACCACTCTTCAAGTTATGTTC
AGAGTAAAATTTCCAAAAGTCGCCGGTCTGGCTGTGAGTACTGCTGACAAGTCTGGCTGCCAGTTTGTCAAATTCCCAAAAATTGCGTGCATTGCACGAGATTTTCATGG
CCCCAAAACTGGGGCCTTCCCCTTTTTTCTCATTCTTCCAACACAGACCGTTTCTCTCTGTCACTCTCTCATCTCTCTGCTGCTGCTCTCACTCCCTCAGTCGCAAATTG
TCACTCTTCCAGCGCTGACTTCTCTGTTCGTCACTCTTCCCTGCCCGCTCCTCTCTTCTCGTTGCTGTAGAGATCATGATTGGAGACATCAGCATGCTACATATATATCT
CTATGGCGTCAACGTCGTAGTCGTTGTGCATCTGGTGGAATAATAAATGGGCCATCTGTGTCAGATGATTACGTATCTTGGTATAATGCAATTGCAAGAAAATATATCAC
ACAAGAAGGGGCTTATTACGACCATGAGTTCATTCATAACACTAGCAGACCCGATGTTCCTAACTTTGATAGGAGACGCGCTCATTGTAGACGCCCACAACCACCAGTTG
AAGATATTTCAGACCCAGATCTTCCTGTTGCTGAAGATCCGATTTCGATGACTCCCATGCACTCACCTTTTCTAATGATATCGACACTGGGAGTTCAAATTGACTTTGCA
TCAATGACTGCTCGTAGCAGCCAATTGCAGCTGAGAAATGGCCCAGTTCACCACACACAGTCGGAACCCAAAGATCGATTAAGATACCAGCTCAGATCAGTTCGAACCCT
CAAACGCAGAATCCCCTTATCTGGTGCCTCCGCTGGACTTTCCCCTTCTGCCTCCTCTGCTTCAGCTTCAAGGAAATCCGCTCAACATGTTGCGGAGATGCGTGTTGGTT
TGAGGGATGAGAGCGTTAGCGATGACGACGCCATTGTTGATCTTGACTACGAGCTCGAGAGTGATGATTTGGCTTGTTTCAGAGGTCTGGTCTTGGATATTTCCTACAGG
CCAGTCAATGTCGTTTGTTGGAAGCGTGCAATTTGTTTGGAGTTCATGGAGAAGGCTGATGTATTGGAATACTATGACCAGACTGTGAGTTCTCCAAGTGGATCATTCTA
TATACCAGCAGTATTAAGGGTTCCCCATTTATTGCAAGTTGTAAAGAGAAGAAGAATCAAGAACTCTTTAAGTCGTAAAAACATACTTTATCGGGACAATTACACTTGTC
AGTATTGTTCATCGCATGATAGTTTGACCATTGACCATGTTTTGCCTATATCCCGGGGTGGAGAATGGACATGGGAAAATCTGGTTGCTGCCTGTGTAAAATGCAATTCA
AAGAAAGGTCAAAAAACTGTAGAAGAAGCAAATATGAAGCTGAAAAAGACTCCCAAGGCCCCAAAAGATTATGATATACTTGCCATCCCTCTAACAAGCACCGCAATAAA
AATGTTGAAACTGAGAAAGGGGACCCCTGCAGAATGGCGTCAATATCTGTCAAATGAGCAATGA
Protein sequenceShow/hide protein sequence
MIDAAESKALPSPKETVTRKDKKRDRETSSSKKKTTKGKKQKGKKKSSKGGDAKKGDRSKEEHKSPTKSNDGKMKKDQTNKLPMMEDKIHKRIQELIQIIQVNTTLQVMF
RVKFPKVAGLAVSTADKSGCQFVKFPKIACIARDFHGPKTGAFPFFLILPTQTVSLCHSLISLLLLSLPQSQIVTLPALTSLFVTLPCPLLSSRCCRDHDWRHQHATYIS
LWRQRRSRCASGGIINGPSVSDDYVSWYNAIARKYITQEGAYYDHEFIHNTSRPDVPNFDRRRAHCRRPQPPVEDISDPDLPVAEDPISMTPMHSPFLMISTLGVQIDFA
SMTARSSQLQLRNGPVHHTQSEPKDRLRYQLRSVRTLKRRIPLSGASAGLSPSASSASASRKSAQHVAEMRVGLRDESVSDDDAIVDLDYELESDDLACFRGLVLDISYR
PVNVVCWKRAICLEFMEKADVLEYYDQTVSSPSGSFYIPAVLRVPHLLQVVKRRRIKNSLSRKNILYRDNYTCQYCSSHDSLTIDHVLPISRGGEWTWENLVAACVKCNS
KKGQKTVEEANMKLKKTPKAPKDYDILAIPLTSTAIKMLKLRKGTPAEWRQYLSNEQ