; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G009090 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G009090
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein HESO1
Genome locationchr04:9152456..9158199
RNA-Seq ExpressionLsi04G009090
SyntenyLsi04G009090
Gene Ontology termsGO:0060964 - regulation of gene silencing by miRNA (biological process)
GO:0071076 - RNA 3' uridylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0050265 - RNA uridylyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050366.1 protein HESO1 isoform X1 [Cucumis melo var. makuwa]3.4e-10680.54Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICP+TGQWLEIESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQL RISEAFRMTHLRLTSV+QNQS ILNDLARPQI Q I++ SGSASAPAFN+GNYPPIRPQVHQAR  QP PW++HQFQN++PRFNMGNFPPIN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ
         P AGT QS+PPVQHK PKTKRIVSSPN LNVGE   PSK Y+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ

XP_008461703.1 PREDICTED: protein HESO1 isoform X1 [Cucumis melo]3.4e-10680.54Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICP+TGQWLEIESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQL RISEAFRMTHLRLTSV+QNQS ILNDLARPQI Q I++ SGSASAPAFN+GNYPPIRPQVHQAR  QP PW++HQFQN++PRFNMGNFPPIN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ
         P AGT QS+PPVQHK PKTKRIVSSPN LNVGE   PSK Y+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ

XP_011654960.1 protein HESO1 isoform X1 [Cucumis sativus]1.4e-10781.89Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA+TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICPYTGQWL+IESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQLMRISEAFRMTHLRLTSV+QN+S ILNDLARPQISQ II+ SGSASAPAFN+ NY PIRPQVHQAR  QP PW++HQFQNN+PRFNMGNFP IN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGEPSKTYNGQGQQKWRPRSQRQ
         P AGT+QS P VQHKTPKTKRIVSSPN LNVGEPSKTY+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGEPSKTYNGQGQQKWRPRSQRQ

XP_011654962.1 protein HESO1 isoform X2 [Cucumis sativus]1.4e-10781.89Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA+TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICPYTGQWL+IESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQLMRISEAFRMTHLRLTSV+QN+S ILNDLARPQISQ II+ SGSASAPAFN+ NY PIRPQVHQAR  QP PW++HQFQNN+PRFNMGNFP IN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGEPSKTYNGQGQQKWRPRSQRQ
         P AGT+QS P VQHKTPKTKRIVSSPN LNVGEPSKTY+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGEPSKTYNGQGQQKWRPRSQRQ

XP_038890315.1 protein HESO1 [Benincasa hispida]6.8e-10781.1Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVES+IA+TCATNIA FKSRT+NRSSLSELFVSFL KF+DISSKASELGICPYTGQWLEIESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQL RISEAF MTHLRLTSVHQNQS ILNDLARPQISQFII+PSGS+SAPAFNIGN+PPIRPQVHQA  TQP PW++HQFQNNVPRFNMGNFP I+PQ
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRS
         P  GTTQSRPPV+HKTPKTKR VS+PN L  GE   PSK YNGQGQQKWRPRS
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRS

TrEMBL top hitse value%identityAlignment
A0A0A0KLX5 Uncharacterized protein6.7e-10881.89Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA+TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICPYTGQWL+IESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQLMRISEAFRMTHLRLTSV+QN+S ILNDLARPQISQ II+ SGSASAPAFN+ NY PIRPQVHQAR  QP PW++HQFQNN+PRFNMGNFP IN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGEPSKTYNGQGQQKWRPRSQRQ
         P AGT+QS P VQHKTPKTKRIVSSPN LNVGEPSKTY+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGEPSKTYNGQGQQKWRPRSQRQ

A0A1S3CFC1 protein HESO1 isoform X11.6e-10680.54Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICP+TGQWLEIESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQL RISEAFRMTHLRLTSV+QNQS ILNDLARPQI Q I++ SGSASAPAFN+GNYPPIRPQVHQAR  QP PW++HQFQN++PRFNMGNFPPIN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ
         P AGT QS+PPVQHK PKTKRIVSSPN LNVGE   PSK Y+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ

A0A1S3CGN1 protein HESO1 isoform X21.6e-10680.54Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICP+TGQWLEIESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQL RISEAFRMTHLRLTSV+QNQS ILNDLARPQI Q I++ SGSASAPAFN+GNYPPIRPQVHQAR  QP PW++HQFQN++PRFNMGNFPPIN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ
         P AGT QS+PPVQHK PKTKRIVSSPN LNVGE   PSK Y+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ

A0A5A7U854 Protein HESO1 isoform X11.6e-10680.54Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICP+TGQWLEIESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQL RISEAFRMTHLRLTSV+QNQS ILNDLARPQI Q I++ SGSASAPAFN+GNYPPIRPQVHQAR  QP PW++HQFQN++PRFNMGNFPPIN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ
         P AGT QS+PPVQHK PKTKRIVSSPN LNVGE   PSK Y+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ

A0A5D3BZS4 Protein HESO1 isoform X21.6e-10680.54Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI
        GVRAEVE+EIA TCATNIA FKSRT NRSSLSELFVSFLAKF+DISSKASELGICP+TGQWLEIESNMRWLPKTYAIFV          F      ARAI
Subjt:  GVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAI

Query:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ
        NARQL RISEAFRMTHLRLTSV+QNQS ILNDLARPQI Q I++ SGSASAPAFN+GNYPPIRPQVHQAR  QP PW++HQFQN++PRFNMGNFPPIN Q
Subjt:  NARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQ

Query:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ
         P AGT QS+PPVQHK PKTKRIVSSPN LNVGE   PSK Y+GQGQQKWRPRSQRQ
Subjt:  DPQAGTTQSRPPVQHKTPKTKRIVSSPNNLNVGE---PSKTYNGQGQQKWRPRSQRQ

SwissProt top hitse value%identityAlignment
Q5XET5 Protein HESO14.7e-2634.29Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKS---RTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDA
        GVR   E  IAQ  A NIA FKS   ++VNRSSLSEL VSF AKF+DI+ KA E G+CP+TG+W  I SN  WLPKTY++FV          F      A
Subjt:  GVRAEVESEIAQTCATNIAMFKS---RTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDA

Query:  RAINARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQ----FIISPSGSASAPAFNIGN-YPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMG
        R+++ R L RI++ F++T  RL S   N++ I+  L    I +     I  PS   +    N+ N +   RPQ  Q +      W +     N P     
Subjt:  RAINARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQ----FIISPSGSASAPAFNIGN-YPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMG

Query:  NFPPINPQDPQAGTTQS-------RPPV-------------QHKTP--------KTKRIVSSPNNLNVGEPSKTYNG-----------------------
        ++PP+    PQ   TQ+       +PPV             Q K+P        K     SS N  ++G+PS   NG                       
Subjt:  NFPPINPQDPQAGTTQS-------RPPV-------------QHKTP--------KTKRIVSSPNNLNVGEPSKTYNG-----------------------

Query:  QGQQKWRPRSQR
        QG Q WRPR ++
Subjt:  QGQQKWRPRSQR

Arabidopsis top hitse value%identityAlignment
AT2G39740.1 Nucleotidyltransferase family protein3.3e-2734.29Show/hide
Query:  GVRAEVESEIAQTCATNIAMFKS---RTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDA
        GVR   E  IAQ  A NIA FKS   ++VNRSSLSEL VSF AKF+DI+ KA E G+CP+TG+W  I SN  WLPKTY++FV          F      A
Subjt:  GVRAEVESEIAQTCATNIAMFKS---RTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDA

Query:  RAINARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQ----FIISPSGSASAPAFNIGN-YPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMG
        R+++ R L RI++ F++T  RL S   N++ I+  L    I +     I  PS   +    N+ N +   RPQ  Q +      W +     N P     
Subjt:  RAINARQLMRISEAFRMTHLRLTSVHQNQSFILNDLARPQISQ----FIISPSGSASAPAFNIGN-YPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMG

Query:  NFPPINPQDPQAGTTQS-------RPPV-------------QHKTP--------KTKRIVSSPNNLNVGEPSKTYNG-----------------------
        ++PP+    PQ   TQ+       +PPV             Q K+P        K     SS N  ++G+PS   NG                       
Subjt:  NFPPINPQDPQAGTTQS-------RPPV-------------QHKTP--------KTKRIVSSPNNLNVGEPSKTYNG-----------------------

Query:  QGQQKWRPRSQR
        QG Q WRPR ++
Subjt:  QGQQKWRPRSQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGTGAGGGCTGAGGTCGAGAGCGAAATTGCACAGACATGTGCTACCAACATAGCCATGTTCAAATCGAGAACAGTCAACAGAAGTTCTTTGTCTGAACTTTTTGT
GTCATTCCTTGCAAAGTTTGCAGATATAAGTTCAAAAGCATCAGAACTAGGAATTTGTCCATACACAGGGCAATGGTTGGAAATAGAAAGCAATATGAGATGGTTGCCAA
AAACATATGCAATATTTGTAATCTCTCTTTCCTATCTTCATGTCATATCATTCTTGCCATATGCACAAGATGCCAGGGCTATTAACGCGAGGCAATTGATGAGGATTTCT
GAAGCATTTCGGATGACTCATTTGAGGCTCACCTCAGTTCATCAGAATCAAAGTTTTATCCTAAATGATTTAGCCCGACCTCAAATATCGCAATTTATCATTAGCCCATC
TGGATCTGCTAGTGCCCCAGCGTTCAATATAGGAAATTACCCCCCAATTCGTCCACAGGTTCACCAAGCCAGAAGTACGCAACCCTGTCCATGGGTTCGACATCAGTTCC
AGAACAATGTTCCCAGGTTCAATATGGGAAACTTCCCACCTATCAATCCACAGGATCCTCAAGCTGGAACTACGCAGTCTCGCCCACCGGTTCAACACAAAACGCCGAAA
ACAAAACGTATAGTAAGCAGTCCTAACAATTTGAACGTGGGGGAGCCCTCAAAGACTTATAATGGTCAAGGCCAGCAAAAGTGGAGACCAAGATCTCAGAGACAGGAAAT
TGTACTCAACGCAGCACATCCTGCAATTTTCACGCACGCAAGAGGCGTTTTTCGCCAATCAGTATCCTATAGCTCGCATGGGGTCCTTCGGATGTATCACGCAGGATTCT
GCCGTGAAGAGCTCATTTCTAGAAATATAGGCATCAAATTGAATGCTCTGGAGCCTATCCGGAGGAGTATGCAAGCGAGCAGGCCATTTCGTCACGTTTCTGCCAAAACC
ATTTTCTGGCAGCCGAGTAATGCAAGGCTTCAGCTCAACATACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGTGAGGGCTGAGGTCGAGAGCGAAATTGCACAGACATGTGCTACCAACATAGCCATGTTCAAATCGAGAACAGTCAACAGAAGTTCTTTGTCTGAACTTTTTGT
GTCATTCCTTGCAAAGTTTGCAGATATAAGTTCAAAAGCATCAGAACTAGGAATTTGTCCATACACAGGGCAATGGTTGGAAATAGAAAGCAATATGAGATGGTTGCCAA
AAACATATGCAATATTTGTAATCTCTCTTTCCTATCTTCATGTCATATCATTCTTGCCATATGCACAAGATGCCAGGGCTATTAACGCGAGGCAATTGATGAGGATTTCT
GAAGCATTTCGGATGACTCATTTGAGGCTCACCTCAGTTCATCAGAATCAAAGTTTTATCCTAAATGATTTAGCCCGACCTCAAATATCGCAATTTATCATTAGCCCATC
TGGATCTGCTAGTGCCCCAGCGTTCAATATAGGAAATTACCCCCCAATTCGTCCACAGGTTCACCAAGCCAGAAGTACGCAACCCTGTCCATGGGTTCGACATCAGTTCC
AGAACAATGTTCCCAGGTTCAATATGGGAAACTTCCCACCTATCAATCCACAGGATCCTCAAGCTGGAACTACGCAGTCTCGCCCACCGGTTCAACACAAAACGCCGAAA
ACAAAACGTATAGTAAGCAGTCCTAACAATTTGAACGTGGGGGAGCCCTCAAAGACTTATAATGGTCAAGGCCAGCAAAAGTGGAGACCAAGATCTCAGAGACAGGAAAT
TGTACTCAACGCAGCACATCCTGCAATTTTCACGCACGCAAGAGGCGTTTTTCGCCAATCAGTATCCTATAGCTCGCATGGGGTCCTTCGGATGTATCACGCAGGATTCT
GCCGTGAAGAGCTCATTTCTAGAAATATAGGCATCAAATTGAATGCTCTGGAGCCTATCCGGAGGAGTATGCAAGCGAGCAGGCCATTTCGTCACGTTTCTGCCAAAACC
ATTTTCTGGCAGCCGAGTAATGCAAGGCTTCAGCTCAACATACCTTAA
Protein sequenceShow/hide protein sequence
MGVRAEVESEIAQTCATNIAMFKSRTVNRSSLSELFVSFLAKFADISSKASELGICPYTGQWLEIESNMRWLPKTYAIFVISLSYLHVISFLPYAQDARAINARQLMRIS
EAFRMTHLRLTSVHQNQSFILNDLARPQISQFIISPSGSASAPAFNIGNYPPIRPQVHQARSTQPCPWVRHQFQNNVPRFNMGNFPPINPQDPQAGTTQSRPPVQHKTPK
TKRIVSSPNNLNVGEPSKTYNGQGQQKWRPRSQRQEIVLNAAHPAIFTHARGVFRQSVSYSSHGVLRMYHAGFCREELISRNIGIKLNALEPIRRSMQASRPFRHVSAKT
IFWQPSNARLQLNIP