; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10005322 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10005322
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationChr07:1563839..1565767
RNA-Seq ExpressionHG10005322
SyntenyHG10005322
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031584.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]1.1e-12379.24Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREET
        MESNLDCFLHCTTP+        TEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITL+NGETLTQYYVPYLSAIQIFTG      FREET
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREET

Query:  ESGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQ
        ESGDGDTRDS SDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK             VPNLSYS+GKNYQ
Subjt:  ESGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQ

Query:  GFVHLLPHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP
        G VHL PHLS+SIIFISRYG G GGGGRNE+ ++E  +GRD G S+WGGDVQDA EP+GGGQ+ S  REA +AVERG VM KA EGPTP
Subjt:  GFVHLLPHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP

TYK07036.1 DUF789 domain-containing protein [Cucumis melo var. makuwa]5.7e-12580.57Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        MESNLDCFLHCTTP+        TEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITL+NGETLTQYYVPYLSAIQIFTG REETESGDGD
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQGFVHLL
        TRDS SDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK             VPNLSYS+GKNYQG VHL 
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQGFVHLL

Query:  PHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP
        PHLS+SIIFISRYG G GGGGRNE+ ++E  +GRD G S+WGGDVQDA EP+GGGQ+ S  REA +AVERG VM KA EGPTP
Subjt:  PHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP

XP_008455334.1 PREDICTED: uncharacterized protein LOC103495524 [Cucumis melo]1.3e-9293.64Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        MESNLDCFLHCTTP+VQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITL+NGETLTQYYVPYLSAIQIFTG REETESGDGD
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS
        TRDS SDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK+  L+
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS

XP_011658743.1 uncharacterized protein LOC101214941 [Cucumis sativus]6.1e-9595.38Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        MESNLDCFLHCTTP+VQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS
        TRDSYSDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK+  L+
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS

XP_038886925.1 uncharacterized protein LOC120077111 [Benincasa hispida]2.3e-9495.38Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        MESNLDCFLHCTTP+VQSQFLPKTEIRNLNRLWHPWEREKVEYFTL DLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFR ETESGDGD
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS
        TRD YSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK+  L+
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS

TrEMBL top hitse value%identityAlignment
A0A0A0K709 Uncharacterized protein3.0e-9595.38Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        MESNLDCFLHCTTP+VQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS
        TRDSYSDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK+  L+
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS

A0A1S3C1E3 uncharacterized protein LOC1034955246.2e-9393.64Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        MESNLDCFLHCTTP+VQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITL+NGETLTQYYVPYLSAIQIFTG REETESGDGD
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS
        TRDS SDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK+  L+
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS

A0A5A7SKB4 DUF789 domain-containing protein5.2e-12479.24Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREET
        MESNLDCFLHCTTP+        TEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITL+NGETLTQYYVPYLSAIQIFTG      FREET
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREET

Query:  ESGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQ
        ESGDGDTRDS SDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK             VPNLSYS+GKNYQ
Subjt:  ESGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQ

Query:  GFVHLLPHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP
        G VHL PHLS+SIIFISRYG G GGGGRNE+ ++E  +GRD G S+WGGDVQDA EP+GGGQ+ S  REA +AVERG VM KA EGPTP
Subjt:  GFVHLLPHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP

A0A5D3C9P7 DUF789 domain-containing protein2.8e-12580.57Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        MESNLDCFLHCTTP+        TEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITL+NGETLTQYYVPYLSAIQIFTG REETESGDGD
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQGFVHLL
        TRDS SDCCSEESDSDKLWRW+G+GSTSSEDGGSEQEA LHLNDRLGYLYFQFFEKSTPYGRVPLLDK             VPNLSYS+GKNYQG VHL 
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDK-------------VPNLSYSHGKNYQGFVHLL

Query:  PHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP
        PHLS+SIIFISRYG G GGGGRNE+ ++E  +GRD G S+WGGDVQDA EP+GGGQ+ S  REA +AVERG VM KA EGPTP
Subjt:  PHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKGRDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP

A0A6J1GJG5 uncharacterized protein LOC111454841 isoform X11.0e-8789.02Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        M SNLDCFLHCTTP+VQSQFLPKTEIRNLNR+WHPWEREKVEYFTLSD+W C+DEWS YGAKVPITL+NGETLTQYYVPYLSAIQIFTGFR ETESGDG 
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS
          DSYSDCCSEESDSDKLWRWDGTGS+SSEDGGSEQE+LLH NDRLGYLYFQFFEKSTPYGRVPLLDK+  L+
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G03610.1 Protein of unknown function (DUF789)6.0e-5657.07Show/hide
Query:  ESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREETE
        +SNLD FLHC TP+V  Q LPKTEIR LNRLWHPWER+KVE+F LSDLW C+DEWSAYGA VPI + NGE+L QYYVPYLSAIQIFT        REE+E
Subjt:  ESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREETE

Query:  SGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQGFVHL
         G+ + RD +SD  S+ES S++               G E   LLH +DRLGYLY Q+FE+S PY RVPL+DK+  L+    + Y G + L
Subjt:  SGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQGFVHL

AT1G17830.1 Protein of unknown function (DUF789)5.1e-3143.58Show/hide
Query:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD
        + SNL+ FL   TP   S  L ++   +LN LW    ++++EYF LSDLW CFDE SAYG    + LNNGE++ QYYVPYLSAIQI+T           D
Subjt:  MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGD

Query:  TRDSYSDCCSEESDSDKLWRWDGTGSTS-----SEDGGSEQEALLHL-NDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS
          D  S+C S++S+ +KL R   +GS+      S+D G E +    L  D+LG + FQ+FE   P+ RVPL  KV  L+
Subjt:  TRDSYSDCCSEESDSDKLWRWDGTGSTS-----SEDGGSEQEALLHL-NDRLGYLYFQFFEKSTPYGRVPLLDKVPNLS

AT4G03420.1 Protein of unknown function (DUF789)2.3e-6061.17Show/hide
Query:  SNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG----FREETESGD
        SNLD FLHCTTP+V  Q L K EIR+LNR+WHPWER+KVE+F LSDLW C+DEWSAYGA VPI L+NGE+L QYYVPYLSAIQIFT      R   +S D
Subjt:  SNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG----FREETESGD

Query:  GDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQGFVHL
        G++RDS+SD  S+ES+SDKL       S  + D G E +ALLH NDRLGYLY Q+FE+S PY RVPL+DK+  L+    + Y G + L
Subjt:  GDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQGFVHL

AT4G28150.1 Protein of unknown function (DUF789)4.0e-5258.29Show/hide
Query:  ESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREETE
        ESNLD FL CTTP+V +  LPKT+I+NLN LW+P E + VEYF L D W CFDEWSAYGA VPI    GETL QYYVPYLSAIQIFT        REETE
Subjt:  ESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREETE

Query:  SGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQG
        SG     DS S+ CSEE      WRW+  G +SSE+G   QE L    DRLGY Y Q+FE+ TPY RVPL+DK+  L    G+ Y G
Subjt:  SGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQG

AT4G28150.2 Protein of unknown function (DUF789)1.1e-4957.75Show/hide
Query:  ESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREETE
        ESNLD FL CTTP+V +  LPK  I+NLN LW+P E + VEYF L D W CFDEWSAYGA VPI    GETL QYYVPYLSAIQIFT        REETE
Subjt:  ESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTG------FREETE

Query:  SGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQG
        SG     DS S+ CSEE      WRW+  G +SSE+G   QE L    DRLGY Y Q+FE+ TPY RVPL+DK+  L    G+ Y G
Subjt:  SGDGDTRDSYSDCCSEESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCCAATTTGGATTGTTTTCTCCATTGCACAACACCAATGGTTCAATCCCAATTCCTCCCAAAGACAGAAATCAGAAATCTCAATCGTCTATGGCATCCATGGGA
GAGAGAAAAAGTAGAATATTTCACTCTGAGTGATCTCTGGAAATGCTTCGATGAATGGAGTGCTTATGGAGCCAAAGTTCCAATCACTTTAAACAATGGAGAGACATTAA
CTCAATATTATGTCCCTTATCTCTCCGCCATACAAATTTTCACTGGTTTTAGGGAGGAAACAGAGTCTGGTGATGGAGATACAAGGGATTCGTACAGTGATTGTTGTAGT
GAAGAAAGCGATAGTGATAAACTATGGAGATGGGATGGAACTGGAAGTACTTCCTCTGAAGATGGAGGATCTGAACAAGAAGCACTTTTGCATCTAAATGATCGATTGGG
GTACCTTTACTTTCAGTTCTTTGAGAAATCAACTCCATATGGAAGAGTCCCTTTACTGGATAAGGTACCCAATTTATCATATTCCCATGGAAAGAACTATCAAGGATTTG
TCCACTTGCTTCCTCACTTATCACACTCTATCATCTTCATTTCAAGATATGGAGATGGACGAGGAGGAGGGGGGAGGAACGAAGAGGTGTCGGAGGAGGGAAGAAAGGGA
AGGGATATCGGTGTCAGCATTTGGGGTGGTGACGTACAAGATGCAGGGGAACCTATGGGTGGCGGGCAACAGCGGTCGGGACCAAGAGAGGCTGGTATCGCTGTCGAGCG
TGGCGGAGTCATGGCTAAAGCAGCTGAGGGTCCAACACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCCAATTTGGATTGTTTTCTCCATTGCACAACACCAATGGTTCAATCCCAATTCCTCCCAAAGACAGAAATCAGAAATCTCAATCGTCTATGGCATCCATGGGA
GAGAGAAAAAGTAGAATATTTCACTCTGAGTGATCTCTGGAAATGCTTCGATGAATGGAGTGCTTATGGAGCCAAAGTTCCAATCACTTTAAACAATGGAGAGACATTAA
CTCAATATTATGTCCCTTATCTCTCCGCCATACAAATTTTCACTGGTTTTAGGGAGGAAACAGAGTCTGGTGATGGAGATACAAGGGATTCGTACAGTGATTGTTGTAGT
GAAGAAAGCGATAGTGATAAACTATGGAGATGGGATGGAACTGGAAGTACTTCCTCTGAAGATGGAGGATCTGAACAAGAAGCACTTTTGCATCTAAATGATCGATTGGG
GTACCTTTACTTTCAGTTCTTTGAGAAATCAACTCCATATGGAAGAGTCCCTTTACTGGATAAGGTACCCAATTTATCATATTCCCATGGAAAGAACTATCAAGGATTTG
TCCACTTGCTTCCTCACTTATCACACTCTATCATCTTCATTTCAAGATATGGAGATGGACGAGGAGGAGGGGGGAGGAACGAAGAGGTGTCGGAGGAGGGAAGAAAGGGA
AGGGATATCGGTGTCAGCATTTGGGGTGGTGACGTACAAGATGCAGGGGAACCTATGGGTGGCGGGCAACAGCGGTCGGGACCAAGAGAGGCTGGTATCGCTGTCGAGCG
TGGCGGAGTCATGGCTAAAGCAGCTGAGGGTCCAACACCATGA
Protein sequenceShow/hide protein sequence
MESNLDCFLHCTTPMVQSQFLPKTEIRNLNRLWHPWEREKVEYFTLSDLWKCFDEWSAYGAKVPITLNNGETLTQYYVPYLSAIQIFTGFREETESGDGDTRDSYSDCCS
EESDSDKLWRWDGTGSTSSEDGGSEQEALLHLNDRLGYLYFQFFEKSTPYGRVPLLDKVPNLSYSHGKNYQGFVHLLPHLSHSIIFISRYGDGRGGGGRNEEVSEEGRKG
RDIGVSIWGGDVQDAGEPMGGGQQRSGPREAGIAVERGGVMAKAAEGPTP