; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007695 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007695
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionzinc finger CCCH domain-containing protein 16
Genome locationChr10:10282279..10293477
RNA-Seq ExpressionHG10007695
SyntenyHG10007695
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR045072 - MKRN-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588269.1 Zinc finger CCCH domain-containing protein 16, partial [Cucurbita argyrosperma subsp. sororia]3.1e-7479.5Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG
        VERERNLLNSKLAEFE LLHKPYVTP N A GN+SS SG+NSLSI PS+QN+  PSLSSFSQLGASLN GFGARPSNPPNN FGQPV FSS  +NSSAFG
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG

Query:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
         TNFPST+A AV G FGSQ+PSQT+G+STLSGFNNSGITNA SN LSSPA+ T FP+TDA    GGQI  N QLVNKLQQENSSVDVSIW+KEKWVPGE+
Subjt:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

XP_008443667.1 PREDICTED: zinc finger CCCH domain-containing protein 16 isoform X1 [Cucumis melo]2.1e-7580.4Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFGM
        VERERNLLNSKLAEFEGLLHKPYVTPSN APGN+SSFSG+NS SILPSAQNNTPSLSSFSQLGASLN GFGARPSN PN +FGQ   FSSP+QNSS FGM
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFGM

Query:  TNFPSTSAVAVRGAFGSQLPSQTFGD-STLSGFNNSGITNAESNLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
        TNFPSTS VAV GA G    SQTFG+ ST SGFNN+GI NA SN+ S A LTN P  +ANSS  GQIAPN QLVNKLQQENSSVDV IWMKEKWVPGE+
Subjt:  TNFPSTSAVAVRGAFGSQLPSQTFGD-STLSGFNNSGITNAESNLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

XP_022144416.1 zinc finger CCCH domain-containing protein 16 [Momordica charantia]4.0e-7476.08Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNT-PSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG
        VERERNLLNSKLAEF+ LLHKPYVTP NH PGN+SSF G+NS S LPSAQN+  PSLSSFSQLGASLN GFGARPSNPPNN FGQPV  SS ++ SSAFG
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNT-PSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG

Query:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESNL-SSPAMLTNFPTTDANS---------SVGGQIAPNGQLVNKLQQENSSVDVSIWM
        + NFPSTSAVAV GA GSQLPSQTFG+STLSGFNNSG+ N  SNL SSPA+LTNFP TDA+S         S G QIA N QLV+ LQQENSSVDVSIW+
Subjt:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESNL-SSPAMLTNFPTTDANS---------SVGGQIAPNGQLVNKLQQENSSVDVSIWM

Query:  KEKWVPGEV
        KEKWVPGE+
Subjt:  KEKWVPGEV

XP_023005674.1 zinc finger CCCH domain-containing protein 16 isoform X1 [Cucurbita maxima]3.1e-7479.5Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG
        VERERNLLNSKLAEFE LLHKPYVTP N A GN+SS SG+NSLSI PS+QN+  PSLSSFSQLGASLN GFGARPSNPPNN FGQPV FSS  +NSSAFG
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG

Query:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
         TNFPST+A AV G FGSQ+PSQTFG+ST+SGFNNSGITNA SN LSSPA+ T FP+TDA    GGQI  N QLVNKLQQENSSVDVSIW+KEKWVPGE+
Subjt:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

XP_023005678.1 zinc finger CCCH domain-containing protein 16 isoform X2 [Cucurbita maxima]3.1e-7479.5Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG
        VERERNLLNSKLAEFE LLHKPYVTP N A GN+SS SG+NSLSI PS+QN+  PSLSSFSQLGASLN GFGARPSNPPNN FGQPV FSS  +NSSAFG
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG

Query:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
         TNFPST+A AV G FGSQ+PSQTFG+ST+SGFNNSGITNA SN LSSPA+ T FP+TDA    GGQI  N QLVNKLQQENSSVDVSIW+KEKWVPGE+
Subjt:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

TrEMBL top hitse value%identityAlignment
A0A1S3B837 zinc finger CCCH domain-containing protein 16 isoform X25.6e-7478.89Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFGM
        VERERNLLNSKLAEFEGLLHKPYVTPSN APGN+SSFSG+NS SILPSAQNNTPSLSSFSQLGASLN GFGARPSN PN +FGQ   FSSP+QNSS FGM
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFGM

Query:  TNFPSTSAVAVRGAFGSQLPSQTFGD-STLSGFNNSGITNAESNLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
        TNFPSTS VAV       + SQTFG+ ST SGFNN+GI NA SN+ S A LTN P  +ANSS  GQIAPN QLVNKLQQENSSVDV IWMKEKWVPGE+
Subjt:  TNFPSTSAVAVRGAFGSQLPSQTFGD-STLSGFNNSGITNAESNLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

A0A1S3B9C9 zinc finger CCCH domain-containing protein 16 isoform X11.0e-7580.4Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFGM
        VERERNLLNSKLAEFEGLLHKPYVTPSN APGN+SSFSG+NS SILPSAQNNTPSLSSFSQLGASLN GFGARPSN PN +FGQ   FSSP+QNSS FGM
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFGM

Query:  TNFPSTSAVAVRGAFGSQLPSQTFGD-STLSGFNNSGITNAESNLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
        TNFPSTS VAV GA G    SQTFG+ ST SGFNN+GI NA SN+ S A LTN P  +ANSS  GQIAPN QLVNKLQQENSSVDV IWMKEKWVPGE+
Subjt:  TNFPSTSAVAVRGAFGSQLPSQTFGD-STLSGFNNSGITNAESNLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

A0A6J1CTC5 zinc finger CCCH domain-containing protein 161.9e-7476.08Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNT-PSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG
        VERERNLLNSKLAEF+ LLHKPYVTP NH PGN+SSF G+NS S LPSAQN+  PSLSSFSQLGASLN GFGARPSNPPNN FGQPV  SS ++ SSAFG
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNT-PSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG

Query:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESNL-SSPAMLTNFPTTDANS---------SVGGQIAPNGQLVNKLQQENSSVDVSIWM
        + NFPSTSAVAV GA GSQLPSQTFG+STLSGFNNSG+ N  SNL SSPA+LTNFP TDA+S         S G QIA N QLV+ LQQENSSVDVSIW+
Subjt:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESNL-SSPAMLTNFPTTDANS---------SVGGQIAPNGQLVNKLQQENSSVDVSIWM

Query:  KEKWVPGEV
        KEKWVPGE+
Subjt:  KEKWVPGEV

A0A6J1KZY0 zinc finger CCCH domain-containing protein 16 isoform X11.5e-7479.5Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG
        VERERNLLNSKLAEFE LLHKPYVTP N A GN+SS SG+NSLSI PS+QN+  PSLSSFSQLGASLN GFGARPSNPPNN FGQPV FSS  +NSSAFG
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG

Query:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
         TNFPST+A AV G FGSQ+PSQTFG+ST+SGFNNSGITNA SN LSSPA+ T FP+TDA    GGQI  N QLVNKLQQENSSVDVSIW+KEKWVPGE+
Subjt:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

A0A6J1L2U5 zinc finger CCCH domain-containing protein 16 isoform X21.5e-7479.5Show/hide
Query:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG
        VERERNLLNSKLAEFE LLHKPYVTP N A GN+SS SG+NSLSI PS+QN+  PSLSSFSQLGASLN GFGARPSNPPNN FGQPV FSS  +NSSAFG
Subjt:  VERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNN-TPSLSSFSQLGASLNMGFGARPSNPPNNLFGQPVPFSSPMQNSSAFG

Query:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV
         TNFPST+A AV G FGSQ+PSQTFG+ST+SGFNNSGITNA SN LSSPA+ T FP+TDA    GGQI  N QLVNKLQQENSSVDVSIW+KEKWVPGE+
Subjt:  MTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESN-LSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKEKWVPGEV

SwissProt top hitse value%identityAlignment
Q5Z807 Zinc finger CCCH domain-containing protein 461.5e-1033.49Show/hide
Query:  VERERNLLNSKLAEFEGLLH--KPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFG---ARPSNPPNNLFGQP--VPFSSPMQ
        VE ERNL N+KL EF  LL+  +P  TPS   P   S     N+ S   S  N  P  SSFSQ+GA+ N+G G     P  P ++ FG P   P ++P  
Subjt:  VERERNLLNSKLAEFEGLLH--KPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFG---ARPSNPPNNLFGQP--VPFSSPMQ

Query:  NSSAFGMTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESNLSSPAMLTNFPTTDANSS--VGGQIAPNGQLVNKL--QQENSSVDVSIWM
         SS              V   FG+Q   Q FG      F +S          SPA        D  S   + G + P   +  +     +N + D SIW+
Subjt:  NSSAFGMTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESNLSSPAMLTNFPTTDANSS--VGGQIAPNGQLVNKL--QQENSSVDVSIWM

Query:  KEKWVPGEV
        KEKW  GE+
Subjt:  KEKWVPGEV

Q9FWS3 Zinc finger CCCH domain-containing protein 161.0e-0829.85Show/hide
Query:  PKNEKVERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPS--LSSFSQLGASLNMGFGARPSNPPNNLF-----------
        P    VERERNL NSK+AEFE  L  PY         N+S F+ +       S+Q N+PS   S F+Q  A  N   G   S+ P N F           
Subjt:  PKNEKVERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPS--LSSFSQLGASLNMGFGARPSNPPNNLF-----------

Query:  ---------GQPVPFSSPMQNS----SAFGMTNFPSTSAVAVRGAFGS-QLPSQTFGDST-------LSGFNNSGITNAESNLSSPAMLTNFPTTDANSS
                 G P PF+S  Q S    +AF  TN    S+     AF S       F  +T        SGF  +  T  +     P     F TT  N++
Subjt:  ---------GQPVPFSSPMQNS----SAFGMTNFPSTSAVAVRGAFGS-QLPSQTFGDST-------LSGFNNSGITNAESNLSSPAMLTNFPTTDANSS

Query:  VGGQIAP-------------------------------NGQLVNKLQQENSSVDVSIWMKEKWVPGEV
        + GQ  P                               N     +LQ     VD SIW+KEKW PGE+
Subjt:  VGGQIAP-------------------------------NGQLVNKLQQENSSVDVSIWMKEKWVPGEV

Arabidopsis top hitse value%identityAlignment
AT1G75340.1 Zinc finger C-x8-C-x5-C-x3-H type family protein7.4e-1029.85Show/hide
Query:  PKNEKVERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPS--LSSFSQLGASLNMGFGARPSNPPNNLF-----------
        P    VERERNL NSK+AEFE  L  PY         N+S F+ +       S+Q N+PS   S F+Q  A  N   G   S+ P N F           
Subjt:  PKNEKVERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPS--LSSFSQLGASLNMGFGARPSNPPNNLF-----------

Query:  ---------GQPVPFSSPMQNS----SAFGMTNFPSTSAVAVRGAFGS-QLPSQTFGDST-------LSGFNNSGITNAESNLSSPAMLTNFPTTDANSS
                 G P PF+S  Q S    +AF  TN    S+     AF S       F  +T        SGF  +  T  +     P     F TT  N++
Subjt:  ---------GQPVPFSSPMQNS----SAFGMTNFPSTSAVAVRGAFGS-QLPSQTFGDST-------LSGFNNSGITNAESNLSSPAMLTNFPTTDANSS

Query:  VGGQIAP-------------------------------NGQLVNKLQQENSSVDVSIWMKEKWVPGEV
        + GQ  P                               N     +LQ     VD SIW+KEKW PGE+
Subjt:  VGGQIAP-------------------------------NGQLVNKLQQENSSVDVSIWMKEKWVPGEV

AT1G75340.2 Zinc finger C-x8-C-x5-C-x3-H type family protein3.0e-1131.65Show/hide
Query:  PKNEKVERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPS--LSSFSQLGASLNMGFGARPSNPPNNLFG----QPVPFS
        P    VERERNL NSK+AEFE  L  PY         N+S F+ +       S+Q N+PS   S F+Q  A  N   G   S+ P N F     QP  FS
Subjt:  PKNEKVERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPS--LSSFSQLGASLNMGFGARPSNPPNNLFG----QPVPFS

Query:  SPMQNSSAFGMTNFPSTSAVAVRGA-FG-------SQLPSQTFGDSTLSGFNNSGITNAES-NLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQEN
                 G + F +  +   + A FG       +   +  FG ST +   N+   N  + N + P      P  +  ++  G          +LQ   
Subjt:  SPMQNSSAFGMTNFPSTSAVAVRGA-FG-------SQLPSQTFGDSTLSGFNNSGITNAES-NLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQEN

Query:  SSVDVSIWMKEKWVPGEV
          VD SIW+KEKW PGE+
Subjt:  SSVDVSIWMKEKWVPGEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATTAAGAGTTGTAAAGTGAAGGATAATTTTTTCTGTGCTTGGAAAGAAGGCAATGGTTTCTTTGTTGAAGACTTGAGCAGAAACATCAAAGTTTTCCTTTCACC
AAGTATGTTAAGCTGGTTCGATGAGGCTTTGAATCTCCTACTATCCTTCCCAACCTACAAGTTCTTCTCGAGGAAATCAAGAATCGATCATGGAGTTATCCGCCTTGCTA
AGTTCTGGTCTTCAGGTGCTTGGTTTGTGGAGTGTGCAATTTGGCCTTCATCGGGTGGAAGAAAGAACATTCATATTCCTTCAGGTCTAGAGAAAGAAGAGGTGGCTATT
AGGCCTAAGGATAAAAATGGTATTGGTTACCGTCCAAAGAAATCCTTTGCAGAGGTCGTTCGCTCTTCGCCAAGCACCACTAAGGCCGAGTTACATCCAAAAGTTGTTGA
GAGGAATTCGTATTGGATTCGAAAGGATCATGATATTGTTGAATTAAATCTTCAAGAGTCTTTAGTTGTCACCAAATTGATGGCACATTACTCGTGGGGGAAAATTAAAG
CATCCTTTGAAGATCTTTTAAAGACAAATGTTTCCATTAGTCCGATTATGGATGATAAAGCTTTATTTCAGGTGAATATAGACCTGGGTTCCCTCATCACTATGGATAAG
TGGGCTTTCTATGATAAGTTTCATCTAAAATTTGAGCTTTGGTCCAAGAAAATTCATTCTTTTCCAGAGTTTATTAAGAGTCATGGAGGAAGAATCTGTATTAAAAATTT
GCCCCTACAGTACTGGAAGCGTTCGGTTTTTGAAGCAATTGGCACTCATTTAGATGGATTAATTGGAATCTCATCTCATTCCCTAAATTGTATTATTGCTTCCTCAGCTT
TGATTCATGTTAAGAGCAACTCTTGTGGCTTTATCCCTGCATCCCTTGAGATTACAGATTATGTTCTTGAGAATTTTAGGATCTTGCTTCAAGATCATGGTTATGCTGCA
TCGGAGATTACTCAACCATCAATTACCCCTGTCTTAAATCCGAATGCCTATTCAAATTCCATTGATCAAGACCGAATTCGAAAAGCCTTGGTAGATGAAGAGTTGGATTT
AAATTCAGACATTTTGGCGCCATCCATTTTAAATTCTCCAAGCAAGTTGATTCGGCCTTTTGAGTCCATGCCAAAAGTCACATCTCTTCCCGAGGAAGTTATAGTAATGA
AGAGTCCTAATGAATATAATGTTCTCCATCAAGAAGTTAATGAGAATTTAATTCATTCCTCAATTCCCCCGAAATTGAATGAAAAAGAGTTAATGACAGCTTTAAATTTG
GGGCAAAAGAGTATTAATGTTCCTTCATTGGAAGTAGAAGAGTCTGTCTCTCCTGGTTTTTCTACTCCTAATCCCAAAAATGAAAAGGTTGAAAGAGAGAGAAATTTACT
TAATTCTAAGTTAGCCGAGTTTGAGGGCCTTCTTCATAAGCCATATGTAACACCATCAAATCATGCTCCTGGCAACGAAAGCTCATTTTCAGGGAGTAATTCTCTTTCAA
TTCTACCAAGTGCTCAGAATAACACTCCTTCATTGTCAAGTTTTAGCCAGTTGGGTGCATCACTTAATATGGGATTTGGTGCTAGGCCCTCTAATCCGCCTAATAATCTA
TTCGGCCAGCCAGTTCCTTTTTCAAGCCCCATGCAAAACTCGAGTGCATTTGGAATGACCAATTTTCCATCAACAAGTGCTGTTGCAGTTAGAGGTGCATTTGGTAGCCA
ACTCCCGAGCCAAACATTTGGAGACTCTACCCTAAGTGGCTTCAATAACAGTGGCATAACAAATGCTGAAAGTAATCTTTCCTCTCCAGCGATGCTAACAAACTTCCCTA
CAACGGATGCAAATTCAAGTGTCGGTGGACAAATTGCGCCAAATGGCCAATTGGTAAATAAGTTACAGCAGGAAAATAGTTCTGTGGATGTCAGCATTTGGATGAAAGAG
AAATGGGTTCCTGGAGAGGTACGATCTTTTGTTAATGTTTATTTTATTTTATTTTTCCCACCAATTTTTCAAGCTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATTAAGAGTTGTAAAGTGAAGGATAATTTTTTCTGTGCTTGGAAAGAAGGCAATGGTTTCTTTGTTGAAGACTTGAGCAGAAACATCAAAGTTTTCCTTTCACC
AAGTATGTTAAGCTGGTTCGATGAGGCTTTGAATCTCCTACTATCCTTCCCAACCTACAAGTTCTTCTCGAGGAAATCAAGAATCGATCATGGAGTTATCCGCCTTGCTA
AGTTCTGGTCTTCAGGTGCTTGGTTTGTGGAGTGTGCAATTTGGCCTTCATCGGGTGGAAGAAAGAACATTCATATTCCTTCAGGTCTAGAGAAAGAAGAGGTGGCTATT
AGGCCTAAGGATAAAAATGGTATTGGTTACCGTCCAAAGAAATCCTTTGCAGAGGTCGTTCGCTCTTCGCCAAGCACCACTAAGGCCGAGTTACATCCAAAAGTTGTTGA
GAGGAATTCGTATTGGATTCGAAAGGATCATGATATTGTTGAATTAAATCTTCAAGAGTCTTTAGTTGTCACCAAATTGATGGCACATTACTCGTGGGGGAAAATTAAAG
CATCCTTTGAAGATCTTTTAAAGACAAATGTTTCCATTAGTCCGATTATGGATGATAAAGCTTTATTTCAGGTGAATATAGACCTGGGTTCCCTCATCACTATGGATAAG
TGGGCTTTCTATGATAAGTTTCATCTAAAATTTGAGCTTTGGTCCAAGAAAATTCATTCTTTTCCAGAGTTTATTAAGAGTCATGGAGGAAGAATCTGTATTAAAAATTT
GCCCCTACAGTACTGGAAGCGTTCGGTTTTTGAAGCAATTGGCACTCATTTAGATGGATTAATTGGAATCTCATCTCATTCCCTAAATTGTATTATTGCTTCCTCAGCTT
TGATTCATGTTAAGAGCAACTCTTGTGGCTTTATCCCTGCATCCCTTGAGATTACAGATTATGTTCTTGAGAATTTTAGGATCTTGCTTCAAGATCATGGTTATGCTGCA
TCGGAGATTACTCAACCATCAATTACCCCTGTCTTAAATCCGAATGCCTATTCAAATTCCATTGATCAAGACCGAATTCGAAAAGCCTTGGTAGATGAAGAGTTGGATTT
AAATTCAGACATTTTGGCGCCATCCATTTTAAATTCTCCAAGCAAGTTGATTCGGCCTTTTGAGTCCATGCCAAAAGTCACATCTCTTCCCGAGGAAGTTATAGTAATGA
AGAGTCCTAATGAATATAATGTTCTCCATCAAGAAGTTAATGAGAATTTAATTCATTCCTCAATTCCCCCGAAATTGAATGAAAAAGAGTTAATGACAGCTTTAAATTTG
GGGCAAAAGAGTATTAATGTTCCTTCATTGGAAGTAGAAGAGTCTGTCTCTCCTGGTTTTTCTACTCCTAATCCCAAAAATGAAAAGGTTGAAAGAGAGAGAAATTTACT
TAATTCTAAGTTAGCCGAGTTTGAGGGCCTTCTTCATAAGCCATATGTAACACCATCAAATCATGCTCCTGGCAACGAAAGCTCATTTTCAGGGAGTAATTCTCTTTCAA
TTCTACCAAGTGCTCAGAATAACACTCCTTCATTGTCAAGTTTTAGCCAGTTGGGTGCATCACTTAATATGGGATTTGGTGCTAGGCCCTCTAATCCGCCTAATAATCTA
TTCGGCCAGCCAGTTCCTTTTTCAAGCCCCATGCAAAACTCGAGTGCATTTGGAATGACCAATTTTCCATCAACAAGTGCTGTTGCAGTTAGAGGTGCATTTGGTAGCCA
ACTCCCGAGCCAAACATTTGGAGACTCTACCCTAAGTGGCTTCAATAACAGTGGCATAACAAATGCTGAAAGTAATCTTTCCTCTCCAGCGATGCTAACAAACTTCCCTA
CAACGGATGCAAATTCAAGTGTCGGTGGACAAATTGCGCCAAATGGCCAATTGGTAAATAAGTTACAGCAGGAAAATAGTTCTGTGGATGTCAGCATTTGGATGAAAGAG
AAATGGGTTCCTGGAGAGGTACGATCTTTTGTTAATGTTTATTTTATTTTATTTTTCCCACCAATTTTTCAAGCTATTTAG
Protein sequenceShow/hide protein sequence
MEIKSCKVKDNFFCAWKEGNGFFVEDLSRNIKVFLSPSMLSWFDEALNLLLSFPTYKFFSRKSRIDHGVIRLAKFWSSGAWFVECAIWPSSGGRKNIHIPSGLEKEEVAI
RPKDKNGIGYRPKKSFAEVVRSSPSTTKAELHPKVVERNSYWIRKDHDIVELNLQESLVVTKLMAHYSWGKIKASFEDLLKTNVSISPIMDDKALFQVNIDLGSLITMDK
WAFYDKFHLKFELWSKKIHSFPEFIKSHGGRICIKNLPLQYWKRSVFEAIGTHLDGLIGISSHSLNCIIASSALIHVKSNSCGFIPASLEITDYVLENFRILLQDHGYAA
SEITQPSITPVLNPNAYSNSIDQDRIRKALVDEELDLNSDILAPSILNSPSKLIRPFESMPKVTSLPEEVIVMKSPNEYNVLHQEVNENLIHSSIPPKLNEKELMTALNL
GQKSINVPSLEVEESVSPGFSTPNPKNEKVERERNLLNSKLAEFEGLLHKPYVTPSNHAPGNESSFSGSNSLSILPSAQNNTPSLSSFSQLGASLNMGFGARPSNPPNNL
FGQPVPFSSPMQNSSAFGMTNFPSTSAVAVRGAFGSQLPSQTFGDSTLSGFNNSGITNAESNLSSPAMLTNFPTTDANSSVGGQIAPNGQLVNKLQQENSSVDVSIWMKE
KWVPGEVRSFVNVYFILFFPPIFQAI