; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016789 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016789
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionNUFIP1 domain-containing protein
Genome locationchr09:9460806..9465861
RNA-Seq ExpressionPI0016789
SyntenyPI0016789
Gene Ontology termsGO:0000492 - box C/D snoRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140922.1 uncharacterized protein LOC101213190 [Cucumis sativus]7.1e-11974.05Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PP PSSQQ+PNSSL NS NGFQNQA FCNPNP  NNLHGN VPTMPPPMFQPGLMMNLQNPLMGLPNN LGASPF PGHMGFANS ANF A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        MPNVNQMN+NSCL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PS+TSYGGPNQQ VPMPFQNPGFST Q FGVNQGM P
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN
        +NQNPQNFIPQAMG               GN TMPINSSTQPQQARNLQSP FAGTQGNSSISDGGNGSNSI NN     N +   +  FQKNQT H+KN
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN

Query:  EKKKFEFSGGQKGKGW
        EKKKF F GGQK KG+
Subjt:  EKKKFEFSGGQKGKGW

XP_008456637.1 PREDICTED: uncharacterized protein LOC103496534 isoform X2 [Cucumis melo]2.0e-12175Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PP PSSQQ+PNSSLANS NGFQNQA FCNPNPH NNL GN VPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPF PGHMGFANS +NF A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        MPNVNQMN+NSCL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PSHTSYGGPNQQ VPMPFQNPGFST QPFGVNQGMHP
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN
        +NQNPQNFIPQAMG               GN TMPINSSTQPQQARNLQSP FAGTQGNSSISDGGNG NSI NN     N +   +  FQKNQT HMKN
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN

Query:  EKKKFEFSGGQKGKGW
        EKK+F F GGQK KG+
Subjt:  EKKKFEFSGGQKGKGW

XP_016902016.1 PREDICTED: uncharacterized protein LOC103496534 isoform X1 [Cucumis melo]4.4e-12162.38Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PP PSSQQ+PNSSLANS NGFQNQA FCNPNPH NNL GN VPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPF PGHMGFANS +NF A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        MPNVNQMN+NSCL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PSHTSYGGPNQQ VPMPFQNPGFST QPFGVNQGMHP
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN
        +NQNPQNFIPQAMG               GN TMPINSSTQPQQARNLQSP FAGTQGNSSISDGGNG NSI NN     N +   +  FQKNQT HMKN
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN

Query:  EKKKFEFSGGQKGKGWIVINRIFLFSLQNVVCGERELARMNYTSCELCAKTSSLRIKIDQIQVNGNLVISQRHSNLSSRQNQSNIAWLNATCLHRKALPS
        EKK+F F GGQK KG       F    +N  CG                         DQ++        Q+ S      +Q    W  A    RK  PS
Subjt:  EKKKFEFSGGQKGKGWIVINRIFLFSLQNVVCGERELARMNYTSCELCAKTSSLRIKIDQIQVNGNLVISQRHSNLSSRQNQSNIAWLNATCLHRKALPS

Query:  DLSLAKYFILQK
          ++ K+ ILQK
Subjt:  DLSLAKYFILQK

XP_023550423.1 uncharacterized protein LOC111808573 [Cucurbita pepo subsp. pepo]2.1e-11068.21Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGF--------QNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANF
        M  PP  SSQQLPNSSLANS+NG         QNQA FCNPN HLNNLHGN VP MPPPMFQPGLMMNLQNPLM LPNNPLGASPF PGHMGFANS ANF
Subjt:  MIPPPGPSSQQLPNSSLANSSNGF--------QNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANF

Query:  LAEGQFNLMPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF----------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQP
         A+GQFN++PNVNQMN+NSCL LAQFFGQNMPNLVQQL QNMGL NGQF                       PSH SYG PNQQ VPMPFQNP  ST+QP
Subjt:  LAEGQFNLMPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF----------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQP

Query:  FGVNQGMHPINQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLG-LFLNQLIRCRFQK
        FGVNQ MHP+NQNPQNFIPQAMG               GN TMP NS TQPQQARNLQSP F G+QGNSSISDGGNGSNS  NNL      +     FQK
Subjt:  FGVNQGMHPINQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLG-LFLNQLIRCRFQK

Query:  NQTRHMKNEKKKFEFSGGQKGKGW
        +Q  HMKNEKKKF   GG KGKG+
Subjt:  NQTRHMKNEKKKFEFSGGQKGKGW

XP_038885674.1 uncharacterized protein LOC120075982 [Benincasa hispida]1.6e-11872.38Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PPG SSQQLPN+SLANS NGFQNQA FCNPNPHLNNLHGN VP MPPPMFQPGLMMNLQNPLMGLPNNPL ASPF PGH+GFANS AN+ A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        +PNVNQMN+N+CL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PSHTSYGGPNQQ +PMPFQNPGFST+QPFGVNQ MHP
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLG-LFLNQLIRCRFQKNQTRHMKNE
        +NQNPQNF PQAMG               GN TM +NSSTQPQQARNLQSP F G+QGNSSISDGGNGSNS  NNL      +  +  FQKNQ  HMKNE
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLG-LFLNQLIRCRFQKNQTRHMKNE

Query:  KKKFEFSGGQKGKGW
        KKKF F GGQKGKG+
Subjt:  KKKFEFSGGQKGKGW

TrEMBL top hitse value%identityAlignment
A0A0A0KE87 NUFIP1 domain-containing protein3.4e-11974.05Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PP PSSQQ+PNSSL NS NGFQNQA FCNPNP  NNLHGN VPTMPPPMFQPGLMMNLQNPLMGLPNN LGASPF PGHMGFANS ANF A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        MPNVNQMN+NSCL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PS+TSYGGPNQQ VPMPFQNPGFST Q FGVNQGM P
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN
        +NQNPQNFIPQAMG               GN TMPINSSTQPQQARNLQSP FAGTQGNSSISDGGNGSNSI NN     N +   +  FQKNQT H+KN
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN

Query:  EKKKFEFSGGQKGKGW
        EKKKF F GGQK KG+
Subjt:  EKKKFEFSGGQKGKGW

A0A1S3C3B2 uncharacterized protein LOC103496534 isoform X29.7e-12275Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PP PSSQQ+PNSSLANS NGFQNQA FCNPNPH NNL GN VPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPF PGHMGFANS +NF A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        MPNVNQMN+NSCL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PSHTSYGGPNQQ VPMPFQNPGFST QPFGVNQGMHP
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN
        +NQNPQNFIPQAMG               GN TMPINSSTQPQQARNLQSP FAGTQGNSSISDGGNG NSI NN     N +   +  FQKNQT HMKN
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN

Query:  EKKKFEFSGGQKGKGW
        EKK+F F GGQK KG+
Subjt:  EKKKFEFSGGQKGKGW

A0A1S4E1B3 uncharacterized protein LOC103496534 isoform X12.2e-12162.38Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PP PSSQQ+PNSSLANS NGFQNQA FCNPNPH NNL GN VPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPF PGHMGFANS +NF A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        MPNVNQMN+NSCL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PSHTSYGGPNQQ VPMPFQNPGFST QPFGVNQGMHP
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN
        +NQNPQNFIPQAMG               GN TMPINSSTQPQQARNLQSP FAGTQGNSSISDGGNG NSI NN     N +   +  FQKNQT HMKN
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN

Query:  EKKKFEFSGGQKGKGWIVINRIFLFSLQNVVCGERELARMNYTSCELCAKTSSLRIKIDQIQVNGNLVISQRHSNLSSRQNQSNIAWLNATCLHRKALPS
        EKK+F F GGQK KG       F    +N  CG                         DQ++        Q+ S      +Q    W  A    RK  PS
Subjt:  EKKKFEFSGGQKGKGWIVINRIFLFSLQNVVCGERELARMNYTSCELCAKTSSLRIKIDQIQVNGNLVISQRHSNLSSRQNQSNIAWLNATCLHRKALPS

Query:  DLSLAKYFILQK
          ++ K+ ILQK
Subjt:  DLSLAKYFILQK

A0A5A7SKM0 Putative basic-leucine zipper transcription factor F isoform X19.7e-12275Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL
        MI PP PSSQQ+PNSSLANS NGFQNQA FCNPNPH NNL GN VPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPF PGHMGFANS +NF A+GQFNL
Subjt:  MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNL

Query:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP
        MPNVNQMN+NSCL LAQFFGQNMPNLVQQLGQNMGL NGQF                      PSHTSYGGPNQQ VPMPFQNPGFST QPFGVNQGMHP
Subjt:  MPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF---------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHP

Query:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN
        +NQNPQNFIPQAMG               GN TMPINSSTQPQQARNLQSP FAGTQGNSSISDGGNG NSI NN     N +   +  FQKNQT HMKN
Subjt:  INQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLGLFLNQLIRCR--FQKNQTRHMKN

Query:  EKKKFEFSGGQKGKGW
        EKK+F F GGQK KG+
Subjt:  EKKKFEFSGGQKGKGW

A0A6J1FNJ1 uncharacterized protein LOC1114455222.9e-11067.9Show/hide
Query:  MIPPPGPSSQQLPNSSLANSSNGF--------QNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANF
        M  PP  SSQQLPNSSLANS+NG         QNQA FCNPN HLNNLHGN VP MPPPMFQPGLMMNLQNPLM LPNNPLGASPF PGHMGFANS ANF
Subjt:  MIPPPGPSSQQLPNSSLANSSNGF--------QNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANF

Query:  LAEGQFNLMPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF----------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQP
          +GQFN++PNVNQMN+NSCL LAQFFGQNMPNLVQQL QNMGL NGQF                       PSH SYG PNQQ VPMPFQNP  ST+QP
Subjt:  LAEGQFNLMPNVNQMNLNSCLSLAQFFGQNMPNLVQQLGQNMGLENGQF----------------------FPSHTSYGGPNQQVVPMPFQNPGFSTVQP

Query:  FGVNQGMHPINQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLG-LFLNQLIRCRFQK
        FGVNQ MHP+NQNPQNFIPQAMG               GN TMP NS TQPQQARNLQSP F G+QGNSSISDGGNGSNS  NNL      +     FQK
Subjt:  FGVNQGMHPINQNPQNFIPQAMG--------------NGNWTMPINSSTQPQQARNLQSPTFAGTQGNSSISDGGNGSNSILNNLG-LFLNQLIRCRFQK

Query:  NQTRHMKNEKKKFEFSGGQKGKGW
        +Q  HMKNEKKKF   GG KGKG+
Subjt:  NQTRHMKNEKKKFEFSGGQKGKGW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCTCCTCCTGGCCCTTCTTCACAACAGTTACCCAATTCATCTCTCGCCAATTCTAGTAATGGGTTTCAAAATCAGGCCTTCTTCTGCAATCCAAACCCCCACTT
GAATAATCTCCATGGAAACCTTGTTCCCACCATGCCACCTCCCATGTTTCAGCCGGGATTGATGATGAATTTGCAAAACCCTCTCATGGGGTTGCCTAATAATCCTCTTG
GTGCTTCCCCTTTTACTCCTGGGCATATGGGTTTTGCAAATTCTTGTGCTAATTTTCTAGCTGAGGGGCAGTTCAATTTGATGCCAAATGTGAATCAGATGAATTTGAAC
TCTTGTTTGTCTCTAGCCCAGTTTTTTGGGCAGAACATGCCGAATTTGGTTCAGCAATTGGGTCAAAATATGGGTTTAGAAAATGGACAGTTTTTTCCTTCTCATACTTC
ATATGGTGGTCCAAATCAACAAGTTGTTCCAATGCCTTTTCAGAATCCTGGCTTCTCTACGGTCCAGCCTTTTGGTGTCAACCAGGGAATGCACCCTATTAACCAGAATC
CCCAAAACTTCATTCCACAAGCAATGGGCAATGGGAATTGGACCATGCCAATTAACTCTTCCACTCAACCACAACAAGCTAGGAACCTGCAGTCACCTACTTTCGCTGGG
ACACAGGGTAATTCTTCAATAAGTGATGGTGGAAATGGATCAAATTCAATTTTGAATAATTTAGGCCTGTTTCTCAACCAACTCATACGATGTAGATTTCAGAAGAATCA
AACTCGTCATATGAAAAATGAGAAGAAAAAGTTTGAGTTTTCTGGCGGACAGAAAGGGAAAGGATGGATCGTGATCAATAGAATCTTTTTGTTTTCCTTACAGAATGTAG
TATGCGGTGAGAGAGAACTCGCGAGGATGAACTATACTTCATGCGAGCTGTGCGCTAAAACAAGCTCCTTGAGGATAAAAATTGATCAAATACAAGTCAATGGCAATTTG
GTAATCTCCCAAAGGCATTCAAATTTATCCTCGCGGCAGAACCAATCCAATATCGCATGGCTCAACGCAACCTGCCTTCACCGCAAGGCATTGCCTTCTGACCTGAGCCT
CGCGAAGTACTTCATTCTGCAGAAGTATATCTTACCTCGCCGCATACTCATCCTTCACCAGCTCGCACTCTTCATTCAACTGACTCAAGCTCGCAACCTTCACCGCAATC
ATCAGCTCGCATGCGATCTGGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTCCTCCTCCTGGCCCTTCTTCACAACAGTTACCCAATTCATCTCTCGCCAATTCTAGTAATGGGTTTCAAAATCAGGCCTTCTTCTGCAATCCAAACCCCCACTT
GAATAATCTCCATGGAAACCTTGTTCCCACCATGCCACCTCCCATGTTTCAGCCGGGATTGATGATGAATTTGCAAAACCCTCTCATGGGGTTGCCTAATAATCCTCTTG
GTGCTTCCCCTTTTACTCCTGGGCATATGGGTTTTGCAAATTCTTGTGCTAATTTTCTAGCTGAGGGGCAGTTCAATTTGATGCCAAATGTGAATCAGATGAATTTGAAC
TCTTGTTTGTCTCTAGCCCAGTTTTTTGGGCAGAACATGCCGAATTTGGTTCAGCAATTGGGTCAAAATATGGGTTTAGAAAATGGACAGTTTTTTCCTTCTCATACTTC
ATATGGTGGTCCAAATCAACAAGTTGTTCCAATGCCTTTTCAGAATCCTGGCTTCTCTACGGTCCAGCCTTTTGGTGTCAACCAGGGAATGCACCCTATTAACCAGAATC
CCCAAAACTTCATTCCACAAGCAATGGGCAATGGGAATTGGACCATGCCAATTAACTCTTCCACTCAACCACAACAAGCTAGGAACCTGCAGTCACCTACTTTCGCTGGG
ACACAGGGTAATTCTTCAATAAGTGATGGTGGAAATGGATCAAATTCAATTTTGAATAATTTAGGCCTGTTTCTCAACCAACTCATACGATGTAGATTTCAGAAGAATCA
AACTCGTCATATGAAAAATGAGAAGAAAAAGTTTGAGTTTTCTGGCGGACAGAAAGGGAAAGGATGGATCGTGATCAATAGAATCTTTTTGTTTTCCTTACAGAATGTAG
TATGCGGTGAGAGAGAACTCGCGAGGATGAACTATACTTCATGCGAGCTGTGCGCTAAAACAAGCTCCTTGAGGATAAAAATTGATCAAATACAAGTCAATGGCAATTTG
GTAATCTCCCAAAGGCATTCAAATTTATCCTCGCGGCAGAACCAATCCAATATCGCATGGCTCAACGCAACCTGCCTTCACCGCAAGGCATTGCCTTCTGACCTGAGCCT
CGCGAAGTACTTCATTCTGCAGAAGTATATCTTACCTCGCCGCATACTCATCCTTCACCAGCTCGCACTCTTCATTCAACTGACTCAAGCTCGCAACCTTCACCGCAATC
ATCAGCTCGCATGCGATCTGGCATAA
Protein sequenceShow/hide protein sequence
MIPPPGPSSQQLPNSSLANSSNGFQNQAFFCNPNPHLNNLHGNLVPTMPPPMFQPGLMMNLQNPLMGLPNNPLGASPFTPGHMGFANSCANFLAEGQFNLMPNVNQMNLN
SCLSLAQFFGQNMPNLVQQLGQNMGLENGQFFPSHTSYGGPNQQVVPMPFQNPGFSTVQPFGVNQGMHPINQNPQNFIPQAMGNGNWTMPINSSTQPQQARNLQSPTFAG
TQGNSSISDGGNGSNSILNNLGLFLNQLIRCRFQKNQTRHMKNEKKKFEFSGGQKGKGWIVINRIFLFSLQNVVCGERELARMNYTSCELCAKTSSLRIKIDQIQVNGNL
VISQRHSNLSSRQNQSNIAWLNATCLHRKALPSDLSLAKYFILQKYILPRRILILHQLALFIQLTQARNLHRNHQLACDLA