; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005893 (gene) of Snake gourd v1 genome

Gene IDTan0005893
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionbox C/D snoRNA protein 1-like
Genome locationLG06:4088408..4091782
RNA-Seq ExpressionTan0005893
SyntenyTan0005893
Gene Ontology termsNA
InterPro domainsIPR007529 - Zinc finger, HIT-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593329.1 Box C/D snoRNA protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.8e-20787.76Show/hide
Query:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA
        MAEGDTV   AAAAASTS NREG  SSLCEECNSNPSKYKCPACSLRSCSL+CVN HKRRSGCTGKR QTQFVP+SQFNDSVLLSDYNLLEEVKRMAESA
Subjt:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA

Query:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN
        QRLRKKLCP+TH Y+RLPFHLKSLRTAASSRRT+IMFLPTGM+KREKNQTRYDKREKTIFWTIEWR NST++ LVDHGVNENT L TVLENHLQPSPWKN
Subjt:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN

Query:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED
        Q+QKFCEQLDSLKF+VRTYPKGA  PFRELDS +PIRQLF NLVFVEYPVIYVFLPSQTPNFEVVKTA     NPEGKNTGK+DLASPEGV FRVEEIED
Subjt:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED

Query:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV
        DDNSFN QVLDLMK S SSP CEV+PQN+ GAT+N STDLMG HEVGNSPNSSSQAKELGV KELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK V
Subjt:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV

Query:  EMEGSGDLLGDAFTVEELEEGEIME
        EMEGSGDLLGD FTV+ELEEGEIME
Subjt:  EMEGSGDLLGDAFTVEELEEGEIME

XP_022959527.1 box C/D snoRNA protein 1-like [Cucurbita moschata]2.6e-20988.24Show/hide
Query:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA
        MAEGDTV   AAAAASTS NREG  SSLC+ECNSNPSKYKCPACSLRSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDSVLLSDYNLLEEVKRMAESA
Subjt:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA

Query:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN
        QRLRKKLCP+TH Y+RLPFHLKSLRTAASSRRT+IMFLPTGM+KREKNQTRYDKREKTIFWTIEWR NST++ LVDHGVNENT L TVLENHLQPSPWKN
Subjt:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN

Query:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED
        Q+QKFCEQLDSLKF+VRTYPKGA +PFRELDS +PIRQLF NLVFVEYPVIYVFLPSQTPNFEVVKTA     NPEGKNTGKNDLASPEGV FRVEEIED
Subjt:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED

Query:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV
        DDNSFN QVLDLMK S SSP CEV+PQN+ GAT+N STDLMGKHEVGNSPNSSSQAKELGV KELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK V
Subjt:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV

Query:  EMEGSGDLLGDAFTVEELEEGEIME
        EMEGSGDLLGD FTV+ELEEGEIME
Subjt:  EMEGSGDLLGDAFTVEELEEGEIME

XP_023004550.1 box C/D snoRNA protein 1 [Cucurbita maxima]4.4e-20988.24Show/hide
Query:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA
        MAEGDTV   AAAAASTS NREG  SSLCEECNSNPSKYKCPACSLRSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDSVLLSDYNLLEEVKRMAESA
Subjt:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA

Query:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN
        QRLRKKLCP+TH Y+RLPFHLKSLRTAASSRRT+IMFLPTGM+KREKNQTRYDKREKTIFWTIEWR NST++ LVDHGVNENT L TVLENHLQPSPWKN
Subjt:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN

Query:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED
        Q+QKFCEQLDSLKF+VRTYPKGA  PFRELDS +PIRQLF NLVFVEYPVIYVFLPSQTPNFEV+KTA     NPEGKNTGKNDL SPEGV FRVEEIED
Subjt:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED

Query:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV
        DDNSFN QVLDLMK S SSP CEV+PQN+ GAT+N STDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK V
Subjt:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV

Query:  EMEGSGDLLGDAFTVEELEEGEIME
        EMEGSGDLLGD FTV+ELEEGEIME
Subjt:  EMEGSGDLLGDAFTVEELEEGEIME

XP_023513608.1 box C/D snoRNA protein 1-like [Cucurbita pepo subsp. pepo]1.1e-20788.03Show/hide
Query:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA
        MAEGDTV   AAAAASTS NREG  SSLCEECNSNPSKYKCPACSLRSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDSVLLSDYNLLEEVKRMAESA
Subjt:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA

Query:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN
        QRLRKKLCP+TH Y+RLPFHLKSLRTAASSRRT+IMFLPTGM+KREKNQTRYDKREKTIFWTIEWR NST++ LVDHGVNENT L TVLENHLQPSPWKN
Subjt:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN

Query:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED
        Q+QKFCEQLDSLKF+VRTYPKGA +PFRELDS +PIRQLF NLVFVEYPVIYVFLPSQTPNFEVVKTA     NPEGKNTGKNDLASPEGV FRVEEIED
Subjt:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED

Query:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHG-ATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKE
        DDNSFN QVLDLMK S SSP CEVDPQN+ G AT+  STDLMGKHEVGNSPNSSSQAKELGV K+LEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK 
Subjt:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHG-ATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKE

Query:  VEMEGSGDLLGDAFTVEELEEGEIME
        VEMEGSGDLLGD FTV+ELEEGEIME
Subjt:  VEMEGSGDLLGDAFTVEELEEGEIME

XP_038900096.1 box C/D snoRNA protein 1-like [Benincasa hispida]3.9e-20587.2Show/hide
Query:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL
        MAEGD  A A ASTSSNR+   SSLC+EC SNPSKYKCPACS+RSCSLNCVN HKRRSGCTGKRKQTQFVPLSQFNDS+LLSDYNLLEEVKRMAESAQR 
Subjt:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL

Query:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ
        RKKLCP+THAYFRLPFHLKSLRTAASSRRT+IMFLPTGM+KRE NQTRYDKREKTIFWT+EWRFNS +I LVDHGVNEN+KL T+LENHLQPSPWKNQL+
Subjt:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ

Query:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIEDDDN
        KFCEQLDSLKF+VRTYPKGATSPFRELDS LPIRQLF NLVFVEYPVIYVFLPSQTPNFEVVKTA     NPEG N GKN+LAS EGV FRVEEIEDDDN
Subjt:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIEDDDN

Query:  SFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEME
        S+NPQVLDLM+VST SPRCEVDPQNLH AT+  S DLMGK E GNSPNSSSQAKELGV+KE EFDFEQDLID YSNIMAQINPDDFLDWEGDFSK VEME
Subjt:  SFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEME

Query:  GSGDLLGDAFTVEELEEGEIME
        GSG+LLGDAFTVEELEEGEIME
Subjt:  GSGDLLGDAFTVEELEEGEIME

TrEMBL top hitse value%identityAlignment
A0A0A0K6N1 HIT-type domain-containing protein1.5e-19483.65Show/hide
Query:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL
        MAE D  A  A STSSN++G  SSLCEEC SNPSKYKCPACS+RSCSLNCVN HKRRSGCTGKRKQTQFVPLSQFNDS+LLSDYNLLEEVKRMAESAQRL
Subjt:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL

Query:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ
        RKKLCP+THAYFRLPFHLKSLR AAS+RRT+IMFLPTGM+KRE NQTRYDKREKTIFWT+EWRFNST+I LVDH VNEN+KL T+LENHL+P PWK QLQ
Subjt:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ

Query:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANP-----EGKNTGKNDLASPEGVPFRVEEIEDDDN
        KF EQLD LKF+VRTYPKGATS F ELDS LPIRQLF NL FVEYPVIYV LPSQTPNFEVVKTANP     EG N  KNDLAS EGV FRVEEIE+D+N
Subjt:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANP-----EGKNTGKNDLASPEGVPFRVEEIEDDDN

Query:  SFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEME
        S NPQVLDLMKVSTSSP C+V P+NLHGAT++ ST L+GK EVGNSP SSSQA+E GV+KELEFDFEQDLID YSNIMAQINPDDFLDW+GDFSKEVEME
Subjt:  SFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEME

Query:  GSGDLLGDAFTVEELEEGEIME
        GSG+LLGDAFTVEELEEGEIME
Subjt:  GSGDLLGDAFTVEELEEGEIME

A0A1S3CEQ2 box C/D snoRNA protein 1-like3.9e-18781.75Show/hide
Query:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL
        MAE D  A  A STSSN +G  SSLCEEC SNPSKYKCPACS+RSCSLNCVN HKRRSGCTGKRKQTQFVPLSQFNDS+LLSDYNLLEEVKRMAESAQRL
Subjt:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL

Query:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ
        RKKLCP+THAYFRLPFHLKSLR AAS+RRT+IMFLPTGM+KRE NQTRYDKREKTIFWT+EWRFNSTNI LVDH VNEN+KL T+L NHL+PSPWK QLQ
Subjt:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ

Query:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANP-----EGKNTGKNDLASPEGVPFRVEEIEDDDN
        KF EQLD LK +VRTYPKGA SPF ELDS LPIRQLF NL FVEYPVIYV LP QTPNFEVVKTANP     EG N  +NDLAS  GV FRVEEIEDD+N
Subjt:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANP-----EGKNTGKNDLASPEGVPFRVEEIEDDDN

Query:  SFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEME
        S NPQVLDLMKVSTSSP C+V P+N           L+GK EVGNSP SSSQA+ELGV+KELEFDFEQDLID YSNIMAQINPDDFLDWEGDFSKEVEME
Subjt:  SFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEME

Query:  GSGDLLGDAFTVEELEEGEIME
        GSG+LLGDAFT EELEEGEIME
Subjt:  GSGDLLGDAFTVEELEEGEIME

A0A6J1DKM0 box C/D snoRNA protein 11.8e-20085.75Show/hide
Query:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL
        MAEGD     AASTSSNRE   SSLC+ECNSN SKY CPACS+RSCSL CVN HKRRSGCTGKRKQTQFVP+SQFNDS+LLSDYNLLEEVKR++ESAQRL
Subjt:  MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRL

Query:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ
        RKKLCP+THAYFRLPF LKSL TAASSRRT+I+FLPTGM+KREKNQTRYDKREKTIFWTIEW+ NST+I L DHGVNENTKL  VLENHLQPSPWKNQLQ
Subjt:  RKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQ

Query:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANP----EGKNTGKNDLASPEGVPFRVEEIEDDDNS
        KF +QLDS+K +VRTYPKGATSPFRELDSGLPIR+LF NLV +EYPVIYVFLPSQTPNFEVVKTANP    EGKNTGK+D ASPEGVPFRVEEIE+DD S
Subjt:  KFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANP----EGKNTGKNDLASPEGVPFRVEEIEDDDNS

Query:  FNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEG
        FNPQVLDLMKVSTSSPRCEV+PQNLHGATYN S DLM  HEV NSPNSSSQAKE+GVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSK VEMEG
Subjt:  FNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEG

Query:  SGDLLGDAFTVEELEEGEIME
        + DLLGD FTVEELEEGEI+E
Subjt:  SGDLLGDAFTVEELEEGEIME

A0A6J1H4S8 box C/D snoRNA protein 1-like1.2e-20988.24Show/hide
Query:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA
        MAEGDTV   AAAAASTS NREG  SSLC+ECNSNPSKYKCPACSLRSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDSVLLSDYNLLEEVKRMAESA
Subjt:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA

Query:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN
        QRLRKKLCP+TH Y+RLPFHLKSLRTAASSRRT+IMFLPTGM+KREKNQTRYDKREKTIFWTIEWR NST++ LVDHGVNENT L TVLENHLQPSPWKN
Subjt:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN

Query:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED
        Q+QKFCEQLDSLKF+VRTYPKGA +PFRELDS +PIRQLF NLVFVEYPVIYVFLPSQTPNFEVVKTA     NPEGKNTGKNDLASPEGV FRVEEIED
Subjt:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED

Query:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV
        DDNSFN QVLDLMK S SSP CEV+PQN+ GAT+N STDLMGKHEVGNSPNSSSQAKELGV KELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK V
Subjt:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV

Query:  EMEGSGDLLGDAFTVEELEEGEIME
        EMEGSGDLLGD FTV+ELEEGEIME
Subjt:  EMEGSGDLLGDAFTVEELEEGEIME

A0A6J1KUW8 box C/D snoRNA protein 12.1e-20988.24Show/hide
Query:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA
        MAEGDTV   AAAAASTS NREG  SSLCEECNSNPSKYKCPACSLRSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDSVLLSDYNLLEEVKRMAESA
Subjt:  MAEGDTV---AAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESA

Query:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN
        QRLRKKLCP+TH Y+RLPFHLKSLRTAASSRRT+IMFLPTGM+KREKNQTRYDKREKTIFWTIEWR NST++ LVDHGVNENT L TVLENHLQPSPWKN
Subjt:  QRLRKKLCPHTHAYFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKN

Query:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED
        Q+QKFCEQLDSLKF+VRTYPKGA  PFRELDS +PIRQLF NLVFVEYPVIYVFLPSQTPNFEV+KTA     NPEGKNTGKNDL SPEGV FRVEEIED
Subjt:  QLQKFCEQLDSLKFYVRTYPKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTA-----NPEGKNTGKNDLASPEGVPFRVEEIED

Query:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV
        DDNSFN QVLDLMK S SSP CEV+PQN+ GAT+N STDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK V
Subjt:  DDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEV

Query:  EMEGSGDLLGDAFTVEELEEGEIME
        EMEGSGDLLGD FTV+ELEEGEIME
Subjt:  EMEGSGDLLGDAFTVEELEEGEIME

SwissProt top hitse value%identityAlignment
O74906 Putative box C/D snoRNA protein SPCC613.074.1e-1626.82Show/hide
Query:  SSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRL
        S NR G    +C  C  N SKY+CP C  R C L C  +HKR + C+G+R    FVP S+  +  L SD+N L  V+R+       RK+   H     R 
Subjt:  SSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRL

Query:  PFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEW--RFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCEQ-------
          +   L+ +       I F P    KR  N+T YDK+   I W+IEW    +ST+  L D   +ENT    +  +H +  P +   +K  E+       
Subjt:  PFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEW--RFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCEQ-------

Query:  --------LDSLKFYVRTYPKGATS-PFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANPEGKNTGKNDLASPEGVPFRVEEIEDDDNS
                 D ++F +++    +    +++++    +     N    E P I+VF    T   +V   ++ E  ++ ++D +S         E    D+ 
Subjt:  --------LDSLKFYVRTYPKGATS-PFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANPEGKNTGKNDLASPEGVPFRVEEIEDDDNS

Query:  FN
         N
Subjt:  FN

P38772 Box C/D snoRNA protein 11.5e-1023.81Show/hide
Query:  LCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGK----RKQTQFVPLSQFND------SVLLSDYNLLEEVKRMA-----ESAQRLRKKLCP---
        LC  C     KYKCP C +++CSL C   HK R  C+G+    ++      L Q +D      + +  DYN L ++KRM      ++  + ++ L P   
Subjt:  LCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGK----RKQTQFVPLSQFND------SVLLSDYNLLEEVKRMA-----ESAQRLRKKLCP---

Query:  HTHAYFRLPFHL-KSLRTAASSRR-----TRIMFLPTGMSKREKNQTRYDKREKTIFWTIEW-------RFNSTNIFL-VDHGVNENTKLPTVLENHLQP
        H   + +  + + +  R +   +R        + LP GM +  +N++++DK      W++EW       +     +F  V H + E         + L  
Subjt:  HTHAYFRLPFHL-KSLRTAASSRR-----TRIMFLPTGMSKREKNQTRYDKREKTIFWTIEW-------RFNSTNIFL-VDHGVNENTKLPTVLENHLQP

Query:  SPWKNQLQKFCE---------------------------QLDSLKFYVRTYPKGATSPFRELDSGLPIR---------QLFYNLVFVEYPVIYV
           KN  QK CE                           Q   LKFY +T+P   T     +DS   +          +L  N   +E+P I+V
Subjt:  SPWKNQLQKFCE---------------------------QLDSLKFYVRTYPKGATSPFRELDSGLPIR---------QLFYNLVFVEYPVIYV

Q3UFB2 Box C/D snoRNA protein 11.1e-2127.2Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSLRT
        S CE C +  +KY+CP C   SCSL CV  HK    C+G R +T +V L QF +  LLSDY  LE+V R A+   R      P    Y      L  ++ 
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSLRT

Query:  AASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSP----WKNQLQKFCEQLDSLKFYVRTYPKG
         A  +   +  LP G SKR++N T +D R++   W ++ +F  +    ++  V ++  +  +L+ ++ P       + +L+ + +    ++  +R     
Subjt:  AASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSP----WKNQLQKFCEQLDSLKFYVRTYPKG

Query:  ATS-PFRELDSGLPIRQLFYNL---VFVEYPVIYVFLPSQTPNFEVVKTANPEGKNTGKND
             + ELD   P + L  NL   V +EYP ++V L   + + ++++  +   +  G  +
Subjt:  ATS-PFRELDSGLPIRQLFYNL---VFVEYPVIYVFLPSQTPNFEVVKTANPEGKNTGKND

Q5RF97 Box C/D snoRNA protein 14.1e-2426.82Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQR--LRKKLCPHTHAYFRLPFHLKSL
        S CE C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R    K+   + H YF        +
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQR--LRKKLCPHTHAYFRLPFHLKSL

Query:  RTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSP----WKNQLQKFCEQLDSLKFYVR-TY
        +  A  +   +  LP G +KR++N T +DK+++   W ++ +F  +    ++  V ++  +  +L+ ++ P       + +L+ +      ++  ++  Y
Subjt:  RTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSP----WKNQLQKFCEQLDSLKFYVR-TY

Query:  PKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANPEG-KNTGKND
         +     + ELD    +     N V +EYP ++V L     + +V++    E  KN G  +
Subjt:  PKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANPEG-KNTGKND

Q9NWK9 Box C/D snoRNA protein 13.5e-2326.44Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQR--LRKKLCPHTHAYFRLPFHLKSL
        S CE C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R    K+   + + YF        +
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQR--LRKKLCPHTHAYFRLPFHLKSL

Query:  RTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSP----WKNQLQKFCEQLDSLKFYVR-TY
        +  A  +   +  LP G +KR++N T +DK+++   W ++ +F  +    ++  V ++  +  +L+ ++ P       + +L+ +      ++  ++  Y
Subjt:  RTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSP----WKNQLQKFCEQLDSLKFYVR-TY

Query:  PKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANPEG-KNTGKND
         +     + ELD    +     N V +EYP ++V L     + +V+     E  KN G  +
Subjt:  PKGATSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANPEG-KNTGKND

Arabidopsis top hitse value%identityAlignment
AT1G04945.1 HIT-type Zinc finger family protein5.4e-9647.99Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSLRT
        S+CEEC  NP KYKCP CS+RSC+L CV  HK+R+GCTGKRK T  VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC + ++Y +LP+ LKSL++
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSLRT

Query:  AASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCE-QLDSLKFYVRTYPKGATS
        AA SRRT++ +LP+GM KRE NQ+RYD R K I WTIEWRF+ST++ LVDHGV E+  L +V++NHL+P PW ++L+ FC+  LDSLK ++R YPKGA +
Subjt:  AASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCE-QLDSLKFYVRTYPKGATS

Query:  PFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTAN--PEGKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSSPRCEVDPQ
        PF+ELD   P+R+    +V +EYPVI+V+LPSQ+  F+V+K  N  P   ++  +      G+ FR EEIE+DD +SF P+VL LMK    +P   V  +
Subjt:  PFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTAN--PEGKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSSPRCEVDPQ

Query:  NLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEGSGDLLGDAFTVEELEEGEIME
        +          + +G     N+ N      E      +E +FEQ LIDTYS++ A++NPD                   D + D     +LEEGEI+E
Subjt:  NLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEGSGDLLGDAFTVEELEEGEIME

AT1G04945.2 HIT-type Zinc finger family protein8.0e-10048.51Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSLRT
        S+CEEC  NP KYKCP CS+RSC+L CV  HK+R+GCTGKRK T  VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC + ++Y +LP+ LKSL++
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSLRT

Query:  AASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCE-QLDSLKFYVRTYPKGATS
        AA SRRT++ +LP+GM KRE NQ+RYD R K I WTIEWRF+ST++ LVDHGV E+  L +V++NHL+P PW ++L+ FC+  LDSLK ++R YPKGA +
Subjt:  AASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCE-QLDSLKFYVRTYPKGATS

Query:  PFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTAN--PEGKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSSPRCEVDPQ
        PF+ELD   P+R+    +V +EYPVI+V+LPSQ+  F+V+K  N  P   ++  +      G+ FR EEIE+DD +SF P+VL LMK    +P   V  +
Subjt:  PFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTAN--PEGKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSSPRCEVDPQ

Query:  NLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEGSGDL--LGDAFTVE--ELEEGEI
        +          + +G     N+ N      E      +E +FEQ LIDTYS++ A++NP D+ ++E +F+K ++ + + +L  L   F  +  +LEEGEI
Subjt:  NLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEGSGDL--LGDAFTVE--ELEEGEI

Query:  ME
        +E
Subjt:  ME

AT1G04945.3 HIT-type Zinc finger family protein1.0e-9948.27Show/hide
Query:  SSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSL
        + S+CEEC  NP KYKCP CS+RSC+L CV  HK+R+GCTGKRK T  VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC + ++Y +LP+ LKSL
Subjt:  SSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHAYFRLPFHLKSL

Query:  RTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCE-QLDSLKFYVRTYPKGA
        ++AA SRRT++ +LP+GM KRE NQ+RYD R K I WTIEWRF+ST++ LVDHGV E+  L +V++NHL+P PW ++L+ FC+  LDSLK ++R YPKGA
Subjt:  RTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCE-QLDSLKFYVRTYPKGA

Query:  TSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTAN--PEGKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSSPRCEVD
         +PF+ELD   P+R+    +V +EYPVI+V+LPSQ+  F+V+K  N  P   ++  +      G+ FR EEIE+DD +SF P+VL LMK    +P   V 
Subjt:  TSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTAN--PEGKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSSPRCEVD

Query:  PQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEGSGDL--LGDAFTVE--ELEEG
         ++          + +G     N+ N      E      +E +FEQ LIDTYS++ A++NP D+ ++E +F+K ++ + + +L  L   F  +  +LEEG
Subjt:  PQNLHGATYNLSTDLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEGSGDL--LGDAFTVE--ELEEG

Query:  EIME
        EI+E
Subjt:  EIME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAGGAGACACGGTAGCAGCAGCAGCAGCTTCCACAAGCTCTAACCGTGAAGGATCATCTTCATCTCTTTGCGAAGAGTGTAATTCGAACCCATCGAAGTACAA
GTGCCCCGCTTGCTCTTTGCGTTCTTGTAGCCTTAATTGCGTCAATGATCACAAGCGGCGCAGTGGCTGTACAGGCAAAAGGAAGCAGACCCAATTCGTCCCACTTTCTC
AGTTCAACGATAGTGTTCTTCTTTCTGATTATAATTTGCTGGAGGAAGTGAAGAGGATGGCTGAATCAGCTCAAAGGCTCAGAAAGAAATTGTGCCCCCATACGCATGCT
TATTTTCGGCTACCATTTCACCTTAAAAGTTTGCGCACAGCTGCTTCAAGCCGGAGAACAAGAATCATGTTTCTCCCCACTGGAATGTCGAAAAGGGAGAAAAATCAAAC
TCGATATGACAAGAGGGAGAAAACAATCTTCTGGACAATTGAGTGGCGGTTTAACTCTACGAATATTTTTTTAGTTGACCATGGAGTTAATGAAAACACAAAGCTTCCTA
CCGTTCTTGAAAACCATCTACAACCAAGCCCATGGAAGAATCAACTTCAGAAGTTCTGTGAGCAGCTGGATAGCCTCAAATTTTATGTCCGTACATACCCCAAGGGAGCT
ACTTCACCTTTCCGTGAGCTGGACTCGGGGTTGCCAATAAGACAACTGTTTTACAATTTGGTTTTTGTGGAATACCCTGTTATATATGTTTTTCTACCCTCTCAAACTCC
TAACTTTGAAGTAGTTAAAACTGCCAATCCAGAAGGTAAAAACACTGGAAAAAATGATCTTGCTAGTCCTGAAGGTGTTCCCTTCAGGGTGGAAGAAATAGAAGACGACG
ACAACTCCTTCAATCCTCAGGTGCTTGATCTGATGAAAGTATCAACTTCAAGCCCACGTTGCGAAGTCGACCCCCAAAACCTGCATGGTGCAACATATAATCTCTCTACA
GATTTGATGGGGAAGCATGAAGTTGGGAATAGCCCCAATTCGAGCTCCCAGGCAAAGGAGCTAGGAGTTCTGAAAGAATTGGAGTTCGATTTCGAACAAGATCTAATCGA
TACATATTCAAATATTATGGCACAAATCAATCCAGATGATTTTCTTGATTGGGAAGGAGATTTCTCTAAGGAAGTGGAAATGGAGGGAAGTGGCGACCTTCTCGGGGATG
CGTTCACGGTTGAAGAATTGGAGGAAGGAGAGATTATGGAATAG
mRNA sequenceShow/hide mRNA sequence
CGGCGAAATTCGGAACCGGCGCTCGCTCTGGTGTCCAAGATGGCCGAAGGAGACACGGTAGCAGCAGCAGCAGCTTCCACAAGCTCTAACCGTGAAGGATCATCTTCATC
TCTTTGCGAAGAGTGTAATTCGAACCCATCGAAGTACAAGTGCCCCGCTTGCTCTTTGCGTTCTTGTAGCCTTAATTGCGTCAATGATCACAAGCGGCGCAGTGGCTGTA
CAGGCAAAAGGAAGCAGACCCAATTCGTCCCACTTTCTCAGTTCAACGATAGTGTTCTTCTTTCTGATTATAATTTGCTGGAGGAAGTGAAGAGGATGGCTGAATCAGCT
CAAAGGCTCAGAAAGAAATTGTGCCCCCATACGCATGCTTATTTTCGGCTACCATTTCACCTTAAAAGTTTGCGCACAGCTGCTTCAAGCCGGAGAACAAGAATCATGTT
TCTCCCCACTGGAATGTCGAAAAGGGAGAAAAATCAAACTCGATATGACAAGAGGGAGAAAACAATCTTCTGGACAATTGAGTGGCGGTTTAACTCTACGAATATTTTTT
TAGTTGACCATGGAGTTAATGAAAACACAAAGCTTCCTACCGTTCTTGAAAACCATCTACAACCAAGCCCATGGAAGAATCAACTTCAGAAGTTCTGTGAGCAGCTGGAT
AGCCTCAAATTTTATGTCCGTACATACCCCAAGGGAGCTACTTCACCTTTCCGTGAGCTGGACTCGGGGTTGCCAATAAGACAACTGTTTTACAATTTGGTTTTTGTGGA
ATACCCTGTTATATATGTTTTTCTACCCTCTCAAACTCCTAACTTTGAAGTAGTTAAAACTGCCAATCCAGAAGGTAAAAACACTGGAAAAAATGATCTTGCTAGTCCTG
AAGGTGTTCCCTTCAGGGTGGAAGAAATAGAAGACGACGACAACTCCTTCAATCCTCAGGTGCTTGATCTGATGAAAGTATCAACTTCAAGCCCACGTTGCGAAGTCGAC
CCCCAAAACCTGCATGGTGCAACATATAATCTCTCTACAGATTTGATGGGGAAGCATGAAGTTGGGAATAGCCCCAATTCGAGCTCCCAGGCAAAGGAGCTAGGAGTTCT
GAAAGAATTGGAGTTCGATTTCGAACAAGATCTAATCGATACATATTCAAATATTATGGCACAAATCAATCCAGATGATTTTCTTGATTGGGAAGGAGATTTCTCTAAGG
AAGTGGAAATGGAGGGAAGTGGCGACCTTCTCGGGGATGCGTTCACGGTTGAAGAATTGGAGGAAGGAGAGATTATGGAATAGTGATTAACATTTCAAGGAAGAAGTTCC
CAAGGTTAGCTTCACTCATGCTTGACAAAGGGATTCTTCTTATCATTTGATTATGGTTTGGCGCTTCAGCGCTAAATGGTCTATCAGCAGGAACCCACTTTTGCTTGCAG
CAGAGTGCTTCCCAGTTCCTACTACATTACAGGTAAATCTTGATGCCATTTTACTTCATTGATTTGCCAATCGCTGCAAGCAACATCCTGCATTTTGGTCCATGTTAGAT
TAGATTATCTGTCAGCTCGGCCTTGCTGTTGTTTACACAATCTTGATTTGATTCATTTCATCAATAGAGTTGTATGAGAACATGTATCTCATGGAATCTTATTTGTTCAT
CAGTTCAATTTTGTTTAAACAATACTTTATGTAGGATTTTGTTTATATTTATTTCTTACAAAAGTTTGAAAGTTGATAATCAGTTGGAAAATGAATGATGGAAAAAAAAC
ATGATAATCACTTGAA
Protein sequenceShow/hide protein sequence
MAEGDTVAAAAASTSSNREGSSSSLCEECNSNPSKYKCPACSLRSCSLNCVNDHKRRSGCTGKRKQTQFVPLSQFNDSVLLSDYNLLEEVKRMAESAQRLRKKLCPHTHA
YFRLPFHLKSLRTAASSRRTRIMFLPTGMSKREKNQTRYDKREKTIFWTIEWRFNSTNIFLVDHGVNENTKLPTVLENHLQPSPWKNQLQKFCEQLDSLKFYVRTYPKGA
TSPFRELDSGLPIRQLFYNLVFVEYPVIYVFLPSQTPNFEVVKTANPEGKNTGKNDLASPEGVPFRVEEIEDDDNSFNPQVLDLMKVSTSSPRCEVDPQNLHGATYNLST
DLMGKHEVGNSPNSSSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKEVEMEGSGDLLGDAFTVEELEEGEIME