; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002806 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002806
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionbox C/D snoRNA protein 1-like
Genome locationchr4:45817279..45819831
RNA-Seq ExpressionLag0002806
SyntenyLag0002806
Gene Ontology termsNA
InterPro domainsIPR007529 - Zinc finger, HIT-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593329.1 Box C/D snoRNA protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.6e-21188.21Show/hide
Query:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEG+    A AAAASTS NREGSSLCEECNSNPSKYKCPACSLRSCSL CVN HKRRSGCTGKR QT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL
        LRKKLCPYTH Y+RLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNST++VL++HGVNENT LST+LENHLQPSPWKNQ+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL

Query:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD
        QKFCEQLDSLKFFVRTYPKGA  PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVK+AN VSHNPE KNTGK+DLASPEGV FRVEEIEDDD
Subjt:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD

Query:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE
        NSFN QVLDLMK S S SP CEVEP+N+ GAT+NYSTDLMG HEVGNSPNS+SQAKELGV KELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK VE
Subjt:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE

Query:  MEGSGDLLGNAFTVEELEEGEIME
        MEGSGDLLG+ FTV+ELEEGEIME
Subjt:  MEGSGDLLGNAFTVEELEEGEIME

KAG7025676.1 Box C/D snoRNA protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-20883.89Show/hide
Query:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLS-----------------
        MAEG+    A AAAASTS NREGSSLCEECNSNPSKYKCPACSLRSCSL CVN HKRRSGCTGKR QT+FVP+SQFNDS+LLS                 
Subjt:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLS-----------------

Query:  ------DYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGV
              DYNLLEEVKRMAESAQRLRKKLCPYTH Y+RLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNST++VL++HGV
Subjt:  ------DYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGV

Query:  NENTKLSTILENHLQPSPWKNQLQKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKN
        NENT LST+LENHLQPSPWKNQ+QKFCEQLDSLKFFVRTYPKGA +PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVK+AN VSHNPE KN
Subjt:  NENTKLSTILENHLQPSPWKNQLQKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKN

Query:  TGKNDLASPEGVPFRVEEIEDDDNSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYS
        TGKNDLASPEGV FRVEEIEDDDNSFN QVLDLMK S S SP CEVEP+N+ GAT+NYSTDLMG HEVGNSPNS+SQAKELGV KELEFDFEQDL+DTYS
Subjt:  TGKNDLASPEGVPFRVEEIEDDDNSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYS

Query:  NIMAQINPDDFLDWEGDFSKAVEMEGSGDLLGNAFTVEELEEGEIME
        NIMAQINPDDFLDW+ DFSK VEMEGSGDLLG+ FTV+ELEEGEIME
Subjt:  NIMAQINPDDFLDWEGDFSKAVEMEGSGDLLGNAFTVEELEEGEIME

XP_022959527.1 box C/D snoRNA protein 1-like [Cucurbita moschata]2.3e-21388.68Show/hide
Query:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEG+    A AAAASTS NREGSSLC+ECNSNPSKYKCPACSLRSCSL CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL
        LRKKLCPYTH Y+RLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNST++VL++HGVNENT LST+LENHLQPSPWKNQ+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL

Query:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD
        QKFCEQLDSLKFFVRTYPKGA +PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVK+AN VSHNPE KNTGKNDLASPEGV FRVEEIEDDD
Subjt:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD

Query:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE
        NSFN QVLDLMK S S SP CEVEP+N+ GAT+NYSTDLMGKHEVGNSPNS+SQAKELGV KELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK VE
Subjt:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE

Query:  MEGSGDLLGNAFTVEELEEGEIME
        MEGSGDLLG+ FTV+ELEEGEIME
Subjt:  MEGSGDLLGNAFTVEELEEGEIME

XP_023004550.1 box C/D snoRNA protein 1 [Cucurbita maxima]3.9e-21388.68Show/hide
Query:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEG+    A AAAASTS NREGSSLCEECNSNPSKYKCPACSLRSCSL CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL
        LRKKLCPYTH Y+RLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNST++VL++HGVNENT LST+LENHLQPSPWKNQ+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL

Query:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD
        QKFCEQLDSLKFFVRTYPKGA  PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEV+K+AN VSHNPE KNTGKNDL SPEGV FRVEEIEDDD
Subjt:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD

Query:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE
        NSFN QVLDLMK S S SP CEVEP+N+ GAT+NYSTDLMGKHEVGNSPNS+SQAKELGVLKELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK VE
Subjt:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE

Query:  MEGSGDLLGNAFTVEELEEGEIME
        MEGSGDLLG+ FTV+ELEEGEIME
Subjt:  MEGSGDLLGNAFTVEELEEGEIME

XP_023513608.1 box C/D snoRNA protein 1-like [Cucurbita pepo subsp. pepo]6.2e-21188Show/hide
Query:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEG+    A AAAASTS NREGSSLCEECNSNPSKYKCPACSLRSCSL CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL
        LRKKLCPYTH Y+RLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNST++VL++HGVNENT LST+LENHLQPSPWKNQ+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL

Query:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD
        QKFCEQLDSLKFFVRTYPKGA +PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVK+AN VSHNPE KNTGKNDLASPEGV FRVEEIEDDD
Subjt:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD

Query:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHG-ATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAV
        NSFN QVLDLMK S S SP CEV+P+N+ G AT+ YSTDLMGKHEVGNSPNS+SQAKELGV K+LEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK V
Subjt:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHG-ATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAV

Query:  EMEGSGDLLGNAFTVEELEEGEIME
        EMEGSGDLLG+ FTV+ELEEGEIME
Subjt:  EMEGSGDLLGNAFTVEELEEGEIME

TrEMBL top hitse value%identityAlignment
A0A0A0K6N1 HIT-type domain-containing protein1.4e-19785Show/hide
Query:  MAEGEATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK
        MAE +AT  A STSSN++GSSLCEEC SNPSKYKCPACS+RSCSL CVNAHKRRSGCTGKRKQT+FVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK
Subjt:  MAEGEATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK

Query:  LCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFC
        LCPYTHAYFRLPFHLKSLR AAS+RRTKIMFLPTGMTKRE NQTRYDKREKTIFWT+EWR NST+IVL++H VNEN+KLSTILENHL+P PWK QLQKF 
Subjt:  LCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFC

Query:  EQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDDNSFN
        EQLD LKFFVRTYPKGATS F ELDS LPIRQLFSNL FVEYPVIYV LPSQTPNFEVVK+AN VS N E  N  KNDLAS EGV FRVEEIE+D+NS N
Subjt:  EQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDDNSFN

Query:  PQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGS
        PQVLDLMKVSTS SP C+V PRNLHGAT++YST L+GK EVGNSP S+SQA+E GV+KELEFDFEQDLID YSNIMAQINPDDFLDW+GDFSK VEMEGS
Subjt:  PQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGS

Query:  GDLLGNAFTVEELEEGEIME
        G+LLG+AFTVEELEEGEIME
Subjt:  GDLLGNAFTVEELEEGEIME

A0A1S3CEQ2 box C/D snoRNA protein 1-like6.1e-18882.38Show/hide
Query:  MAEGEATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK
        MAE +AT  A STSSN +GSSLCEEC SNPSKYKCPACS+RSCSL CVNAHKRRSGCTGKRKQT+FVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK
Subjt:  MAEGEATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK

Query:  LCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFC
        LCPYTHAYFRLPFHLKSLR AAS+RRTKIMFLPTGMTKRE NQTRYDKREKTIFWT+EWR NST IVL++H VNEN+KLSTIL NHL+PSPWK QLQKF 
Subjt:  LCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFC

Query:  EQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDDNSFN
        EQLD LK FVRTYPKGA SPF ELDS LPIRQLFSNL FVEYPVIYV LP QTPNFEVVK+AN  S N E  N  +NDLAS  GV FRVEEIEDD+NS N
Subjt:  EQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDDNSFN

Query:  PQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGS
        PQVLDLMKVSTS SP C+V PRN           L+GK EVGNSP S+SQA+ELGV+KELEFDFEQDLID YSNIMAQINPDDFLDWEGDFSK VEMEGS
Subjt:  PQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGS

Query:  GDLLGNAFTVEELEEGEIME
        G+LLG+AFT EELEEGEIME
Subjt:  GDLLGNAFTVEELEEGEIME

A0A6J1DKM0 box C/D snoRNA protein 11.8e-20386.67Show/hide
Query:  MAEGEATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK
        MAEG+    AASTSSNRE SSLC+ECNSN SKY CPACS+RSCSLTCVN+HKRRSGCTGKRKQT+FVP+SQFNDSILLSDYNLLEEVKR++ESAQRLRKK
Subjt:  MAEGEATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKK

Query:  LCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFC
        LCPYTHAYFRLPF LKSL TAASSRRTKI+FLPTGMTKREKNQTRYDKREKTIFWTIEW+ NST+IVL +HGVNENTKLS +LENHLQPSPWKNQLQKF 
Subjt:  LCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFC

Query:  EQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDDNSFN
        +QLDS+K FVRTYPKGATSPFRELDSGLPIR+LFSNLV +EYPVIYVFLPSQTPNFEVVK+AN VSH+ E KNTGK+D ASPEGVPFRVEEIE+DD SFN
Subjt:  EQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDDNSFN

Query:  PQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGS
        PQVLDLMKVSTS SPRCEVEP+NLHGATYN+S DLM  HEV NSPNS+SQAKE+GVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSK VEMEG+
Subjt:  PQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGS

Query:  GDLLGNAFTVEELEEGEIME
         DLLG+ FTVEELEEGEI+E
Subjt:  GDLLGNAFTVEELEEGEIME

A0A6J1H4S8 box C/D snoRNA protein 1-like1.1e-21388.68Show/hide
Query:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEG+    A AAAASTS NREGSSLC+ECNSNPSKYKCPACSLRSCSL CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL
        LRKKLCPYTH Y+RLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNST++VL++HGVNENT LST+LENHLQPSPWKNQ+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL

Query:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD
        QKFCEQLDSLKFFVRTYPKGA +PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVK+AN VSHNPE KNTGKNDLASPEGV FRVEEIEDDD
Subjt:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD

Query:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE
        NSFN QVLDLMK S S SP CEVEP+N+ GAT+NYSTDLMGKHEVGNSPNS+SQAKELGV KELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK VE
Subjt:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE

Query:  MEGSGDLLGNAFTVEELEEGEIME
        MEGSGDLLG+ FTV+ELEEGEIME
Subjt:  MEGSGDLLGNAFTVEELEEGEIME

A0A6J1KUW8 box C/D snoRNA protein 11.9e-21388.68Show/hide
Query:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEG+    A AAAASTS NREGSSLCEECNSNPSKYKCPACSLRSCSL CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGE----ATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL
        LRKKLCPYTH Y+RLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNST++VL++HGVNENT LST+LENHLQPSPWKNQ+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQL

Query:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD
        QKFCEQLDSLKFFVRTYPKGA  PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEV+K+AN VSHNPE KNTGKNDL SPEGV FRVEEIEDDD
Subjt:  QKFCEQLDSLKFFVRTYPKGATSPFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD

Query:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE
        NSFN QVLDLMK S S SP CEVEP+N+ GAT+NYSTDLMGKHEVGNSPNS+SQAKELGVLKELEFDFEQDL+DTYSNIMAQINPDDFLDW+ DFSK VE
Subjt:  NSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVE

Query:  MEGSGDLLGNAFTVEELEEGEIME
        MEGSGDLLG+ FTV+ELEEGEIME
Subjt:  MEGSGDLLGNAFTVEELEEGEIME

SwissProt top hitse value%identityAlignment
O74906 Putative box C/D snoRNA protein SPCC613.079.2e-1628.11Show/hide
Query:  SSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF
        S NR G  +C  C  N SKY+CP C  R C L C   HKR + C+G+R    FVP S+  +  L SD+N L  V+R+      + +K         R   
Subjt:  SSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF

Query:  HLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLN-----------STEIVLINHGVNENTKLSTILENHLQP-SPWKNQLQKF-C
        +   L+ +       I F P    KR  N+T YDK+   I W+IEW L+           ++E  +I H   E+  L  I    ++  S   +Q+ K   
Subjt:  HLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLN-----------STEIVLINHGVNENTKLSTILENHLQP-SPWKNQLQKF-C

Query:  EQLDSLKFFVRTYPKGATS-PFRELDSGLPIRQLFSNLVFVEYPVIYVF
           D ++F +++    +    +++++    +     N    E P I+VF
Subjt:  EQLDSLKFFVRTYPKGATS-PFRELDSGLPIRQLFSNLVFVEYPVIYVF

P38772 Box C/D snoRNA protein 12.2e-0923.71Show/hide
Query:  LCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGK----RKQTEFVPLSQFND------SILLSDYNLLEEVKRMA-----ESAQRLRKKLCP---
        LC  C     KYKCP C +++CSL C   HK R  C+G+    ++      L Q +D      + +  DYN L ++KRM      ++  + ++ L P   
Subjt:  LCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGK----RKQTEFVPLSQFND------SILLSDYNLLEEVKRMA-----ESAQRLRKKLCP---

Query:  YTHAYFRLPFHL-KSLRTAASSRR-----TKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRL-----NSTEIVLINHGVNENTKLSTILENHLQPSPW
        +   + +  + + +  R +   +R        + LP GM +  +N++++DK      W++EW L        +  L  H V+   K +  L   +     
Subjt:  YTHAYFRLPFHL-KSLRTAASSRR-----TKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRL-----NSTEIVLINHGVNENTKLSTILENHLQPSPW

Query:  KNQLQKFCE---------------------------QLDSLKFFVRTYPKGATSPFRELDSGLPIR---------QLFSNLVFVEYPVIYV
        KN  QK CE                           Q   LKF+ +T+P   T     +DS   +          +L  N   +E+P I+V
Subjt:  KNQLQKFCE---------------------------QLDSLKFFVRTYPKGATSPFRELDSGLPIR---------QLFSNLVFVEYPVIYV

Q3UFB2 Box C/D snoRNA protein 19.5e-2129.1Show/hide
Query:  SSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF
        S  +   S CE C +  +KY+CP C   SCSL CV  HK    C+G R +T +V L QF +  LLSDY  LE+V R A+   R      P    Y     
Subjt:  SSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF

Query:  HLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSP----WKNQLQKFCEQLDSLKFF
         L  ++  A  +   +  LP G +KR++N T +D R++   W ++ +   ++   I   V ++  ++ IL+ ++ P       + +L+ + +    ++  
Subjt:  HLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSP----WKNQLQKFCEQLDSLKFF

Query:  VRTYPKGATS-PFRELDSGLPIRQLFSNL---VFVEYPVIYVFL
        +R          + ELD   P + L  NL   V +EYP ++V L
Subjt:  VRTYPKGATS-PFRELDSGLPIRQLFSNL---VFVEYPVIYVFL

Q5RF97 Box C/D snoRNA protein 11.2e-2327.14Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF---HLKS
        S CE C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R          A+ + P    H+  
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF---HLKS

Query:  LRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSP----WKNQLQKFCEQLDSLKFFVR-T
        ++  A  +   +  LP G TKR++N T +DK+++   W ++ +   ++   I   V ++  ++ IL+ ++ P       + +L+ +      ++  ++  
Subjt:  LRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSP----WKNQLQKFCEQLDSLKFFVR-T

Query:  YPKGATSPFRELDSGLPIRQLFSNL---VFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKND
        Y +     + ELD   P + L  NL   V +EYP ++V L     + +V++       +   KN G  +
Subjt:  YPKGATSPFRELDSGLPIRQLFSNL---VFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKND

Q9NWK9 Box C/D snoRNA protein 11.0e-2227.07Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT
        S CE C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R      P ++ Y      +  ++ 
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT

Query:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSP----WKNQLQKFCEQLDSLKFFVR-TYPK
         A  +   +  LP G TKR++N T +DK+++   W ++ +   ++   I   V ++  ++ IL+ ++ P       + +L+ +      ++  ++  Y +
Subjt:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSP----WKNQLQKFCEQLDSLKFFVR-TYPK

Query:  GATSPFRELDSGLPIRQLFSNL---VFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKND
             + ELD   P + L  NL   V +EYP ++V L     + +V+        +   KN G  +
Subjt:  GATSPFRELDSGLPIRQLFSNL---VFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKND

Arabidopsis top hitse value%identityAlignment
AT1G04945.1 HIT-type Zinc finger family protein3.2e-9650.27Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT
        S+CEEC  NP KYKCP CS+RSC+L CV AHK+R+GCTGKRK T+ VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   H  ++LP+ LKSL++
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT

Query:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFCE-QLDSLKFFVRTYPKGATS
        AA SRRTK+ +LP+GM KRE NQ+RYD R K I WTIEWR +ST+++L++HGV E+  L ++++NHL+P PW ++L+ FC+  LDSLK F+R YPKGA +
Subjt:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFCE-QLDSLKFFVRTYPKGATS

Query:  PFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSKSPRCE
        PF+ELD   P+R+  + +V +EYPVI+V+LPSQ+  F+V+K  N     P   ++  +      G+ FR EEIE+DD +SF P+VL LMK   + +P   
Subjt:  PFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSKSPRCE

Query:  VEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQ--AKELGVLKELEFDFEQDLIDTYSNIMAQINPD-DFL
        V  ++              K E   + NSN Q    E      +E +FEQ LIDTYS++ A++NPD DF+
Subjt:  VEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQ--AKELGVLKELEFDFEQDLIDTYSNIMAQINPD-DFL

AT1G04945.2 HIT-type Zinc finger family protein7.3e-10148.28Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT
        S+CEEC  NP KYKCP CS+RSC+L CV AHK+R+GCTGKRK T+ VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   H  ++LP+ LKSL++
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT

Query:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFCE-QLDSLKFFVRTYPKGATS
        AA SRRTK+ +LP+GM KRE NQ+RYD R K I WTIEWR +ST+++L++HGV E+  L ++++NHL+P PW ++L+ FC+  LDSLK F+R YPKGA +
Subjt:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFCE-QLDSLKFFVRTYPKGATS

Query:  PFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSKSPRCE
        PF+ELD   P+R+  + +V +EYPVI+V+LPSQ+  F+V+K  N     P   ++  +      G+ FR EEIE+DD +SF P+VL LMK   + +P   
Subjt:  PFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSKSPRCE

Query:  VEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQ--AKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGSGDL--LGNAFTVE--E
        V  ++              K E   + NSN Q    E      +E +FEQ LIDTYS++ A++NP D+ ++E +F+K ++ + + +L  L   F  +  +
Subjt:  VEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQ--AKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGSGDL--LGNAFTVE--E

Query:  LEEGEIME
        LEEGEI+E
Subjt:  LEEGEIME

AT1G04945.3 HIT-type Zinc finger family protein7.3e-10148.28Show/hide
Query:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT
        S+CEEC  NP KYKCP CS+RSC+L CV AHK+R+GCTGKRK T+ VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   H  ++LP+ LKSL++
Subjt:  SLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRT

Query:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFCE-QLDSLKFFVRTYPKGATS
        AA SRRTK+ +LP+GM KRE NQ+RYD R K I WTIEWR +ST+++L++HGV E+  L ++++NHL+P PW ++L+ FC+  LDSLK F+R YPKGA +
Subjt:  AASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFCE-QLDSLKFFVRTYPKGATS

Query:  PFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSKSPRCE
        PF+ELD   P+R+  + +V +EYPVI+V+LPSQ+  F+V+K  N     P   ++  +      G+ FR EEIE+DD +SF P+VL LMK   + +P   
Subjt:  PFRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDD-NSFNPQVLDLMKVSTSKSPRCE

Query:  VEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQ--AKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGSGDL--LGNAFTVE--E
        V  ++              K E   + NSN Q    E      +E +FEQ LIDTYS++ A++NP D+ ++E +F+K ++ + + +L  L   F  +  +
Subjt:  VEPRNLHGATYNYSTDLMGKHEVGNSPNSNSQ--AKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGSGDL--LGNAFTVE--E

Query:  LEEGEIME
        LEEGEI+E
Subjt:  LEEGEIME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAGGAGAAGCAACTGCAGCAGCAGCTTCCACAAGCTCCAACCGCGAAGGATCATCACTTTGTGAAGAGTGTAATTCGAACCCATCGAAGTACAAGTGCCCCGC
TTGTTCTTTGCGTTCTTGTAGCCTCACTTGCGTCAATGCTCACAAGCGGCGCAGTGGATGTACAGGCAAAAGGAAGCAAACCGAATTCGTCCCGCTTTCTCAGTTCAACG
ACAGTATCCTTCTTTCTGATTATAATTTGCTGGAGGAAGTGAAGAGAATGGCTGAATCTGCTCAAAGGCTCCGAAAGAAATTGTGCCCCTATACTCATGCTTATTTTCGA
CTACCATTTCACCTTAAAAGTTTGCGCACTGCTGCTTCAAGCCGGAGAACAAAAATTATGTTTCTCCCCACTGGGATGACGAAAAGGGAGAAAAATCAAACTCGATATGA
CAAGAGGGAAAAAACAATCTTCTGGACAATTGAGTGGCGGTTGAACTCTACAGAAATTGTTTTAATTAACCATGGAGTTAATGAAAACACAAAGCTTTCTACCATTCTTG
AAAACCATCTACAACCAAGCCCATGGAAAAATCAACTTCAGAAGTTCTGTGAGCAGCTGGATAGCCTCAAATTTTTTGTCCGTACATACCCCAAGGGAGCTACTTCGCCT
TTCCGTGAGCTGGACTCGGGGTTGCCAATTAGACAACTGTTTTCCAATTTGGTTTTTGTGGAATACCCTGTTATATATGTTTTTCTTCCCTCTCAAACTCCCAACTTTGA
AGTAGTTAAATCTGCCAATCTAGTAAGTCATAATCCAGAAGTTAAGAACACAGGAAAAAATGATCTTGCTAGTCCTGAAGGTGTTCCTTTCAGAGTAGAAGAAATAGAAG
ACGATGACAACTCCTTCAATCCTCAGGTGCTTGATCTGATGAAAGTATCAACTTCAAAAAGCCCACGTTGCGAAGTCGAGCCCCGAAACCTGCATGGTGCAACATATAAT
TATTCTACAGATTTGATGGGGAAACATGAAGTTGGGAATAGCCCCAATTCAAACTCCCAGGCCAAGGAGCTGGGGGTTCTGAAAGAATTGGAATTTGATTTTGAGCAAGA
TCTGATAGATACATATTCAAATATCATGGCACAAATCAACCCAGATGATTTTCTTGATTGGGAAGGAGACTTTTCCAAGGCAGTGGAAATGGAAGGAAGCGGTGACCTTC
TCGGGAATGCGTTCACGGTGGAAGAATTGGAGGAAGGAGAGATTATGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAAGGAGAAGCAACTGCAGCAGCAGCTTCCACAAGCTCCAACCGCGAAGGATCATCACTTTGTGAAGAGTGTAATTCGAACCCATCGAAGTACAAGTGCCCCGC
TTGTTCTTTGCGTTCTTGTAGCCTCACTTGCGTCAATGCTCACAAGCGGCGCAGTGGATGTACAGGCAAAAGGAAGCAAACCGAATTCGTCCCGCTTTCTCAGTTCAACG
ACAGTATCCTTCTTTCTGATTATAATTTGCTGGAGGAAGTGAAGAGAATGGCTGAATCTGCTCAAAGGCTCCGAAAGAAATTGTGCCCCTATACTCATGCTTATTTTCGA
CTACCATTTCACCTTAAAAGTTTGCGCACTGCTGCTTCAAGCCGGAGAACAAAAATTATGTTTCTCCCCACTGGGATGACGAAAAGGGAGAAAAATCAAACTCGATATGA
CAAGAGGGAAAAAACAATCTTCTGGACAATTGAGTGGCGGTTGAACTCTACAGAAATTGTTTTAATTAACCATGGAGTTAATGAAAACACAAAGCTTTCTACCATTCTTG
AAAACCATCTACAACCAAGCCCATGGAAAAATCAACTTCAGAAGTTCTGTGAGCAGCTGGATAGCCTCAAATTTTTTGTCCGTACATACCCCAAGGGAGCTACTTCGCCT
TTCCGTGAGCTGGACTCGGGGTTGCCAATTAGACAACTGTTTTCCAATTTGGTTTTTGTGGAATACCCTGTTATATATGTTTTTCTTCCCTCTCAAACTCCCAACTTTGA
AGTAGTTAAATCTGCCAATCTAGTAAGTCATAATCCAGAAGTTAAGAACACAGGAAAAAATGATCTTGCTAGTCCTGAAGGTGTTCCTTTCAGAGTAGAAGAAATAGAAG
ACGATGACAACTCCTTCAATCCTCAGGTGCTTGATCTGATGAAAGTATCAACTTCAAAAAGCCCACGTTGCGAAGTCGAGCCCCGAAACCTGCATGGTGCAACATATAAT
TATTCTACAGATTTGATGGGGAAACATGAAGTTGGGAATAGCCCCAATTCAAACTCCCAGGCCAAGGAGCTGGGGGTTCTGAAAGAATTGGAATTTGATTTTGAGCAAGA
TCTGATAGATACATATTCAAATATCATGGCACAAATCAACCCAGATGATTTTCTTGATTGGGAAGGAGACTTTTCCAAGGCAGTGGAAATGGAAGGAAGCGGTGACCTTC
TCGGGAATGCGTTCACGGTGGAAGAATTGGAGGAAGGAGAGATTATGGAATAG
Protein sequenceShow/hide protein sequence
MAEGEATAAAASTSSNREGSSLCEECNSNPSKYKCPACSLRSCSLTCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFR
LPFHLKSLRTAASSRRTKIMFLPTGMTKREKNQTRYDKREKTIFWTIEWRLNSTEIVLINHGVNENTKLSTILENHLQPSPWKNQLQKFCEQLDSLKFFVRTYPKGATSP
FRELDSGLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKSANLVSHNPEVKNTGKNDLASPEGVPFRVEEIEDDDNSFNPQVLDLMKVSTSKSPRCEVEPRNLHGATYN
YSTDLMGKHEVGNSPNSNSQAKELGVLKELEFDFEQDLIDTYSNIMAQINPDDFLDWEGDFSKAVEMEGSGDLLGNAFTVEELEEGEIME