; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G04070 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G04070
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionbox C/D snoRNA protein 1-like
Genome locationClcChr09:3243622..3246389
RNA-Seq ExpressionClc09G04070
SyntenyClc09G04070
Gene Ontology termsNA
InterPro domainsIPR007529 - Zinc finger, HIT-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022959527.1 box C/D snoRNA protein 1-like [Cucurbita moschata]1.1e-20283.92Show/hide
Query:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEGD     A  AASTS NR+ SSLC EC SNPSKYKCPACS+RSCSL+CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL
        LRKKLCPY H Y+RLPFHLKSLRTAASSRRTKI+FLPTGMTKRE NQTRYDKREKTIFWT+EWR NSTD+VLV+HGVNEN+ LST+LENHLQPSPWK Q+
Subjt:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL

Query:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD
         KFCEQLDSLKFFVR YPKGA +PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVS NPEG N GKNDLAS EGV FR+EEIEDDD
Subjt:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD

Query:  NSSNPQVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEM
        NS N QVLD+MK S SSP CEV+ QN+  ATHNYSTDLMGK EVGNSPNSSSQ KELGV K+ EFDFEQDL+D YSNIMAQINPDDFLDW+ DFSKGVEM
Subjt:  NSSNPQVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEM

Query:  EGSGELLGDTFTIEELEEGEIME
        EGSG+LLGD FT++ELEEGEIME
Subjt:  EGSGELLGDTFTIEELEEGEIME

XP_023513608.1 box C/D snoRNA protein 1-like [Cucurbita pepo subsp. pepo]2.3e-20284.2Show/hide
Query:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEGD     A  AASTS NR+ SSLC+EC SNPSKYKCPACS+RSCSL+CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL
        LRKKLCPY H Y+RLPFHLKSLRTAASSRRTKI+FLPTGMTKRE NQTRYDKREKTIFWT+EWR NSTD+VLV+HGVNEN+ LST+LENHLQPSPWK Q+
Subjt:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL

Query:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD
         KFCEQLDSLKFFVR YPKGA +PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVS NPEG N GKNDLAS EGV FR+EEIEDDD
Subjt:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD

Query:  NSSNPQVLDMMKVSTSSPRCEVDSQN-LHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVE
        NS N QVLD+MK S SSP CEVD QN L +ATH YSTDLMGK EVGNSPNSSSQ KELGV KD EFDFEQDL+D YSNIMAQINPDDFLDW+ DFSKGVE
Subjt:  NSSNPQVLDMMKVSTSSPRCEVDSQN-LHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVE

Query:  MEGSGELLGDTFTIEELEEGEIME
        MEGSG+LLGD FT++ELEEGEIME
Subjt:  MEGSGELLGDTFTIEELEEGEIME

XP_031745217.1 box C/D snoRNA protein 1 [Cucumis sativus]2.0e-20687.8Show/hide
Query:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAE DAT+A STSSN+Q SSLC+ECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQT+FVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
Subjt:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE
        CPY HAYFRLPFHLKSLR AAS+RRTKI+FLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLV+H VNENSKLSTILENHL+P PWKTQL KF E
Subjt:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE

Query:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP
        QLD LKFFVR YPKGATS F ELDSTLPIRQLFSNL FVEYPVIYV LPSQTPNFEVVKTANPVSRN EG NA KNDLASHEGV FR+EEIE+D+NS NP
Subjt:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP

Query:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE
        QVLD+MKVSTSSP C+V  +NLH ATH+YST L+GKQEVGNSP SSSQ +E GVVK+ EFDFEQDLIDAYSNIMAQINPDDFLDW+GDFSK VEMEGSGE
Subjt:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE

Query:  LLGDTFTIEELEEGEIME
        LLGD FT+EELEEGEIME
Subjt:  LLGDTFTIEELEEGEIME

XP_038896194.1 LOW QUALITY PROTEIN: box C/D snoRNA protein 1-like [Benincasa hispida]1.3e-20588.76Show/hide
Query:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAEGDAT  ASTS N QESSLC ECKSNPSKYKCPACSI SCSLNCVNAHKRR GCTGKRKQT+FVPLSQFNDSIL SDYN LE+VKRMAESAQ LRKKL
Subjt:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE
        CPY HAY    FHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEW FNSTDIVLV+HGVNENSKLSTILENH QP PWK QL KFCE
Subjt:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE

Query:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP
        QLDSLKFFVR YPKGATSPFRELDS LPIR+LFSNLVFVEYPVIYVFLPSQTPNFEVVKTANP+SRNPEGTNAG+NDLASHEGVSFR+EEIEDDD     
Subjt:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP

Query:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE
           D+MKVSTSSPRCEVD QNLH+ATHNYSTDLMGKQEVGNSPNSSSQ KE+GVVK+FEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE
Subjt:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE

Query:  LLGDTFTIEELEEGEIME
        LLGD F  EEL EGEIME
Subjt:  LLGDTFTIEELEEGEIME

XP_038900096.1 box C/D snoRNA protein 1-like [Benincasa hispida]2.7e-22293.3Show/hide
Query:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAEGDAT  ASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQT+FVPLSQFNDSILLSDYNLLEEVKRMAESAQR RKKL
Subjt:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE
        CPY HAYFRLPFHLKSLRTAASSRRTKI+FLPTGMTKRENNQTRYDKREKTIFWTMEWRFNS DIVLV+HGVNENSKLSTILENHLQPSPWK QL KFCE
Subjt:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE

Query:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP
        QLDSLKFFVR YPKGATSPFRELDS LPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANP+SRNPEGTNAGKN+LASHEGVSFR+EEIEDDDNS NP
Subjt:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP

Query:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE
        QVLD+M+VST SPRCEVD QNLH+ATH YS DLMGKQE GNSPNSSSQ KELGVVK+FEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE
Subjt:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE

Query:  LLGDTFTIEELEEGEIME
        LLGD FT+EELEEGEIME
Subjt:  LLGDTFTIEELEEGEIME

TrEMBL top hitse value%identityAlignment
A0A0A0K6N1 HIT-type domain-containing protein9.9e-20787.8Show/hide
Query:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAE DAT+A STSSN+Q SSLC+ECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQT+FVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
Subjt:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE
        CPY HAYFRLPFHLKSLR AAS+RRTKI+FLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLV+H VNENSKLSTILENHL+P PWKTQL KF E
Subjt:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE

Query:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP
        QLD LKFFVR YPKGATS F ELDSTLPIRQLFSNL FVEYPVIYV LPSQTPNFEVVKTANPVSRN EG NA KNDLASHEGV FR+EEIE+D+NS NP
Subjt:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP

Query:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE
        QVLD+MKVSTSSP C+V  +NLH ATH+YST L+GKQEVGNSP SSSQ +E GVVK+ EFDFEQDLIDAYSNIMAQINPDDFLDW+GDFSK VEMEGSGE
Subjt:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE

Query:  LLGDTFTIEELEEGEIME
        LLGD FT+EELEEGEIME
Subjt:  LLGDTFTIEELEEGEIME

A0A1S3CEQ2 box C/D snoRNA protein 1-like2.4e-19785.17Show/hide
Query:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAE DAT+A STSSN Q SSLC+ECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQT+FVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
Subjt:  MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE
        CPY HAYFRLPFHLKSLR AAS+RRTKI+FLPTGMTKRENNQTRYDKREKTIFWTMEWRFNST+IVLV+H VNENSKLSTIL NHL+PSPWKTQL KF E
Subjt:  CPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE

Query:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP
        QLD LK FVR YPKGA SPF ELDSTLPIRQLFSNL FVEYPVIYV LP QTPNFEVVKTANP SRN EG+NA +NDLASH GV FR+EEIEDD+NS NP
Subjt:  QLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNP

Query:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE
        QVLD+MKVSTSSP C+V  +N           L+GKQEVGNSP SSSQ +ELGVVK+ EFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSK VEMEGSGE
Subjt:  QVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGE

Query:  LLGDTFTIEELEEGEIME
        LLGD FT EELEEGEIME
Subjt:  LLGDTFTIEELEEGEIME

A0A5D3CJA8 Box C/D snoRNA protein 1-like2.5e-19485.16Show/hide
Query:  LAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAY
        +A STSSN Q SSLC+ECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQT+FVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPY HAY
Subjt:  LAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAY

Query:  FRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCEQLDSLKF
        FRLPFHLKSLR AAS+RRTKI+FLPTGMTKRENNQTRYDKREKTIFWTMEWRFNST+IVLV+H VNENSKLSTIL NHL+PSPWKTQL KF EQLD LK 
Subjt:  FRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCEQLDSLKF

Query:  FVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNPQVLDMMK
        FVR YPKGA SPF ELDSTLPIRQLFSNL FVEYPVIYV LP QTPNFEVVKTANP SRN EG+NA +NDLASH GV FR+EEIEDD+NS NPQVLD+MK
Subjt:  FVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNPQVLDMMK

Query:  VSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGELLGDTFT
        VSTSSP C+V  +N           L+GKQEVGNSP SSSQ +ELGVVK+ EFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSK VEMEGSGELLGD FT
Subjt:  VSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGELLGDTFT

Query:  IEELEEGEIME
         EELEEGEIME
Subjt:  IEELEEGEIME

A0A6J1H4S8 box C/D snoRNA protein 1-like5.1e-20383.92Show/hide
Query:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEGD     A  AASTS NR+ SSLC EC SNPSKYKCPACS+RSCSL+CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL
        LRKKLCPY H Y+RLPFHLKSLRTAASSRRTKI+FLPTGMTKRE NQTRYDKREKTIFWT+EWR NSTD+VLV+HGVNEN+ LST+LENHLQPSPWK Q+
Subjt:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL

Query:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD
         KFCEQLDSLKFFVR YPKGA +PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVS NPEG N GKNDLAS EGV FR+EEIEDDD
Subjt:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD

Query:  NSSNPQVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEM
        NS N QVLD+MK S SSP CEV+ QN+  ATHNYSTDLMGK EVGNSPNSSSQ KELGV K+ EFDFEQDL+D YSNIMAQINPDDFLDW+ DFSKGVEM
Subjt:  NSSNPQVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEM

Query:  EGSGELLGDTFTIEELEEGEIME
        EGSG+LLGD FT++ELEEGEIME
Subjt:  EGSGELLGDTFTIEELEEGEIME

A0A6J1KUW8 box C/D snoRNA protein 12.5e-20283.45Show/hide
Query:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAEGD     A  AASTS NR+ SSLC+EC SNPSKYKCPACS+RSCSL+CVN HKRRSGCTGKRKQT+FVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEGD-----ATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL
        LRKKLCPY H Y+RLPFHLKSLRTAASSRRTKI+FLPTGMTKRE NQTRYDKREKTIFWT+EWR NSTD+VLV+HGVNEN+ LST+LENHLQPSPWK Q+
Subjt:  LRKKLCPYAHAYFRLPFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQL

Query:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD
         KFCEQLDSLKFFVR YPKGA  PFRELDS +PIRQLFSNLVFVEYPVIYVFLPSQTPNFEV+KTANPVS NPEG N GKNDL S EGV FR+EEIEDDD
Subjt:  HKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD

Query:  NSSNPQVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEM
        NS N QVLD+MK S SSP CEV+ QN+  ATHNYSTDLMGK EVGNSPNSSSQ KELGV+K+ EFDFEQDL+D YSNIMAQINPDDFLDW+ DFSKGVEM
Subjt:  NSSNPQVLDMMKVSTSSPRCEVDSQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEM

Query:  EGSGELLGDTFTIEELEEGEIME
        EGSG+LLGD FT++ELEEGEIME
Subjt:  EGSGELLGDTFTIEELEEGEIME

SwissProt top hitse value%identityAlignment
O74906 Putative box C/D snoRNA protein SPCC613.071.3e-1423.27Show/hide
Query:  LCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSLRTA
        +C  C+ N SKY+CP C  R C L C   HKR + C+G+R    FVP S+  +  L SD+N L  V+R+    +    ++   A        +   L+ +
Subjt:  LCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSLRTA

Query:  ASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCEQ---------------LDSL
               I F P    KR  N+T YDK+   I W++EW  + +     +   +E S+ + I  +H +  P +    K  E+                D +
Subjt:  ASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCEQ---------------LDSL

Query:  KFFVRMYPKGATS-PFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVS-FRMEEIEDDDNSSNPQVL
        +F ++     +    +++++ +  +     N    E P I+VF  +   + E     +   ++ + +++      S E  S   + E+ ++  +++P   
Subjt:  KFFVRMYPKGATS-PFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVS-FRMEEIEDDDNSSNPQVL

Query:  DMMKVSTSSPRCEVDSQN
            V TSS    V  QN
Subjt:  DMMKVSTSSPRCEVDSQN

P38772 Box C/D snoRNA protein 19.2e-0822.19Show/hide
Query:  LCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGK----RKQTEFVPLSQFND------SILLSDYNLLEEVKRMA-----ESAQRLRKKLCP---
        LC  C     KYKCP C +++CSL C   HK R  C+G+    ++      L Q +D      + +  DYN L ++KRM      ++  + ++ L P   
Subjt:  LCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGK----RKQTEFVPLSQFND------SILLSDYNLLEEVKRMA-----ESAQRLRKKLCP---

Query:  YAHAYFRLPFHL-KSLRTAASSRR-----TKILFLPTGMTKRENNQTRYDKREKTIFWTMEW--------------------RFNSTDIVLVEHGVNENS
        +   + +  + + +  R +   +R        L LP GM +   N++++DK      W++EW                    R   TD ++   G N   
Subjt:  YAHAYFRLPFHL-KSLRTAASSRR-----TKILFLPTGMTKRENNQTRYDKREKTIFWTMEW--------------------RFNSTDIVLVEHGVNENS

Query:  KLSTIL-----------ENHLQPSPWKTQLHKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIR---------QLFSNLVFVEYPVIYVFLPS------
        K                E+  +    +TQ+     Q   LKF+ + +P   T     +DS   +          +L  N   +E+P I+V +        
Subjt:  KLSTIL-----------ENHLQPSPWKTQLHKFCEQLDSLKFFVRMYPKGATSPFRELDSTLPIR---------QLFSNLVFVEYPVIYVFLPS------

Query:  -----QTPN-FEVVKTANPVSRNP-EGTNAGKNDLASHEGVSFRMEEIEDDDNSSN
             Q P   E   T N    N  E  +A ++   + E V    ++  D D+ S+
Subjt:  -----QTPN-FEVVKTANPVSRNP-EGTNAGKNDLASHEGVSFRMEEIEDDDNSSN

Q3UFB2 Box C/D snoRNA protein 11.2e-2029.1Show/hide
Query:  SSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPF
        S  +   S C+ C +  +KY+CP C   SCSL CV  HK    C+G R +T +V L QF +  LLSDY  LE+V R A+   R      P    Y     
Subjt:  SSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPF

Query:  HLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSP----WKTQLHKFCEQLDSLKFF
         L  ++  A  +   +  LP G +KR+ N T +D R++   W ++ +F  +    +E  V ++  ++ IL+ ++ P       + +L  + +    ++  
Subjt:  HLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSP----WKTQLHKFCEQLDSLKFF

Query:  VRMYPKGATS-PFRELDSTLPIRQLFSNL---VFVEYPVIYVFL
        +R+         + ELD   P + L  NL   V +EYP ++V L
Subjt:  VRMYPKGATS-PFRELDSTLPIRQLFSNL---VFVEYPVIYVFL

Q5RF97 Box C/D snoRNA protein 11.2e-2327.02Show/hide
Query:  SLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPF---HLKS
        S C+ C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R          A+ + P    H+  
Subjt:  SLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPF---HLKS

Query:  LRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSP----WKTQLHKFCEQLDSLKFFVRM-
        ++  A  +   +  LP G TKR+ N T +DK+++   W ++ +F  +    +E  V ++  ++ IL+ ++ P       + +L  +      ++  +++ 
Subjt:  LRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSP----WKTQLHKFCEQLDSLKFFVRM-

Query:  YPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVK
        Y +     + ELD    +     N V +EYP ++V L     + +V++
Subjt:  YPKGATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVK

Q9NWK9 Box C/D snoRNA protein 11.0e-2227.05Show/hide
Query:  SLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSLRT
        S C+ C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R      P ++ Y      +  ++ 
Subjt:  SLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSLRT

Query:  AASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSP----WKTQLHKFCEQLDSLKFFVRM-YPK
         A  +   +  LP G TKR+ N T +DK+++   W ++ +F  +    +E  V ++  ++ IL+ ++ P       + +L  +      ++  +++ Y +
Subjt:  AASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSP----WKTQLHKFCEQLDSLKFFVRM-YPK

Query:  GATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVV
             + ELD    +     N V +EYP ++V L     + +V+
Subjt:  GATSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVV

Arabidopsis top hitse value%identityAlignment
AT1G04945.1 HIT-type Zinc finger family protein3.1e-9648.45Show/hide
Query:  ESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSL
        + S+C+ECK NP KYKCP CSIRSC+L CV AHK+R+GCTGKRK T+ VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   ++Y +LP+ LKSL
Subjt:  ESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSL

Query:  RTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE-QLDSLKFFVRMYPKGA
        ++AA SRRTK+ +LP+GM KRENNQ+RYD R K I WT+EWRF+STD++LV+HGV E+  L ++++NHL+P PW  +L  FC+  LDSLK F+R YPKGA
Subjt:  RTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE-QLDSLKFFVRMYPKGA

Query:  TSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD-NSSNPQVLDMMKVSTSSPRC
         +PF+ELD   P+R+  + +V +EYPVI+V+LPSQ+  F+V+K  N     P   ++  +      G++FR EEIE+DD +S  P+VL +MK    +P  
Subjt:  TSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD-NSSNPQVLDMMKVSTSSPRC

Query:  EVD----SQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPD-DFLDWEGDFSKGVEME
         V     ++ + +   N   D   +++ GN                 E +FEQ LID YS++ A++NPD DF+    D  +G  +E
Subjt:  EVD----SQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPD-DFLDWEGDFSKGVEME

AT1G04945.2 HIT-type Zinc finger family protein5.5e-10147.69Show/hide
Query:  ESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSL
        + S+C+ECK NP KYKCP CSIRSC+L CV AHK+R+GCTGKRK T+ VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   ++Y +LP+ LKSL
Subjt:  ESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSL

Query:  RTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE-QLDSLKFFVRMYPKGA
        ++AA SRRTK+ +LP+GM KRENNQ+RYD R K I WT+EWRF+STD++LV+HGV E+  L ++++NHL+P PW  +L  FC+  LDSLK F+R YPKGA
Subjt:  RTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE-QLDSLKFFVRMYPKGA

Query:  TSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD-NSSNPQVLDMMKVSTSSPRC
         +PF+ELD   P+R+  + +V +EYPVI+V+LPSQ+  F+V+K  N     P   ++  +      G++FR EEIE+DD +S  P+VL +MK    +P  
Subjt:  TSPFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD-NSSNPQVLDMMKVSTSSPRC

Query:  EVD----SQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGEL--LGDTFTIE
         V     ++ + +   N   D   +++ GN                 E +FEQ LID YS++ A++NP D+ ++E +F+KG++ + +  L  L   F  +
Subjt:  EVD----SQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGEL--LGDTFTIE

Query:  --ELEEGEIME
          +LEEGEI+E
Subjt:  --ELEEGEIME

AT1G04945.3 HIT-type Zinc finger family protein5.5e-10147.92Show/hide
Query:  SLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSLRT
        S+C+ECK NP KYKCP CSIRSC+L CV AHK+R+GCTGKRK T+ VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   ++Y +LP+ LKSL++
Subjt:  SLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRLPFHLKSLRT

Query:  AASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE-QLDSLKFFVRMYPKGATS
        AA SRRTK+ +LP+GM KRENNQ+RYD R K I WT+EWRF+STD++LV+HGV E+  L ++++NHL+P PW  +L  FC+  LDSLK F+R YPKGA +
Subjt:  AASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCE-QLDSLKFFVRMYPKGATS

Query:  PFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD-NSSNPQVLDMMKVSTSSPRCEV
        PF+ELD   P+R+  + +V +EYPVI+V+LPSQ+  F+V+K  N     P   ++  +      G++FR EEIE+DD +S  P+VL +MK    +P   V
Subjt:  PFRELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDD-NSSNPQVLDMMKVSTSSPRCEV

Query:  D----SQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGEL--LGDTFTIE--
             ++ + +   N   D   +++ GN                 E +FEQ LID YS++ A++NP D+ ++E +F+KG++ + +  L  L   F  +  
Subjt:  D----SQNLHSATHNYSTDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGEL--LGDTFTIE--

Query:  ELEEGEIME
        +LEEGEI+E
Subjt:  ELEEGEIME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAGGAGATGCAACTCTGGCAGCTTCCACAAGCTCCAACCGCCAAGAATCATCACTTTGCCAAGAGTGTAAATCAAACCCATCAAAATACAAGTGCCCCGCTTG
CTCAATCCGTTCTTGTAGCCTCAATTGCGTCAATGCCCACAAGCGCCGCAGTGGCTGTACTGGCAAGAGGAAACAAACCGAATTCGTCCCGCTTTCTCAGTTCAATGATA
GTATCCTTCTTTCTGATTATAATTTGCTGGAGGAAGTGAAAAGGATGGCTGAATCAGCTCAAAGACTTAGAAAGAAATTGTGCCCTTATGCTCATGCTTACTTTCGATTA
CCGTTTCACCTTAAAAGTTTGCGCACTGCTGCTTCAAGCAGGAGAACAAAAATCCTGTTTCTCCCCACAGGAATGACTAAAAGGGAGAACAATCAAACTCGTTATGACAA
GAGGGAAAAAACAATCTTCTGGACGATGGAATGGCGGTTTAACTCTACAGATATTGTTTTAGTTGAGCATGGAGTTAATGAAAACTCAAAGCTTTCTACCATTCTTGAAA
ACCATCTACAACCAAGTCCATGGAAAACTCAACTTCATAAGTTCTGTGAGCAGTTGGATAGCCTCAAATTTTTCGTCCGTATGTACCCCAAGGGAGCTACATCGCCTTTT
CGTGAGCTGGACTCGACGTTGCCAATAAGACAACTGTTTTCCAATTTGGTTTTCGTGGAATACCCTGTTATATATGTTTTTCTACCCTCTCAAACTCCTAACTTTGAAGT
AGTTAAAACTGCCAATCCAGTGAGTCGTAATCCAGAAGGTACAAATGCCGGAAAAAATGATCTTGCTAGTCATGAAGGCGTTTCCTTCAGGATGGAAGAAATAGAAGATG
ATGACAACTCCAGCAATCCGCAGGTTCTTGATATGATGAAAGTATCAACTTCAAGCCCACGTTGCGAAGTCGACTCCCAAAACCTGCATAGTGCAACACATAATTACTCT
ACGGATTTGATGGGGAAGCAGGAAGTTGGGAATAGCCCCAATTCAAGCTCCCAGGGCAAGGAGCTAGGGGTTGTGAAAGACTTCGAGTTTGATTTTGAGCAAGATCTGAT
AGATGCATACTCAAATATCATGGCACAAATCAATCCAGATGATTTTCTTGATTGGGAAGGAGACTTTTCCAAGGGAGTAGAAATGGAAGGAAGTGGTGAACTTCTCGGGG
ACACGTTCACGATTGAAGAACTGGAGGAAGGAGAGATTATGGAATAG
mRNA sequenceShow/hide mRNA sequence
TGGGGCTTTCAATTGTATCGGCCCACGCTTTATTGAAACAAAGTTTCGATTTTGAAGCAATAAATTGCAATTGTGCAAACGAAACTATATAAAGCAGATTCAAATTACGT
AACCTAGTCGTCGTCGCCGGTGAAATTCGGAACCGGCGCTCGCTCCGTTGTCCAAGATGGCCGAAGGAGATGCAACTCTGGCAGCTTCCACAAGCTCCAACCGCCAAGAA
TCATCACTTTGCCAAGAGTGTAAATCAAACCCATCAAAATACAAGTGCCCCGCTTGCTCAATCCGTTCTTGTAGCCTCAATTGCGTCAATGCCCACAAGCGCCGCAGTGG
CTGTACTGGCAAGAGGAAACAAACCGAATTCGTCCCGCTTTCTCAGTTCAATGATAGTATCCTTCTTTCTGATTATAATTTGCTGGAGGAAGTGAAAAGGATGGCTGAAT
CAGCTCAAAGACTTAGAAAGAAATTGTGCCCTTATGCTCATGCTTACTTTCGATTACCGTTTCACCTTAAAAGTTTGCGCACTGCTGCTTCAAGCAGGAGAACAAAAATC
CTGTTTCTCCCCACAGGAATGACTAAAAGGGAGAACAATCAAACTCGTTATGACAAGAGGGAAAAAACAATCTTCTGGACGATGGAATGGCGGTTTAACTCTACAGATAT
TGTTTTAGTTGAGCATGGAGTTAATGAAAACTCAAAGCTTTCTACCATTCTTGAAAACCATCTACAACCAAGTCCATGGAAAACTCAACTTCATAAGTTCTGTGAGCAGT
TGGATAGCCTCAAATTTTTCGTCCGTATGTACCCCAAGGGAGCTACATCGCCTTTTCGTGAGCTGGACTCGACGTTGCCAATAAGACAACTGTTTTCCAATTTGGTTTTC
GTGGAATACCCTGTTATATATGTTTTTCTACCCTCTCAAACTCCTAACTTTGAAGTAGTTAAAACTGCCAATCCAGTGAGTCGTAATCCAGAAGGTACAAATGCCGGAAA
AAATGATCTTGCTAGTCATGAAGGCGTTTCCTTCAGGATGGAAGAAATAGAAGATGATGACAACTCCAGCAATCCGCAGGTTCTTGATATGATGAAAGTATCAACTTCAA
GCCCACGTTGCGAAGTCGACTCCCAAAACCTGCATAGTGCAACACATAATTACTCTACGGATTTGATGGGGAAGCAGGAAGTTGGGAATAGCCCCAATTCAAGCTCCCAG
GGCAAGGAGCTAGGGGTTGTGAAAGACTTCGAGTTTGATTTTGAGCAAGATCTGATAGATGCATACTCAAATATCATGGCACAAATCAATCCAGATGATTTTCTTGATTG
GGAAGGAGACTTTTCCAAGGGAGTAGAAATGGAAGGAAGTGGTGAACTTCTCGGGGACACGTTCACGATTGAAGAACTGGAGGAAGGAGAGATTATGGAATAGTGATTAG
CATTTCAAGGAACAGTTTTTCAAGGGAATGATATATCAGCAGGAACCAAATTTTGCTTAAGTAGAGCTTCTCAGTTCATACACTATACTACAGGTAAATCTTGATACCAT
TGTACTTCATTGATTTGCCCATCTCTGCAAGCAACATCCTGCATTTTGACTCATGCTAGAGTGGAACAATCTGGAATCATCTGTTAGCTCAACTTTGTTGTTTACACTTA
CAGCTAATCTTGATTTGATTTATCATATCAATGAGTTGTATGAGAACATGTATCTTCTGTTTAACTTTATTTCTCACAAAGTTGATAATCAAAATAGGAATATGAATTAT
AGGAGAAAAAA
Protein sequenceShow/hide protein sequence
MAEGDATLAASTSSNRQESSLCQECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTEFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYAHAYFRL
PFHLKSLRTAASSRRTKILFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVEHGVNENSKLSTILENHLQPSPWKTQLHKFCEQLDSLKFFVRMYPKGATSPF
RELDSTLPIRQLFSNLVFVEYPVIYVFLPSQTPNFEVVKTANPVSRNPEGTNAGKNDLASHEGVSFRMEEIEDDDNSSNPQVLDMMKVSTSSPRCEVDSQNLHSATHNYS
TDLMGKQEVGNSPNSSSQGKELGVVKDFEFDFEQDLIDAYSNIMAQINPDDFLDWEGDFSKGVEMEGSGELLGDTFTIEELEEGEIME